Query lcl|NC_018838.1_cdsid_YP_006906407.1 [gene=6] [protein=putative capsid] [protein_id=YP_006906407.1] [location=4656..5603] Match_columns 315 No_of_seqs 124 out of 755 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 12:43:09 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80684 Length: 315 100.0 2.2E-83 1.3E-86 473.8 32.6 315 1-315 1-315 (315) 2 protein:vir:8187 Length: 311 # 100.0 3.3E-67 2E-70 385.1 30.9 298 1-307 1-311 (311) 3 protein:vir:9574 Length: 300 # 100.0 8.8E-67 5.5E-70 382.7 30.5 296 1-306 1-300 (300) 4 protein:vir:9759 Length: 303 # 100.0 2.2E-65 1.4E-68 375.0 31.1 297 1-306 1-303 (303) 5 protein:vir:1638 Length: 298 # 100.0 2.3E-65 1.5E-68 374.9 30.5 292 1-305 1-298 (298) 6 protein:vir:94771 Length: 298 100.0 1.2E-64 7.6E-68 371.0 30.1 292 1-305 1-298 (298) 7 protein:vir:99920 Length: 311 100.0 1E-63 6.5E-67 365.9 30.1 298 1-306 1-311 (311) 8 protein:vir:78523 Length: 338 100.0 5.1E-63 3.2E-66 362.1 30.2 301 1-309 10-338 (338) 9 protein:vir:78223 Length: 333 100.0 6.9E-63 4.3E-66 361.4 30.1 299 1-307 10-333 (333) 10 protein:vir:7771 Length: 330 # 100.0 3.3E-61 2E-64 352.2 31.2 302 1-313 10-330 (330) 11 protein:vir:5739 Length: 366 # 100.0 1.4E-61 8.9E-65 354.2 29.0 291 1-306 64-366 (366) 12 protein:vir:97148 Length: 324 100.0 4.2E-61 2.6E-64 351.6 31.2 292 1-315 27-324 (324) 13 protein:vir:41 Length: 299 # N 100.0 3.8E-61 2.3E-64 351.9 30.3 288 1-307 6-299 (299) 14 protein:vir:94142 Length: 304 100.0 7.2E-61 4.4E-64 350.3 30.4 286 1-305 9-304 (304) 15 protein:vir:105905 Length: 304 100.0 7.2E-61 4.4E-64 350.3 30.4 286 1-305 9-304 (304) 16 protein:vir:2504 Length: 305 # 100.0 4.9E-61 3E-64 351.2 29.2 293 1-313 1-305 (305) 17 protein:vir:104085 Length: 320 100.0 1.9E-60 1.2E-63 347.9 30.4 297 1-309 14-320 (320) 18 protein:vir:80376 Length: 435 100.0 1.4E-60 8.7E-64 348.7 28.4 295 1-308 132-435 (435) 19 protein:vir:1433 Length: 435 # 100.0 1.6E-60 9.7E-64 348.5 28.2 295 1-308 132-435 (435) 20 protein:vir:103955 Length: 324 100.0 6.4E-60 4E-63 345.1 31.2 292 1-315 27-324 (324) 21 protein:vir:78830 Length: 324 100.0 8E-60 5E-63 344.6 30.8 292 1-315 27-324 (324) 22 protein:vir:96392 Length: 324 100.0 8E-60 5E-63 344.6 30.8 292 1-315 27-324 (324) 23 protein:vir:2344 Length: 397 # 100.0 6E-60 3.7E-63 345.3 28.6 299 1-315 10-321 (397) 24 protein:vir:100247 Length: 425 100.0 3.8E-60 2.4E-63 346.3 27.4 281 1-307 130-425 (425) 25 protein:vir:99749 Length: 324 100.0 1.9E-59 1.2E-62 342.5 31.2 292 1-315 27-324 (324) 26 protein:vir:9309 Length: 324 # 100.0 2.1E-59 1.3E-62 342.3 31.0 292 1-315 27-324 (324) 27 protein:vir:105038 Length: 428 100.0 7.9E-60 4.9E-63 344.6 27.5 291 1-306 125-428 (428) 28 protein:vir:93616 Length: 645 100.0 1.9E-59 1.2E-62 342.5 28.3 288 1-314 338-645 (645) 29 protein:vir:96223 Length: 324 100.0 6.4E-59 4E-62 339.6 30.9 292 1-315 27-324 (324) 30 protein:vir:485 Length: 407 # 100.0 3.4E-59 2.1E-62 341.1 28.6 287 1-313 106-407 (407) 31 protein:vir:4226 Length: 326 # 100.0 1.1E-58 6.9E-62 338.3 30.2 293 1-309 20-326 (326) 32 protein:vir:4456 Length: 401 # 100.0 2.8E-59 1.7E-62 341.6 26.9 280 1-306 107-401 (401) 33 protein:vir:2430 Length: 318 # 100.0 1.7E-58 1.1E-61 337.3 30.7 296 1-311 14-318 (318) 34 protein:vir:101650 Length: 497 100.0 1E-58 6.3E-62 338.5 28.4 287 1-310 151-497 (497) 35 protein:vir:7855 Length: 497 # 100.0 1E-58 6.3E-62 338.5 28.4 287 1-310 151-497 (497) 36 protein:vir:95763 Length: 297 100.0 9.2E-58 5.7E-61 333.3 30.4 282 1-310 9-297 (297) 37 protein:vir:8102 Length: 543 # 100.0 1.1E-57 6.5E-61 333.0 28.0 289 1-307 250-543 (543) 38 protein:vir:4511 Length: 409 # 100.0 9.8E-58 6.1E-61 333.1 27.3 289 1-309 117-409 (409) 39 protein:vir:81227 Length: 413 100.0 2.5E-57 1.6E-60 330.8 29.1 287 1-309 118-413 (413) 40 protein:vir:95376 Length: 425 100.0 8E-58 5E-61 333.6 26.4 281 1-313 138-425 (425) 41 protein:vir:6242 Length: 390 # 100.0 1.5E-57 9.5E-61 332.1 27.3 276 1-307 110-390 (390) 42 protein:vir:96762 Length: 632 100.0 6.6E-58 4.1E-61 334.1 25.1 271 1-305 357-632 (632) 43 protein:vir:102082 Length: 392 100.0 8.5E-57 5.3E-60 328.0 28.6 281 1-314 106-392 (392) 44 protein:vir:102873 Length: 392 100.0 8.5E-57 5.3E-60 328.0 28.6 281 1-314 106-392 (392) 45 protein:vir:105004 Length: 392 100.0 8.5E-57 5.3E-60 328.0 28.6 281 1-314 106-392 (392) 46 protein:vir:107593 Length: 392 100.0 8.5E-57 5.3E-60 328.0 28.6 281 1-314 106-392 (392) 47 protein:vir:102119 Length: 404 100.0 2.9E-56 1.8E-59 325.1 29.1 289 1-310 110-404 (404) 48 protein:vir:100135 Length: 418 100.0 4E-56 2.5E-59 324.3 29.6 281 1-314 135-418 (418) 49 protein:vir:1328 Length: 392 # 100.0 2.1E-56 1.3E-59 325.8 28.0 278 1-307 110-392 (392) 50 protein:vir:81160 Length: 371 100.0 5.8E-56 3.6E-59 323.4 28.4 274 1-306 91-371 (371) 51 protein:vir:4953 Length: 397 # 100.0 9.7E-56 6E-59 322.2 28.2 281 1-315 109-394 (397) 52 protein:vir:4339 Length: 395 # 100.0 1.6E-55 9.8E-59 321.0 29.2 278 1-306 113-395 (395) 53 protein:vir:104256 Length: 458 100.0 1.1E-55 7E-59 321.8 28.2 283 1-306 162-458 (458) 54 protein:vir:4856 Length: 293 # 100.0 2.1E-55 1.3E-58 320.4 29.5 281 1-315 5-290 (293) 55 protein:vir:1025 Length: 408 # 100.0 2.3E-55 1.5E-58 320.1 29.3 280 1-315 116-402 (408) 56 protein:vir:81070 Length: 390 100.0 5.6E-55 3.4E-58 318.0 29.0 275 1-304 113-390 (390) 57 protein:vir:97053 Length: 390 100.0 4.9E-55 3E-58 318.3 28.5 275 1-304 113-390 (390) 58 protein:vir:1886 Length: 385 # 100.0 7.2E-55 4.5E-58 317.4 28.8 278 1-307 105-385 (385) 59 protein:vir:191 Length: 385 # 100.0 7.2E-55 4.5E-58 317.4 28.8 278 1-307 105-385 (385) 60 protein:vir:4997 Length: 397 # 100.0 7.1E-55 4.4E-58 317.5 28.7 281 1-315 109-397 (397) 61 protein:vir:3991 Length: 404 # 100.0 1.2E-54 7.2E-58 316.3 28.9 282 1-315 116-404 (404) 62 protein:vir:4830 Length: 397 # 100.0 1.1E-54 6.9E-58 316.4 28.2 281 1-315 109-394 (397) 63 protein:vir:6212 Length: 434 # 100.0 7.4E-55 4.6E-58 317.3 26.7 285 1-309 143-434 (434) 64 protein:vir:4600 Length: 415 # 100.0 2.2E-54 1.3E-57 314.8 29.2 286 1-315 121-410 (415) 65 protein:vir:4700 Length: 415 # 100.0 2.2E-54 1.3E-57 314.8 29.2 286 1-315 121-410 (415) 66 protein:vir:3845 Length: 395 # 100.0 2.4E-54 1.5E-57 314.6 28.6 281 1-315 107-392 (395) 67 protein:vir:10364 Length: 390 100.0 2.8E-54 1.7E-57 314.2 28.9 275 1-304 113-390 (390) 68 protein:vir:4092 Length: 390 # 100.0 1.6E-54 9.8E-58 315.6 26.6 279 1-315 84-379 (390) 69 protein:vir:1268 Length: 397 # 100.0 3.2E-54 2E-57 313.9 27.6 271 1-306 123-397 (397) 70 protein:vir:7409 Length: 408 # 100.0 5.4E-54 3.4E-57 312.6 28.8 279 1-315 116-404 (408) 71 protein:vir:79987 Length: 415 100.0 1.4E-53 8.8E-57 310.3 29.0 286 1-315 121-410 (415) 72 protein:vir:98339 Length: 415 100.0 1.4E-53 8.8E-57 310.3 29.0 286 1-315 121-410 (415) 73 protein:vir:81100 Length: 415 100.0 1.4E-53 8.8E-57 310.3 29.0 286 1-315 121-410 (415) 74 protein:vir:9410 Length: 415 # 100.0 2.9E-53 1.8E-56 308.6 28.3 286 1-315 121-410 (415) 75 protein:vir:98635 Length: 377 100.0 7E-54 4.3E-57 312.0 21.6 280 1-306 79-377 (377) 76 protein:vir:101607 Length: 379 100.0 1.3E-52 7.9E-56 305.1 28.3 271 1-306 107-379 (379) 77 protein:vir:100172 Length: 394 100.0 2.1E-52 1.3E-55 303.8 28.4 279 1-315 111-392 (394) 78 protein:vir:100884 Length: 389 100.0 3.7E-52 2.3E-55 302.6 27.6 278 1-313 109-389 (389) 79 protein:vir:78350 Length: 383 100.0 5.1E-53 3.2E-56 307.3 22.3 287 1-314 83-383 (383) 80 protein:vir:78640 Length: 352 100.0 9.9E-53 6.1E-56 305.7 23.7 269 1-312 83-352 (352) 81 protein:vir:101291 Length: 381 100.0 1.5E-52 9.3E-56 304.7 24.3 277 1-315 76-379 (381) 82 protein:vir:9509 Length: 381 # 100.0 1.5E-52 9.3E-56 304.7 24.3 277 1-315 76-379 (381) 83 protein:vir:94673 Length: 419 100.0 1.2E-51 7.6E-55 299.7 28.8 281 1-308 123-419 (419) 84 protein:vir:100632 Length: 381 100.0 2.6E-52 1.6E-55 303.4 24.3 275 1-315 76-375 (381) 85 protein:vir:95963 Length: 395 100.0 6.8E-52 4.2E-55 301.1 26.4 280 1-315 86-387 (395) 86 protein:vir:8420 Length: 477 # 100.0 4.9E-52 3E-55 301.9 25.3 296 1-312 156-477 (477) 87 protein:vir:1383 Length: 421 # 100.0 1.3E-51 8E-55 299.6 26.7 276 1-315 114-401 (421) 88 protein:vir:9704 Length: 394 # 100.0 2E-51 1.2E-54 298.6 26.3 264 1-310 128-394 (394) 89 protein:vir:3870 Length: 400 # 100.0 6.8E-51 4.2E-54 295.6 25.6 264 1-307 133-400 (400) 90 protein:vir:96978 Length: 387 100.0 2.3E-51 1.4E-54 298.2 22.3 268 1-312 118-387 (387) 91 protein:vir:94424 Length: 387 100.0 2.3E-51 1.4E-54 298.2 22.3 268 1-312 118-387 (387) 92 protein:vir:2685 Length: 387 # 100.0 2.3E-51 1.4E-54 298.2 22.3 268 1-312 118-387 (387) 93 protein:vir:93881 Length: 387 100.0 5.3E-51 3.3E-54 296.2 23.8 267 1-312 118-387 (387) 94 protein:vir:1084 Length: 437 # 100.0 1.3E-50 7.9E-54 294.1 24.8 278 1-315 156-437 (437) 95 protein:vir:9361 Length: 402 # 100.0 6.5E-51 4E-54 295.7 21.8 268 1-312 133-402 (402) 96 protein:vir:9643 Length: 377 # 100.0 7.5E-50 4.7E-53 289.9 26.2 269 1-306 79-377 (377) 97 protein:vir:80128 Length: 466 100.0 2.8E-49 1.7E-52 286.8 23.7 282 1-315 148-462 (466) 98 protein:vir:962 Length: 397 # 100.0 1.9E-48 1.2E-51 282.2 22.6 263 1-306 132-397 (397) 99 protein:vir:4197 Length: 314 # 100.0 3.5E-41 2.1E-44 242.4 25.4 287 1-309 14-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 1.3E-40 7.9E-44 239.3 23.9 281 1-303 19-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 2.7E-37 1.7E-40 221.1 25.4 295 1-315 18-321 (321) 102 protein:vir:97397 Length: 517 100.0 9.9E-37 6.1E-40 218.0 21.6 278 1-314 239-517 (517) 103 protein:vir:3033 Length: 272 # 100.0 2.1E-35 1.3E-38 210.7 25.2 268 1-309 1-272 (272) 104 protein:vir:9820 Length: 272 # 100.0 2.1E-35 1.3E-38 210.7 25.2 268 1-309 1-272 (272) 105 protein:vir:4074 Length: 480 # 99.9 3E-31 1.9E-34 187.9 14.2 264 1-309 211-480 (480) 106 protein:vir:93742 Length: 274 99.9 3.5E-26 2.2E-29 160.1 23.0 269 1-310 1-274 (274) 107 protein:vir:80930 Length: 278 99.9 3E-25 1.8E-28 155.0 21.9 274 1-307 1-278 (278) 108 protein:vir:96123 Length: 274 99.9 7.3E-25 4.5E-28 152.9 22.4 269 1-310 1-274 (274) 109 protein:vir:3613 Length: 272 # 99.9 4.4E-25 2.7E-28 154.1 20.3 267 1-306 1-272 (272) 110 protein:vir:105334 Length: 276 99.9 2E-24 1.2E-27 150.5 21.9 272 1-314 1-276 (276) 111 protein:vir:97433 Length: 274 99.9 5.6E-23 3.5E-26 142.6 23.4 269 1-310 1-274 (274) 112 protein:vir:94494 Length: 274 99.9 5.6E-23 3.5E-26 142.6 23.4 269 1-310 1-274 (274) 113 protein:vir:96833 Length: 275 99.9 3.8E-23 2.4E-26 143.5 21.2 270 1-310 1-275 (275) 114 protein:vir:1239 Length: 274 # 99.8 6.9E-22 4.3E-25 136.6 22.0 269 1-310 1-274 (274) 115 protein:vir:94933 Length: 330 99.8 1.2E-21 7.4E-25 135.3 22.5 290 1-307 25-330 (330) 116 protein:vir:96262 Length: 274 99.8 1.4E-21 8.8E-25 134.9 22.6 269 1-310 1-274 (274) 117 protein:vir:95898 Length: 274 99.8 1.4E-21 8.8E-25 134.9 22.6 269 1-310 1-274 (274) 118 protein:vir:95107 Length: 270 99.8 1.1E-19 7E-23 124.5 21.5 265 1-311 1-270 (270) 119 protein:vir:79928 Length: 393 99.7 1.1E-17 6.8E-21 113.6 18.0 293 1-315 74-393 (393) 120 protein:vir:97255 Length: 310 99.6 4.7E-16 2.9E-19 104.6 23.2 288 1-306 1-310 (310) 121 protein:vir:739 Length: 231 # 99.6 1.7E-16 1.1E-19 107.0 17.0 231 35-306 1-231 (231) 122 protein:vir:7990 Length: 273 # 99.5 1.2E-15 7.1E-19 102.5 18.8 266 1-306 1-273 (273) 123 protein:vir:105822 Length: 273 99.5 1.5E-15 9.5E-19 101.8 19.1 266 1-306 1-273 (273) 124 protein:vir:102605 Length: 273 99.5 1.5E-15 9.5E-19 101.8 19.1 266 1-306 1-273 (273) 125 protein:vir:99424 Length: 360 99.5 3.3E-15 2E-18 100.0 20.5 288 1-309 23-360 (360) 126 protein:vir:108211 Length: 318 99.5 1.6E-15 9.9E-19 101.7 18.1 283 1-307 1-318 (318) 127 protein:vir:94622 Length: 341 99.4 6.9E-15 4.3E-18 98.2 15.8 298 1-308 1-341 (341) 128 protein:vir:80213 Length: 334 99.4 2.9E-14 1.8E-17 94.8 18.2 289 1-308 1-334 (334) 129 protein:vir:2201 Length: 345 # 99.4 3.4E-14 2.1E-17 94.4 17.3 284 1-306 1-345 (345) 130 protein:vir:5974 Length: 324 # 99.4 3.2E-13 2E-16 89.1 22.2 280 1-315 1-297 (324) 131 protein:vir:10450 Length: 344 99.4 2.7E-14 1.6E-17 95.0 15.9 286 1-306 1-344 (344) 132 protein:vir:94576 Length: 347 99.4 3.8E-14 2.3E-17 94.2 16.7 287 1-306 1-347 (347) 133 protein:vir:8885 Length: 347 # 99.4 3.5E-14 2.2E-17 94.3 16.5 287 1-307 1-347 (347) 134 protein:vir:1583 Length: 351 # 99.4 2.5E-13 1.6E-16 89.7 20.8 284 1-315 1-300 (351) 135 protein:vir:94711 Length: 347 99.4 2E-14 1.2E-17 95.7 14.6 287 1-307 1-347 (347) 136 protein:vir:103323 Length: 364 99.4 5.7E-13 3.5E-16 87.7 22.0 292 1-315 1-345 (364) 137 protein:vir:6324 Length: 335 # 99.4 2.4E-13 1.5E-16 89.8 19.0 293 1-313 1-335 (335) 138 protein:vir:78935 Length: 335 99.3 2.2E-13 1.4E-16 89.9 18.6 293 1-313 1-335 (335) 139 protein:vir:102944 Length: 330 99.3 1.2E-12 7.5E-16 85.9 21.7 288 1-315 1-302 (330) 140 protein:vir:3364 Length: 347 # 99.3 3.6E-13 2.3E-16 88.8 17.9 288 1-308 1-347 (347) 141 protein:vir:78739 Length: 332 99.3 1.5E-12 9.2E-16 85.4 19.0 289 1-304 7-332 (332) 142 protein:vir:1541 Length: 347 # 99.3 1.4E-12 8.8E-16 85.5 18.0 289 1-308 1-347 (347) 143 protein:vir:80180 Length: 381 99.2 9.1E-12 5.7E-15 81.1 18.4 289 1-315 1-347 (381) 144 protein:vir:100057 Length: 375 99.1 5.6E-11 3.5E-14 76.8 21.0 296 1-310 1-375 (375) 145 protein:vir:9927 Length: 295 # 99.1 1.1E-11 6.6E-15 80.8 14.8 273 1-314 1-295 (295) 146 protein:vir:99675 Length: 324 99.1 1.4E-11 9E-15 80.0 15.2 262 34-315 1-307 (324) 147 protein:vir:97031 Length: 402 99.1 7.9E-11 4.9E-14 76.0 19.2 295 1-315 1-344 (402) 148 protein:vir:7019 Length: 401 # 99.0 5.2E-11 3.2E-14 77.0 16.1 295 1-315 1-343 (401) 149 protein:vir:3136 Length: 322 # 99.0 4.4E-11 2.7E-14 77.3 15.2 288 1-310 1-322 (322) 150 protein:vir:103285 Length: 296 98.9 1.2E-09 7.2E-13 69.6 18.3 279 1-307 1-296 (296) 151 protein:vir:105645 Length: 400 98.8 1E-09 6.2E-13 69.9 17.8 295 1-315 1-342 (400) 152 protein:vir:102655 Length: 322 98.8 1.4E-09 8.8E-13 69.1 18.5 284 1-307 1-322 (322) 153 protein:vir:107687 Length: 319 98.8 3.6E-09 2.2E-12 66.9 18.0 276 1-304 23-319 (319) 154 protein:vir:80068 Length: 301 98.7 6.9E-09 4.3E-12 65.3 19.3 275 1-304 1-301 (301) 155 protein:vir:106647 Length: 303 98.7 3.3E-09 2E-12 67.1 17.2 282 1-313 1-303 (303) 156 protein:vir:8324 Length: 410 # 98.7 2.7E-09 1.7E-12 67.6 15.6 265 1-304 131-410 (410) 157 protein:vir:93858 Length: 400 98.6 1.4E-09 8.8E-13 69.1 12.4 274 1-304 117-400 (400) 158 protein:vir:104342 Length: 314 98.6 2.7E-08 1.7E-11 62.1 17.3 279 1-307 19-314 (314) 159 protein:vir:99075 Length: 392 98.5 4E-08 2.5E-11 61.2 17.8 285 1-315 1-324 (392) 160 protein:vir:79642 Length: 329 98.5 1.1E-07 7E-11 58.7 18.6 279 1-307 29-329 (329) 161 protein:vir:105374 Length: 423 98.4 4.4E-07 2.7E-10 55.4 20.1 289 1-315 1-323 (423) 162 protein:vir:9875 Length: 296 # 98.4 4.6E-08 2.9E-11 60.8 14.7 275 1-307 1-296 (296) 163 protein:vir:78387 Length: 349 98.4 9.9E-07 6.1E-10 53.5 22.6 285 1-315 1-322 (349) 164 protein:vir:94989 Length: 349 98.4 1.1E-06 7E-10 53.2 23.0 286 1-315 1-322 (349) 165 protein:vir:80446 Length: 367 98.3 1.3E-06 7.8E-10 52.9 20.6 293 1-315 1-342 (367) 166 protein:vir:174 Length: 423 # 98.3 9.2E-07 5.7E-10 53.7 19.6 285 1-315 1-323 (423) 167 protein:vir:95318 Length: 328 98.3 4E-07 2.5E-10 55.6 17.0 230 1-242 1-328 (328) 168 protein:vir:3525 Length: 423 # 98.1 4.4E-06 2.8E-09 49.9 20.0 280 1-315 1-323 (423) 169 protein:vir:95131 Length: 325 98.1 2.4E-06 1.5E-09 51.4 18.2 286 1-315 1-300 (325) 170 protein:vir:8843 Length: 317 # 98.0 2.4E-06 1.5E-09 51.4 17.5 284 1-308 1-317 (317) 171 protein:vir:5255 Length: 304 # 97.9 1.4E-06 8.9E-10 52.6 14.4 279 6-303 1-304 (304) 172 protein:vir:97331 Length: 319 97.9 1.1E-05 6.8E-09 47.8 20.5 280 1-315 19-303 (319) 173 protein:vir:94800 Length: 319 97.9 1.1E-05 6.8E-09 47.8 20.5 280 1-315 19-303 (319) 174 protein:vir:107388 Length: 331 97.8 1.6E-05 9.8E-09 46.9 18.3 228 1-242 1-331 (331) 175 protein:vir:98525 Length: 331 97.8 1.6E-05 9.8E-09 46.9 18.3 228 1-242 1-331 (331) 176 protein:vir:107826 Length: 331 97.8 1.6E-05 9.8E-09 46.9 18.3 228 1-242 1-331 (331) 177 protein:vir:107120 Length: 329 97.8 2E-05 1.2E-08 46.4 21.0 281 1-315 30-315 (329) 178 protein:vir:108303 Length: 418 97.6 3.4E-05 2.1E-08 45.1 19.5 278 1-315 1-318 (418) 179 protein:vir:79548 Length: 652 97.6 4.5E-05 2.8E-08 44.4 18.0 272 1-303 359-652 (652) 180 protein:vir:7324 Length: 335 # 97.6 2.6E-05 1.6E-08 45.7 16.0 230 1-243 1-335 (335) 181 protein:vir:103759 Length: 330 97.5 3.6E-05 2.2E-08 44.9 16.0 229 1-242 1-330 (330) 182 protein:vir:95512 Length: 693 97.4 6.8E-05 4.2E-08 43.4 16.7 272 1-304 394-693 (693) 183 protein:vir:3643 Length: 336 # 97.4 1.2E-05 7.3E-09 47.6 12.1 274 1-304 31-336 (336) 184 protein:vir:105522 Length: 423 97.3 0.0001 6.3E-08 42.5 20.2 281 1-315 1-323 (423) 185 protein:vir:94070 Length: 339 97.2 5.5E-05 3.4E-08 43.9 14.3 273 1-304 35-339 (339) 186 protein:vir:101557 Length: 336 97.1 4.7E-05 2.9E-08 44.3 12.5 274 1-304 31-336 (336) 187 protein:vir:107732 Length: 379 97.0 0.00014 8.9E-08 41.7 14.7 278 1-304 54-379 (379) 188 protein:vir:1781 Length: 221 # 96.7 0.00013 8.3E-08 41.8 12.5 200 82-315 1-210 (221) 189 protein:vir:96079 Length: 382 96.7 0.00038 2.4E-07 39.3 14.7 281 1-304 61-382 (382) 190 protein:vir:1829 Length: 355 # 96.6 0.00044 2.7E-07 39.0 15.5 286 1-315 16-355 (355) 191 protein:vir:78558 Length: 336 96.6 0.0002 1.3E-07 40.8 12.6 275 1-304 31-336 (336) 192 protein:vir:99576 Length: 388 96.6 0.00024 1.5E-07 40.4 12.9 281 1-304 65-388 (388) 193 protein:vir:95451 Length: 313 96.4 0.00069 4.3E-07 37.9 17.4 291 1-307 1-313 (313) 194 protein:vir:96792 Length: 315 96.3 0.00076 4.7E-07 37.7 17.9 269 1-315 1-286 (315) 195 protein:vir:98566 Length: 355 96.2 0.00083 5.2E-07 37.5 15.3 290 1-315 16-355 (355) 196 protein:vir:106734 Length: 336 96.1 0.00047 2.9E-07 38.8 11.9 274 1-304 31-336 (336) 197 protein:vir:1153 Length: 338 # 95.9 0.0013 8.3E-07 36.3 15.4 279 1-308 16-338 (338) 198 protein:vir:6061 Length: 357 # 95.8 0.0015 9.2E-07 36.1 14.0 286 1-315 16-351 (357) 199 protein:vir:78777 Length: 358 95.3 0.0024 1.5E-06 35.0 14.3 286 1-315 22-350 (358) 200 protein:vir:2016 Length: 357 # 95.3 0.0024 1.5E-06 35.0 13.7 287 1-315 16-351 (357) 201 protein:vir:100331 Length: 342 95.2 0.0025 1.6E-06 34.8 14.3 281 1-310 16-342 (342) 202 protein:vir:5694 Length: 357 # 95.0 0.003 1.8E-06 34.4 13.8 287 1-315 16-351 (357) 203 protein:vir:95875 Length: 401 94.9 0.0033 2.1E-06 34.2 18.6 299 1-310 2-401 (401) 204 protein:vir:104011 Length: 337 94.8 0.0034 2.1E-06 34.1 16.3 280 1-309 16-337 (337) 205 protein:vir:103886 Length: 302 94.8 0.0034 2.1E-06 34.1 17.1 269 1-306 1-302 (302) 206 protein:vir:79171 Length: 337 94.8 0.0036 2.2E-06 34.0 16.3 280 1-309 18-337 (337) 207 protein:vir:3746 Length: 336 # 93.9 0.006 3.7E-06 32.8 13.9 283 1-312 13-336 (336) 208 protein:vir:79157 Length: 339 93.9 0.0062 3.8E-06 32.7 15.1 281 1-310 16-339 (339) 209 protein:vir:3783 Length: 336 # 93.0 0.009 5.6E-06 31.8 14.1 283 1-312 13-336 (336) 210 protein:vir:78186 Length: 337 92.9 0.0094 5.8E-06 31.7 14.0 280 1-309 16-337 (337) 211 protein:vir:348 Length: 321 # 91.6 0.015 9.4E-06 30.6 14.0 287 1-304 1-321 (321) 212 protein:vir:96442 Length: 418 89.3 0.027 1.7E-05 29.2 15.8 284 1-314 69-418 (418) 213 protein:vir:79008 Length: 299 89.3 0.027 1.7E-05 29.2 20.3 278 1-306 1-299 (299) 214 protein:vir:94870 Length: 318 87.0 0.041 2.5E-05 28.2 9.5 272 1-302 35-318 (318) 215 protein:vir:861 Length: 318 # 86.6 0.018 1.1E-05 30.1 7.4 276 1-302 35-318 (318) 216 protein:vir:1663 Length: 393 # 86.6 0.012 7.7E-06 31.0 6.5 276 1-302 110-393 (393) 217 protein:vir:270 Length: 341 # 86.4 0.046 2.8E-05 27.9 13.9 284 1-315 22-341 (341) 218 protein:vir:98856 Length: 343 83.7 0.066 4.1E-05 27.0 13.5 286 1-315 16-341 (343) 219 protein:vir:103370 Length: 418 77.0 0.13 8E-05 25.5 14.9 280 1-314 69-418 (418) 220 protein:vir:93966 Length: 400 72.1 0.19 0.00012 24.6 8.5 276 1-302 117-400 (400) 221 protein:vir:99888 Length: 309 69.4 0.22 0.00014 24.2 12.7 290 6-307 1-309 (309) 222 protein:vir:96666 Length: 462 53.9 0.52 0.00032 22.2 17.3 303 1-315 26-403 (462) 223 protein:vir:99311 Length: 463 49.9 0.62 0.00039 21.7 14.9 282 1-315 26-371 (463) 224 protein:vir:95603 Length: 463 49.9 0.62 0.00039 21.7 14.9 282 1-315 26-371 (463) 225 protein:vir:102335 Length: 312 48.1 0.68 0.00042 21.5 19.6 284 1-314 1-312 (312) 226 protein:vir:78920 Length: 290 38.0 1.1 0.00068 20.4 19.8 274 1-306 1-290 (290) 227 protein:vir:93696 Length: 364 35.3 1.2 0.00077 20.1 17.4 289 1-303 1-364 (364) 228 protein:vir:100851 Length: 514 33.4 1.4 0.00084 19.9 11.1 255 1-315 45-343 (514) 229 protein:vir:102823 Length: 470 31.9 1.5 0.00091 19.7 8.2 267 1-315 1-307 (470) 230 protein:vir:79712 Length: 285 29.2 1.7 0.001 19.3 17.3 265 1-306 1-285 (285) 231 protein:vir:105464 Length: 346 28.6 1.7 0.0011 19.3 18.9 283 1-315 1-316 (346) 232 protein:vir:80491 Length: 467 25.6 2 0.0013 18.9 13.4 278 1-315 31-332 (467) 233 protein:vir:63741 Length: 468 25.6 2 0.0013 18.9 13.2 278 1-315 32-333 (468) No 1 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=2.2e-83 Score=473.78 Aligned_cols=315 Identities=97% Similarity=1.407 Sum_probs=301.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+++++++||++||++++++||+.+++.|+|+++|+++|+++++++||++++++.++||+|++.+|+++++|++++|.+| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|||+++..+...+|+++|.+++++++++++|.++++|++++++..+.++.......++.++.+...|+|+ T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 160 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATADL 160 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchHHH Confidence 99999999999999999999999999999999999999999999999998888888888888777777777888889999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++...+++|+|||+++..|+++++.+|++++++++||+...+++++|+|+||+++++||.....+.+++.. T Consensus 161 ~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~ 240 (315) T protein:vir:80 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) T ss_pred HHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccccE Confidence 99999887766666778999999999999999999999999999999999999999999999999999988888888889 Q ss_pred EEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) +++|||++++|+++++++++++++++.++.++++|++|+++||+++|+||+|+||+||++|+.+++|+|+||++| T Consensus 241 ~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 241 AIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred EEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=3.3e-67 Score=385.06 Aligned_cols=298 Identities=38% Similarity=0.589 Sum_probs=256.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+.. ++||++||++++++||+.+++.|+++++|++++++++++++|+.+++++++|++|++++|+++++|++++|.+| T Consensus 1 mat~--~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~ 78 (311) T protein:vir:81 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) T ss_pred Ccee--cCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeE Confidence 7766 77899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccc----cch Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDAT----DSA 156 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 156 (315) |+++++++|+|+|+++.++ ...|+++|.+++++++++++|.++++|++++++..+.++.+......+.+..+ ... T Consensus 79 kl~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:81 79 KVQVTQRFSQEVKWADESR-QLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) T ss_pred EEEEeehhhHHHhhcCccc-HHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchH Confidence 9999999999999887544 45589999999999999999999999998888888888777654444333322 233 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc-- Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS-- 234 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~-- 234 (315) +.++.+++.++...+ ..+++|+||++++.+|++|||++|+|+|.. ....+.+++|+|+||++++.||...... T Consensus 158 ~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~----~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~ 232 (311) T protein:vir:81 158 DLAVEAAVGLVLGDN-LSPDGVALDNTFSFMLATQRDSQGRKLYPE----LGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) T ss_pred HHHHHHHHHHhhhcC-CCceEEEEcHHHHHHHHhhhccCCCeeecC----ccccCCCceecceeEEeccccccccccccc Confidence 566777777776544 455679999999999999999999988743 3345677899999999999999765432 Q ss_pred -------ccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 235 -------PASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 235 -------~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) ...+..+++|||++++|+.+++++++++++.+.++. +++|++|++++|+++|+||+|+||+||++|++++.. T Consensus 233 ~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred ccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcc-hhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 234557899999999999999999999999876654 689999999999999999999999999999998876 No 3 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=8.8e-67 Score=382.71 Aligned_cols=296 Identities=31% Similarity=0.470 Sum_probs=251.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) ||+++.++|. +||++++.+||+.+++.|+++++|++++++++++++|+.+++++++||+|++++|+++++|+++++++| T Consensus 1 ma~~t~~~G~-lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 79 (300) T protein:vir:95 1 MSEAQLSKGN-LFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPL 79 (300) T ss_pred CcccccCCcc-eechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeE Confidence 9999988755 689999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc--cccccc-cccccchh Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL--DKTTKT-VDATDSAT 157 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~ 157 (315) |++++++||+|||+++.+ ....|+++|.+++++++++++|.++++|++++++.......... ...... .......| T Consensus 80 k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 80 KVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 999999999999987643 35568999999999999999999999998765554433322221 111112 12234567 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...+ ..+++|+|||+++.+|++++|++|+|+|.. ....+.+++|+|+||+++++||... ... T Consensus 159 ~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~----~~~~~~~~~l~G~Pv~~s~~v~~~~---~~~ 230 (300) T protein:vir:95 159 ESMEDAVGMIDGSE-RDITGAILDPIFTTALSKMKNAEGGKLYPE----LAWGGVPDAINGLAVDKNRTVSYSQ---TDP 230 (300) T ss_pred HHHHHHHHHhhhcC-CCccEEEECHHHHHHHHHhhccCCCeeccC----ccccCCCceecceeeEEecCCCCCC---CCC Confidence 88999998886554 445689999999999999999999887632 3345678899999999999998543 344 Q ss_pred cceEEEecccceE-EEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 238 GVKAIVGDFSRVH-WGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 238 ~~~~~~gDf~~~~-i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +..+++|||+++. ++.|++++++++++.+.++.++++|++|++++|+++|+||++.||+||++||+++. T Consensus 231 ~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 231 KNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 5678899998754 99999999999999999999999999999999999999999999999999999988 No 4 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=2.2e-65 Score=375.03 Aligned_cols=297 Identities=30% Similarity=0.468 Sum_probs=252.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+ +.++||++||++++++||+.+++.|+++++|+++||+++..++|+.++++.++||+|++++|+++++|+++++++| T Consensus 1 m~--t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MG--TETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPI 78 (303) T ss_pred Cc--ccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeE Confidence 77 4467899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccc--cc--ccccccccccccch Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVK--VS--LDKTTKTVDATDSA 156 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~--~~--~~~~~~~~~~~~~~ 156 (315) |+++++++|+|||+++. |...+|+++|.+++++++++++|.++++|+++.++....... .. ..........+... T Consensus 79 kl~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (303) T protein:vir:97 79 KVEYGARLSDEFLYATE-EEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDA 157 (303) T ss_pred EEEEeehhhHHHhhcCc-cchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccch Confidence 99999999999998774 445668999999999999999999999998765554332222 11 11122222234556 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccccccccc-CCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGF-AGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~-~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) |+++.+++.++..++ ..+++|+|||+++.+|+++||++|+|++ +|+... +++++|+|+||+++++||....... T Consensus 158 ~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~lkd~~g~~~~----~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 232 (303) T protein:vir:97 158 DANIEAAVNLIQGAE-GVVTGLAMDTEFSTALAKVTNGEMGPKM----YPELAWGANPDSINGLKSSVNTTVGAGADEAE 232 (303) T ss_pred HHHHHHHHHHHhhcC-CCccEEEEcHHHHHHHHHhhccCCCeEE----ecCccCCCCCceecceeeEEecccCCccccCC Confidence 899999999887654 4557899999999999999999998875 344443 3557999999999999997654443 Q ss_pred cccceEEEecc-cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 236 ASGVKAIVGDF-SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 236 ~~~~~~~~gDf-~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ....+++||| +.+.++.+++++++++++.+.++..+++|++|++++|+++|+||+|+||+||++||++.. T Consensus 233 -~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 233 -SKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred -CccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 3457889999 567899999999999999999999999999999999999999999999999999999886 No 5 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=2.3e-65 Score=374.90 Aligned_cols=292 Identities=29% Similarity=0.432 Sum_probs=249.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+ .+||++||++++++|++.++++|+++++|++++++++..++|+.++.++++||+|++++|+++++|+++++.+| T Consensus 1 ma----~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:16 1 MV----LNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred Cc----ccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeee Confidence 65 56789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccc-----ccccccccccc Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLD-----KTTKTVDATDS 155 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 155 (315) |+++++++|+|+|+++. +...+|+++|.+++++++++++|.++++|++++++.+......... ........... T Consensus 77 k~a~~~~iS~ell~~s~-d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:16 77 KVEYGARISDEFMYASD-EEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred eEEEeehhhHHHhhcCc-ccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccccc Confidence 99999999999998875 3456689999999999999999999999988776655443332211 11122223334 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .+.++.+++.++..++. .+.+|+||++++..|++++|++|+|++. +....+.+++|+|+||+++++||.... T Consensus 156 ~~~~i~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~----~~~~~~~~~~l~G~PV~~~~~v~~~~~--- 227 (298) T protein:vir:16 156 PNGAIENAVELLTGVDA-DVTGIAINPSFRSALAKQKDLQDNALFP----ELKWGATPDTINGLPVDVNKTVSDMSL--- 227 (298) T ss_pred HHHHHHHHHHHhhhcCC-CccEEEEcHHHHHHHHHhhccCCCeeec----CcccCCCCceecceeeEEecccccccC--- Confidence 46789999998876554 4567999999999999999999988764 234567788999999999999986433 Q ss_pred cccceEEEecccce-EEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 236 ASGVKAIVGDFSRV-HWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 236 ~~~~~~~~gDf~~~-~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ..+..+++|||+++ .++.+++++++++++.+.++..+++|++||+++|+++|+||+++||+||++||.++ T Consensus 228 ~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 34567899999874 59999999999999998888899999999999999999999999999999999999 No 6 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.2e-64 Score=370.97 Aligned_cols=292 Identities=30% Similarity=0.451 Sum_probs=249.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+. +||++||++++++|++.++++|+++++|+++++++++.++|+.+++++++||+|++++|+++++|+++++.+| T Consensus 1 ma~----~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred Cee----ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeee Confidence 554 6789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccc--cccccc---ccccccc Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVS--LDKTTK---TVDATDS 155 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~--~~~~~~---~~~~~~~ 155 (315) |+++++++|+|+|+++.++ ..+|+++|.+++++++++++|.++++|++++++....+.... ....++ ..+.... T Consensus 77 k~~~~~~iS~ell~~~~~~-~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:94 77 KVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred EEEEeeehhHHHhccCCcc-HHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccccccccccc Confidence 9999999999999877544 556899999999999999999999999876666554443321 111111 1222334 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .++++.+++.++..++. .+.+|+||++++.+|++++|++|+|++.. ....+.+++|+|+||++++.||.... T Consensus 156 ~~~~i~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l~~~----~~~~~~~~tl~G~PV~~~~~v~~~~~--- 227 (298) T protein:vir:94 156 PNGAIENAVELLTGVDA-DVTGIAINPSFRSALAKQKDLQGNALFPE----LKWGATPDTINGLPVDVNKTVSDMSL--- 227 (298) T ss_pred HHHHHHHHHHhhhhcCC-CccEEEEcHHHHHHHHHhhccCCCeeecC----cccCCCCceecceeeEEecccccccC--- Confidence 57789999999876654 45679999999999999999999887643 34566778999999999999986533 Q ss_pred cccceEEEecccce-EEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 236 ASGVKAIVGDFSRV-HWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 236 ~~~~~~~~gDf~~~-~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ..+..+++|||+++ .++++++++++++++.+.++..+++|++|++++|+++|+||+++||+||++||+++ T Consensus 228 ~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 34567899999875 59999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1e-63 Score=365.88 Aligned_cols=298 Identities=33% Similarity=0.505 Sum_probs=249.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+..+ ++||++||++++++|++.+++.|+++++|+++|+++++.+||+.+++++++||+|++++|+++++|+++++.+| T Consensus 1 Mat~t-t~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~ 79 (311) T protein:vir:99 1 MATFG-TGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPK 79 (311) T ss_pred Cceec-CCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeE Confidence 99665 67888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccc----cch Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDAT----DSA 156 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 156 (315) |+++++++|+|||+++. |...+|+++|++++++++++++|+++++|+|++++..+.+..+.....++.++.+ ... T Consensus 80 k~~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 80 KAQVTMRFNEEVQWADE-DYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEEeehhhHHHhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchh Confidence 99999999999998763 4456689999999999999999999999999777776666555433333333222 223 Q ss_pred hHHHHHHHHHhhhcc-cccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc- Q lcl|NC_018838. 157 TTDLVKAVGLIAGAG-LQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS- 234 (315) Q Consensus 157 ~~di~~~~~~~~~~~-~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~- 234 (315) +.++.+++..+..+. ....++|+||++++..|++++|++|+|++. +....+.+++|+|+||++++++|...... T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~ 234 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFP----ELGLGIGVSSFEGIDASVSDTVNGGDEADP 234 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeec----CcccCCCCceecceeeEeeccccccccccc Confidence 567777887776543 344568999999999999999999998764 34455667899999999999998655432 Q ss_pred ------ccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 235 ------PASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 235 ------~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) .+....+++|||++ +.++++++++++++++++.+ ..+++|++|++++|+++|+||+|.|| +|++++.++| T Consensus 235 ~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 235 DDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPD-GQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCC-cchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 33455678999986 56999999999999988755 45789999999999999999999986 7888988888 No 8 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=5.1e-63 Score=362.09 Aligned_cols=301 Identities=22% Similarity=0.310 Sum_probs=252.7 Q ss_pred CCCCccCCCce------EcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCce--------eEEeecccccC Q lcl|NC_018838. 1 MADDFLSAGKL------ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPR--------AKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~~s~Gg~------~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~--------a~wv~Eg~~~~ 66 (315) |+.++...|++ +||++++++|++.+++.|+|+++|+++||+++.+++|+.+..+. +.|++|++.+| T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~ 89 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKP 89 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccccccccccccccc Confidence 66666555554 79999999999999999999999999999999999999876544 55677999999 Q ss_pred CCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccc Q lcl|NC_018838. 67 SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKT 146 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~ 146 (315) +++++|+++++++||+++++++|+|+|+++..+ ++++|.+++++++++++|.++++|+|+.+...+.++....... T Consensus 90 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~----~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~ 165 (338) T protein:vir:78 90 LSGTAWDTRSVAPIKLATIVTVSEEFARMNPSG----LYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIV 165 (338) T ss_pred ccccceeEEEEEEEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccc Confidence 999999999999999999999999999877654 7889999999999999999999999987777676665542221 Q ss_pred c-cc----cccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHH---HHhhccCccccccccccccccCCCccccc Q lcl|NC_018838. 147 T-KT----VDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALS---TEVYPKGSPLAGQPMYPAAGFAGLDNWRG 218 (315) Q Consensus 147 ~-~~----~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~---~l~d~~g~~~~~~~~~~~~~~~~~~~l~G 218 (315) . .. .......++++.+++..+..+.....++|+||++++..|+ +++|.+|+|++.. ....+.+++|+| T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~----~~~~~~~~~l~G 241 (338) T protein:vir:78 166 NTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTR----INLAASAGDLLG 241 (338) T ss_pred cccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecc----cccCCCCceeee Confidence 1 11 1112344678888887776655556678999999988774 5678888887532 345667889999 Q ss_pred eeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEe Q lcl|NC_018838. 219 LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAI 292 (315) Q Consensus 219 ~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v 292 (315) +||+++++||.+.....+++..+++|||++++++++++++++++++++ .+...+++|++|++++|+++|+||++ T Consensus 242 ~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v 321 (338) T protein:vir:78 242 LPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLL 321 (338) T ss_pred eeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEe Confidence 999999999998888888888999999999999999999999999874 34556899999999999999999999 Q ss_pred ecccceEEEeeccCCCC Q lcl|NC_018838. 293 ESLDSFAVVKEKAAPKP 309 (315) Q Consensus 293 ~~~~af~~l~~~~a~~~ 309 (315) .||+||++|+++++|.. T Consensus 322 ~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 322 GDKQAFVKFVDDEDPDA 338 (338) T ss_pred ecccceEEEecccCCCC Confidence 99999999999999988 No 9 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=6.9e-63 Score=361.38 Aligned_cols=299 Identities=22% Similarity=0.303 Sum_probs=252.6 Q ss_pred CCCCccCCCce------EcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecc--------cccC Q lcl|NC_018838. 1 MADDFLSAGKL------ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEG--------EVKP 66 (315) Q Consensus 1 m~~~~~s~Gg~------~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg--------~~~~ 66 (315) |+.++...|++ +||+++.++|++.+++.|+++++|++++++++..++|+.++.+.++|++|+ +.++ T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~ 89 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKP 89 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCccccccccccccc Confidence 56666555554 899999999999999999999999999999999999999999999998876 4578 Q ss_pred CCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccc- Q lcl|NC_018838. 67 SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDK- 145 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~- 145 (315) +++++|+++++.+||+++++++|+|+|+++..+ ++++|+++|++++++++|.++++|+|..+...+.++.+.... T Consensus 90 ~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~----~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~ 165 (333) T protein:vir:78 90 LSGTAWDTRSVSPIKLATIVTVSEEFARMNPSG----LYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIA 165 (333) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccc Confidence 889999999999999999999999999877654 788999999999999999999999997766666665543211 Q ss_pred ----cccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHH---HhhccCccccccccccccccCCCccccc Q lcl|NC_018838. 146 ----TTKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALST---EVYPKGSPLAGQPMYPAAGFAGLDNWRG 218 (315) Q Consensus 146 ----~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~---l~d~~g~~~~~~~~~~~~~~~~~~~l~G 218 (315) ...........++++++++..+..+.....++|+|||+++..|++ ++|.+|++++. +....+++++|+| T Consensus 166 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~----~~~~~~~~~~l~G 241 (333) T protein:vir:78 166 NTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPS----RINLAAQTGDVLG 241 (333) T ss_pred ccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeec----CccccCCCceeec Confidence 112222334457889999888876666667789999999887765 56788877653 2455677899999 Q ss_pred eeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCC---ccccchhhhhcCcEEEEEEEEeccEeecc Q lcl|NC_018838. 219 LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD---PDQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) Q Consensus 219 ~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~---~~~~~~~~f~~~~v~~r~~~r~~~~v~~~ 295 (315) +||+++++||.+...+.+++..+++|||++++++++++++++++++++ .++..+++|++|++++|+++|+||++.|| T Consensus 242 ~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~ 321 (333) T protein:vir:78 242 LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDK 321 (333) T ss_pred eeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecc Confidence 999999999998888888888999999999999999999999999875 34556789999999999999999999999 Q ss_pred cceEEEeeccCC Q lcl|NC_018838. 296 DSFAVVKEKAAP 307 (315) Q Consensus 296 ~af~~l~~~~a~ 307 (315) +||++|+.+++| T Consensus 322 ~a~~~l~~~~a~ 333 (333) T protein:vir:78 322 QAFVKFVDDEQP 333 (333) T ss_pred cceEEEeccCCC Confidence 999999999998 No 10 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3.3e-61 Score=352.17 Aligned_cols=302 Identities=20% Similarity=0.205 Sum_probs=242.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) ++..+.+ +|.+||++++++|++.+++.++|++++++++++++..++|+.+++++++|++|++++|+++++|+++++.+| T Consensus 10 ~~~~t~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 88 (330) T protein:vir:77 10 QVALTGD-FSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSFGKQELEPV 88 (330) T ss_pred hccccCC-CcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccccccceeeEEEEeEE Confidence 4444444 455677788899999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccc--------cccccc Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKT--------TKTVDA 152 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~--------~~~~~~ 152 (315) |++++++||+|+|+++..+ ++++|.+++++++++++|.++++|+|. +.++.++.+.+... ...... T Consensus 89 k~~~~~~is~ell~ds~~~----~~~~i~~~l~~ai~~~~~~~~l~G~g~--~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 162 (330) T protein:vir:77 89 KITTIFAESAEVVRLNPLN----YLNTMRTKIAEAIALKFDAAAIHGIDK--PSAFKGYLAETTKVVSLADTNLTTASGP 162 (330) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhcccCC--CCccccccccccccceeecccccccccc Confidence 9999999999999876544 789999999999999999999999984 44444544332211 111122 Q ss_pred ccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccc-cccCCCccccceeeEeecccCccc Q lcl|NC_018838. 153 TDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPA-AGFAGLDNWRGLNVGASSTVSGAP 231 (315) Q Consensus 153 ~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~-~~~~~~~~l~G~Pv~~s~~v~~~~ 231 (315) ....++++.+++..+..++ ..+++|+||++++..|+++||++|+|++....... ......++|+|+||+++++||.. T Consensus 163 ~~~~~~~l~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~- 240 (330) T protein:vir:77 163 QGNAYLAVNNALSLLVNSG-KKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNG- 240 (330) T ss_pred cchhHHHHHHHHHhhhhcC-CCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCC- Confidence 2334678888888876554 34568999999999999999999999876543322 22234569999999999999853 Q ss_pred cccccccceEEEecccceEEEeeccceEEEeccCCc----------cccchhhhhcCcEEEEEEEEeccEeecccceEEE Q lcl|NC_018838. 232 EMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP----------DQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVV 301 (315) Q Consensus 232 ~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~----------~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l 301 (315) ...++..+++|||++++++++++++++++++.+. ....+++|++|++++|+++|+||++.||+||++| T Consensus 241 --~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i 318 (330) T protein:vir:77 241 --TVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKL 318 (330) T ss_pred --CCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEE Confidence 2345678899999999999999999999988652 2345689999999999999999999999999999 Q ss_pred eeccCCCCCCCC Q lcl|NC_018838. 302 KEKAAPKPNPPA 313 (315) Q Consensus 302 ~~~~a~~~~~~~ 313 (315) +.+++-++-++. T Consensus 319 ~~~~~~~~~~~~ 330 (330) T protein:vir:77 319 TDQVAGTDPEEE 330 (330) T ss_pred EeccCCcCCCCC Confidence 987644333333 No 11 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.4e-61 Score=354.16 Aligned_cols=291 Identities=15% Similarity=0.130 Sum_probs=236.6 Q ss_pred CCCC-ccCCCceEcchhHHHHHHHHHHhccchhhh-cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADD-FLSAGKLELPGSMIGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~-~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) |+.+ +.++||++||+++.++|++.+++.++++++ ++++++.++++++|+.++++.++||+|++++|+++++|+++++. T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~ 143 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLS 143 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEEe Confidence 2222 335699999999999999999999999998 88999999999999999999999999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc--cccch Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD--ATDSA 156 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 156 (315) +||+++++++|+|+|+++.. +++++|++++++++++++|+++++|+| ++..+.|+.+.......... .+... T Consensus 144 ~~k~~~~~~iS~ell~ds~~----~~~~~i~~~l~~a~~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~t~~~ 217 (366) T protein:vir:57 144 AKTMIALVPVSNQLIGRAGF----NVEQLLLGDILSAIATREDKAFLRDDG--TGDTPKGMKAVATAANRLVAWTGTAIN 217 (366) T ss_pred eEEEEEeehhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHhhccCC--CCccccceeeccccccceeeccccccc Confidence 99999999999999977654 478999999999999999999999997 45567787765443322221 12222 Q ss_pred hHHHHHHHHHh----hhcc-cccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccc Q lcl|NC_018838. 157 TTDLVKAVGLI----AGAG-LQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAP 231 (315) Q Consensus 157 ~~di~~~~~~~----~~~~-~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~ 231 (315) +.++..++..+ ...+ ......|+||+.++.+|++++|++|+++| ++. +.++|+|+||+++++||.+. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~-----~~~---~~g~l~G~Pvv~s~~ip~~~ 289 (366) T protein:vir:57 218 LTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVY-----PEM---SQGILKGYPIQRTSAIPANL 289 (366) T ss_pred hhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceec-----cCC---CCCeecceeeEEcccccccc Confidence 33333222222 1122 22345799999999999999999998765 333 33689999999999999876 Q ss_pred cccccccceEEEecccceEEEeeccceEEEeccCC---ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 232 EMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD---PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 232 ~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~---~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) .... +...++||||++++|+++++++++++++++ .++..+++|++|++++|+++|+||+++||+||++|+...= T Consensus 290 ~~~~-~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 290 GDDG-NESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ccCC-CccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 4443 345688999999999999999999999864 4566789999999999999999999999999999998776 No 12 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=4.2e-61 Score=351.59 Aligned_cols=292 Identities=16% Similarity=0.142 Sum_probs=251.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +...+.++|+++||++++++|++.+++.++++++|+++|++++.+++|+.++.+.++|++|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~ 106 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeE Confidence 44555677999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++..+ ++++|.+++++++++++|+++++|+| ++..+.++...... .+....+...|+++ T Consensus 107 k~~~~~~is~ell~ds~~~----l~~~i~~~l~~aia~~~d~a~l~G~g--~~~~~~gi~~~~~~-~~~~~~~~~~~~~i 179 (324) T protein:vir:97 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIEK-TNKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhccCC--CCccCccccccccc-cceeccccCCHHHH Confidence 9999999999999877544 78899999999999999999999987 34455555554333 33444566679999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++ ..+..|+||++++..|++++|.+|++++. .+.+++|+|+||++++..+. ++.. T Consensus 180 ~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~lkd~~g~~~~~--------~~~~~tl~G~PV~~~~~~~~-------~~~~ 243 (324) T protein:vir:97 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERIY--------DRNSDTLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhcCCCceeec--------CCCCccccceeeEeecCCCC-------Ccce Confidence 99999987654 44568999999999999999999987642 34567899999998775542 3446 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++.+ .++..+++|++|+++||+++|+||++.+|+||++|+.+....+.||+. T Consensus 244 ~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) T protein:vir:97 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCC Confidence 88999999999999999999999875 355678999999999999999999999999999999999988888888 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:97 324 V 324 (324) T ss_pred C Confidence 8 No 13 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=3.8e-61 Score=351.85 Aligned_cols=288 Identities=17% Similarity=0.201 Sum_probs=251.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+.++.+.||.+||++++++|++.+++.++++++|+++|++++..++|+.++ +.++||+|++++|+++++|+++++.+| T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~f~~v~l~~~ 84 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG-VGAFWVDEAERIQTSKPTFTKAKMRSK 84 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC-CceeeeecCccccccccceeEEEEeeE Confidence 8888889999999999999999999999999999999999999999998865 789999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++.. +++++|.+.+++++++++|.++++|+|.+ .+.++........+....+...|+++ T Consensus 85 k~~~~~~is~ell~ds~~----~~~~~i~~~l~~a~~~~~d~a~l~G~g~~---~~~gil~~~~~~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 85 KMGVIIPTTKENLNYSVT----NFFSLMQAEIVEAFYKKFDQAVFTGVESP---YNWNILKSATDASNLVEETANKYDDL 157 (299) T ss_pred EEEEeehhhHHHHhcCHH----HHHHHHHHHHHHHHHHHHHHHHhhcccCc---ccccccccccccceeeccccccHHHH Confidence 999999999999987654 47889999999999999999999999743 34566666555556666667789999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++..+..++ ..+++|+||++++.+|++++|.+|+|++... . .++.++|+|+||+++++||.. +++.. T Consensus 158 ~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~----~-~~~~~~l~G~PV~~~~~~~~~-----~~~~~ 226 (299) T protein:vir:41 158 NEAIGLIEAED-LEPNGIATIRKQRVKYRSTKDGNGMPIFNTA----T-SNGVDDVLGLPIAYTPKYTFG-----DKDIS 226 (299) T ss_pred HHHHHhhhccc-CCcCEEEEcHHHHHHHHHhhccCCceeecCC----c-CCCCceecceeeEEecccCCC-----CCceE Confidence 99999987655 4567899999999999999999999876432 2 234468999999999999853 34567 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) +++|||++++++++++++++++++.+ .++..+++|++|++++|+++|+||++.+|+||++|+.+++- T Consensus 227 ~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 227 ELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 89999999999999999999999774 35667889999999999999999999999999999999887 No 14 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=7.2e-61 Score=350.32 Aligned_cols=286 Identities=15% Similarity=0.153 Sum_probs=243.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +..++++.||++||++++++|++.+++.++++++|+++|++++..+||++++++.++|++|++++|+++++|++++++++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~ 88 (304) T protein:vir:94 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAK 88 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEE Confidence 55666788899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc--ccccccccccccccccccccchhH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK--PAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) |++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|..... .+.++...........+.+...|+ T Consensus 89 k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (304) T protein:vir:94 89 KIGVIIPLSKEFLKWTAKD----FFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYV 164 (304) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccccccchHH Confidence 9999999999999877544 7889999999999999999999999853322 112222222333333344556689 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+..++. .+.+|+||++++.+|++++|++|+|++.. .+++|+|+||+++++||... ++ T Consensus 165 ~i~~~~~~l~~~~~-~~~~~v~~~~~~~~L~~lkd~~G~~l~~~---------~~~~l~G~PV~~~~~~~~~~-----~~ 229 (304) T protein:vir:94 165 DLSALMATIEDEEL-DPNGVLTTRSFRSKMRNALDANDRPLFDA---------NGNEIMGLPLSYTGADVYDK-----KK 229 (304) T ss_pred HHHHHHHHhhhccC-CcCEEEEcHHHHHHHHHhhccCCcEeecC---------CCccccceeeEEecccccCC-----CC Confidence 99999999876553 45679999999999999999999987632 34689999999999998642 34 Q ss_pred ceEEEecccceEEEeeccceEEEeccCC--------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGD--------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~--------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ..+++|||+++++++++++++++++++. .++..+++|++|++++|+++|+|++++||+||++||.+- T Consensus 230 ~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 230 SLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 5688999999999999999999998763 466678899999999999999999999999999999887 No 15 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=7.2e-61 Score=350.32 Aligned_cols=286 Identities=15% Similarity=0.153 Sum_probs=243.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +..++++.||++||++++++|++.+++.++++++|+++|++++..+||++++++.++|++|++++|+++++|++++++++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~ 88 (304) T protein:vir:10 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAK 88 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEE Confidence 55666788899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc--ccccccccccccccccccccchhH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK--PAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) |++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|..... .+.++...........+.+...|+ T Consensus 89 k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (304) T protein:vir:10 89 KIGVIIPLSKEFLKWTAKD----FFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYV 164 (304) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccccccchHH Confidence 9999999999999877544 7889999999999999999999999853322 112222222333333344556689 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+..++. .+.+|+||++++.+|++++|++|+|++.. .+++|+|+||+++++||... ++ T Consensus 165 ~i~~~~~~l~~~~~-~~~~~v~~~~~~~~L~~lkd~~G~~l~~~---------~~~~l~G~PV~~~~~~~~~~-----~~ 229 (304) T protein:vir:10 165 DLSALMATIEDEEL-DPNGVLTTRSFRSKMRNALDANDRPLFDA---------NGNEIMGLPLSYTGADVYDK-----KK 229 (304) T ss_pred HHHHHHHHhhhccC-CcCEEEEcHHHHHHHHHhhccCCcEeecC---------CCccccceeeEEecccccCC-----CC Confidence 99999999876553 45679999999999999999999987632 34689999999999998642 34 Q ss_pred ceEEEecccceEEEeeccceEEEeccCC--------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGD--------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~--------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ..+++|||+++++++++++++++++++. .++..+++|++|++++|+++|+|++++||+||++||.+- T Consensus 230 ~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 230 SLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 5688999999999999999999998763 466678899999999999999999999999999999887 No 16 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=4.9e-61 Score=351.24 Aligned_cols=293 Identities=15% Similarity=0.167 Sum_probs=241.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccc-----cCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEV-----KPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~-----~~~s~~~~~~v 75 (315) |+.++.++||++||++++++|++.+++.++|++++++++++++++++|+.++++.++||+|++. +|.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 9999999999999999999999999999999999999999999999999999999999999986 45678999999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccc-- Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDAT-- 153 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~-- 153 (315) ++++||++++++||+|+|+++..+ ++++|++++++++++++|.++++|+|++.+..+.++.............. T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~----~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVA----VLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccccccc Confidence 999999999999999999877554 78999999999999999999999998766554444433322222221111 Q ss_pred cchhHHHHHHHHHhh---hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcc Q lcl|NC_018838. 154 DSATTDLVKAVGLIA---GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGA 230 (315) Q Consensus 154 ~~~~~di~~~~~~~~---~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~ 230 (315) ...+.++...+.... .......++|+||++++..|++++|++|+|+|. +++|+|+||++++++|.. T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~-----------~~~l~G~Pv~~~~~~~~~ 225 (305) T protein:vir:25 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR-----------DDSFAGFRTFFNRNGAWD 225 (305) T ss_pred chhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeec-----------CCcccccceEEcCccCCC Confidence 222334333333221 223344567999999999999999999998763 358999999999998753 Q ss_pred ccccccccceEEEecccceEEEeeccceEEEeccCC--ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 231 PEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD--PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 231 ~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~--~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) .++..+++|||++++++++++++++++++.+ .+...+++|++|++++|+++|+||.|.||+||++++...... T Consensus 226 -----~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~ 300 (305) T protein:vir:25 226 -----ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) T ss_pred -----CCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccc Confidence 2345788999999999999999999999874 355567899999999999999999999999999999987766 Q ss_pred CCCCC Q lcl|NC_018838. 309 PNPPA 313 (315) Q Consensus 309 ~~~~~ 313 (315) ++|-+ T Consensus 301 ~~pa~ 305 (305) T protein:vir:25 301 VAPAA 305 (305) T ss_pred cCCCC Confidence 66666 No 17 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.9e-60 Score=347.95 Aligned_cols=297 Identities=19% Similarity=0.174 Sum_probs=241.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+.++++.+|.+||++++++|++.+++.++|+++|++++++++++++|+.+++++++|++|++++|+++++|+++++.+| T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~ 93 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPH 93 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeE Confidence 88888888888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc-ccccccc--hh Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK-TVDATDS--AT 157 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~ 157 (315) |++++++||+|+|+++..+ |+++|.+++++++++++|+++++|+|.+....+.++......... ....... .. T Consensus 94 k~~~~~~is~ell~ds~~~----l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (320) T protein:vir:10 94 KIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYD 169 (320) T ss_pred EEEEeehhhHHHHhcChHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccccHH Confidence 9999999999999877654 788999999999999999999999986544444333332221111 1111111 12 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccc-cCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG-FAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~-~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) .++.+++..+... ...+++|+|||+++.+|++++|++|++++......... .....+++|+||++++++|.+ T Consensus 170 ~~~~~~~~~~~~~-~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~------ 242 (320) T protein:vir:10 170 AVAVNGLSLLVNA-KKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADG------ 242 (320) T ss_pred HHHHHHHhhhhcc-cCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCC------ Confidence 3455555555433 34567899999999999999999999887544332211 112357999999999998753 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) +..+++|||++++++++++++++++++.+ .++..+++|++|++++|+++|+||++.||+||++|+.+++|+. T Consensus 243 -~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 243 -TTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred -ceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 34578999999999999999999998875 3455678999999999999999999999999999999998777 No 18 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.4e-60 Score=348.73 Aligned_cols=295 Identities=16% Similarity=0.133 Sum_probs=245.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhh-cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +..++...||++||++++++|++.+++.++++++ ++++|+.++.+++|+.++++.+.||+|++.+|+++++|++|++.+ T Consensus 132 ~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~ 211 (435) T protein:vir:80 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTA 211 (435) T ss_pred hcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEee Confidence 5667777899999999999999999999999998 788999999999999999999999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc----ccc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA----TDS 155 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~----~~~ 155 (315) ||++++++||+|+|+++..+ ..|+++|.+++++++++++|.++++|+| ++..|.|+............. ... T Consensus 212 ~k~~~~~~is~ell~ds~~~--~~l~~~i~~~l~~a~~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (435) T protein:vir:80 212 KKMAALVPIANDLIKYAGVN--PNVDQIVVGDLTAAIGAREDKAFIRDDG--TANTPKGLRFWALPGNVITASDGSTLQK 287 (435) T ss_pred EEEEEeehhhHHHHHhhccc--HHHHHHHHHHHHHHHHHHHHHHhhccCC--CCCcccceeecccccceeecccccchhh Confidence 99999999999999876432 2478999999999999999999999987 445677776543332211111 122 Q ss_pred hhHHHHHHHHHhhhccc-ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGL-QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS 234 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~-~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 234 (315) .+.++.+++..+..... ..+++|+||+.++.+|++++|++|+|++ |+. +.++|+|+||+++++||.+...+ T Consensus 288 ~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~-----~~~---~~~~l~G~pv~~~~~~p~~~~~~ 359 (435) T protein:vir:80 288 IETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVY-----PEL---ANGMLKGYPVGKTTQVPINLGEA 359 (435) T ss_pred HHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceec-----cCC---CCCeEeeeeeEEeccccccccCC Confidence 34577777777755443 3356799999999999999999998765 433 33589999999999999876544 Q ss_pred ccccceEEEecccceEEEeeccceEEEeccCC---ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 235 PASGVKAIVGDFSRVHWGFQRNFPIELIEYGD---PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 235 ~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~---~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) . +...+++|||+++++++++++++++++++. .++..+++|++|+++||++.|+||++.||+||++|++..-.+ T Consensus 360 ~-~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 360 G-KESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred C-CcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 3 455788999999999999999999999874 345567899999999999999999999999999999987776 No 19 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.6e-60 Score=348.45 Aligned_cols=295 Identities=15% Similarity=0.136 Sum_probs=246.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhh-cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..++...||++||+++.++|++.+++.++++++ ++.+++.++.+++|+.++++.++||+|++.+|+++++|+++++.+ T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~ 211 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTA 211 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeee Confidence 6777788899999999999999999999999998 788999999999999999999999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc--c--ccccccc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT--K--TVDATDS 155 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~--~--~~~~~~~ 155 (315) ||++++++||+|+|+++..+ .+|+++|.+++++++++++|.++++|+| ++..+.|+........ . ....... T Consensus 212 ~k~~~~~~iS~ell~ds~~~--~~l~~~i~~~l~~ai~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (435) T protein:vir:14 212 KKMAALVPIANDLIKYAGVN--PNVDQIVVGDLTAAIGAREDKAFIRDDG--TANTPKGLRFWALPSNVITASDASTLQK 287 (435) T ss_pred EEEEEeehhhHHHHHhhccC--HHHHHHHHHHHHHHHHHHHHHHhhccCC--CCccccceeecccccceeccccccchhh Confidence 99999999999999876432 3588999999999999999999999987 4456777765432211 1 1111222 Q ss_pred hhHHHHHHHHHhhhccc-ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGL-QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS 234 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~-~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 234 (315) .+.++.+++..+..... ..+++|+||+.++..|++++|++|+|++ |+. ..++|+|+||++++.||.+...+ T Consensus 288 ~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~-----~~~---~~g~l~G~Pv~~~~~~p~~~~~~ 359 (435) T protein:vir:14 288 IETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVY-----PEL---ANGMLKGYPVGKTTQVPINLGET 359 (435) T ss_pred HHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceec-----cCC---CCCeeecceeEeeccccccccCC Confidence 34577778777765432 3456799999999999999999998765 333 34589999999999999876554 Q ss_pred ccccceEEEecccceEEEeeccceEEEeccCC---ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 235 PASGVKAIVGDFSRVHWGFQRNFPIELIEYGD---PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 235 ~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~---~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) . ....+++|||++|++++|+++++++++++. .++..+++|++|++++|+++|+||++.+|+||++|++++-.+ T Consensus 360 ~-~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 360 G-KESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred C-ccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 3 345688999999999999999999999874 234567889999999999999999999999999999998887 No 20 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=6.4e-60 Score=345.11 Aligned_cols=292 Identities=16% Similarity=0.141 Sum_probs=249.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +...+.+.++.+||++++++|++.+++.|+++++|+++|++++.++||+.++.+.+.|++|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:10 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeE Confidence 44445566778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |+++++++|+|+|+++..+ ++++|.+++++++++++|.++++|+|. +..+.++.+..... .....+...++++ T Consensus 107 k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~ai~~~~d~a~l~G~g~--~~~~~~i~~~~~~~-~~~~~~~~t~~~i 179 (324) T protein:vir:10 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEKT-NKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhhcCCC--CccCcccccccccc-ceeccccCCHHHH Confidence 9999999999999877544 788999999999999999999999873 33445555543333 3344456678999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++..+..++ ..+++|+||++++..|++++|++|++++ ..+.+++|+|+||++++.++. ++.. T Consensus 180 ~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~l~d~~g~~~~--------~~~~~~~l~G~PV~~~~~~~~-------~~~~ 243 (324) T protein:vir:10 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERI--------YDRNSDTLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCceee--------cCCCCccccceeEEeecCCCC-------Ccce Confidence 99999997654 3456899999999999999999998753 234567899999998766542 3456 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++.+ .++..+++|++|++++|+++|+||++.+|+||++|+.+++....+|+- T Consensus 244 ~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:10 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 88999999999999999999999874 355667899999999999999999999999999999998888777777 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:10 324 V 324 (324) T ss_pred C Confidence 7 No 21 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=8e-60 Score=344.58 Aligned_cols=292 Identities=16% Similarity=0.142 Sum_probs=245.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +...+.++||++||+++.++|++.+++.|++++++++++++++.++||+.++++.++||+|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeE Confidence 66666788999999999999999999999999999999999889999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+| ++..+.++...... ......+...++++ T Consensus 107 k~~~~~~is~ell~ds~~~----l~~~i~~~la~ai~~~~d~a~l~G~g--~~~~~~gi~~~~~~-~~~~~~~~~t~~~i 179 (324) T protein:vir:78 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIEK-TNKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhccCC--CCCcCccccccccc-cceeccccccHHHH Confidence 9999999999999877544 78899999999999999999999987 33444555544333 23344456679999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++ ..+++|+||++++..|++++|.+|++++ ..+.+++|+|+||+++..++. ++.. T Consensus 180 ~~~~~~l~~~~-~~~~~~vmn~~~~~~L~~l~d~~G~~~~--------~~~~~~~l~G~PV~~~~~~~~-------~~~~ 243 (324) T protein:vir:78 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERI--------YDRNSDSLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCCeee--------cCCCCCcccceeeEeeCCCCC-------Ccce Confidence 99999887654 4456899999999999999999998764 234567899999998765542 3456 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++++ .++..+++|++|+++||+++|+||++.||+||++|+.+..-.-..|++ T Consensus 244 ~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:78 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 88999999999999999999999874 356678999999999999999999999999999999864433223333 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:78 324 V 324 (324) T ss_pred C Confidence 3 No 22 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=8e-60 Score=344.58 Aligned_cols=292 Identities=16% Similarity=0.142 Sum_probs=245.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +...+.++||++||+++.++|++.+++.|++++++++++++++.++||+.++++.++||+|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeE Confidence 66666788999999999999999999999999999999999889999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+| ++..+.++...... ......+...++++ T Consensus 107 k~~~~~~is~ell~ds~~~----l~~~i~~~la~ai~~~~d~a~l~G~g--~~~~~~gi~~~~~~-~~~~~~~~~t~~~i 179 (324) T protein:vir:96 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIEK-TNKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhccCC--CCCcCccccccccc-cceeccccccHHHH Confidence 9999999999999877544 78899999999999999999999987 33444555544333 23344456679999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++ ..+++|+||++++..|++++|.+|++++ ..+.+++|+|+||+++..++. ++.. T Consensus 180 ~~~~~~l~~~~-~~~~~~vmn~~~~~~L~~l~d~~G~~~~--------~~~~~~~l~G~PV~~~~~~~~-------~~~~ 243 (324) T protein:vir:96 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERI--------YDRNSDSLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCCeee--------cCCCCCcccceeeEeeCCCCC-------Ccce Confidence 99999887654 4456899999999999999999998764 234567899999998765542 3456 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++++ .++..+++|++|+++||+++|+||++.||+||++|+.+..-.-..|++ T Consensus 244 ~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 88999999999999999999999874 356678999999999999999999999999999999864433223333 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:96 324 V 324 (324) T ss_pred C Confidence 3 No 23 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=6e-60 Score=345.26 Aligned_cols=299 Identities=19% Similarity=0.189 Sum_probs=244.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+.++.++++.+||++++++||+.+++.++|++++++++|++++++||++++++.++||+|++.+|+++++|+++++.+| T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 89 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPA 89 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccccccccceeEEEEeeE Confidence 77777777777788889999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++..+ ++++|++++++++++++|+++++|+|. +.++.++...... .........++++ T Consensus 90 k~~~~v~iS~ell~ds~~~----l~~~i~~~l~~aia~~~d~a~l~G~gt--~~~~~~~~~~~~~--~~~~~~~~~~~~~ 161 (397) T protein:vir:23 90 KIATIFVASAETVRANPAN----YLGTMRTKVATAIAMAFDNAALHGTNA--PSAFQGYLDQSNK--TQSISPNAYQGLG 161 (397) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhhcccC--Ccccccccccccc--eeeecccchhHHH Confidence 9999999999999877544 788999999999999999999999984 3333333332221 1222334456677 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccc-cccCCCccccceeeEeecccCccccccccccc Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPA-AGFAGLDNWRGLNVGASSTVSGAPEMSPASGV 239 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~-~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~ 239 (315) .+++..+..+. ..+++|+||++++..|+++||++|+|++....... .....+++|+|+||+++++||.+ +. T Consensus 162 ~~~~~~l~~~~-~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g-------~~ 233 (397) T protein:vir:23 162 VSGLTKLVTDG-KKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG-------DV 233 (397) T ss_pred HHHHHhhhhcc-cCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCC-------ce Confidence 77777776554 44578999999999999999999999876544322 12233468999999999999843 34 Q ss_pred eEEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC----- Q lcl|NC_018838. 240 KAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK----- 308 (315) Q Consensus 240 ~~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~----- 308 (315) .+++|||++++++++++++++++++.+ ..+..+++|++|+++||+++|+||+++||+||++++..+.+. T Consensus 234 ~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 313 (397) T protein:vir:23 234 VGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALD 313 (397) T ss_pred EEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeec Confidence 678999999999999999999999875 244567899999999999999999999999999999754433 Q ss_pred -CCCCCCC Q lcl|NC_018838. 309 -PNPPAGN 315 (315) Q Consensus 309 -~~~~~~~ 315 (315) +.+.+|+ T Consensus 314 ~~~~~~~~ 321 (397) T protein:vir:23 314 LDGASAGN 321 (397) T ss_pred ccccCcce Confidence 3344444 No 24 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=3.8e-60 Score=346.33 Aligned_cols=281 Identities=11% Similarity=0.074 Sum_probs=240.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCc-cceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSAS-VDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~-~~~~~v~l~~ 79 (315) |..++.+.||++||++++.+|++.+++.++|+++|++++++++..++|+.++++.++|++|++.+|+++ ++|+++++.+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~ 209 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFAS 209 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccccccccccccceeeeeh Confidence 889999999999999999999999999999999999999999999999999999999999999999876 7999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc------------ Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT------------ 147 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~------------ 147 (315) ||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|. + .|.|+.+...... T Consensus 210 ~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~ai~~~~d~~~l~G~G~--~-~p~Gil~~~~~~~~~~~~~~~~~~~ 282 (425) T protein:vir:10 210 GEIYANPAATQQILDDAEID----LESWLATEVQTEFAKQEGKAFLAGDGT--N-KPNGLLTYIAGGANAAKHPFGAIEV 282 (425) T ss_pred eeeEeehHhHHHHHhcchhH----HHHHHHHHHHHHHHHHHHhhhhcccCC--C-Ccceeeecccccccccccccccccc Confidence 99999999999999776544 789999999999999999999999983 2 4666665433222 Q ss_pred -cccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 148 -KTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 148 -~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) .....+...++++++++..+.+.+ ..+..|+||++++.+|++++|.+|+|+|. |+...+.+++|+|+||+++++ T Consensus 283 ~~~~~~~~~~~d~l~~l~~~l~~~~-~~~a~~vmn~~~~~~L~~lkD~~G~~l~~----~~~~~g~~~~l~G~PV~~~~~ 357 (425) T protein:vir:10 283 VNSGAAADITSDGIIDLVYDLPSAF-TGNARFAMNRNTQRQVRKLKDGQGNYLWQ----PSYVAGQPATLAGYPVTEVPD 357 (425) T ss_pred ccccccccccHHHHHHHHhhhhhhh-ccCCEEEEchHHHHHHHHhhcCCCceeec----cCccCCCCceecceeeEEecC Confidence 122344567889999998886543 34557999999999999999999998864 456677888999999999999 Q ss_pred cCccccccccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ||... .....++||||++ |+++++.++++..+++ |++|++.||++.|+|++|++|+||++|+.++ T Consensus 358 ~p~~~----~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~----------~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 358 MPDVA----ANSTPILFGDFQQTYLIIDRIGVRVLRDPY----------TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAA 423 (425) T ss_pred cCCcc----CCccEEEEEehhccEEEEEecceEEEeccc----------ccCCcEEEEEEEEeccEeecccceEEEEeec Confidence 98542 2235678899987 7888898887765543 7789999999999999999999999999999 Q ss_pred CC Q lcl|NC_018838. 306 AP 307 (315) Q Consensus 306 a~ 307 (315) ++ T Consensus 424 s~ 425 (425) T protein:vir:10 424 SE 425 (425) T ss_pred cC Confidence 88 No 25 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.9e-59 Score=342.54 Aligned_cols=292 Identities=16% Similarity=0.143 Sum_probs=247.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +...+.+.++.+||++++++|++.+++.++|+++|+++|+++++++||+.++++.++|++|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:99 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeE Confidence 44455566778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |+++++++|+|+|+++..+ ++++|.+++++++++++|.++++|+|. +..+.++.+..... .....+...++++ T Consensus 107 k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~ai~~~~d~~~l~G~g~--~~~~~~~~~~~~~~-~~~~~~~~~~~~i 179 (324) T protein:vir:99 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEKT-NKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhhcCCC--CccCcccccccccc-ceeccccCCHHHH Confidence 9999999999999877544 788999999999999999999999873 33445555443333 3444456778999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++..+.+.+ ..++.|+|||+++..|++++|++|++++ + .+.+++|+|+||++++.++. ++.. T Consensus 180 ~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~l~d~~g~~~~-----~---~~~~~~l~G~PVv~~~~~~~-------~~~~ 243 (324) T protein:vir:99 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERI-----Y---DRNSDTLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhcCCCceee-----c---CCCCccccceeEEeecCCCC-------Ccce Confidence 99999997654 3456799999999999999999998753 2 34567899999998876653 3456 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++.. .++..+++|++|++++|+++|+||++.||+||++|+.++.....+|+- T Consensus 244 ~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~ 323 (324) T protein:vir:99 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 88999999999999999999999874 345567899999999999999999999999999999987766666666 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:99 324 V 324 (324) T ss_pred C Confidence 6 No 26 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.1e-59 Score=342.29 Aligned_cols=292 Identities=16% Similarity=0.151 Sum_probs=244.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +..++.++++++||++++++|++.+++.|+++++|+++|++++.++||+.++.+.++|++|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 106 (324) T protein:vir:93 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeE Confidence 44455667788999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++.. +++++|.+++++++++++|.++++|+|. +..+.++...... +.....+...++++ T Consensus 107 k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~aia~~~d~a~l~G~g~--~~~~~~~~~~~~~-~~~~~~~~~~~~~i 179 (324) T protein:vir:93 107 KLGVILPVTKEFLNYTYS----QFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEK-TNKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchH----HHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCccccccccc-cceeccccccHHHH Confidence 999999999999987754 4788999999999999999999999873 3344455544333 23444556678999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++. .+.+|+||++++..|++++|++|++++. .+.+++|+|+||+++...+ .++.. T Consensus 180 ~~~~~~l~~~~~-~~~~~v~n~~~~~~L~~l~d~~G~~~~~--------~~~~~~l~G~PVv~~~~~~-------~~~~~ 243 (324) T protein:vir:93 180 IDLEALLEDDEL-EANAFISKTQNRSLLRKIVDPETKERIY--------DRNSDSLDGLPVVNLKSSN-------LKRGE 243 (324) T ss_pred HHHHHhhhhccC-CCCEEEEcHHHHHHHHHhhCCCCCeeec--------CCCCCcccceeeEeecCCC-------CCcce Confidence 999999876654 4568999999999999999999987632 3456789999999876543 23456 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||++++++++++++++++++.. .++..+++|++|++++|+++|+||++.||+||++|+.+..-.-..|+. T Consensus 244 i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:93 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCC Confidence 88999999999999999999999874 356678999999999999999999999999999999765433223333 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:93 324 V 324 (324) T ss_pred C Confidence 3 No 27 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=7.9e-60 Score=344.59 Aligned_cols=291 Identities=12% Similarity=0.131 Sum_probs=234.6 Q ss_pred CC-CCccCCCceEcchhHHHHHHHHHHhccchhhh-cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MA-DDFLSAGKLELPGSMIGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~-~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) ++ .++.+.||++||+++.++||+.+++.++++++ ++++|+.++.+++|+.++++.++|++|++.+|+++++|++|++. T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~ 204 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLT 204 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEee Confidence 22 23334689999999999999999999999999 78899999999999999999999999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc---cccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD---ATDS 155 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~---~~~~ 155 (315) ++|++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+| ++..|.|+.+.......... .... T Consensus 205 ~~k~~~~v~is~ell~ds~~----~l~~~i~~~l~~ai~~~~d~~~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 278 (428) T protein:vir:10 205 AKTMIAMVPISNALIGRAGF----NVEQLVLQDILTAISVREDKAFMRDDG--TGDTPIGMKARATQWNRLLPWAADAAV 278 (428) T ss_pred eEEEEEeehhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHHhccCC--CCccccccccccccccccccccccccc Confidence 99999999999999976544 478999999999999999999999987 55667777665433221111 1112 Q ss_pred hhHHHHHHHHHhh-----hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcc Q lcl|NC_018838. 156 ATTDLVKAVGLIA-----GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGA 230 (315) Q Consensus 156 ~~~di~~~~~~~~-----~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~ 230 (315) .++.+...+..+. ......+..|+||+.++.+|++++|++|+|++. +. ..++|+|+||+++++||.+ T Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~-----~~---~~g~l~G~pv~~~~~~p~~ 350 (428) T protein:vir:10 279 NLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYP-----EM---AQGMLKGYPIQRTSAIPAN 350 (428) T ss_pred cHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceecc-----CC---CCCeeeceeeEEecccccc Confidence 2223323222221 112223457999999999999999999988763 22 2358999999999999987 Q ss_pred ccccccccceEEEecccceEEEeeccceEEEeccCC---ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 231 PEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD---PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 231 ~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~---~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ...+ .+...++||||+++++++++++++++++++. .++..+++|++|++++|+++|+||++.||+||+++++..= T Consensus 351 ~~~~-~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 351 LGEG-GKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ccCC-CccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 5544 3456788999999999999999999999864 3445678999999999999999999999999999998776 No 28 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.9e-59 Score=342.54 Aligned_cols=288 Identities=17% Similarity=0.128 Sum_probs=229.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecC----CCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTI----FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~----~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |..++.+.||+++|+++.++||+.+++.+++++++....++ .+++++|+.++++.++||+|++.+|+++++|++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~ 417 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESIT 417 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEE Confidence 55566677999999999999999999999999998664333 35689999999999999999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cccccccccccccccccccccccc Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA-TGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~-~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) |.+||+++++++|+|||+++..+ ++++|++++++++++++|.++|+|++.+ .+..+.++...+ ........ T Consensus 418 l~~~kla~~~~iS~ell~ds~~~----~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~----~~~~~~~~ 489 (645) T protein:vir:93 418 FSHAKVSAIAVLTEELIRFSSPA----ADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV----KGTASSGN 489 (645) T ss_pred EeeEEEEEeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc----cccccccc Confidence 99999999999999999877554 7899999999999999999999988643 223445544322 22223334 Q ss_pred hhHHHHHHHHHhhhccccc-ceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQV-PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS 234 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~-~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 234 (315) .+.|+.+++..+..++... .++|+|||.++.+|++++|++|+++ ||.+. ...++|+|+||+++++||++ T Consensus 490 ~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~-----~~~~~-~~~~tL~G~PV~~s~~vp~~---- 559 (645) T protein:vir:93 490 PDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKE-----YPDMT-LLGGSFQGLPVIVSQYVGDQ---- 559 (645) T ss_pred hHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCcee-----ecCCC-CCCceeeceeeEEeccCCcc---- Confidence 5678888888876655443 3479999999999999999988765 44432 23469999999999999853 Q ss_pred ccccceEEEecccceEEEeeccceEEEeccCCc--------------cccchhhhhcCcEEEEEEEEeccEeecccceEE Q lcl|NC_018838. 235 PASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP--------------DQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAV 300 (315) Q Consensus 235 ~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~--------------~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~ 300 (315) +++|||+++++|.+.++.+.++++++. .+..++|||+||++||+++|+||+++||+||++ T Consensus 560 ------~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~ 633 (645) T protein:vir:93 560 ------LVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAV 633 (645) T ss_pred ------eeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEE Confidence 567899999999999988888776542 224589999999999999999999999999999 Q ss_pred EeeccCCCCCCCCC Q lcl|NC_018838. 301 VKEKAAPKPNPPAG 314 (315) Q Consensus 301 l~~~~a~~~~~~~~ 314 (315) |+++.==.. -+| T Consensus 634 lt~~~~g~~--~~~ 645 (645) T protein:vir:93 634 ITGVNYGSA--SGG 645 (645) T ss_pred EecccCCcc--cCC Confidence 996421000 011 No 29 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=6.4e-59 Score=339.63 Aligned_cols=292 Identities=16% Similarity=0.140 Sum_probs=243.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +.....+.++.+||++++++|++.+++.|+++++++++|++++.++||+.++.++++||+|++.+|+++++|+++++.+| T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~ 106 (324) T protein:vir:96 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeE Confidence 33344456778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) |++++++||+|+|+++.. +|+++|.+++++++++++|.++|+|+| ++..+.++....... .....+...|+++ T Consensus 107 k~~~~~~is~ell~ds~~----~l~~~i~~~l~~aia~~~d~~~l~G~g--~~~~~~~~~~~~~~~-~~~~~~~~~~~~i 179 (324) T protein:vir:96 107 KLGVILPVTKEFLNYTYS----QFFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIKKT-NKVIKGDFTQDNI 179 (324) T ss_pred EEEEeehhhHHHHhcchH----HHHHHHHHHHHHHHHHHHHHHhhhcCC--CCCcCcccccccccc-ceecccccchHHH Confidence 999999999999987654 488999999999999999999999987 334445555443333 3344455678999 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccce Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 240 (315) .+++.++..++ ..+++|+||++++..|++++|++|+++. ..+.+++|+|+||+++...+. ++.. T Consensus 180 ~~~~~~i~~~~-~~~~~~i~n~~~~~~L~~lkd~~G~~~~--------~~~~~~~l~G~PV~~~~~~~~-------~~~~ 243 (324) T protein:vir:96 180 IDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERI--------YDRNSDSLDGLPVVNLKSSNL-------KRGE 243 (324) T ss_pred HHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhCCCCCeee--------cCCCCCcccceeeEeecCCCC-------Ccce Confidence 99999987654 4456899999999999999999998763 234567899999998765542 3446 Q ss_pred EEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 241 ~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) +++|||+++++++++++++++++++. .++..+++|++|++++|+++|+||++.+|+||++|+.+..-.-..|+. T Consensus 244 ~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCC Confidence 88999999999999999999999875 356678999999999999999999999999999999775443334443 Q ss_pred C Q lcl|NC_018838. 315 N 315 (315) Q Consensus 315 ~ 315 (315) - T Consensus 324 ~ 324 (324) T protein:vir:96 324 V 324 (324) T ss_pred C Confidence 3 No 30 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=3.4e-59 Score=341.10 Aligned_cols=287 Identities=12% Similarity=0.071 Sum_probs=241.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCc-cceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSAS-VDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~-~~~~~v~l~~ 79 (315) |..++.++||++||++++++|++.+++.++|+++|++++++++...+|+.++++.++|++|++.+|+++ ++|+++++.+ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~ 185 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFM 185 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeee Confidence 888999999999999999999999999999999999999999999999999999999999999999865 8999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc------------ Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT------------ 147 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~------------ 147 (315) ||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|. + .|.|+........ T Consensus 186 ~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~~~~a~l~G~G~--~-~p~Gil~~~~~~~~~~~~~~~~~~~ 258 (407) T protein:vir:48 186 GEIYGNPQATQKMLDDAFFN----VEDWINSELALEFAEQEEIAFTSGDGS--K-KPKGFLAYESTDEDDKTRAFGKLQH 258 (407) T ss_pred eeeEeehhhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhhhccCCC--C-ccceeeecccccccccccccccccc Confidence 99999999999999776544 788999999999999999999999984 2 4566654422211 Q ss_pred -cccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 148 -KTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 148 -~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) .....+...++++.+++..+...+ ..+..|+||++++..|++++|.+|+|++. |++..+.+++|+|+||+++++ T Consensus 259 ~~~~~~~~~~~d~i~~l~~~l~~~~-~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~----~~~~~g~~~~l~G~PV~~~~~ 333 (407) T protein:vir:48 259 IASGAASGVTADAIIKLIYTLRKAH-RSGAKFMMNNSSLFAIRLLKDNDGNYLWR----PGIELGQPSSLAGYGIVENEQ 333 (407) T ss_pred cccccccccChHHHHHHHHhhchhh-hcCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCCceecceeeEEecC Confidence 122233456889999999886653 33457999999999999999999998753 456778889999999999999 Q ss_pred cCccccccccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ||... .....++||||++ |.+.++.++++..+++ |++|++.||+++|+|+++++|+||++|+.++ T Consensus 334 ~p~~~----~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~a 399 (407) T protein:vir:48 334 MPDIA----ADAKAIAFGNFKRGYTIVDRIGTRILRDPY----------TNKPFVGFYTTKRTGGMLVDSQAIKLMKIGA 399 (407) T ss_pred cCCcc----CCccEEEEEeccccEEEEEeeceEEEeecc----------ccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 98532 2334678899975 8888999988876553 7889999999999999999999999999887 Q ss_pred CCCCCCCC Q lcl|NC_018838. 306 APKPNPPA 313 (315) Q Consensus 306 a~~~~~~~ 313 (315) +....--+ T Consensus 400 a~~~~~~~ 407 (407) T protein:vir:48 400 ATRQKAAA 407 (407) T ss_pred cCCCCCCC Confidence 66665555 No 31 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.1e-58 Score=338.30 Aligned_cols=293 Identities=19% Similarity=0.159 Sum_probs=234.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |..++.+.|+ +||++++++||+.+++.++|+++|+++||++++.++|+.++++.++||+|++.+|+++++|+++++.+| T Consensus 20 ~~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 98 (326) T protein:vir:42 20 AQTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPH 98 (326) T ss_pred eeccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeE Confidence 6666655555 589999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc-----ccccc Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV-----DATDS 155 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~ 155 (315) |+++++++|+|+|+++..+ ++++|.+++++++++++|+++++|+|. + .+.++.+......... ..... T Consensus 99 k~~~~v~iS~ell~~s~~~----~~~~i~~~l~~a~~~~~d~a~l~G~gs--~-~p~gi~~~~~~~~~~~~~~~~~~~~~ 171 (326) T protein:vir:42 99 KIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDNAAINGTDS--P-FPTFLAQTTKEVSLVDPDGTGSNADL 171 (326) T ss_pred EEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHhhcccCC--C-ccccccccccccceeecccccccccc Confidence 9999999999999887654 788999999999999999999999984 2 2344443322211111 11112 Q ss_pred hhHHH--HHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccc-cCCCccccceeeEeecccCcccc Q lcl|NC_018838. 156 ATTDL--VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG-FAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 156 ~~~di--~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~-~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) .+.++ ..++..+ .......++|+||++++.+|+++||++|+|++......... ....++|+|+||+++++||.+ T Consensus 172 ~~~~~~~~~~~~~~-~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-- 248 (326) T protein:vir:42 172 TVYDAVAVNALSLL-VNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASG-- 248 (326) T ss_pred hhHHHHHHHHHhhh-hhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCC-- Confidence 23332 2333333 23334456799999999999999999999887654432211 112357999999999999853 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCc------cccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP------DQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~------~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +..+++|||++++++++++++++++++.+. +...+++|++|++++|+++|+||++.||+||++|+.+++ T Consensus 249 -----~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 249 -----TVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred -----ceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 456789999999999999999999988652 345678999999999999999999999999999999888 Q ss_pred CCC Q lcl|NC_018838. 307 PKP 309 (315) Q Consensus 307 ~~~ 309 (315) .+. T Consensus 324 ~~~ 326 (326) T protein:vir:42 324 TEA 326 (326) T ss_pred cCC Confidence 887 No 32 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=2.8e-59 Score=341.59 Aligned_cols=280 Identities=11% Similarity=0.053 Sum_probs=237.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCC-ccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l~~ 79 (315) |+.++++.||++||++++++|++.+++.++|+++|++++++++..++|+..+++.++|++|++.+|.+ .++|+++++.+ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~ 186 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFM 186 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeeh Confidence 99999999999999999999999999999999999999999999999999999999999999999975 48999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc------------ Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT------------ 147 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~------------ 147 (315) ||+++++++|+|+|+++.. +|+++|.++|++++++++|.++++|+|. + .|.|+.+...... T Consensus 187 ~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~ai~~~~~~~~l~G~G~--~-~p~Gil~~~~~~~~~~~~~~~~~~~ 259 (401) T protein:vir:44 187 GEIYGNPQATQKMLDDAFF----NVEAWINSELATEFAEQEEIAFTTGDGT--K-KPKGFLAYESTEESDKARAFGKLQH 259 (401) T ss_pred hheeeehhhhHHHHhcchH----HHHHHHHHHHHHHHHHHHHhhhhccCCC--C-ccceeeccccccccccccccccccc Confidence 9999999999999977654 4788999999999999999999999984 2 4556654322111 Q ss_pred -cccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 148 -KTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 148 -~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) .....+...|+++.+++..+...+ ..+..|+||++++..|++++|.+|+|++. |+.+.|.+++|+|+||+++++ T Consensus 260 ~~t~~~~~~~~d~i~~~~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~l~~----~~~~~g~~~~l~G~PVv~~~~ 334 (401) T protein:vir:44 260 IVSGEATAVTADAIIKLIYTLRKAH-RTGAKFMMNNNSLFAIRLLKDTEGNYLWR----PGLELGQPSSLAGYGIAENEQ 334 (401) T ss_pred cccccccccCHHHHHHHHHhcchhh-hcCCEEEEcHHHHHHHHHhhccCCceeec----CCcCCCCCceecceeeEEecC Confidence 112233456889999998886543 33457999999999999999999998753 456677888999999999999 Q ss_pred cCccccccccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) ||... .+...++||||++ |++..+.++++.+++ +|++|++.||++.|+|+++.+++||++|+.++ T Consensus 335 ~p~~~----~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~----------~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~a 400 (401) T protein:vir:44 335 MPDIA----ADAKAIAFGNFKRGYTIVDRIGTRILRDP----------YTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAA 400 (401) T ss_pred cCCcc----CCccEEEEeehhccEEEEEecceEEeeec----------cccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 98642 2334678899975 788999998887654 37899999999999999999999999999999 Q ss_pred C Q lcl|NC_018838. 306 A 306 (315) Q Consensus 306 a 306 (315) + T Consensus 401 a 401 (401) T protein:vir:44 401 A 401 (401) T ss_pred C Confidence 8 No 33 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.7e-58 Score=337.30 Aligned_cols=296 Identities=19% Similarity=0.163 Sum_probs=239.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) |+.++.+++|.+||++++++||+.+++.++|+++|+++|++++.++||+.++++.++|++|++++++++++|+++++.+| T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 93 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPH 93 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeE Confidence 88888888899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc--cccccccchhH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT--KTVDATDSATT 158 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 158 (315) |+++++++|+|+|+++..+ ++++|.+++++++++++|.++++|+|.+. +.++........ .........+. T Consensus 94 k~~~~~~iS~e~l~ds~~~----~~~~i~~~l~~~~~~~~d~a~l~G~g~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (318) T protein:vir:24 94 KIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDGAAMHGTDSPF---PTYIGQTTKAISIADTTGATTVYDQ 166 (318) T ss_pred EEEEeehhhHHHhhcChHH----HHHHHHHHHHHHHHHHHHHhhhcccCCCC---CcccccccccccccccccccchHHH Confidence 9999999999999877654 78899999999999999999999997433 233333222211 11112222334 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccc-cCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG-FAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~-~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) ++.+++..+... ...+..|+|||+++..|+++||++|+|++.+....... .....+++|+||+++++++.+ T Consensus 167 ~~~~~~~~~~~~-~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~------- 238 (318) T protein:vir:24 167 VAVNGLSLLVND-GKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEG------- 238 (318) T ss_pred HHHHHHHhhccc-cCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCC------- Confidence 556666665443 34456899999999999999999999886543221111 111247999999999988743 Q ss_pred cceEEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNP 311 (315) Q Consensus 238 ~~~~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~ 311 (315) +..+++|||++++++++++++++++++.+ .++..+++|++|++++|+++|+||++.+|+||++|+.+++--..- T Consensus 239 ~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 239 TTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred ccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 34678999999999999999999999865 245567899999999999999999999999999999877544443 No 34 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1e-58 Score=338.51 Aligned_cols=287 Identities=16% Similarity=0.107 Sum_probs=231.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..+++++||++||+++..+||+.+++.++|++++++++++++.++||+.++ .+.++||+|++.+|+++++|++|++.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 7888889999999999999999999999999999999999999999999876 468999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccccc----- Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATD----- 154 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~----- 154 (315) ||++++++||+|||+++ .+|+++|.++++++|++++|.++++|+|. + .+.|+.+............. T Consensus 231 ~k~a~~~~iS~ell~d~-----~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--~-~p~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:10 231 GKVANALTITDEGLRDA-----PELFNFVQGRLLEGIQRKEEVQLLAGGGY--P-GVNGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eeeEeecHhHHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHhhcCCCc--c-cccccccccccccccccccchhhhh Confidence 99999999999999643 24899999999999999999999999973 3 35666554322111100000 Q ss_pred --------------------------------------------------chhHHHHHHHHHhhhcccccceEEEEeHHH Q lcl|NC_018838. 155 --------------------------------------------------SATTDLVKAVGLIAGAGLQVPNGVALDPAF 184 (315) Q Consensus 155 --------------------------------------------------~~~~di~~~~~~~~~~~~~~~~~~~m~~~~ 184 (315) ....++..++..+...++..+++|+||+.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~ 382 (497) T protein:vir:10 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHH Confidence 001123344444555556667789999999 Q ss_pred HHHHHHHhhccCcccccccccc--ccccCCCccccceeeEeecccCccccccccccceEEEecccc--eEEEeeccceEE Q lcl|NC_018838. 185 SFALSTEVYPKGSPLAGQPMYP--AAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSR--VHWGFQRNFPIE 260 (315) Q Consensus 185 ~~~L~~l~d~~g~~~~~~~~~~--~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~--~~i~~~~~~~v~ 260 (315) +..|++++|++|+|+|+.+... ....+...+|+|+||++++.||.. .+++|||++ +.++++++++|+ T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~---------~~~~Gd~~~~~~~i~~r~~~~v~ 453 (497) T protein:vir:10 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------TILVGHFAPSVIQTARREGVTMQ 453 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC---------ceEEeecccceEEEEEecccEEE Confidence 9999999999999998765321 112334569999999999999853 367899986 557789999999 Q ss_pred EeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 261 ~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ++++.. .+|++|+++||++.|+||+|++|+||++|+.+++.+.- T Consensus 454 ~~~~~~------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 454 MTNSNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eecccc------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 998743 25999999999999999999999999999986655544 No 35 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1e-58 Score=338.51 Aligned_cols=287 Identities=16% Similarity=0.107 Sum_probs=231.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..+++++||++||+++..+||+.+++.++|++++++++++++.++||+.++ .+.++||+|++.+|+++++|++|++.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 7888889999999999999999999999999999999999999999999876 468999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccccc----- Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATD----- 154 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~----- 154 (315) ||++++++||+|||+++ .+|+++|.++++++|++++|.++++|+|. + .+.|+.+............. T Consensus 231 ~k~a~~~~iS~ell~d~-----~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--~-~p~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:78 231 GKVANALTITDEGLRDA-----PELFNFVQGRLLEGIQRKEEVQLLAGGGY--P-GVNGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eeeEeecHhHHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHhhcCCCc--c-cccccccccccccccccccchhhhh Confidence 99999999999999643 24899999999999999999999999973 3 35666554322111100000 Q ss_pred --------------------------------------------------chhHHHHHHHHHhhhcccccceEEEEeHHH Q lcl|NC_018838. 155 --------------------------------------------------SATTDLVKAVGLIAGAGLQVPNGVALDPAF 184 (315) Q Consensus 155 --------------------------------------------------~~~~di~~~~~~~~~~~~~~~~~~~m~~~~ 184 (315) ....++..++..+...++..+++|+||+.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~ 382 (497) T protein:vir:78 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHH Confidence 001123344444555556667789999999 Q ss_pred HHHHHHHhhccCcccccccccc--ccccCCCccccceeeEeecccCccccccccccceEEEecccc--eEEEeeccceEE Q lcl|NC_018838. 185 SFALSTEVYPKGSPLAGQPMYP--AAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSR--VHWGFQRNFPIE 260 (315) Q Consensus 185 ~~~L~~l~d~~g~~~~~~~~~~--~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~--~~i~~~~~~~v~ 260 (315) +..|++++|++|+|+|+.+... ....+...+|+|+||++++.||.. .+++|||++ +.++++++++|+ T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~---------~~~~Gd~~~~~~~i~~r~~~~v~ 453 (497) T protein:vir:78 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------TILVGHFAPSVIQTARREGVTMQ 453 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC---------ceEEeecccceEEEEEecccEEE Confidence 9999999999999998765321 112334569999999999999853 367899986 557789999999 Q ss_pred EeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 261 ~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ++++.. .+|++|+++||++.|+||+|++|+||++|+.+++.+.- T Consensus 454 ~~~~~~------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 454 MTNSNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eecccc------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 998743 25999999999999999999999999999986655544 No 36 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=9.2e-58 Score=333.28 Aligned_cols=282 Identities=16% Similarity=0.094 Sum_probs=240.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |...+.+++|.+||++++++|++.+++.|+++++|++++++++ ...+|+.++++.++|++|++.+|+++++|+++++++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 88 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVTLKA 88 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccceeEEEEee Confidence 7777778899999999999999999999999999999999765 478899999999999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTD 159 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 159 (315) ||+++++++|+|+|+++..+ ++++|.+++++++++++|.++++|+|... +.++.+.... ......+...|++ T Consensus 89 ~k~~~~~~is~ell~ds~~~----l~~~i~~~la~ai~~~~d~a~l~G~g~~~---~~gi~~~~~~-~~~~~~~~~t~~~ 160 (297) T protein:vir:95 89 HKLGIILVTSREALNYTWKK----FFEDMKPQIVEAFYKKIDEAGLLGHDTPF---ANSVAKAAKD-ANKVIGGPINYDN 160 (297) T ss_pred EEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHHhcccCCcc---cccccccccc-cceecccccCHHH Confidence 99999999999999877544 78999999999999999999999987432 3455544333 2344455667999 Q ss_pred HHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccc Q lcl|NC_018838. 160 LVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV 239 (315) Q Consensus 160 i~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~ 239 (315) +.+++.++..++ ..+++|+||++++.+|++++|.+|++++. +.+++|+|+||+++...+. ++. T Consensus 161 i~~~~~~l~~~~-~~~~~~v~~~~~~~~L~~l~d~~G~~i~~---------~~~~~l~G~Pv~~~~~~~~-------~~~ 223 (297) T protein:vir:95 161 ILKLQDALYDAD-VEPNAFVSKIQNRSALREARDGNKVSIYD---------KAANTIDGITTVDLKSARF-------EKG 223 (297) T ss_pred HHHHHHHhhhcc-CCcCEEEEcHHHHHHHHHhhccCCceeec---------CCCCcccceeeEeecCCCC-------CCc Confidence 999999997655 44578999999999999999999988753 2346899999997765442 234 Q ss_pred eEEEecccceEEEeeccceEEEeccCC------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 240 KAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 240 ~~~~gDf~~~~i~~~~~~~v~~~~~~~------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) .+++|||++++++++++++++++++.+ .++..+++|++|++++|+++|+||++.+|+||++||.++ |. T Consensus 224 ~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at---~~ 297 (297) T protein:vir:95 224 DLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAE---RV 297 (297) T ss_pred eEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecC---CC Confidence 688999999999999999999999874 345667899999999999999999999999999998776 33 No 37 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=1.1e-57 Score=332.95 Aligned_cols=289 Identities=13% Similarity=0.040 Sum_probs=241.6 Q ss_pred CCCCccCCCceEcchhHHHHHH-HHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVR-DRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii-~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) ....+.++||++||++++.+|| +.+++.++++++++++++ ++...+|+.++++.++||+|++.+|+++++|+++++.+ T Consensus 250 ~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 328 (543) T protein:vir:81 250 AMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPV 328 (543) T ss_pred hcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCccccccccccceeeeee Confidence 2334567899999999998876 557888999999988765 57899999999999999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc---cccccccch Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT---KTVDATDSA 156 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 156 (315) +|++++++||+|+|+++ . +|.++|.++|++++++++|.++++|+| ++..+.|+.+...... ......... T Consensus 329 ~k~~~~~~is~ell~d~-~----~~~~~i~~~l~~~~~~~~d~ail~G~G--t~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 401 (543) T protein:vir:81 329 KKAQGFVPISIEALQDE-A----NVTETVALLFAEGKDELEAVTLTTGTG--QGNQPTGIVTALAGTAAEIAPVTAETFA 401 (543) T ss_pred eeeEeeehhhHHHHhcc-H----HHHHHHHHHHHHHHHHHHHHHHhccCC--CCcccccchhhccccccccccccccccc Confidence 99999999999999654 2 488999999999999999999999997 4556777765433222 223344567 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccc-c Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS-P 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~-~ 235 (315) ++++.+++..+...+ .....|+||++++..|++++|++|+|+|. .+..+.+++|+|+||+++++||.+.... . T Consensus 402 ~~~~~~~~~~l~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~l~~-----~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~ 475 (543) T protein:vir:81 402 LADVYAVYEQLAARH-RRQGAWLANNLIYNKIRQFDTQGGAGLWT-----TIGNGEPSQLLGRPVGEAEAMDANWNTSAS 475 (543) T ss_pred HHHHHHHHHhhhccc-cCCcEEEEcHHHHHHHHHhhcCCCceecc-----CcCCCCCccccceeeEEecccccccccccc Confidence 889999999886554 34457999999999999999999988764 3456677899999999999999765432 3 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) .....++||||++|+|+++++++|+++++.+.+ +.|.+|++.||+++|+||++.+|+||++|+.+++. T Consensus 476 ~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 476 ADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGT----NRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred CCcceEEEeeccceeEEeecccEEEEecccccc----chhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 345578899999999999999999999986543 35889999999999999999999999999988877 No 38 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=9.8e-58 Score=333.12 Aligned_cols=289 Identities=17% Similarity=0.128 Sum_probs=245.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCC-ceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~-~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) |..+++++||++||++++++|++.+++.++|+++|++++++++. +.+|+..+. ..+.|++|++.+|+++++|++++|. T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~ 196 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLG 196 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeee Confidence 78888899999999999999999999999999999999998765 455555543 4678999999999999999999999 Q ss_pred eEEEE-EeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 79 PIKVV-TQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 79 ~~kl~-~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) ++|++ ++++||+|+|+++.. +|+++|.++++++++.++|.++++|+|.+....+.|+.+...........+...+ T Consensus 197 ~~k~~~~~i~is~ell~ds~~----~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~ 272 (409) T protein:vir:45 197 ALKMTSKIIRVSNELLQDSAI----DMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKW 272 (409) T ss_pred eeeeeeeehhhhHHHHhccHH----HHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccch Confidence 99985 678999999977644 4889999999999999999999999997666778888877666656665666778 Q ss_pred HHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) +++.+++..+...+..... .|+||+.++.+|++|+|++|+|++. ++...+++.+|+|+||+++++||... . T Consensus 273 d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~----~~~~~~~~~~l~G~PV~~~~~~p~~~----~ 344 (409) T protein:vir:45 273 QEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWL----PDIVGVAPASVLNVPYVIDQEIDDIG----A 344 (409) T ss_pred HHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeec----cCcCCCCCceecceeeEEecCcCCcc----C Confidence 9999999988765543322 3678999999999999999998753 45667788899999999999998532 2 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) ....++||||++|+++.+++++++.+.+. +|++|++.||++.|+|+++.+|+||++|+.+++... T Consensus 345 ~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~--------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 345 GKKFMFCGDFDRFIIRRVRYMILKRLVER--------YAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred CccEEEEeehhhhheeeccceEEEEeecc--------cccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 34468889999999999999999988763 588999999999999999999999999998876666 No 39 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.5e-57 Score=330.85 Aligned_cols=287 Identities=17% Similarity=0.099 Sum_probs=241.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC----ceeEEeecccccCCCc-cceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV----PRAKIVGEGEVKPSAS-VDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~----~~a~wv~Eg~~~~~s~-~~~~~v 75 (315) ++.++.+.|+++||++++++|++.+++.++|++++++++++++.+++|+.... ..++||+|++.+|+++ ++|+++ T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i 197 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIV 197 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceee Confidence 55666778999999999999999999999999999999999998999998754 4679999999999987 689999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) ++.+||++++++||+|+|+++ ..|+++|++++++++++++|.++++|+| ++.++.|+.+.....+.....+.. T Consensus 198 ~~~~~k~~~~~~iS~ell~ds-----~~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~~~~Gi~~~~~~~~~~~~~~~~ 270 (413) T protein:vir:81 198 TESLSKIAGLTKITDEMIEDY-----DFLVSYINARLLEELAIEEERQLLLGDG--TGNNLTGLLKRDGIQTLAVSNKDE 270 (413) T ss_pred EeeeeeEEEeehhhHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHhccCC--CCCcccccccccccccccccccch Confidence 999999999999999999653 2489999999999999999999999987 555677777665444444444445 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccc---cCCCccccceeeEeecccCcccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG---FAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~---~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) .++++.+++..+..+..+.+++|+||++++.+|+++||++|+|++.+++.+... ...+++|+|+||+++++||.+ T Consensus 271 ~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~-- 348 (413) T protein:vir:81 271 LADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVG-- 348 (413) T ss_pred hHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcc-- Confidence 567777777666555556677899999999999999999999998776654322 223468999999999999843 Q ss_pred ccccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 233 MSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) .+++|||++ |+++.+++++++++++.. .+|++|++.||+++|+|+++.+|+||++++.+++..| T Consensus 349 -------~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 349 -------KPVVGAFRSAASVLRKGGVRIDSTNTNV------DDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred -------cEEEEecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 378999986 778889999999998754 2699999999999999999999999999998777666 No 40 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=8e-58 Score=333.60 Aligned_cols=281 Identities=18% Similarity=0.199 Sum_probs=233.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCc-cceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSAS-VDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~-~~~~~v~l~~ 79 (315) ++.++.++||++||+++.++|++.+++.++|+++|++++++ ++.++|+..+.+.++|++|++++|+++ ++|++|++.+ T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~ 216 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDG 216 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ceeEEEEecCCccccccccccccccccccccceeeeeh Confidence 55566678999999999999999999999999999999986 678999999999999999999999887 7899999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc-ccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK-TVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 158 (315) ||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|.++. .|.|+.+....... ...+....++ T Consensus 217 ~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~-~p~Gil~~~~~~~~~~~~~~~~~~~ 291 (425) T protein:vir:95 217 FKVGKVTFVDNYLLQDSII----NLDDYVTKKIARAIAKALDLAIVKGTGAANK-QPLGIIPSLPPENQVTVEADNNLLK 291 (425) T ss_pred eeeeeeehhhHHHHhccHH----HHHHHHHHHHHHHHHHHHHHHhhccCCCCcc-ccceeecccccccccccccccchHH Confidence 9999999999999977654 4889999999999999999999999985432 34566654333322 2233455688 Q ss_pred HHHHHHHHhhhccccc-ceEEEEeHHHH----HHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQV-PNGVALDPAFS----FALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) Q Consensus 159 di~~~~~~~~~~~~~~-~~~~~m~~~~~----~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 233 (315) ++.+++..+....... ..+|+||+.++ ..|+.++|.+|+|+|..+ .+..++|+|+||++++.||.. T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~------~~~~~~l~G~pvv~~~~~~~~--- 362 (425) T protein:vir:95 292 NLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLP------NLRTPDLLGLRVVFNNFLDDD--- 362 (425) T ss_pred HHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccC------CCCCccccceeeEEcCcCCCc--- Confidence 8999888776554333 45699999884 357788999999987532 344578999999999999853 Q ss_pred cccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCC Q lcl|NC_018838. 234 SPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPA 313 (315) Q Consensus 234 ~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~ 313 (315) .++||||++|+++++++++++++++. .|.+|++.||+..|+|+++.+|+||++++..+ |+.++ T Consensus 363 ------~i~~Gd~~~~~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~---~~~g~ 425 (425) T protein:vir:95 363 ------TVLFGEFEQYTLVERENITIDSSTHV--------KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD---PVQGA 425 (425) T ss_pred ------cEEEEecccEEEEeecceEEEeeccc--------ccccCceEEEEEEeeCcEeecccceEEEEecC---cCCCC Confidence 37889999999999999999999874 59999999999999999999999999998887 44444 No 41 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.5e-57 Score=332.07 Aligned_cols=276 Identities=13% Similarity=0.084 Sum_probs=226.0 Q ss_pred CCCCccC-CCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLS-AGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s-~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) ...++.+ +|++++|+.+...|++.++..++++++|+++++.++ .+++|+.++.+.++||+|++.+|+++++|+++++. T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~ 189 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMG 189 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEee Confidence 2334444 445555555555566677888889999999999764 48999999999999999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc---cccccccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT---KTVDATDS 155 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~---~~~~~~~~ 155 (315) +||++++++||+|+|+++.. +|+++|..++++++++++|.++++|+|. |.|+.+...... .....+.. T Consensus 190 ~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~~l~G~G~-----p~Gi~~~~~~~~~~~~~~~~~~~ 260 (390) T protein:vir:62 190 GFKYGFASVVSYEFATDQVL----DLVGFLVSDAGPAIGDAMGRHFITGTGQ-----PRGILTDASPATATFLATDTDSK 260 (390) T ss_pred eeeEEeehHHHHHHHhhhhH----HHHHHHHHHHHHHHHHHHHhhhhccCCc-----cccccccccccccceeccccccc Confidence 99999999999999977644 4788999999999999999999999874 345544332222 22223445 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .++++.+++..+...+ ..+..|+||++++..|++|+|.+|+|+|. |++..+.+++|+|+||++++.+|.+ T Consensus 261 ~~~~l~~~~~~l~~~~-~~~a~~vmn~~~~~~L~~lkd~~g~~l~~----~~~~~g~~~~l~G~Pv~~~~~~p~~----- 330 (390) T protein:vir:62 261 VSDALIDLFHEVPSAY-RANAKYVVNDLRAAQMRKLKDANGQYLWQ----SGLTVGAPSLFNGKVVETDDGMPAD----- 330 (390) T ss_pred chHHHHHHHHhhhhhh-hcCCEEEEchHHHHHHHHhhccCCCeeec----CCcCCCccceecccceEEecCCCCc----- Confidence 6889999998886543 33446999999999999999999998763 4566777889999999999999853 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) .++||||++|+++.+++++++.+.+. +|++|++.||++.|+|+++++|+||++|+.+++. T Consensus 331 ----~i~~gd~s~~~i~~~~~~~v~~~~~~--------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 331 ----KILFADLSKYRVRFAGSLRVDRSVDA--------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ----cEEEeeccceeEEeecceEEEeeccc--------cccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 37789999999999999999999874 6999999999999999999999999999977766 No 42 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=6.6e-58 Score=334.07 Aligned_cols=271 Identities=14% Similarity=0.143 Sum_probs=235.1 Q ss_pred CCCCccCCCceEcchhH-HHHHHHHHHhccchhhh-cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSM-IGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~-~~~ii~~~~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) |..++.++||++||+++ .++||+.+++.++++++ ++++|+.+++++||+.+++++++||+|++.+++++++|++++|. T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~ 436 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEee Confidence 77888899999999886 68999999999999998 68889999999999999999999999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|||+++..+ ++++|+++|+.++++++|.++|+|+| ++..|.|+.+...........+...|+ T Consensus 437 ~~k~~~~v~iS~ell~ds~~~----~~~~i~~~l~~a~~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 510 (632) T protein:vir:96 437 PKTIAGAVPVTRKLRKQSSIH----VENLIREDLIEGIGVALDLAMLTGTG--LANDPVGLLNMTGVPALTYPAGGVDWA 510 (632) T ss_pred eeEEEEehhhHHHHHhccchH----HHHHHHHHHHHHHHHHHHHHhhcccC--CCCccceeeecccccceecccccCCHH Confidence 999999999999999876544 78899999999999999999999987 344577777654444334444556788 Q ss_pred HHHHHHHHhhhccccc-ceEEEEeHHHHHHHHH--HhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQV-PNGVALDPAFSFALST--EVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 159 di~~~~~~~~~~~~~~-~~~~~m~~~~~~~L~~--l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+++.++...+... +.+|+||+.+...|++ ++|.+|+|+|. +++|+|+||+++++||.+ T Consensus 511 ~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~-----------~~~l~G~pv~~s~~ip~~----- 574 (632) T protein:vir:96 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-----------NNEVNGYRAEASNQIPAD----- 574 (632) T ss_pred HHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeec-----------CCeecccceEeccccccC----- Confidence 9999998887665433 4579999998887775 67888887752 358999999999999854 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) .+++|||+++++++++++++.++++. .|.+|++.||+++|+|++++||++|+++|.++ T Consensus 575 ----~~~~gd~s~~~i~~~~~~~i~~~~~~--------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 575 ----TWIFGDWSQIVIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ----cEEEeecceEEEEEecceEEEEcccc--------ccccCceEEEEEeecCceeechhhhhheeecC Confidence 37899999999999999999999985 48899999999999999999999999999988 No 43 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=8.5e-57 Score=327.99 Aligned_cols=281 Identities=12% Similarity=0.043 Sum_probs=233.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) |+.+++++||++||++++.+|++.+++.|+|+++|++++++++. ..+|+.++++.++||+|++.++++ .++|+++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEe Confidence 88888999999999999999999999999999999999997554 567777888899999999999976 599999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|..+ .. +...| T Consensus 186 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~~----------------~~~~~ 243 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDSDQ----NILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQ----------------AIKSL 243 (392) T ss_pred eeeeEEEeehhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc----------------CccCH Confidence 999999999999999976644 478899999999999999999999987322 11 12346 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe-ecc-cCccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA-SST-VSGAPEMSP 235 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~-s~~-v~~~~~~~~ 235 (315) +++.+++.......+..+..|+|||+++.+|+++||++|+|++. ++...+.+++|+|+|+++ ++. ++.... .. T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~tllG~~~v~~~~~~~~~~~~-~~ 318 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ----SDPTQKNKKLFAGTNPVVVVSNRFLKSKG-TT 318 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee----cCccCCccccccCcccEEEecccccCCCc-cc Confidence 78888775443444445567999999999999999999998763 456677788999986665 333 333332 33 Q ss_pred cccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 236 ~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) .....+++|||++ |.++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++..++...++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4456788999987 778999999999998754 369999999999999999999999999999988888888888 No 44 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=8.5e-57 Score=327.99 Aligned_cols=281 Identities=12% Similarity=0.043 Sum_probs=233.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) |+.+++++||++||++++.+|++.+++.|+|+++|++++++++. ..+|+.++++.++||+|++.++++ .++|+++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEe Confidence 88888999999999999999999999999999999999997554 567777888899999999999976 599999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|..+ .. +...| T Consensus 186 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~~----------------~~~~~ 243 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDSDQ----NILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQ----------------AIKSL 243 (392) T ss_pred eeeeEEEeehhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc----------------CccCH Confidence 999999999999999976644 478899999999999999999999987322 11 12346 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe-ecc-cCccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA-SST-VSGAPEMSP 235 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~-s~~-v~~~~~~~~ 235 (315) +++.+++.......+..+..|+|||+++.+|+++||++|+|++. ++...+.+++|+|+|+++ ++. ++.... .. T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~tllG~~~v~~~~~~~~~~~~-~~ 318 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ----SDPTQKNKKLFAGTNPVVVVSNRFLKSKG-TT 318 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee----cCccCCccccccCcccEEEecccccCCCc-cc Confidence 78888775443444445567999999999999999999998763 456677788999986665 333 333332 33 Q ss_pred cccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 236 ~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) .....+++|||++ |.++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++..++...++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4456788999987 778999999999998754 369999999999999999999999999999988888888888 No 45 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=8.5e-57 Score=327.99 Aligned_cols=281 Identities=12% Similarity=0.043 Sum_probs=233.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) |+.+++++||++||++++.+|++.+++.|+|+++|++++++++. ..+|+.++++.++||+|++.++++ .++|+++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEe Confidence 88888999999999999999999999999999999999997554 567777888899999999999976 599999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|..+ .. +...| T Consensus 186 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~~----------------~~~~~ 243 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDSDQ----NILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQ----------------AIKSL 243 (392) T ss_pred eeeeEEEeehhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc----------------CccCH Confidence 999999999999999976644 478899999999999999999999987322 11 12346 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe-ecc-cCccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA-SST-VSGAPEMSP 235 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~-s~~-v~~~~~~~~ 235 (315) +++.+++.......+..+..|+|||+++.+|+++||++|+|++. ++...+.+++|+|+|+++ ++. ++.... .. T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~tllG~~~v~~~~~~~~~~~~-~~ 318 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ----SDPTQKNKKLFAGTNPVVVVSNRFLKSKG-TT 318 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee----cCccCCccccccCcccEEEecccccCCCc-cc Confidence 78888775443444445567999999999999999999998763 456677788999986665 333 333332 33 Q ss_pred cccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 236 ~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) .....+++|||++ |.++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++..++...++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4456788999987 778999999999998754 369999999999999999999999999999988888888888 No 46 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=8.5e-57 Score=327.99 Aligned_cols=281 Identities=12% Similarity=0.043 Sum_probs=233.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) |+.+++++||++||++++.+|++.+++.|+|+++|++++++++. ..+|+.++++.++||+|++.++++ .++|+++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEe Confidence 88888999999999999999999999999999999999997554 567777888899999999999976 599999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|..+ .. +...| T Consensus 186 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~~----------------~~~~~ 243 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDSDQ----NILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQ----------------AIKSL 243 (392) T ss_pred eeeeEEEeehhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc----------------CccCH Confidence 999999999999999976644 478899999999999999999999987322 11 12346 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe-ecc-cCccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA-SST-VSGAPEMSP 235 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~-s~~-v~~~~~~~~ 235 (315) +++.+++.......+..+..|+|||+++.+|+++||++|+|++. ++...+.+++|+|+|+++ ++. ++.... .. T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~tllG~~~v~~~~~~~~~~~~-~~ 318 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ----SDPTQKNKKLFAGTNPVVVVSNRFLKSKG-TT 318 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee----cCccCCccccccCcccEEEecccccCCCc-cc Confidence 78888775443444445567999999999999999999998763 456677788999986665 333 333332 33 Q ss_pred cccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 236 ~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) .....+++|||++ |.++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++..++...++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4456788999987 778999999999998754 369999999999999999999999999999988888888888 No 47 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=2.9e-56 Score=325.09 Aligned_cols=289 Identities=12% Similarity=0.106 Sum_probs=241.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecC--CCceEEEEEeCCceeEEeecccccCCC--ccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTI--FGPVKGAVFSGVPRAKIVGEGEVKPSA--SVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~--~~~~~ip~~~~~~~a~wv~Eg~~~~~s--~~~~~~v~ 76 (315) |..+++++||++||++++++|++.+++.++|++++++++++ ++.+.+|+.++.+.++|++|++.++.+ +++|++++ T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~ 189 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFN 189 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeE Confidence 88899999999999999999999999999999999999886 456889999999999999999999876 58999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+| ++.++.|+.......+... .+... T Consensus 190 ~~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~il~G~g--~~~~~~gi~~~~~~~~~~~-~~~~~ 262 (404) T protein:vir:10 190 FKLKDLADFMSIPNDLLKFADK----SLEDWIINWFVDKVRITRNAEILYGAG--GDEHATGIMTANKFKKITL-PKSPA 262 (404) T ss_pred eeheeeEeeehhhHHHHhhcHH----HHHHHHHHHHHHHHHHHHHHHHhhcCC--CCCcccceeeccccceeec-ccccc Confidence 9999999999999999976544 488999999999999999999999987 5566777766544433222 33445 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEee-cccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGAS-STVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s-~~v~~~~~~~~ 235 (315) ++++.+++.......+..+..|+|||+++..|++++|++|+|++. |++..+.+++|+|+||++. +.++. .. T Consensus 263 ~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~~l~G~PV~~~~~~~~~----~~ 334 (404) T protein:vir:10 263 LKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQ----PDPKDPTQYRFLGLPVIELPNDLLL----ST 334 (404) T ss_pred HHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCCccccceeeEEecccccC----CC Confidence 788888777544455555567999999999999999999998753 4566778889999999854 44442 23 Q ss_pred cccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) .++..+++|||++ +.++.+++++++++++.+ ..|++|++.||+++|+|+++.+|+||++++.+++.+|. T Consensus 335 ~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 335 ESAIPVLLGDTKEAYKYVSDGAYELATTNIGA------GAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred CCccEEEEEeccccEEEEEecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 4456788999986 778899999999998754 35999999999999999999999999999988877766 No 48 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=4e-56 Score=324.31 Aligned_cols=281 Identities=15% Similarity=0.085 Sum_probs=236.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) ...++.++||++||++++.+|++.+++.++|++++++++++++.+++|+.++ ++.+.|++|++.+|+++++|+++++.+ T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 214 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPV 214 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEee Confidence 4455667789999999999999999999999999999999988899999877 578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc-ccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA-TDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 158 (315) ||++++++||+|+|+++ .+|+++|.+++++++++++|.++++|+| ++..+.|+.+........... +...++ T Consensus 215 ~k~~~~~~is~ell~ds-----~~l~~~i~~~l~~a~~~~~d~a~l~G~g--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (418) T protein:vir:10 215 RTIAHLFKASRQILDDA-----PALQSYIDGRARYGLQLTEEGQILKGDG--TGANILGILPQASAFMPSITLANATPID 287 (418) T ss_pred eeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHhccCC--CCccccccccccccccccccccccccHH Confidence 99999999999999643 2488999999999999999999999987 445577777654443333322 234577 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+ ..+++|+|||.++..|++++|++|+|++ ++...+++++|+|+||+++++||.+ T Consensus 288 ~i~~~~~~~~~~~-~~~~~~v~n~~~~~~L~~lkd~~G~~i~-----~~~~~~~~~~l~G~pV~~~~~~p~~-------- 353 (418) T protein:vir:10 288 KIRLALLQAVLAE-FPATGIVLNPIDWASIELTKDSQGRYIV-----GNPVNGTTPRLWNLPVVETQAMTAN-------- 353 (418) T ss_pred HHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceec-----cccccCCCceecceeeEEcCCCCCC-------- Confidence 8888888776544 4456799999999999999999988765 4555677889999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) .+++|||++ |+++++++++++++++.. .+|++|++.||+++|+||++++|+||++++.+++.. | T Consensus 354 -~~~~gd~s~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~-----g 418 (418) T protein:vir:10 354 -EFLVGAFSMAAQIFDRMEIEVLLSTENV------DDFEKNMVSIRAEERLALAVYRPESFVTGALVEQAG-----G 418 (418) T ss_pred -cEEEeeccceEEEEEecceEEEEecccc------hhhhcCceEEEEEEeeccEEecccceEEEEeccCCC-----C Confidence 378899997 778889999999988753 369999999999999999999999999998765332 2 No 49 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=2.1e-56 Score=325.82 Aligned_cols=278 Identities=16% Similarity=0.134 Sum_probs=229.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHH-HhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRA-IDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~-~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) ...++.+++|.++|+++..++|..+ +..++++.+++++++..+ .+.+|+.++.+.++||+|++++|+++++|+++++. T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~ 189 (392) T protein:vir:13 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMG 189 (392) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEee Confidence 4445555666666777777776665 556678899999988654 58999999999999999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc---cccccccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT---KTVDATDS 155 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~---~~~~~~~~ 155 (315) +||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|. + .|.|+.+...... .....+.. T Consensus 190 ~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~~d~~~l~G~Gt--~-~p~Gil~~~~~~~~~~~~~~~~~~ 262 (392) T protein:vir:13 190 GFKYGFASVVSYEFATDQVLD----LVGFLVSDAGPAIGDAMGRHFLTGTGT--G-QPRGILTDATGANAAFGEADADSK 262 (392) T ss_pred eeeEEeeehhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhcccCC--c-cccccccccccccccccccccccc Confidence 999999999999999876544 788999999999999999999999873 3 4566765543222 22334456 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .|+++.+++..+.... ..++.|+||++++..|++++|++|+|++. |+++.+.+++|+|+||+++++||.+ T Consensus 263 ~~d~l~~~~~~l~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~l~~----~~~~~g~~~~l~G~Pv~~~~~~~~~----- 332 (392) T protein:vir:13 263 VSDALIDLFHEVPSAY-RKNAKFVVNDLRAAQMRKLKDANGQYLWQ----SALTVGAPDTFNGKVVETDDGMPAD----- 332 (392) T ss_pred cHHHHHHHHHhhhhhh-hcCCEEEEcHHHHHHHHHhhccCCceeec----CCcCCCCCceecceeeEEcCCCCCC----- Confidence 6889999988886543 34567999999999999999999998764 4567778889999999999999853 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) .+++|||++|+++.+++++++.+.+. +|++|++.||++.|+|+++.||+||+.++.+++. T Consensus 333 ----~i~~Gdf~~~~i~~~~~~~i~~~~~~--------~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 333 ----KVLFADLSKYRVRFAGSLRVDRSVDA--------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred ----cEEEeeccceeEEeecceEEEeeccc--------cccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 37889999999999999999988763 5999999999999999999999999999887666 No 50 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=5.8e-56 Score=323.40 Aligned_cols=274 Identities=16% Similarity=0.078 Sum_probs=228.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeCCceeEEeecccccCC-CccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSGVPRAKIVGEGEVKPS-ASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~-s~~~~~~v~l 77 (315) |+.++.+.||++||++++.+|++.+++.++|+++++++||+++. ..+++..+.+.++||+|++.+|+ ++++|+++++ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~ 170 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQY 170 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEe Confidence 99999999999999999999999999999999999999998655 45566667789999999999986 5799999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) ++||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|... +.+ ...+ T Consensus 171 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~---~~~---------------~~~~ 228 (371) T protein:vir:81 171 QVKKYAGFFRVTNELLNDSTE----AIVNTLVRWIGDESRVTRNGLIINVLNTKA---KTA---------------IADL 228 (371) T ss_pred eeeEEEEeehhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccc---------------cccH Confidence 999999999999999976644 478899999999999999999999987322 111 1235 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc---cc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE---MS 234 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~---~~ 234 (315) +++..++.......+..+..|+|||+++..|++++|++|+|++. |++..+.+++|+|+||++++++|.... .. T Consensus 229 ~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~ 304 (371) T protein:vir:81 229 DGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQ----PSISSPTGRQLLGLPVVIVSNKVLANRVDGGT 304 (371) T ss_pred HHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeee----cccCCCCCceecceeEEEecccccCccccccc Confidence 66776665433334445568999999999999999999998764 355677789999999999999984321 22 Q ss_pred ccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 235 PASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 235 ~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ......+++|||++ +.++.+++++++++++.+ ++|++|++.||++.|+||++.+|+||++++.++| T Consensus 305 ~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 305 GAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM------DAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cCCcceEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 23455689999986 778899999999998754 3699999999999999999999999999999998 No 51 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=9.7e-56 Score=322.18 Aligned_cols=281 Identities=12% Similarity=0.046 Sum_probs=234.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC--CceEEEEEeC-CceeEEeecccccCC-CccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF--GPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~--~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~ 76 (315) |+.++.++||++||++++.+|++.+++.++|+++|+++++++ +.+.+|+... .+.++||+|++.+|+ ++++|++++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~ 188 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIK 188 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEE Confidence 888999999999999999999999999999999999999875 4466666554 468999999999996 579999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++.. .|+++|.+++++++++++|.++++|+|..+ ... +... T Consensus 189 ~~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~--~~~---------------~~~~ 247 (397) T protein:vir:49 189 YTIKRYAGISTVTNSLLADSAE----NILAWLSGWIAKKVVVTRNKAILEAIAALP--TKP---------------TLTK 247 (397) T ss_pred eeeeeEEeeehhHHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHHhhccccc--ccc---------------cccc Confidence 9999999999999999977654 478899999999999999999999987322 211 1234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) |+++.+++..+.+++ ..+++|+||++++.+|++++|++|+|++. |++..+.+++|+|+||++++...- ..+.. T Consensus 248 ~d~i~~~~~~l~~~~-~~~a~~vmn~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~PV~~~~~~~~--~~~~~ 320 (397) T protein:vir:49 248 WDDIIDLEAKVDPAI-KQTSFFLTNTSGFTALKKVKNALGDYLME----RDVKSPTGYSIDGFAVKEVADRWL--ANGTG 320 (397) T ss_pred HHHHHHHHHhhhhhh-cCCCEEEEcHHHHHHHHHhhcCCCceeec----cCcCCCCCceecceeeEEeccccc--ccccC Confidence 788999999887654 44578999999999999999999998753 356677788999999998654221 11223 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ....+++|||++ |.++.+++++++++++.. ++|++|++.||++.|+|+++.+|+||++++.+++.++++..+- T Consensus 321 ~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGS 394 (397) T ss_pred CceeEEEeeccceEEEEeecceEEEEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCccc Confidence 445688999996 788999999999998754 3699999999999999999999999999999888877777777 No 52 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.6e-55 Score=321.02 Aligned_cols=278 Identities=14% Similarity=0.065 Sum_probs=235.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC-ceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +..++...+|++||++++.+|++.+++.++|+++|++++++++.+++|+.++. +.++||+|++.+|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~ 192 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPV 192 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEee Confidence 44556677889999999999999999999999999999999888999998774 68999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc---ccccch Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV---DATDSA 156 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 156 (315) ||++++++||+|+|+++ .+|+++|.+++++++++++|.++++|+| ++.++.|+.+......... ..+... T Consensus 193 ~k~~~~~~is~ell~d~-----~~l~~~v~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 265 (395) T protein:vir:43 193 RTIAHLFKASRQILDDA-----SALQSYIDARARYGLMLVEECQLLYGNG--TGANLHGIIPQAQAYAPPSGVVVTAEQR 265 (395) T ss_pred eeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccccccccccccccccccchh Confidence 99999999999999543 2488999999999999999999999987 5556777776544333222 222334 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++..+...+ ..+++|+|||+++..|++++|++|+|++ ++...+.+++|+|+||++++.||.+ T Consensus 266 ~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~-----~~~~~~~~~~l~G~pVv~~~~~~~~------ 333 (395) T protein:vir:43 266 IDRIRLAILQAQLAE-FPASGIVLNPIDWALIELNKDAENRYII-----GSPQNGTTPTLWRLPVVETQAITQD------ 333 (395) T ss_pred HHHHHHHHHhhcccc-CCCcEEEEcHHHHHHHHHhhccCCceec-----cccccCCCceecceeeEEcCCCCCC------ Confidence 678888888886554 3456899999999999999999988765 3455667789999999999999853 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) .+++|||++ +.+.++++++++++++.. ++|++|++.||+++|+||++++|+||++++.+++ T Consensus 334 ---~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 334 ---EFLTGAFSLGAQIFDRMDIEVLVSTEND------KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred ---cEEEEeccceEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 378899987 668889999999998753 3699999999999999999999999999999888 No 53 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=1.1e-55 Score=321.81 Aligned_cols=283 Identities=11% Similarity=-0.012 Sum_probs=236.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCC------ccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSA------SVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s------~~~~~~ 74 (315) ...++.+.||++||++++++|++.+++.++++++|+++|++++...+|+.++++.++||+|++.++++ +++|++ T Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~ 241 (458) T protein:vir:10 162 NQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKE 241 (458) T ss_pred hhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeeccccccccccccccccccccee Confidence 23345567999999999999999999999999999999999999999999999999999999988864 578999 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccc-------c Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKT-------T 147 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~-------~ 147 (315) +++.+||++++++||+|+|+++.. +|.++|.++|++++++++|.++++|+|. + .|.|+.+..... . T Consensus 242 i~~~~~k~~~~v~is~ell~ds~~----~~~~~i~~~l~~~i~~~~d~~~l~G~G~--~-~p~Gi~~~~~~~~~~~~~~~ 314 (458) T protein:vir:10 242 IHFSTYKLAAKSFITDETEEDAIF----SLLPLLRKRLIEAHAVSIEEAFMTGDGS--G-KPKGLLTLASEDSAKVVTEA 314 (458) T ss_pred eEeeeeeEEeeehhhHHHHhcchH----HHHHHHHHHHHHHHHHHHHHHhhcCCCC--C-ccceeeecccccccceeecc Confidence 999999999999999999977654 4789999999999999999999999873 2 455665543221 1 Q ss_pred cccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeeccc Q lcl|NC_018838. 148 KTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTV 227 (315) Q Consensus 148 ~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v 227 (315) .........|+++.+++..+..++. .++.|+||+.++.+|++++|++|+|++.....+....+.+++|+|+||+++++| T Consensus 315 ~~~~~~~~~~~~i~~~~~~l~~~~~-~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 393 (458) T protein:vir:10 315 KADGSVLVTAKTISKLRRKLGRHGL-KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYF 393 (458) T ss_pred cccccccccHHHHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccc Confidence 2222334568899999998876543 456899999999999999999999998766555566777889999999999999 Q ss_pred CccccccccccceEEEeccc-ceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 228 SGAPEMSPASGVKAIVGDFS-RVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 228 ~~~~~~~~~~~~~~~~gDf~-~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) |... +...+++|||+ .|.++++.+++++++++ +.+|++.||++.|+|+.+.+|+||++.+.+++ T Consensus 394 p~~~-----~~~~~~~~~f~~~~~~~~~~~~~v~~d~~----------~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 394 PAKA-----NSAEFAVIVYKDNFVMPRQRAVTVERERQ----------AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cccc-----CCcceEEEEecccEEEEEeeceEEEeecc----------cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 9642 22346789995 58899999999987654 56899999999999999999999999999888 No 54 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=2.1e-55 Score=320.39 Aligned_cols=281 Identities=12% Similarity=0.022 Sum_probs=231.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC--ceEEEEEe-CCceeEEeecccccCC-CccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG--PVKGAVFS-GVPRAKIVGEGEVKPS-ASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~--~~~ip~~~-~~~~a~wv~Eg~~~~~-s~~~~~~v~ 76 (315) |+.+++++||++||++++++|++.++++++|+++|++++++.+ ...+|+.. .++.++||+|++++|+ ++++|++++ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~i~ 84 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSLIK 84 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccceeEEE Confidence 9999999999999999999999999999999999999998754 46677765 4578999999999997 579999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||+++++++|+|+|+++.. .|+++|.+++++++++++|.++++|++..+. ..+... T Consensus 85 l~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~-----------------~~~~~~ 143 (293) T protein:vir:48 85 YTIKRYAGISTVTNSLLADSAE----NILAWLSGWIAKKVVVTRNKAILGVVDKLPT-----------------KPTLTK 143 (293) T ss_pred EeeeEEEEeehhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHhHHhhccccccc-----------------cccccC Confidence 9999999999999999977654 4888999999999999999999998763211 112345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) |+++.+++.++..++ ..++.|+||++++..|++++|++|+|++. +++..+.+++|+|+||++++..+-. .... T Consensus 144 ~d~i~~~~~~l~~~~-~~~a~~vmn~~~~~~L~~lkd~~g~~l~~----~~~~~~~~~~l~G~Pv~~~~~~~~~--~~~~ 216 (293) T protein:vir:48 144 WDDIIDLEAKVDPAI-KQTSFFLTNTSGFTALKKVKNALGDYLME----RDVKSPTGYSIAGFAVKEISDRWLP--NASS 216 (293) T ss_pred HHHHHHHHHhhhhhh-cCCCEEEEcHHHHHHHHHhhccCCceEee----cCcCCCCCceecceeeEEecccccC--CccC Confidence 889999999987554 34567999999999999999999998764 3556677889999999986554321 1123 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ....+++|||++ |+++.+++++++++++.. ++|++|++.+|+++|+|+++.+|+||++++.+++..|..-.+- T Consensus 217 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~ 290 (293) T protein:vir:48 217 GVMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGS 290 (293) T ss_pred CceEEEEEeccceEEEEEecceEEEEecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccc Confidence 455688999986 678899999999998753 4699999999999999999999999999987664433322222 No 55 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.3e-55 Score=320.09 Aligned_cols=280 Identities=10% Similarity=0.035 Sum_probs=227.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEE--EEe-CCceeEEeecccccCCC-ccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGA--VFS-GVPRAKIVGEGEVKPSA-SVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip--~~~-~~~~a~wv~Eg~~~~~s-~~~~~~v~ 76 (315) |..++.++||++||++++++|++.+++.++|+++|++++++++..++| +.. ..+.+.|++|++.+|++ .++|++|+ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~ 195 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEE Confidence 888999999999999999999999999999999999999987665555 443 34678999999999975 58999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|..+ +. .+... T Consensus 196 ~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~~~~il~g~g~~~--~~---------------~~~~~ 254 (408) T protein:vir:10 196 YLIKRYAGIITATNTSLKDTAEN----ILAWLSSWIAKKVVVTRNQAIIEVMKAAP--KK---------------PTIAK 254 (408) T ss_pred eeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhcccccc--cc---------------ccccc Confidence 99999999999999999876544 78889999999999999999999987321 11 11224 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc--cCcccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST--VSGAPEMS 234 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~--v~~~~~~~ 234 (315) ++++.+++.......+..+..|+||++++.+|++++|++|+|+|. ++...+.+++|+|+||+++++ +|.. T Consensus 255 ~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~----~~~~~~~~~~l~G~PV~~~~~~~~~~~---- 326 (408) T protein:vir:10 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLE----PDPTKPNSYLIKGKQVIVVADRWLPNT---- 326 (408) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEec----cCcCCCCCceecceeeEEecccccCcc---- Confidence 678877765433334444557999999999999999999998864 345667788999999999664 4432 Q ss_pred ccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCC Q lcl|NC_018838. 235 PASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPA 313 (315) Q Consensus 235 ~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~ 313 (315) .+....+++|||++ |.++++++++++++++.+ ..|++|++.||+++|+|+++.+|+||++++.+++.+++|=. T Consensus 327 ~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 400 (408) T protein:vir:10 327 GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) T ss_pred CCCceEEEEEehhccEEEEEecceEEEEccccc------chhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCCCC Confidence 23456789999996 779999999999998764 35999999999999999999999999999987754333322 Q ss_pred CC Q lcl|NC_018838. 314 GN 315 (315) Q Consensus 314 ~~ 315 (315) +- T Consensus 401 ~~ 402 (408) T protein:vir:10 401 KT 402 (408) T ss_pred CC Confidence 22 No 56 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=5.6e-55 Score=318.03 Aligned_cols=275 Identities=16% Similarity=0.082 Sum_probs=234.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC-ceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +..++.+.+|.++|+++..+|++.+++.++|+++|++++++++.+++|+.++. +.++|++|++.+|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEee Confidence 56666778888889999999999999999999999999999999999999875 68999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc-cccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT-VDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 158 (315) ||++++++||+|+|+++ .+++++|.++|++++++++|.++++|+| ++..+.|+.+........ .......++ T Consensus 193 ~k~~~~~~is~ell~d~-----~~~~~~i~~~l~~~~~~~~d~a~l~G~g--~~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 265 (390) T protein:vir:81 193 HVIAHTMKATRQILSDA-----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGLIPQATTYAAPTTIAGATRVD 265 (390) T ss_pred eEEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCcccceeecccccccccccccchhHH Confidence 99999999999999643 2489999999999999999999999987 556677877654433322 233345577 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+ ..+++|+|||+++..|++++|++|+|+|. +...+..++|+|+||++++.||.+ T Consensus 266 ~~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~-----~~~~~~~~~l~G~pv~~~~~~p~~-------- 331 (390) T protein:vir:81 266 QLRLAMLQASLAE-YNPSGIVINPIDWAAIELAKDANNQYLIG-----NARGTLTPTLWGLPVVATQAMAPG-------- 331 (390) T ss_pred HHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceeec-----CcccccCceecceeeEEcCCCCCC-------- Confidence 8888888886554 45668999999999999999999988764 344556679999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) .+++|||++ |.+..+++++++.+++.. +|++|++.||+++|+||++++|+||++++.+ T Consensus 332 -~~~~gd~~~~~~~~~~~~~~v~~~~~~~-------~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 332 -EFLVGAFDLAAQIFDQWDARVEIGYVGE-------DFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred -cEEEEehhceEEEEEecceEEEEecccc-------hhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 378999986 678889999999887642 6999999999999999999999999999999 No 57 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=4.9e-55 Score=318.35 Aligned_cols=275 Identities=16% Similarity=0.070 Sum_probs=234.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC-ceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +..++..+||++||++++..|++.+++.++|++++++++++++..++|+.++. +.+.||+||+.+|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEee Confidence 66777788899999999999999999999999999999999999999999764 68999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc-cccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT-VDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 158 (315) ||+++++++|+|+|+++ .+|+++|.+++++++++++|.++++|+| ++..+.|+.+........ ...+...++ T Consensus 193 ~k~~~~~~is~ell~ds-----~~l~~~i~~~la~a~~~~~d~a~l~G~g--~~~~p~Gi~~~~~~~~~~~~~~~~~~~d 265 (390) T protein:vir:97 193 HVIAHTMKATRQILSDA-----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGLIPQATTYAAPTTIAGATRVD 265 (390) T ss_pred eeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCccccceeeccccccccccccccchHH Confidence 99999999999999654 2488999999999999999999999987 455677877654433322 223344567 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+ ..+++|+|||+++..|++++|++|+|++. +...++.++|+|+||++++.||.+ T Consensus 266 ~~~~~~~~~~~~~-~~~~~~v~n~~~~~~L~~lkd~~G~~l~~-----~~~~~~~~~l~G~pV~~~~~~~~~-------- 331 (390) T protein:vir:97 266 QLRLAMLQASLAE-YPASGIVINPIDWAAIELAKDANNQYLIG-----NARGTLTPTLWGLPVVATQAMAPG-------- 331 (390) T ss_pred HHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceeec-----CccCCCCceecceeeEEcCCCCCC-------- Confidence 7888887776544 34668999999999999999999988764 334556679999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) .+++|||++ |.++.+++++++++++. .+|++|+++||+++|+||++.+|+||++++.+ T Consensus 332 -~~~~gd~~~~~~~~~~~~~~i~~~~~~-------~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 332 -EFLVGAFDLAAQIFDQWDARVEIGYVN-------DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred -cEEEEeccceEEEEEecceEEEEeecc-------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 378999986 77889999999998653 25999999999999999999999999999999 No 58 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=7.2e-55 Score=317.41 Aligned_cols=278 Identities=14% Similarity=0.075 Sum_probs=234.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..++++ +|.+||++++..|++.+++.++|+++|++++++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.+ T Consensus 105 ~~~~~~~-~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:18 105 LGSDADS-AGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hcccccc-CCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 5555544 45567778999999999999999999999999988899999876 578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc-cccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD-ATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 158 (315) ||++++++||+|+|+++ .+|+++|.+++++++++++|.++++|+| ++.++.|+.+.......... .+...++ T Consensus 184 ~k~~~~~~is~ell~d~-----~~l~~~i~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:18 184 KTIAHWVQASRQVMDDA-----PMLQSYINNRLMYGLALKEEGQLLNGDG--TGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred eeEEEeehhhHHHHhhH-----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCcccccccccccccccccccccchHH Confidence 99999999999998643 3488999999999999999999999987 45667777765443332222 2344577 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+... +..++.|+|||+++.+|++++|++|+|++ +++..+.+++|+|+||+++++||.+ T Consensus 257 ~i~~~~~~l~~~-~~~~~~~~~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~l~G~pV~~~~~~p~~-------- 322 (385) T protein:vir:18 257 IIAHAIYQVTES-EFSASGIVLNPRDWHNIALLKDNEGRYIF-----GGPQAFTSNIMWGLPVVPTKAQAAG-------- 322 (385) T ss_pred HHHHHHHhhccc-cCCCCEEEEcHHHHHHHHHhhcCCCceec-----cCcccCCCceecceeeEEcCcCCCC-------- Confidence 888888888654 44567899999999999999999998765 4456777889999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) .+++|||++ |+++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++.+++. T Consensus 323 -~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 323 -TFTVGGFDMASQVWDRMDATVEVSREDR------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -cEEEeecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 378999986 788899999999887653 35999999999999999999999999999998887 No 59 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=7.2e-55 Score=317.41 Aligned_cols=278 Identities=14% Similarity=0.075 Sum_probs=234.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..++++ +|.+||++++..|++.+++.++|+++|++++++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.+ T Consensus 105 ~~~~~~~-~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:19 105 LGSDADS-AGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hcccccc-CCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 5555544 45567778999999999999999999999999988899999876 578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc-cccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD-ATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 158 (315) ||++++++||+|+|+++ .+|+++|.+++++++++++|.++++|+| ++.++.|+.+.......... .+...++ T Consensus 184 ~k~~~~~~is~ell~d~-----~~l~~~i~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:19 184 KTIAHWVQASRQVMDDA-----PMLQSYINNRLMYGLALKEEGQLLNGDG--TGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred eeEEEeehhhHHHHhhH-----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCcccccccccccccccccccccchHH Confidence 99999999999998643 3488999999999999999999999987 45667777765443332222 2344577 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+... +..++.|+|||+++.+|++++|++|+|++ +++..+.+++|+|+||+++++||.+ T Consensus 257 ~i~~~~~~l~~~-~~~~~~~~~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~l~G~pV~~~~~~p~~-------- 322 (385) T protein:vir:19 257 IIAHAIYQVTES-EFSASGIVLNPRDWHNIALLKDNEGRYIF-----GGPQAFTSNIMWGLPVVPTKAQAAG-------- 322 (385) T ss_pred HHHHHHHhhccc-cCCCCEEEEcHHHHHHHHHhhcCCCceec-----cCcccCCCceecceeeEEcCcCCCC-------- Confidence 888888888654 44567899999999999999999998765 4456777889999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) .+++|||++ |+++.+++++++++++.. ++|++|++.||+++|+||++.+|+||++++.+++. T Consensus 323 -~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 323 -TFTVGGFDMASQVWDRMDATVEVSREDR------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -cEEEeecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 378999986 788899999999887653 35999999999999999999999999999998887 No 60 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=7.1e-55 Score=317.46 Aligned_cols=281 Identities=13% Similarity=0.035 Sum_probs=230.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeC-CceeEEeecccccCCCc-cceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSG-VPRAKIVGEGEVKPSAS-VDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~-~~~a~wv~Eg~~~~~s~-~~~~~v~ 76 (315) |+.++.++||++||+++..+|++.+++.++|+++|++++++++. +.+|+... .+.++||+|++.+|+++ ++|++|+ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~ 188 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIR 188 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeE Confidence 99999999999999999999999999999999999999998655 45565544 46799999999999875 8999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|..+ +. .+... T Consensus 189 ~~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~~~~~~d~ail~G~g~~~--~~---------------~~~~~ 247 (397) T protein:vir:49 189 YAIKRYAGISTVTNSLLADSAE----NILAWLSGWIAKKVVVTRNKAILEAIGTLP--NK---------------PTLAK 247 (397) T ss_pred eeeeeeEeehhhHHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHHHhcccccc--cc---------------ccccC Confidence 9999999999999999976654 478899999999999999999999987422 11 12235 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) |+++.+++..+..++ ..++.|+|||+++..|++++|++|+|++. |++..+.+++|+|+||++++..+-. ...+ T Consensus 248 ~d~i~~~~~~l~~~~-~~~a~~v~n~~~~~~l~~lkd~~g~~l~~----~~~~~g~~~~l~G~pV~~~~~~~~~--~~~~ 320 (397) T protein:vir:49 248 WDDIIDLQAKVDPAI-KQTSLFLTNTSGFTALKKVKNAMGDYLME----RDVKSPTGYSIDGFVVKEISDRFLP--NGTG 320 (397) T ss_pred HHHHHHHHHhhhhhh-cCCCEEEEcHHHHHHHHHhhccCCceeec----ccccCCCCceecceeeEEecccccc--cccC Confidence 789999998887654 45678999999999999999999998753 3566777889999999986643311 2223 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc---CCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA---APKPNPP 312 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~---a~~~~~~ 312 (315) .+..++||||++ |+++++++++++++++.. ++|++|++.||+++|+||++.+|+||++++.++ +++.+.. T Consensus 321 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKLST 394 (397) T ss_pred CceeEEEeeccceEEEEeecccEEEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEecccccccCcccc Confidence 456789999986 789999999999998754 469999999999999999999999999998543 3334444 Q ss_pred CCC Q lcl|NC_018838. 313 AGN 315 (315) Q Consensus 313 ~~~ 315 (315) .|- T Consensus 395 ~~~ 397 (397) T protein:vir:49 395 AGA 397 (397) T ss_pred cCC Confidence 444 No 61 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=1.2e-54 Score=316.29 Aligned_cols=282 Identities=11% Similarity=0.039 Sum_probs=228.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEE--Ee-CCceeEEeecccccCC-CccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAV--FS-GVPRAKIVGEGEVKPS-ASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~--~~-~~~~a~wv~Eg~~~~~-s~~~~~~v~ 76 (315) |..++.++||++||++++.+|++.+++.++|+++|+++|++++..++|+ .. ..+.++||+|++.+|+ ++++|++++ T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~ 195 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIK 195 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEE Confidence 8888999999999999999999999999999999999999877666554 33 3467899999999997 579999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++..+ |+++|.++|++++++++|.++++|+|... +. .+... T Consensus 196 ~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d~~il~g~g~~~--~~---------------~~~~~ 254 (404) T protein:vir:39 196 YLIKRYAGIITATNTLLKDTAEN----ILAWLSSWIAKKVVVTRNQAIIAAMGTVP--KK---------------PTIAK 254 (404) T ss_pred eeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHHhcccccc--cc---------------ccccc Confidence 99999999999999999776544 78899999999999999999999987321 11 11223 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++.......+..+..|+|||+++..|++++|++|+|++. +++..+.+++|+|+||+++++..- ..... T Consensus 255 ~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~--~~~~~ 328 (404) T protein:vir:39 255 FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLE----PDPTKPNSYLIKGKKVIVVADRWL--PNSGS 328 (404) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCcceecceeEEEeccccc--CccCC Confidence 677777776444444445567999999999999999999998753 355667778999999999775421 12233 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC--CCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA--PKPNPPA 313 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a--~~~~~~~ 313 (315) ....+++|||++ +.++++++++++++++.. ++|++|++.+|++.|+|+.+.+|+||++++.+++ +.-+-|+ T Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~ 402 (404) T protein:vir:39 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTA 402 (404) T ss_pred CccEEEEEeccccEEEEeecceEEEEeccch------hhhhhceeeEEEEeeeccEEecccceEEEEeeccccCCCCCCC Confidence 455689999986 778999999999999754 3699999999999999999999999999986653 2333455 Q ss_pred CC Q lcl|NC_018838. 314 GN 315 (315) Q Consensus 314 ~~ 315 (315) |- T Consensus 403 ~~ 404 (404) T protein:vir:39 403 GK 404 (404) T ss_pred CC Confidence 55 No 62 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.1e-54 Score=316.37 Aligned_cols=281 Identities=12% Similarity=0.021 Sum_probs=231.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEE---eCCceeEEeecccccCCC-ccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVF---SGVPRAKIVGEGEVKPSA-SVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~---~~~~~a~wv~Eg~~~~~s-~~~~~~v~ 76 (315) |+.+++++||++||++++.+|++.+++.++|+++|++++++++...+|+. +..+.++|++|++.++++ +++|++|+ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~ 188 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIR 188 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEE Confidence 88888899999999999999999999999999999999998776665543 344679999999999987 59999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.++|++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|..+ .. .+... T Consensus 189 ~~~~k~~~~~~iS~ell~ds~~~----l~~~v~~~l~~~~~~~~d~~il~G~g~~~--~~---------------~~~~~ 247 (397) T protein:vir:48 189 YAIKRYAGISTVTNSLLADSAEN----ILAWLSGWIAKKVVVTRNKAILEAIATLP--TK---------------PTLTK 247 (397) T ss_pred eeheeeeeehhhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhcccccc--cc---------------ccccc Confidence 99999999999999999876544 78899999999999999999999987322 11 12234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++..+...+ ..++.|+||++++..|++++|++|+|++. +++..+.+++|+|+||++++..+- ..... T Consensus 248 ~d~i~~~~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~i~~----~~~~~~~~~~l~G~PV~~~~~~~~--~~~~~ 320 (397) T protein:vir:48 248 WDDIIDLQAKVDPAI-KQTSFFLTNTSGFTALKKVKNAFGDYLME----RDVKSPTGYSIDGFAVKEVADRWL--ANASS 320 (397) T ss_pred HHHHHHHHHHhhhhh-cCCCEEEECHHHHHHHHHhhcCCCceeec----cCcCCCCCceeccceeEEeccccc--CCcCC Confidence 788999988887654 45678999999999999999999998753 456677889999999998654321 12233 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ....+++|||++ +.++.+++++++++++.+ ++|++|++.||+.+|+|+++.+|+||++++.+++.++.+--+. T Consensus 321 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:48 321 GAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGS 394 (397) T ss_pred CceEEEEEeccceEEEEeecceEEEEeccch------hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCccc Confidence 456788999996 678999999999998754 3699999999999999999999999999988776555444444 No 63 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=7.4e-55 Score=317.35 Aligned_cols=285 Identities=14% Similarity=0.064 Sum_probs=227.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEe---ecccccCCCccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIV---GEGEVKPSASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv---~Eg~~~~~s~~~~~~v~l 77 (315) ++. ++++||++||++++++|++.++++++|+++|++++++ +++++|+...++.+.|+ +|++.+|+++++|+++++ T Consensus 143 ~~~-~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~ 220 (434) T protein:vir:62 143 LGL-VTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIEL 220 (434) T ss_pred hcc-cccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-CceEEEEEecCCcccceecccccccccccccceeeEEe Confidence 332 2356899999999999999999999999999999887 56999999888888775 567889999999999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+| ++.++.++.+...... .......+ T Consensus 221 ~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~~~~~d~~~l~G~G--~~~~~~g~~~~~~~~~--~~~~~~~~ 292 (434) T protein:vir:62 221 SPTEFDALATVTKKLLARTGLP----IEQIVMDELKKAYVRKETQYMVNGDE--ANNINDGALAKKAVEF--KTDEKNLY 292 (434) T ss_pred eheeeEeehhhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhccCC--CCccccceeecccccc--cccccchh Confidence 9999999999999999776544 78999999999999999999999998 4445555554322222 22334568 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++++++..+...+ ..+..|+||++++.+|++++|++|+|+|.+.. ....+.+.+|+|+||++++.||... .++ T Consensus 293 d~l~~l~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~--~~~~g~~~tl~G~pV~~~~~~~~~~---~~~ 366 (434) T protein:vir:62 293 DALVKMKNTPVKEV-RKKARWVLNTAALTKIETMKTDDGFPLLRPFN--QAEGGIGYTLLGFPVEEEDAIDIPD---SPD 366 (434) T ss_pred hHHHHHHhhcchhh-hcCCEEEEcHHHHHHHHHhhccCCCEeeccCC--CccCCCCceecceeeEEecCccCcc---CCC Confidence 89999999886553 33457999999999999999999999875421 3455677899999999999998432 223 Q ss_pred cceEEEecccceEEEeec-cceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeec-ccceEEEee--ccCCCC Q lcl|NC_018838. 238 GVKAIVGDFSRVHWGFQR-NFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIES-LDSFAVVKE--KAAPKP 309 (315) Q Consensus 238 ~~~~~~gDf~~~~i~~~~-~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~-~~af~~l~~--~~a~~~ 309 (315) ...++||||++|+++++. .++++++.+. +|.+|+|.||++.|+|+++++ |+++++++. +++... T Consensus 367 ~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~--------~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 367 TPVFYFGDFSKFYIQDVIGSLEVQKLVEL--------FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ceEEEEeeccceEEEEeeceeEEEeehhh--------hcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 356778999999999876 5778888763 589999999999999999775 998887743 333333 No 64 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.2e-54 Score=314.81 Aligned_cols=286 Identities=13% Similarity=0.020 Sum_probs=230.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEE--eCCceeEEeecccccCC-CccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVF--SGVPRAKIVGEGEVKPS-ASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~--~~~~~a~wv~Eg~~~~~-s~~~~~~v~l 77 (315) .+..+.++|+++||++++++|++.+++.++|+++|++++++++..++|+. ++...++|++|++.+|+ +.++|+++++ T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:46 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEe Confidence 23334567889999999999999999999999999999999887777764 56678999999999997 4689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. .|+++|.+++++++++++|.++++|+|.+ .+..+..............+...+ T Consensus 201 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~il~g~g~g--~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:46 201 DINTHRGYFRISREAIEDAKV----NVLQELKLWMARTIAATRNKAIIDVITKG--STGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeeeeEeeehhhHHHHhhchH----HHHHHHHHHHHHHHHHHHHHHHhhccccC--Cccccccccccccceeccccccch Confidence 999999999999999977654 47889999999999999999999998743 333333332222233333445668 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...++ .+++|+||++++.+|++++|++|+|++. |++..+.+++|+|+||++++++|.. ..+ T Consensus 275 ~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~i~~----~~~~~~~~~~l~G~pV~~~~~~~~~----~~~ 345 (415) T protein:vir:46 275 DDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLG----QKG 345 (415) T ss_pred HHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCeeec----cCcCCCCCccccceeeEEecccccc----CCC Confidence 899999998876554 4678999999999999999999998764 4566777889999999999998742 233 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...+++|||++ |+++.+++++++.++ |.++++.+|+.+|+|+++.+|+||++++..+ +..|.|+ T Consensus 346 ~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~---~~~~~~~ 410 (415) T protein:vir:46 346 NNTLIIGNLKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEeec-----------cccCceEEEEEEEeccEEeccccEEEEEeec---cCCCCCC Confidence 45689999997 678889999998875 4566788999999999999999999998877 3344455 No 65 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.2e-54 Score=314.81 Aligned_cols=286 Identities=13% Similarity=0.020 Sum_probs=230.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEE--eCCceeEEeecccccCC-CccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVF--SGVPRAKIVGEGEVKPS-ASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~--~~~~~a~wv~Eg~~~~~-s~~~~~~v~l 77 (315) .+..+.++|+++||++++++|++.+++.++|+++|++++++++..++|+. ++...++|++|++.+|+ +.++|+++++ T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:47 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEe Confidence 23334567889999999999999999999999999999999887777764 56678999999999997 4689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. .|+++|.+++++++++++|.++++|+|.+ .+..+..............+...+ T Consensus 201 ~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~l~~~i~~~~d~~il~g~g~g--~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:47 201 DINTHRGYFRISREAIEDAKV----NVLQELKLWMARTIAATRNKAIIDVITKG--STGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeeeeEeeehhhHHHHhhchH----HHHHHHHHHHHHHHHHHHHHHHhhccccC--Cccccccccccccceeccccccch Confidence 999999999999999977654 47889999999999999999999998743 333333332222233333445668 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...++ .+++|+||++++.+|++++|++|+|++. |++..+.+++|+|+||++++++|.. ..+ T Consensus 275 ~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~i~~----~~~~~~~~~~l~G~pV~~~~~~~~~----~~~ 345 (415) T protein:vir:47 275 DDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLG----QKG 345 (415) T ss_pred HHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCeeec----cCcCCCCCccccceeeEEecccccc----CCC Confidence 899999998876554 4678999999999999999999998764 4566777889999999999998742 233 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...+++|||++ |+++.+++++++.++ |.++++.+|+.+|+|+++.+|+||++++..+ +..|.|+ T Consensus 346 ~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~---~~~~~~~ 410 (415) T protein:vir:47 346 NNTLIIGNLKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEeec-----------cccCceEEEEEEEeccEEeccccEEEEEeec---cCCCCCC Confidence 45689999997 678889999998875 4566788999999999999999999998877 3344455 No 66 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=2.4e-54 Score=314.59 Aligned_cols=281 Identities=10% Similarity=0.020 Sum_probs=224.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEE--EEeC-CceeEEeecccccCCC-ccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGA--VFSG-VPRAKIVGEGEVKPSA-SVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip--~~~~-~~~a~wv~Eg~~~~~s-~~~~~~v~ 76 (315) ...++.++||++||++++.+|++.+++.++|+++|++++|+++...++ +... .+.++|++|++.+|++ +++|++++ T Consensus 107 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~ 186 (395) T protein:vir:38 107 SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVK 186 (395) T ss_pred hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEE Confidence 334445578999999999999999999999999999999986655544 4433 4678999999999976 59999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++.. +|+++|.++|++++++++|.++++|+|.+. +.. +... T Consensus 187 ~~~~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~il~g~g~~~--~~~---------------~~~~ 245 (395) T protein:vir:38 187 YLIHRYAGITTVTNTLLKDTVD----NIIQWLVNWAAKKDVVTRNAKILEVMGKAP--KKP---------------TISQ 245 (395) T ss_pred eeeeeeEeehhhHHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccc---------------cccc Confidence 9999999999999999977654 478899999999999999999999987432 211 1123 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++.......+..+..|+|||+++..|++++|++|+|++. +++..+.+++|+|+||+++++++.... . T Consensus 246 ~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~~~---~ 318 (395) T protein:vir:38 246 FDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQ----PDVTSPDKYLIDGKPVIRIADKWLPDV---S 318 (395) T ss_pred HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCcceeccceeEEecccccCcC---C Confidence 677777776444444445567999999999999999999998864 345667788999999999988754322 2 Q ss_pred ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 237 ~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ....+++|||++ |+++++++++++++++.+ .+|++|++.||++.|+|+++.+|+||++++.+++....|..-. T Consensus 319 ~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~ 392 (395) T protein:vir:38 319 GSHPLYFGDLKQGITLFDRQQMQIDTTNVGA------GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQGTAG 392 (395) T ss_pred CcceEEEEeccccEEEEEecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCCccC Confidence 344688999986 788999999999998754 3599999999999999999999999999997765433333212 No 67 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=2.8e-54 Score=314.17 Aligned_cols=275 Identities=16% Similarity=0.073 Sum_probs=229.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC-ceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +..++..++|.++|+++..+||+.+++.++|+++|++++++++.+++|+.++. +.+.|++|++.+|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEee Confidence 33344445556677788899999999999999999999999999999998875 68999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc-cccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT-VDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 158 (315) ||++++++||+|+|+++ .+|+++|.+++++++++++|.++++|+| ++..+.|+.+........ ...+...++ T Consensus 193 ~k~~~~~~is~ell~d~-----~~l~~~i~~~l~~~~~~~~~~~il~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 265 (390) T protein:vir:10 193 HVIAHTMKATRQILSDA-----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGLIPQATTYAAPTTIAGATRVD 265 (390) T ss_pred EEEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCccccccccccccccccccccccchHH Confidence 99999999999999653 2488999999999999999999999987 455677877654433322 223344567 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+ ..+++|+|||+++..|++++|++|+|++.. ...+++++|+|+||++++.||.+ T Consensus 266 ~~~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~-----~~~~~~~~l~G~pv~~~~~~p~~-------- 331 (390) T protein:vir:10 266 QLRLAMLQASLAE-YPASGIVINPIDWAAIELAKDANNQYLIGN-----ARGTLTPTLWGLPVVATQAMAPG-------- 331 (390) T ss_pred HHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceeecC-----CcCcCCceecceeeEEcCCCCCC-------- Confidence 8888888886544 456789999999999999999999987643 33455679999999999999853 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) .+++|||++ |.+.++++++++++++. .+|++|++.||++.|+||++++|+||++++.+ T Consensus 332 -~~~~gdf~~~~~~~~~~~~~i~~~~~~-------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 332 -EFLVGAFDLAAQIFDQWDARVEIGYVN-------DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred -cEEEEeccceEEEEEecceEEEEeecc-------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 378899986 66888999999988753 25999999999999999999999999999999 No 68 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.6e-54 Score=315.55 Aligned_cols=279 Identities=13% Similarity=0.017 Sum_probs=224.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCC-CccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPS-ASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~-s~~~~~~v~l~~ 79 (315) ++.++.++||++||+++.++|++.+++.|+|+++|+++|++++...+|+.++.+.+.|++|++.+++ ++++|++++|.+ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~ 163 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGM 163 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeee Confidence 7788889999999999999999999999999999999999999999999999999999999988875 689999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc----cccccc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT----VDATDS 155 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~----~~~~~~ 155 (315) ||++++++||+|+|+++..+ |+++|.++++++++.++|.++++|+|. + .|.|+.+.....+.. ...... T Consensus 164 ~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~~i~~~~~~a~l~G~G~--~-~P~Gil~~~~~~~~~~~~~~~~~~~ 236 (390) T protein:vir:40 164 YKLSAYIPVCNAMLDLGPSW----LDQYVRTILGEAMALGLEAGIVNGSGK--D-QPIGMMRDLNNVTAGEHPVKTATPL 236 (390) T ss_pred eeEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhhhcccCC--C-ccceeeecccccccccccccccccc Confidence 99999999999999877554 789999999999999999999999973 3 456776543322211 122233 Q ss_pred hhHHHHHHHHHhhhc------ccccceEEEEeHHHH----HHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 156 ATTDLVKAVGLIAGA------GLQVPNGVALDPAFS----FALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 156 ~~~di~~~~~~~~~~------~~~~~~~~~m~~~~~----~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) .+.+..+++..+... ....+..|+||+++. ..++.++|.+|+|++.. .++|+||++++ T Consensus 237 t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~------------~~~g~pvv~~~ 304 (390) T protein:vir:40 237 TDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI------------LPVPLEIVQSV 304 (390) T ss_pred chhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc------------CCCceeEEEcC Confidence 445555555444321 123345699999874 34567889998877532 35799999999 Q ss_pred ccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~ 305 (315) +||.+ .++||||++|+++++++++++++++. +|.+|++.||+..|+|+++.+++||++|+.++ T Consensus 305 ~~p~~---------~i~~Gd~s~~~i~~~~~~~v~~~~~~--------~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~ 367 (390) T protein:vir:40 305 AVPVG---------KAVAGRAKDYFMGIGSEQVIRTSTEY--------RLLDDETLYYAKQYANGRPKDNSSFLVFDITG 367 (390) T ss_pred CCCCC---------cEEEEeeceEEEEeecceEEEecchh--------hhhcCcEEEEEEEEeCCEEecccceEEEEeec Confidence 99853 27789999999999999999999874 59999999999999999999999999997554 Q ss_pred C--CCCCCCCCC Q lcl|NC_018838. 306 A--PKPNPPAGN 315 (315) Q Consensus 306 a--~~~~~~~~~ 315 (315) . ..+.||.+- T Consensus 368 ~~~~~~~~~~~~ 379 (390) T protein:vir:40 368 LEGSPAIDVNVV 379 (390) T ss_pred cCCCCCCCccee Confidence 4 224444433 No 69 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3.2e-54 Score=313.87 Aligned_cols=271 Identities=12% Similarity=-0.007 Sum_probs=225.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC--CceEEEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF--GPVKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~--~~~~ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) |+.++.++||++||+++..+|++.+++.++|+++|+++++++ +.+.+|+.++.+.++||+|++.+|++ .++|+++++ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~ 202 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSY 202 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEe Confidence 888889999999999999999999999999999999999874 56778888888999999999999975 699999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+++++..+ |+++|.+++++++++++|.++++|+|.. . +.+ ...+ T Consensus 203 ~~~k~~~~~~is~e~l~ds~~~----l~~~i~~~l~~~~~~~~d~~il~G~g~~--~-~~g---------------~~~~ 260 (397) T protein:vir:12 203 SIIDYGGIMTLSNSMLNDSDQA----IMTYVAKWFAKKSVVTRNNLILAAIASL--K-KVD---------------IDGL 260 (397) T ss_pred eheeeEeeehhhHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHHhccccc--c-ccc---------------cccH Confidence 9999999999999999876554 7889999999999999999999998732 1 111 1236 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++.......+..+..|+|||+++.+|++++|++|+|++. |++..+.+++|+|+||++++.+.... ... T Consensus 261 ~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~----~~~~~g~~~~l~G~pv~~~~~~~~~~---~~~ 333 (397) T protein:vir:12 261 DGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQ----PDPTNPTKKLLDGRPVVPFTNRVLKT---QKG 333 (397) T ss_pred HHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeec----ccccCCCCccccceeeEEeccccccc---CCC Confidence 77887775333344445567999999999999999999998754 35567778899999999776542221 233 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +..+++|||++ |.++++++++++++++.+ +.|++|++.||+++|+||++.+|+||++++..+- T Consensus 334 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 334 KAPLIIGNLKEAIVLFDREQQSIASTDTGA------GAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccEEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 45688999997 568889999999988754 3699999999999999999999999999987765 No 70 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=5.4e-54 Score=312.60 Aligned_cols=279 Identities=10% Similarity=0.045 Sum_probs=224.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc--eEEEEEeC-CceeEEeecccccCC-CccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~ 76 (315) |..++.+.||++||++++.+|++.+++.++|+++|+++|++++. +.+++..+ +..+.|++|++.+++ ++++|++++ T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~ 195 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIK 195 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEE Confidence 88889999999999999999999999999999999999998655 45555544 457789999999997 569999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|.. .+.. +... T Consensus 196 ~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d~~il~G~G~~--~~~~---------------~~~~ 254 (408) T protein:vir:74 196 YLIKRYAGIITATNTLLKDTAEN----ILAWLSSWIAKKVVVTRNQAIIAAMGTV--PKKP---------------TIAN 254 (408) T ss_pred eeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhccccc--cccc---------------cccc Confidence 99999999999999999776544 7889999999999999999999998732 2211 1224 Q ss_pred hHHHHHHHH-HhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc--cCccccc Q lcl|NC_018838. 157 TTDLVKAVG-LIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST--VSGAPEM 233 (315) Q Consensus 157 ~~di~~~~~-~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~--v~~~~~~ 233 (315) ++++.+++. .+.+ .+..+..|+|||+++.+|++++|++|+|++. +++..+.+++|+|+||++++. +|.. T Consensus 255 ~~~i~~~~~~~l~~-~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~~~--- 326 (408) T protein:vir:74 255 FDDVITMINTSVDP-AIIATSSLLTNQSGLNKLALVKTAEGKYLLE----PDPTKPNSYLIKGKQVIVVADRWLPNS--- 326 (408) T ss_pred HHHHHHHHHHhhhh-hhcCCCEEEEcHHHHHHHHHhhcCCCceEec----cCcCCCCCceecceeeEEecCcccccc--- Confidence 677877764 4443 3344567999999999999999999998764 456677788999999998764 4432 Q ss_pred cccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeecc--CCCCC Q lcl|NC_018838. 234 SPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA--APKPN 310 (315) Q Consensus 234 ~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~--a~~~~ 310 (315) .++...+++|||++ |.++++++++++++++.. ..|++|++.+|+++|+||++.+|+||++++.++ .+++. T Consensus 327 -~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~ 399 (408) T protein:vir:74 327 -GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGN 399 (408) T ss_pred -cCCcceEEEEehhccEEEEEecceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCC Confidence 24456789999986 778999999999998754 359999999999999999999999999998644 22222 Q ss_pred CCCCC Q lcl|NC_018838. 311 PPAGN 315 (315) Q Consensus 311 ~~~~~ 315 (315) -|..- T Consensus 400 ~~~~~ 404 (408) T protein:vir:74 400 FKTTT 404 (408) T ss_pred CCCCc Confidence 22222 No 71 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.4e-53 Score=310.32 Aligned_cols=286 Identities=12% Similarity=0.019 Sum_probs=229.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceE--EEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVK--GAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~--ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) ....+.++||++||+++.+.|++.+++.++|++++++++|+++..+ +|+.++...++|++|++.+|++ .++|+++++ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:79 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEe Confidence 3334556789999999999999999999999999999999866554 5556677889999999999975 689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|.+. +..+..............+...| T Consensus 201 ~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:79 201 DINTHRGYFRISREAIEDAKVN----VLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeeeeEeeehhhHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhccccCc--cccccccccccccccccccccch Confidence 9999999999999999776544 78899999999999999999999987433 33333332233333333445678 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...+ ..+++|+||++++.+|++++|++|+|++. |+...+.+++|+|+||++++++|.. ... T Consensus 275 ~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~----~~~ 345 (415) T protein:vir:79 275 DDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLG----QKG 345 (415) T ss_pred hHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCCceecceeeEEecccccC----CCC Confidence 99999998886654 34678999999999999999999998764 3556677789999999999988743 223 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...++||||++ |+++++++++++.+++ .++++.+|+.+|+|+++.||+||++++..++ ..|.|+ T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~---~~~~~~ 410 (415) T protein:vir:79 346 NNTLIIGNLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDS---ERGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEecc---CCCCCc Confidence 45689999987 6688899999988763 4556789999999999999999999988773 334455 No 72 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.4e-53 Score=310.32 Aligned_cols=286 Identities=12% Similarity=0.019 Sum_probs=229.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceE--EEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVK--GAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~--ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) ....+.++||++||+++.+.|++.+++.++|++++++++|+++..+ +|+.++...++|++|++.+|++ .++|+++++ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:98 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEe Confidence 3334556789999999999999999999999999999999866554 5556677889999999999975 689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|.+. +..+..............+...| T Consensus 201 ~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:98 201 DINTHRGYFRISREAIEDAKVN----VLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeeeeEeeehhhHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhccccCc--cccccccccccccccccccccch Confidence 9999999999999999776544 78899999999999999999999987433 33333332233333333445678 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...+ ..+++|+||++++.+|++++|++|+|++. |+...+.+++|+|+||++++++|.. ... T Consensus 275 ~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~----~~~ 345 (415) T protein:vir:98 275 DDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLG----QKG 345 (415) T ss_pred hHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCCceecceeeEEecccccC----CCC Confidence 99999998886654 34678999999999999999999998764 3556677789999999999988743 223 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...++||||++ |+++++++++++.+++ .++++.+|+.+|+|+++.||+||++++..++ ..|.|+ T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~---~~~~~~ 410 (415) T protein:vir:98 346 NNTLIIGNLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDS---ERGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEecc---CCCCCc Confidence 45689999987 6688899999988763 4556789999999999999999999988773 334455 No 73 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.4e-53 Score=310.32 Aligned_cols=286 Identities=12% Similarity=0.019 Sum_probs=229.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceE--EEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVK--GAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~--ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) ....+.++||++||+++.+.|++.+++.++|++++++++|+++..+ +|+.++...++|++|++.+|++ .++|+++++ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:81 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEe Confidence 3334556789999999999999999999999999999999866554 5556677889999999999975 689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++..+ |+++|.+++++++++++|.++++|+|.+. +..+..............+...| T Consensus 201 ~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:81 201 DINTHRGYFRISREAIEDAKVN----VLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeeeeEeeehhhHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHHhhccccCc--cccccccccccccccccccccch Confidence 9999999999999999776544 78899999999999999999999987433 33333332233333333445678 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...+ ..+++|+||++++.+|++++|++|+|++. |+...+.+++|+|+||++++++|.. ... T Consensus 275 ~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~----~~~ 345 (415) T protein:vir:81 275 DDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLG----QKG 345 (415) T ss_pred hHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCceeec----cCcCCCCCceecceeeEEecccccC----CCC Confidence 99999998886654 34678999999999999999999998764 3556677789999999999988743 223 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...++||||++ |+++++++++++.+++ .++++.+|+.+|+|+++.||+||++++..++ ..|.|+ T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~---~~~~~~ 410 (415) T protein:vir:81 346 NNTLIIGNLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDS---ERGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEecc---CCCCCc Confidence 45689999987 6688899999988763 4556789999999999999999999988773 334455 No 74 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2.9e-53 Score=308.61 Aligned_cols=286 Identities=12% Similarity=0.006 Sum_probs=228.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceE--EEEEeCCceeEEeecccccCCC-ccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVK--GAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~--ip~~~~~~~a~wv~Eg~~~~~s-~~~~~~v~l 77 (315) ...++.++||++||++++.+|++.+++.++|+++|++++|+++..+ +++.++.+.++|++|++.+|++ .++|+++++ T Consensus 121 ~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~ 200 (415) T protein:vir:94 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred hhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEe Confidence 2333456789999999999999999999999999999999876555 5556677899999999999965 689999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|+|+++.. +|+++|.+++++++++++|.++++|+|.+.. ..+..............+...| T Consensus 201 ~~~k~~~~~~is~ell~ds~~----~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~--~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:94 201 DINTHRGYFRISREAIEDAKV----NVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eheeeeeechhhHHHHhhchH----HHHHHHHHHHHHHHHHHHHHHHhhccccCcc--ccccccccccccccccccccch Confidence 999999999999999977654 4788999999999999999999999874332 2222222222223333344568 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++..+...++ .+++|+||++++.+|++++|++|+|++. |+...+.+++|+|+||++++++|... .. T Consensus 275 ~~i~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~~----~~ 345 (415) T protein:vir:94 275 DDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLGQ----KG 345 (415) T ss_pred HHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCeeec----cCcCCCCCceecceeeEEecccccCC----CC Confidence 899999998866543 4678999999999999999999998754 35566778899999999999988432 23 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ...+++|||++ |+++.+++++++.++ |.++++.+|+++|+|+++.+|+||++++..++. .|.|+ T Consensus 346 ~~~i~~gd~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~---~~~~~ 410 (415) T protein:vir:94 346 NNTLIIGNLKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE---RGEGD 410 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEec-----------cccCceEEEEEEEeccEEeccccEEEEEEeccC---CCCCc Confidence 45689999997 677889999998775 456678899999999999999999999876633 33444 No 75 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=7e-54 Score=312.01 Aligned_cols=280 Identities=11% Similarity=-0.028 Sum_probs=227.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) +..++.++||++||+++.++|++.+++.|+|+++|++++++ +..++|+.++.+.+.|++|+++++ +++++|++++|.+ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-cceEEEEecCCcceeEeecccccCcccCccceeEeecc Confidence 88999999999999999999999999999999999999986 568999999999999999988765 5789999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc---cc--- Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD---AT--- 153 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~---~~--- 153 (315) ||++++++||+|||.++..+ |+++|++++++++++++|.+|++|+| ++ .|.|+.+.....+.... .+ T Consensus 158 ~kl~a~~~is~elL~ds~~~----ie~~i~~~la~~~a~~~~~a~i~G~G--~~-qP~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) T protein:vir:98 158 FKLTAFVVIPKDALKFGPKW----IKQFITEQLKEAIAVALELAIVKGDG--LL-QPVGLLKDLSQPTVDQSTGRDITTY 230 (377) T ss_pred eeEEeeecccHHhhhccHhH----HHHHHHHHHHHHHHHHHhhceEeccC--CC-cceeeeecccccccccccccccccc Confidence 99999999999999766554 88999999999999999999999998 33 56777764332221111 11 Q ss_pred cchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccc------ccccc----cccCCCccccceee-- Q lcl|NC_018838. 154 DSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQ------PMYPA----AGFAGLDNWRGLNV-- 221 (315) Q Consensus 154 ~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~------~~~~~----~~~~~~~~l~G~Pv-- 221 (315) ...++.+.++...+... +....+|+||+.+...+++++|.+|+++|.. .++|. ...|++.+++|+|+ T Consensus 231 ~~~~~~~~~l~~~~~~~-~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~v 309 (377) T protein:vir:98 231 KTDKEAIADLSDLTPDN-APKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITI 309 (377) T ss_pred cchhhhHhhhhhhchhH-HHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceE Confidence 11223455555555433 2233469999999999999999999999831 01121 13566779999995 Q ss_pred EeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEE Q lcl|NC_018838. 222 GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVV 301 (315) Q Consensus 222 ~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l 301 (315) +.+++||.+ .++||||++|.++++++++++++++. +|.+|++.||+.+|+|+++++++||++| T Consensus 310 v~s~~~p~~---------~i~fgdf~~Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dg~~~~~~a~~vl 372 (377) T protein:vir:98 310 LESLAVETG---------KAIAFVANRYDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKDNHTAALL 372 (377) T ss_pred EecCCCCcc---------cEEEEEecceeEEeecceEEEeechh--------hhhcCceEEEEEEEEcCEEeccCcEEEE Confidence 567777742 37899999999999999999999874 6999999999999999999999999999 Q ss_pred eeccC Q lcl|NC_018838. 302 KEKAA 306 (315) Q Consensus 302 ~~~~a 306 (315) +.+-. T Consensus 373 ~i~~~ 377 (377) T protein:vir:98 373 TLAGG 377 (377) T ss_pred EEecC Confidence 87776 No 76 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.3e-52 Score=305.10 Aligned_cols=271 Identities=14% Similarity=0.053 Sum_probs=223.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCC--ceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV--PRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~--~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) -+.++++.++.+||++++.+|++.++..++|+++|+++++.++.++||+.++. ..+.|++||+.+|+++++|+++++. T Consensus 107 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~ 186 (379) T protein:vir:10 107 GDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVN 186 (379) T ss_pred cccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEee Confidence 22244455666899999999999999999999999999999999999998754 4678999999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|+|+++ .+|+++|.++|++++++++|.+++.|+|..+. .+... .......+ T Consensus 187 ~~k~~~~~~iS~ell~D~-----~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~---~~~~~---------~~~~~~~d 249 (379) T protein:vir:10 187 TDFIAGFTRYSKKMANNL-----PFLTSFIPNALRRDYAKAENAAFNAVLAANAT---ASTEI---------ITNKNKVE 249 (379) T ss_pred eeeEEeeehhhHHHHhhH-----HHHHHHHHHHHHHHHHHHHHHHHhcccccccc---ccccc---------ccCcccHH Confidence 999999999999999653 35899999999999999999999988763211 11010 01122346 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+... +..+++|+|||.++..|+++||++|+|++.... ....+++.+|+|+||++++.||.+ T Consensus 250 ~i~~~~~~~~~~-~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~--~~~~~~~~~l~G~pvv~s~~~~ag-------- 318 (379) T protein:vir:10 250 MLINEIAKQENL-DFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGV--VTQDNGVLRINGIPLFRATWLAAN-------- 318 (379) T ss_pred HHHHHHHhhhhc-cCCCCEEEEcHHHHHHHHHhhccCCceeccCCc--cCCCCCcceecceeeEecCCCCCC-------- Confidence 788877777554 345678999999999999999999999875332 334566779999999999999743 Q ss_pred ceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) .+++|||+++.+.++++++++++++.. ++|++|++.||+++|+|++|+||+||++++.++. T Consensus 319 -~~~~gdf~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 319 -KYYVGDWTRVTKVTTEGLSLEFSEVEG------TNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred -ceEEeecccEEEEEEeceEEEEeeccc------ccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 378999999999999999999998753 3599999999999999999999999999998876 No 77 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=2.1e-52 Score=303.85 Aligned_cols=279 Identities=15% Similarity=0.087 Sum_probs=225.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l~ 78 (315) +..++.++||++||++++++|++.+++.++|+++|++++++++..++|+... ...+.|++|++++|+ ++++|++|++. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~ 190 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWS 190 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEee Confidence 7778889999999999999999999999999999999999999899998765 467899999999996 67999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|+|+++.. +|+++|.++|++++++++|.++++|+|.+. . .+. .+...++ T Consensus 191 ~~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~il~g~g~~~--~-~~~------------~~~~~~d 251 (394) T protein:vir:10 191 VSTYRGAIPLSEEAIADSAV----DLTSLVGQSINEKSVNTYNAMIAPVLQSFT--A-KAT------------TTDTLVD 251 (394) T ss_pred eeeeEeeehhHHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc--c-ccc------------cccccHH Confidence 99999999999999977654 478899999999999999999999986321 1 111 1223466 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++.......+ .+.|+||++++.+|++++|++|+|++..........+.+++|+|+||++++.+... ...++ T Consensus 252 ~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~---~~~~~ 326 (394) T protein:vir:10 252 SLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLG---SAAGD 326 (394) T ss_pred HHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccC---CCCCc Confidence 77777765444332 46799999999999999999999998766544445567789999999987654211 12334 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) ..+++|||++ |+++.+++++++++++.+ |.+ .+|+.+|+|+++.+|+||+.++..++.+ -|++++ T Consensus 327 ~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--------~~~---~~~~~~r~d~~~~~~~ai~~~~~~~~~~-~~~~~~ 392 (394) T protein:vir:10 327 QKAFVGDLKRGVLFADRQQVTLAWEDSKI--------YGR---YLGAAFRFGVKQADSNAGYFVTNTDAAS-GSTSGT 392 (394) T ss_pred eEEEEeeccccEEEEeecceEEEEecccc--------cce---eEEEEEEeccEEeccccEEEEEeecccC-CCCCCC Confidence 5688999986 778889999999887642 443 5889999999999999999999777655 566666 No 78 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=3.7e-52 Score=302.58 Aligned_cols=278 Identities=14% Similarity=0.061 Sum_probs=223.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l~ 78 (315) |+.++.++||++||+++..+|++.++++++|+++|+++|++++..++|+... ...+.|++|++.+++ ++++|+++++. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~ 188 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWS 188 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeee Confidence 8899999999999999999999999999999999999999999999999875 456689999999985 68999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||+++++++|+|+|+++.. +|+++|.+++++++++++|.++++|.+... . . ...+...++ T Consensus 189 ~~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--~-~------------~~~~~~~~d 249 (389) T protein:vir:10 189 VATYRGAIPLSEEAIADSAV----DLTALVGQSIKEKSVNTYNAMIAPVLQSFT--A-K------------KTTTDTLVD 249 (389) T ss_pred heeeEeeehhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc--c-c------------cccccccHH Confidence 99999999999999977654 478889999999999999999998876321 1 0 011233467 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++.......+ .++|+||++++.+|++++|++|+|++.+........+.+++|+|+||++++..... ..+++ T Consensus 250 ~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~ 324 (389) T protein:vir:10 250 SLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLG---SLAGD 324 (389) T ss_pred HHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccC---CCCCc Confidence 78777764333322 46899999999999999999999998765433334566789999999876543211 12334 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPA 313 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~ 313 (315) ..+++|||++ |+++++++++++++++.+ |. ..+|+.+|+|+++.+|+||++++..+++..+|-- T Consensus 325 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~---~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 325 QKAFVGDLKRGVLFTDRQQVTLAWEDSKI--------YG---KYLGAAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred eEEEEeeccccEEEEeecceEEEeecccc--------cc---ceEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 5689999997 789999999999988642 44 3578999999999999999999876665555444 No 79 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=5.1e-53 Score=307.28 Aligned_cols=287 Identities=10% Similarity=-0.035 Sum_probs=221.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) |..+++++||++||+++.++|++.+++.|+|+++|+++++++ ..++|+.++.+.+.|++|+++++ +++++|++++|.+ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 161 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQ 161 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-ceEEEEEcCCcceEEeecccccccccCcceeeEeecc Confidence 889999999999999999999999999999999999999875 58999999999999999987765 5789999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc--------c Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV--------D 151 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~--------~ 151 (315) ||++++++||+|||+++.. +|+++|++++++++++++|.+|++|+| ++ .|.|+...+....... . T Consensus 162 ~kl~~~i~is~ell~Ds~~----~ie~~i~~~l~~~~a~~~~~a~i~G~G--~~-qP~Gil~~~~~~~~~~~~~~~~~~~ 234 (383) T protein:vir:78 162 NKLTAFVVVPKDLEKFGPA----WVKRFVVTQIEEAFAVALESAYIVGDG--ND-KPIGLNRKVGKGSTVVDGVYAEKAA 234 (383) T ss_pred eeeEeeccchHHHhhccHH----HHHHHHHHHHHHHHHHHHhhheEeccC--CC-CceeeeeccCCcccccccccccccc Confidence 9999999999999976654 488999999999999999999999997 33 4667765432221111 1 Q ss_pred cccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccC--cc-ccccccccccccCCCcccccee--eEeecc Q lcl|NC_018838. 152 ATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKG--SP-LAGQPMYPAAGFAGLDNWRGLN--VGASST 226 (315) Q Consensus 152 ~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g--~~-~~~~~~~~~~~~~~~~~l~G~P--v~~s~~ 226 (315) .+...+.++..++..+... .....|.||..+...+++++.-.. .+ .+..........|++.+++|+| |+.+++ T Consensus 235 ~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~ 312 (383) T protein:vir:78 235 TGTLTFANPKTTVNELTDV--YKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLF 312 (383) T ss_pred cchhhhhhhHHHHHHHHHH--HhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCC Confidence 2233455666666655432 222335555555555555442111 11 1111111112345666788777 556778 Q ss_pred cCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ||.+ .++||||++|+++++++++++++++. +|.+|++.||+.+|+|+++++++||++|+.+.+ T Consensus 313 ~p~~---------~iifgdfs~Y~i~~r~~~~i~~~~~~--------~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 313 VPEK---------KAISYVAERYDALIGGPLDIGTYDQT--------LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred CCcc---------cEEEeeccceEEEecccceEEecchh--------hhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 8743 27889999999999999999999874 699999999999999999999999999999999 Q ss_pred CCCCCCCC Q lcl|NC_018838. 307 PKPNPPAG 314 (315) Q Consensus 307 ~~~~~~~~ 314 (315) ++++.|+| T Consensus 376 ~~~~~~~~ 383 (383) T protein:vir:78 376 PAEQTPEG 383 (383) T ss_pred CCCCCCCC Confidence 99999999 No 80 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=9.9e-53 Score=305.69 Aligned_cols=269 Identities=12% Similarity=0.053 Sum_probs=214.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..++.++||++||+++.++|++.++.+++||++|+++++++ ..+|+.+. .+.++||+|++.+++++++|+++++.+ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 160 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTT 160 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEEEecCCCcccccccccccccccccceeeeecc Confidence 899999999999999999999999999999999999988753 56787665 468999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTD 159 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 159 (315) ||++++++||+|||+++.. +|+++|.++|+++++++++.. ++++|.+++.+..++.+. . ...++ +...|++ T Consensus 161 ~k~~~~i~is~ell~Ds~~----~l~~~i~~~la~~~~~~e~~~-~~~~g~g~~~~~g~l~~~-~--~~~~t-~~~~~d~ 231 (352) T protein:vir:78 161 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKD-ALAVSPKSGLEHMSFYNG-S--VKEVE-GANMYDA 231 (352) T ss_pred eeEEeechhhHHHHhhhhH----HHHHHHHHHHHHHHHHHHHHh-hhhcCCCCcccccceecc-c--ccccc-ccchHHH Confidence 9999999999999976654 488889999999998875443 334443454443333222 1 12222 3345888 Q ss_pred HHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccc Q lcl|NC_018838. 160 LVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV 239 (315) Q Consensus 160 i~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~ 239 (315) +.+++..+...+. .+..|+||+.++..|+++++.+|++++ .+.+++|+|+||++++.++ T Consensus 232 i~~~~~~l~~~~~-~~a~~~mn~~t~~~l~~~~~~~~~~~~---------~~~~~~llG~PV~~~~~~~----------- 290 (352) T protein:vir:78 232 IINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF---------DTPAEKVFGKPVVFTDAAV----------- 290 (352) T ss_pred HHHHHhccChhhh-cCCEEEEehHHHHHHHHHHhccCCccc---------ccCCccccccceEEecCCC----------- Confidence 9999988865543 345799999999999999988887764 2456789999999988653 Q ss_pred eEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 240 KAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 240 ~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||++++++ +.++.++...+ ..++++.|++.+|+|+++++|+||+.++.+++..++|- T Consensus 291 ~~~~Gdf~~~~~~-~~~~~~~~~~~----------~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 291 KPIVGDFNYFGIN-YDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred ceeEeehhhhhhh-hhhheeeeecc----------ccCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 2678999988775 56666665554 23688999999999999999999999999998888888 No 81 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=1.5e-52 Score=304.71 Aligned_cols=277 Identities=10% Similarity=-0.014 Sum_probs=214.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) |..+++++||++||++++++|++.+++.|+||++|++++++ +..++|+.++.+.++|++|+++++ +++++|++++|.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cceEEEEecCCcceeeecccccccccccccceeeeecc Confidence 88899999999999999999999999999999999999987 568999999999999999988876 5689999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc--------cc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT--------VD 151 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~--------~~ 151 (315) ||++++++||+|||+++.. +|+++|++++++++++++|.+|++|+| ++ .|.|+.+.+...... .. T Consensus 155 ~kl~~~~~is~elL~Ds~~----~ie~~i~~~la~~~a~~~~~a~i~G~G--~~-qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:10 155 NKLTAFVVLPKDLNDFGPA----WIERFVRVQIEEAFAVALETAFLKGTG--KD-QPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) T ss_pred eeEEeechhhHHHhhcCHH----HHHHHHHHHHHHHHHHHhhheeEeccC--CC-CceeeeeccCccccccccccccccc Confidence 9999999999999976654 488999999999999999999999998 33 456676543221111 01 Q ss_pred cccchh-------HHHHHHHHHhhhc-----ccccce-EEEEeHHHHHHHHHHhh---ccCccccccccccccccCCCcc Q lcl|NC_018838. 152 ATDSAT-------TDLVKAVGLIAGA-----GLQVPN-GVALDPAFSFALSTEVY---PKGSPLAGQPMYPAAGFAGLDN 215 (315) Q Consensus 152 ~~~~~~-------~di~~~~~~~~~~-----~~~~~~-~~~m~~~~~~~L~~l~d---~~g~~~~~~~~~~~~~~~~~~~ 215 (315) .....+ ..+.+++..+... ..+..+ .|+||+.+...|+.+++ .+|+|+|.. T Consensus 228 ~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l------------- 294 (381) T protein:vir:10 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL------------- 294 (381) T ss_pred ccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC------------- Confidence 111122 2333333333211 112223 59999999999987764 344443311 Q ss_pred ccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecc Q lcl|NC_018838. 216 WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) Q Consensus 216 l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~ 295 (315) -+|++|+.++.||.+ .++||||++|++++|++++++++++. +|.+|++.||+.+|+|++++++ T Consensus 295 ~~g~~vv~s~~~p~~---------~iifgDfs~Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:10 295 PFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred CCCceEEecCCCCcC---------cEEEEecccEEEEEecccEEEeechh--------HhhcCCeEEEEEEEEcCEEecC Confidence 146678889999843 27899999999999999999999984 6999999999999999999999 Q ss_pred cceEEEeecc--CCCCCCCCCC Q lcl|NC_018838. 296 DSFAVVKEKA--APKPNPPAGN 315 (315) Q Consensus 296 ~af~~l~~~~--a~~~~~~~~~ 315 (315) +||++++.+. +++.+|-.-| T Consensus 358 ~A~~v~~l~~~~~~~~~~~~~~ 379 (381) T protein:vir:10 358 KVAAVWKLDLKGHKPALEGTEE 379 (381) T ss_pred ceEEEEEEEecCCCcCcccccc Confidence 9999976544 4433333344 No 82 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=1.5e-52 Score=304.71 Aligned_cols=277 Identities=10% Similarity=-0.014 Sum_probs=214.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) |..+++++||++||++++++|++.+++.|+||++|++++++ +..++|+.++.+.++|++|+++++ +++++|++++|.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cceEEEEecCCcceeeecccccccccccccceeeeecc Confidence 88899999999999999999999999999999999999987 568999999999999999988876 5689999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc--------cc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT--------VD 151 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~--------~~ 151 (315) ||++++++||+|||+++.. +|+++|++++++++++++|.+|++|+| ++ .|.|+.+.+...... .. T Consensus 155 ~kl~~~~~is~elL~Ds~~----~ie~~i~~~la~~~a~~~~~a~i~G~G--~~-qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:95 155 NKLTAFVVLPKDLNDFGPA----WIERFVRVQIEEAFAVALETAFLKGTG--KD-QPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) T ss_pred eeEEeechhhHHHhhcCHH----HHHHHHHHHHHHHHHHHhhheeEeccC--CC-CceeeeeccCccccccccccccccc Confidence 9999999999999976654 488999999999999999999999998 33 456676543221111 01 Q ss_pred cccchh-------HHHHHHHHHhhhc-----ccccce-EEEEeHHHHHHHHHHhh---ccCccccccccccccccCCCcc Q lcl|NC_018838. 152 ATDSAT-------TDLVKAVGLIAGA-----GLQVPN-GVALDPAFSFALSTEVY---PKGSPLAGQPMYPAAGFAGLDN 215 (315) Q Consensus 152 ~~~~~~-------~di~~~~~~~~~~-----~~~~~~-~~~m~~~~~~~L~~l~d---~~g~~~~~~~~~~~~~~~~~~~ 215 (315) .....+ ..+.+++..+... ..+..+ .|+||+.+...|+.+++ .+|+|+|.. T Consensus 228 ~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l------------- 294 (381) T protein:vir:95 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL------------- 294 (381) T ss_pred ccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC------------- Confidence 111122 2333333333211 112223 59999999999987764 344443311 Q ss_pred ccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecc Q lcl|NC_018838. 216 WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) Q Consensus 216 l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~ 295 (315) -+|++|+.++.||.+ .++||||++|++++|++++++++++. +|.+|++.||+.+|+|++++++ T Consensus 295 ~~g~~vv~s~~~p~~---------~iifgDfs~Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:95 295 PFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred CCCceEEecCCCCcC---------cEEEEecccEEEEEecccEEEeechh--------HhhcCCeEEEEEEEEcCEEecC Confidence 146678889999843 27899999999999999999999984 6999999999999999999999 Q ss_pred cceEEEeecc--CCCCCCCCCC Q lcl|NC_018838. 296 DSFAVVKEKA--APKPNPPAGN 315 (315) Q Consensus 296 ~af~~l~~~~--a~~~~~~~~~ 315 (315) +||++++.+. +++.+|-.-| T Consensus 358 ~A~~v~~l~~~~~~~~~~~~~~ 379 (381) T protein:vir:95 358 KVAAVWKLDLKGHKPALEGTEE 379 (381) T ss_pred ceEEEEEEEecCCCcCcccccc Confidence 9999976544 4433333344 No 83 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.2e-51 Score=299.69 Aligned_cols=281 Identities=16% Similarity=0.159 Sum_probs=228.0 Q ss_pred CCCCcc-CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC--------CceeEEeecccccCCCccc Q lcl|NC_018838. 1 MADDFL-SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG--------VPRAKIVGEGEVKPSASVD 71 (315) Q Consensus 1 m~~~~~-s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~--------~~~a~wv~Eg~~~~~s~~~ 71 (315) +..++. +.|+.++|..+...|+...+..+.++++++++++.++.+++|+.++ .+.++||+||+.+|+++++ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 202 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS 202 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccc Confidence 344433 3445677777788778888889999999999999999899988654 3468899999999999999 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccc----- Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKT----- 146 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~----- 146 (315) |+++++.+||++++++||+|+|+++ ..|+++|.+++++++++++|.++++|+|. + .+.|+.+..... T Consensus 203 ~~~i~~~~~k~~~~~~is~ell~d~-----~~l~~~i~~~la~a~~~~~d~aii~G~G~--~-~p~Gi~~~~~~~~~~~~ 274 (419) T protein:vir:94 203 FDTITTTLKTVAHWLPITRQAADDN-----SQLMGYIQGRLTYGLRFLRDRQLLNGNGS--T-EMQGILTTPGIGTYQQP 274 (419) T ss_pred eeeEEeeeeeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHHhccCc--c-cccceeccccccccccc Confidence 9999999999999999999999643 24889999999999999999999999984 2 455665432111 Q ss_pred -ccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 147 -TKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 147 -~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) ..........++++.+++..+...+ ..+++|+||++++..|+++++.+|++.. +++++..+.+++|+|+||++++ T Consensus 275 ~~~~~~t~~~~~~~l~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~~k~~~~~~~~---~~~~~~~~~~~~l~G~pV~~~~ 350 (419) T protein:vir:94 275 KPTAPATDEPPLVDIRRAKTVAEIAG-FPPDGVVVHPQDWESIELDQAPGSGVFR---VIANVQGEATPRIWGLNVVSTV 350 (419) T ss_pred ccccccccchhHHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHHhhcCCCcee---ecCCcccCCCccccceeeEEcC Confidence 1111122345788999998887655 3567899999999999999998776432 3467778888999999999999 Q ss_pred ccCccccccccccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) +||.+ .+++|||++ |++..+++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.+ T Consensus 351 ~~~~~---------~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 351 AIAQG---------TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) T ss_pred CCCCc---------cEEEeeccceEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEeccccEEEEEec Confidence 99843 378899987 567889999999988754 36999999999999999999999999999999 Q ss_pred cCCC Q lcl|NC_018838. 305 AAPK 308 (315) Q Consensus 305 ~a~~ 308 (315) +++. T Consensus 416 aa~~ 419 (419) T protein:vir:94 416 AATT 419 (419) T ss_pred cCCC Confidence 8877 No 84 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=2.6e-52 Score=303.37 Aligned_cols=275 Identities=11% Similarity=-0.010 Sum_probs=215.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) |..+++++||++||+++.++|++.+++.|+||++|++++++ +..++|+.++++.+.|++|.++++ +++++|+++++.+ T Consensus 76 ~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:10 76 INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred HhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC-cceEEEeecCCcceEEeecccccccccCccceeEeecc Confidence 88999999999999999999999999999999999999986 568999999999999999987765 6689999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc--------c Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV--------D 151 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~--------~ 151 (315) ||++++++||+|||+++.. +|+++|++++++++++++|.+|++|+| ++ .|.|+.+.+....... . T Consensus 155 ~kl~a~i~is~elL~Ds~~----~le~~i~~~la~~~a~~~~~afi~GdG--~~-qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:10 155 NKLTAFVVLPKDLNDFGPA----WIERFVRVQIEEAFAVALETAFLKGTG--KD-QPIGLNRQVQKGVSVTDGAYPEKEE 227 (381) T ss_pred eeEEeeccccHHHHhccHH----HHHHHHHHHHHHHHHHHhhceeEeccc--CC-CceeeeecCCccccccccccccccc Confidence 9999999999999976654 489999999999999999999999998 33 4567765332211111 1 Q ss_pred cccchhHHHHHHHHHhhh------------ccccc-ceEEEEeHHHHHHHHHHh---hccCccccccccccccccCCCcc Q lcl|NC_018838. 152 ATDSATTDLVKAVGLIAG------------AGLQV-PNGVALDPAFSFALSTEV---YPKGSPLAGQPMYPAAGFAGLDN 215 (315) Q Consensus 152 ~~~~~~~di~~~~~~~~~------------~~~~~-~~~~~m~~~~~~~L~~l~---d~~g~~~~~~~~~~~~~~~~~~~ 215 (315) .....+.++..++..+.+ ...+. +..|+||+.+...|+.++ +.+|+|+|.. T Consensus 228 ~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~l------------- 294 (381) T protein:vir:10 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL------------- 294 (381) T ss_pred cccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecC------------- Confidence 112223333333222211 01122 235899999999988665 5566655421 Q ss_pred ccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecc Q lcl|NC_018838. 216 WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) Q Consensus 216 l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~ 295 (315) -+|+||+.+++||.+ .++||||++|+|++|++++++++++. +|.+|++.||+..|+|++++|+ T Consensus 295 p~g~~vv~~~~~p~~---------~i~fGDfs~Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dG~~~~~ 357 (381) T protein:vir:10 295 PFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred CCCceeEEcCCCCcC---------cEEEEEcccEEEEEecccEEEeechh--------hhhcCceEEEEEEEEcCEEecC Confidence 157889999999853 27899999999999999999999984 6999999999999999999999 Q ss_pred cceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 296 DSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 296 ~af~~l~~~~a~~~~~~~~~ 315 (315) +||++++.+... +||+-. T Consensus 358 ~A~~v~~l~~~~--~~~~~~ 375 (381) T protein:vir:10 358 KVAAVWKLDLKG--HKPALE 375 (381) T ss_pred CcEEEEEEeecC--Cccccc Confidence 999998765443 444443 No 85 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.8e-52 Score=301.09 Aligned_cols=280 Identities=12% Similarity=0.041 Sum_probs=217.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeeccccc-CCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVK-PSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~-~~s~~~~~~v~l~~ 79 (315) |..+++++||++||++++++|++.+++.|+|+++|++++++ +..++|+.++.+.+.|++|.+++ ++++++|++++|.+ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 164 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAG-IKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQ 164 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccCccccccceeeeece Confidence 88899999999999999999999999999999999999997 56899999999999999987665 56799999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc----cccccc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT----VDATDS 155 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~----~~~~~~ 155 (315) ||++++++||+|||+++..+ |+++|++++++++++++|.++++|+|.+. ..|.|+.+.....+.. ...... T Consensus 165 ~kl~~~~~iS~ell~ds~~~----ie~~i~~~la~~ia~~~~~a~i~G~G~~~-~qP~Gil~~~~~~~~~~~~~~~~~~~ 239 (395) T protein:vir:95 165 YKLTCFVVLPDDLSTFGPAW----IERFVRTQIQEAISVALESAIINGGGAAK-TQPVGLMKDVNTNSGAVTDKASSGTL 239 (395) T ss_pred eeEEEeecccHHHHhcchhH----HHHHHHHHHHHHHHHHHhhheeeccCCCC-cCceeeeecccccccccccccccchh Confidence 99999999999999766544 88999999999999999999999998321 3467776543322211 112223 Q ss_pred hhHHHHHHHHHhhhc-------------ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccc--ee Q lcl|NC_018838. 156 ATTDLVKAVGLIAGA-------------GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRG--LN 220 (315) Q Consensus 156 ~~~di~~~~~~~~~~-------------~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G--~P 220 (315) ++.++..++..+... .......|+||+.++. +.+|+++|.. ..|++.+++| +| T Consensus 240 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~------~~G~~~~~lg~g~~ 307 (395) T protein:vir:95 240 TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLT------ANGGFVTVLPYNVT 307 (395) T ss_pred hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceecc------CCCcceeccCCcce Confidence 344444443333211 1222345999998765 3456666542 3567778864 45 Q ss_pred eEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEE Q lcl|NC_018838. 221 VGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAV 300 (315) Q Consensus 221 v~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~ 300 (315) |+.++.||.+ .++||||++|++++|++++++++++. +|.+|++.||+.+|+|+++++++||++ T Consensus 308 v~~~~~~p~~---------~i~fgdfs~y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dg~~~~~~A~~~ 370 (395) T protein:vir:95 308 IITSEFVPEG---------KLVAFVTDRYNAVRGGGLTVKKFDQT--------LALEDAVLFTAKTFAYGQPDDNKASAV 370 (395) T ss_pred EEEcCCCCCC---------cEEEEecccEEEEEecceEEEeccch--------hhhCCcEEEEEEEEECCEEeccccEEE Confidence 7889999853 27889999999999999999999874 599999999999999999999999999 Q ss_pred EeeccC--CCCCCCCCC Q lcl|NC_018838. 301 VKEKAA--PKPNPPAGN 315 (315) Q Consensus 301 l~~~~a--~~~~~~~~~ 315 (315) |+...+ +..+..++. T Consensus 371 l~i~~~~~~~~~~~~~~ 387 (395) T protein:vir:95 371 YDLKVASAPRRQTSAGG 387 (395) T ss_pred EEeeccCCCCCCCCCCC Confidence 876433 333333443 No 86 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=4.9e-52 Score=301.88 Aligned_cols=296 Identities=13% Similarity=0.082 Sum_probs=227.1 Q ss_pred CCCCccCCCceEcchh-HHHHHHHHHHhccchhhhcceeecCC--CceEEEEEeCCc-eeEEeecccc-----cCCCccc Q lcl|NC_018838. 1 MADDFLSAGKLELPGS-MIGAVRDRAIDSGVLAKLSPEQPTIF--GPVKGAVFSGVP-RAKIVGEGEV-----KPSASVD 71 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~-~~~~ii~~~~~~s~i~~l~~~~~~~~--~~~~ip~~~~~~-~a~wv~Eg~~-----~~~s~~~ 71 (315) ...++.+.||++||++ +.++|++.+++.++|++++++++++. ++++||+...++ .++|++|++. +|+++++ T Consensus 156 ~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 156 DLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLT 235 (477) T ss_pred cccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccc Confidence 1234456678888777 57889999999999999999998764 568999976654 5789999864 5788899 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD 151 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~ 151 (315) |+++++.+||++++++||+|||+++..+ |+++|.++|++++++++|.++++|+| ++..|.|+.+.......... T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d~~~l~G~G--t~~~p~Gi~~~~~~~~~~~~ 309 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAAVS----VDEFVFRDLAADYANKLNVQVISGTG--SNNQVVGVRATAGITQVTAT 309 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccchh----HHHHHHHHHHHHHHHHHHHHHhccCC--CCCccceeeecccccccccc Confidence 9999999999999999999999776544 78999999999999999999999987 44467777765333221211 Q ss_pred ccc-------chhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccc---------cccccCCCcc Q lcl|NC_018838. 152 ATD-------SATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY---------PAAGFAGLDN 215 (315) Q Consensus 152 ~~~-------~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~---------~~~~~~~~~~ 215 (315) .+. ..+.++.+++..+..++....++|+|||+++..|++++|++|+|+|.+... ..+..+.+++ T Consensus 310 ~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (477) T protein:vir:84 310 SAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQ 389 (477) T ss_pred ccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccch Confidence 111 123445555555555554455689999999999999999999999864311 1133445679 Q ss_pred ccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEecc-Eeec Q lcl|NC_018838. 216 WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV-AIES 294 (315) Q Consensus 216 l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~-~v~~ 294 (315) |+|+||++++.||.+.+...+ ...++||||++++++. .+++++++++.. +.++++.||...++++ .++| T Consensus 390 l~G~pVv~s~~~p~~~~~~~d-~~~i~~gd~~~~~i~~-~~~~~~~~~~~~--------~~~~~~~~~v~~~~~~~~~r~ 459 (477) T protein:vir:84 390 MHGLPVVTDPTLPTTLGTGTD-QDVIHVLRASDLALFE-SSVRMRALQETR--------AENLSVLLQVYGYLAFTAARF 459 (477) T ss_pred hcccceEecCcccccccccCC-cceEEEEEeceEEEEe-eceeEEeccccc--------cccceeeeeehhhhhhhhhcc Confidence 999999999999987554433 3467889999999986 578898888753 4567788888888887 4456 Q ss_pred ccceEEEeeccCCCCCCC Q lcl|NC_018838. 295 LDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 295 ~~af~~l~~~~a~~~~~~ 312 (315) |+||++++..+..+||=. T Consensus 460 ~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 460 PQSVVEIGGTALTAPTFA 477 (477) T ss_pred ccceEEeecccccccccC Confidence 999999999988888877 No 87 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.3e-51 Score=299.58 Aligned_cols=276 Identities=13% Similarity=0.043 Sum_probs=226.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCce--eEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPR--AKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~--a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) .+-.+.++||++||++++.+|++.+++.++|+++|++++|+++..++|+...... ++|++|++.+++++++|+++++. T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~ 193 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYD 193 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEee Confidence 3335557799999999999999999999999999999999999999999877654 57799999999999999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|+|+++.. +|+++|.+++++++..++|.++++. +.|+.+. .+...|+ T Consensus 194 ~~k~~~~v~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~~i~~~--------~~g~~~~---------~~~~~~d 252 (421) T protein:vir:13 194 IDDYGLLAPIDNSLLEDSEI----NFLEFVNEEFAEFAVNTENAEIVKQ--------AKAVLAE---------ETINDYA 252 (421) T ss_pred eeeeEeehhhhHHHHhhhHH----HHHHHHHHHHHHHHHHHhhhhHhhh--------hhhcccc---------ccccchH Confidence 99999999999999977654 4788999999999999998876632 2233221 1223478 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+..++ ..+++|+||+.++..|++++|++|+|++. ++..+++++|+|+||++++++|... +.. T Consensus 253 ~i~~~~~~l~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~-----~~~~~~~~tl~G~pV~~~~~~~~~~----~~~ 322 (421) T protein:vir:13 253 GLVKTINSLVPNA-RKRAIIVTNSDGRAYLDGLMDKQGRPLLK-----ELSDGGDLVFKGRPVIELEESIFDV----GDE 322 (421) T ss_pred HHHHHHHHhhhhh-cCCCEEEEcHHHHHHHHHhhcCCCceeec-----CcCCCCCceecceeeEEeccccccC----CCc Confidence 9999999886554 44568999999999999999999988763 4556778899999999999988432 235 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec---------cCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK---------AAPK 308 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~---------~a~~ 308 (315) ..+++|||++ |+++++++++++++++. .|++|++.||++.|+|+++.+++||+.++.. -+|+ T Consensus 323 ~~~~~gd~~~~~~~~~~~~~~v~~~~~~--------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~ 394 (421) T protein:vir:13 323 TKFIVSDFKTLIKFMDRKQYLIDQSKEA--------GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLK 394 (421) T ss_pred eEEEEEeccccEEEEEecceEEEeeccc--------ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccC Confidence 6789999986 78999999999999874 5999999999999999999999998766533 2344 Q ss_pred CCCCCCC Q lcl|NC_018838. 309 PNPPAGN 315 (315) Q Consensus 309 ~~~~~~~ 315 (315) ++++.+. T Consensus 395 ~~~~~~~ 401 (421) T protein:vir:13 395 SSPRSGK 401 (421) T ss_pred CCCcCCC Confidence 4444444 No 88 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2e-51 Score=298.55 Aligned_cols=264 Identities=14% Similarity=-0.003 Sum_probs=217.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l~ 78 (315) ....+.++||++||++++..|++.+++.++++++|++++++++...+|+... +..++|++|++.+|+ ++++|++|++. T Consensus 128 ~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~ 207 (394) T protein:vir:97 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWN 207 (394) T ss_pred ccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEee Confidence 3344556799999999999999999999999999999999999999999764 567899999999997 56999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|||+++..+ |+++|.+++++++++++|.++++|.+..+. .+...++ T Consensus 208 ~~k~~~~i~is~ell~ds~~~----~~~~i~~~la~~~~~~~~~~i~~g~~~~~~------------------~~~~~~~ 265 (394) T protein:vir:97 208 IDTYRGAIPLSQESIDDADVD----LVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------KTVKNLD 265 (394) T ss_pred hhheeeehhhHHHHHhhhhHH----HHHHHHHHHHHHHHHHHHHHHhhccccccc------------------cccccHH Confidence 999999999999999776544 788899999999999999999988752211 1122367 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++....+.. ....|+|||+++.+|++++|++|+|+|.+ ++..+.+++|+|+||++++.+.. .. T Consensus 266 ~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~~i~~~----~~~~~~~~~l~G~pv~~~~~~~~-------~~ 332 (394) T protein:vir:97 266 EIKALLNGGFDPA--YNVSLIVSQSFYQTLDTLKDGNGRYLLQD----DITAVSGKVLLGKPVFVLSDEVL-------GA 332 (394) T ss_pred HHHHHHHhhhhhh--hCCEEEEcHHHHHHHHHhhccCCCeeeec----CcCCCCCceeccceeEEeccccc-------CC Confidence 7888777654432 24579999999999999999999988643 55667778999999998664321 12 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..+++|||++ |.++.+++++++.+++. ++...+|+++|+|+++.+|+||++|+..+++.|- T Consensus 333 ~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 333 NKAFIGDFKRGVLFADRKDLGLRWADNE-----------IYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred ccEEEeeccccEEEEEecceEEEEeccc-----------ccceeEEEEEEEccEEecccceEEEEecccccCC Confidence 3478999986 77889999999987652 3346789999999999999999999998877777 No 89 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=6.8e-51 Score=295.60 Aligned_cols=264 Identities=13% Similarity=0.028 Sum_probs=218.0 Q ss_pred CCCC-ccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEE Q lcl|NC_018838. 1 MADD-FLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTA 77 (315) Q Consensus 1 m~~~-~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l 77 (315) +..+ +.++||++||++++.+|++.+++.++|++++++++++++..++|+... .+.+.|++|++..++ ++++|+++++ T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~ 212 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNW 212 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEe Confidence 4443 567789999999999999999999999999999999999999999874 467899999999986 6899999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchh Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSAT 157 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) .+||++++++||+|||+++.. .|+++|.++++++++.++|.++++|+|..+. . +...+ T Consensus 213 ~~~k~~~~~~is~ell~ds~~----~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~---~---------------~~~~~ 270 (400) T protein:vir:38 213 SVETYRQALPVSQESIDDSAI----DLVGLIAQNGQQIKVNTTNGAVATLLKGFTA---K---------------TISSV 270 (400) T ss_pred ehhheeeehhhHHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhhhhccccccc---c---------------ccccH Confidence 999999999999999976654 4788999999999999999999999863211 1 11236 Q ss_pred HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 158 TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 158 ~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) +++.+++....... .++.|+|||+++.+|++++|++|+|+|. |++..+.+++|+|+||++++++|.. ..+ T Consensus 271 ~~~~~~~~~~~~~~--~~a~~v~~~~~~~~l~~lkd~~G~~i~~----~~~~~~~~~~l~G~pv~~~~~~~~~----~~g 340 (400) T protein:vir:38 271 DDLKHINNVDLDPA--YSRVIIASQSFYNFLDTVKDGNGRYLLQ----DSILTPSGKSVLGMPIAVVSDDTLG----AAG 340 (400) T ss_pred HHHHHHHHhhhhhh--hCcEEEEcHHHHHHHHHhhccCCCeeee----cCcCCCCccccccceeEEecccccC----CCC Confidence 67777766544332 3467999999999999999999998763 4567777889999999999998742 233 Q ss_pred cceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 238 ~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) +..+++|||++ |++..+++++++++++.+ | ...+|+.+|+|+++.+|+||++|+.+++. T Consensus 341 ~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~--------~---~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 341 EAHAFLGDIKRAILFANRADFMVRWVDDQI--------Y---GQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ceEEEEEeccccEEEEeecceEEEEecccc--------c---ceeEEEEEEeccEEecccceEEEEeecCC Confidence 55789999996 677789999999987632 3 35789999999999999999999986655 No 90 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=2.3e-51 Score=298.17 Aligned_cols=268 Identities=12% Similarity=0.033 Sum_probs=212.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEe-CCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFS-GVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~-~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |.+++.++||++||++++++|++.++++++|+++++++++++ .++|+.. ....++|++|++.+++++++|+++++.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 195 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTT 195 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeech Confidence 889999999999999999999999999999999999988763 5678765 4578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh-hhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI-AFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a-~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) ||++++++||+|||+++.. +|+++|.++++++++++++.. +..|+| ++.+ .++.... ....+ .+...++ T Consensus 196 ~k~~~~i~iS~ell~ds~~----~l~~~i~~~la~~~~~~e~~~~~~~g~g--~g~~-~g~~~~~--~~~~~-~~~~~~d 265 (387) T protein:vir:96 196 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKDALAVSPK--SGLE-HMSFYNG--SVKEV-EGADMYD 265 (387) T ss_pred heeeeechhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhHhhcCCC--cccc-ceeeecc--ccccc-cccchHH Confidence 9999999999999976544 478889999999999887654 333433 4333 3333221 11222 2344588 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+. .+..|+||+.++..+.++++..|++++ .+.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~---------~~~~~~llG~PV~~~~~~~---------- 325 (387) T protein:vir:96 266 AIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF---------DTPAEKVFGKPVVFTDAAV---------- 325 (387) T ss_pred HHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc---------ccCCccccccceEEecCCC---------- Confidence 99999988866543 455799999998887777776666543 2456789999999988654 Q ss_pred ceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||++++++ +.++.++.+++ ..+|++.||+..|+|+++++|+||++++.+++..|+|- T Consensus 326 -~~~~GDf~~~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 326 -KPIVGDFNYFGIN-YDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred -ceeeechhhhhhh-hhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3678999988765 45666655554 23688999999999999999999999999999999988 No 91 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=2.3e-51 Score=298.17 Aligned_cols=268 Identities=12% Similarity=0.033 Sum_probs=212.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEe-CCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFS-GVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~-~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |.+++.++||++||++++++|++.++++++|+++++++++++ .++|+.. ....++|++|++.+++++++|+++++.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 195 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTT 195 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeech Confidence 889999999999999999999999999999999999988763 5678765 4578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh-hhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI-AFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a-~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) ||++++++||+|||+++.. +|+++|.++++++++++++.. +..|+| ++.+ .++.... ....+ .+...++ T Consensus 196 ~k~~~~i~iS~ell~ds~~----~l~~~i~~~la~~~~~~e~~~~~~~g~g--~g~~-~g~~~~~--~~~~~-~~~~~~d 265 (387) T protein:vir:94 196 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKDALAVSPK--SGLE-HMSFYNG--SVKEV-EGADMYD 265 (387) T ss_pred heeeeechhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhHhhcCCC--cccc-ceeeecc--ccccc-cccchHH Confidence 9999999999999976544 478889999999999887654 333433 4333 3333221 11222 2344588 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+. .+..|+||+.++..+.++++..|++++ .+.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~---------~~~~~~llG~PV~~~~~~~---------- 325 (387) T protein:vir:94 266 AIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF---------DTPAEKVFGKPVVFTDAAV---------- 325 (387) T ss_pred HHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc---------ccCCccccccceEEecCCC---------- Confidence 99999988866543 455799999998887777776666543 2456789999999988654 Q ss_pred ceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||++++++ +.++.++.+++ ..+|++.||+..|+|+++++|+||++++.+++..|+|- T Consensus 326 -~~~~GDf~~~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 326 -KPIVGDFNYFGIN-YDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred -ceeeechhhhhhh-hhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3678999988765 45666655554 23688999999999999999999999999999999988 No 92 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=2.3e-51 Score=298.17 Aligned_cols=268 Identities=12% Similarity=0.033 Sum_probs=212.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEe-CCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFS-GVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~-~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |.+++.++||++||++++++|++.++++++|+++++++++++ .++|+.. ....++|++|++.+++++++|+++++.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 195 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTT 195 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeech Confidence 889999999999999999999999999999999999988763 5678765 4578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh-hhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI-AFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a-~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) ||++++++||+|||+++.. +|+++|.++++++++++++.. +..|+| ++.+ .++.... ....+ .+...++ T Consensus 196 ~k~~~~i~iS~ell~ds~~----~l~~~i~~~la~~~~~~e~~~~~~~g~g--~g~~-~g~~~~~--~~~~~-~~~~~~d 265 (387) T protein:vir:26 196 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKDALAVSPK--SGLE-HMSFYNG--SVKEV-EGADMYD 265 (387) T ss_pred heeeeechhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhHhhcCCC--cccc-ceeeecc--ccccc-cccchHH Confidence 9999999999999976544 478889999999999887654 333433 4333 3333221 11222 2344588 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+. .+..|+||+.++..+.++++..|++++ .+.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~---------~~~~~~llG~PV~~~~~~~---------- 325 (387) T protein:vir:26 266 AIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF---------DTPAEKVFGKPVVFTDAAV---------- 325 (387) T ss_pred HHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc---------ccCCccccccceEEecCCC---------- Confidence 99999988866543 455799999998887777776666543 2456789999999988654 Q ss_pred ceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||++++++ +.++.++.+++ ..+|++.||+..|+|+++++|+||++++.+++..|+|- T Consensus 326 -~~~~GDf~~~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 326 -KPIVGDFNYFGIN-YDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred -ceeeechhhhhhh-hhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3678999988765 45666655554 23688999999999999999999999999999999988 No 93 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=5.3e-51 Score=296.19 Aligned_cols=267 Identities=12% Similarity=0.023 Sum_probs=208.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEe-CCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFS-GVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~-~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..++.++||++||+++.++|++.++++++|+++|+++++++ ..+|+.. +...++|++|++..++++++|+++++.+ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 195 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTT 195 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccccccccccceeeeeh Confidence 899999999999999999999999999999999999998763 5678765 4578999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhh-hcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIA-FHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~-~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) ||++++++||+|||+++.. +|+++|.++++++++++++..+ ..|+| ++.+ .++..... ...+ .+...|+ T Consensus 196 ~k~~~~~~iS~ell~Ds~~----~l~~~i~~~la~~~~~~e~~~~~~~g~g--~g~p-~g~l~~~~--~~~v-~~~~~~d 265 (387) T protein:vir:93 196 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKDALAVSPK--SGLD-HMSFYNGS--VKEV-EGADMYD 265 (387) T ss_pred eeeeeechhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhHhhcCCC--cccc-ceeeeccc--cccc-cccchHH Confidence 9999999999999966544 4788899999999998876643 33443 4333 33332211 1222 2344578 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHH-HHhhccCccccccccccccccCCCccccceeeEeecccCccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALS-TEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~-~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 237 (315) ++++++..+...+. .+..|+||+.++..+. .+++.+| +++ .+.+++|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~~~-~~a~~~mn~~t~~~~~~~~~d~~~-~~~---------~~~~~~llG~PV~~~~~~~--------- 325 (387) T protein:vir:93 266 AIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTT-NFF---------DTPAEKVFGKPVVFTDAAV--------- 325 (387) T ss_pred HHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCC-ccc---------ccCCccccccceEEecCCC--------- Confidence 89999988866543 4557999999987765 5555443 322 2455789999999988654 Q ss_pred cceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 238 GVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 238 ~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||++++++ +.++.++.+.+ +.++++.|++..|+|+++++|+||+.++.+++..|+|- T Consensus 326 --~~~~GDf~~~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 326 --KPIVGDFNYFGIN-YDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred --ceeeeehhhhhee-hhhheeeeccc----------ccCCceeEEEEeeeCceeechhheEEEEeecCCCCCCC Confidence 3678999998775 55666665544 45788999999999999999999999998888888887 No 94 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.3e-50 Score=294.14 Aligned_cols=278 Identities=12% Similarity=0.004 Sum_probs=217.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l~ 78 (315) ++..+.+.||++||+++.+.| ..++..+.++.++++++++++..++|+... .+.++|++|++.+++ ++++|+++++. T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~ 234 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPE-KEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWD 234 (437) T ss_pred hhhcccccccccchHHHHHHH-HHhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeee Confidence 677788899999999998865 556889999999999999999999998854 468999999999996 56999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||++++++||+|+|+++.. +|+++|.++++++++.++|.++++|+|.+. +. ..+...++ T Consensus 235 ~~k~~~~~~is~ell~ds~~----~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~--~~--------------~~~~~~~~ 294 (437) T protein:vir:10 235 LKTYTGGYVFSQELISDSSY----DWQAELQSRLIELRDNTDDSLIITALTDGI--KK--------------TTSTYLLG 294 (437) T ss_pred hhheeeehhhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cc--------------cccccchh Confidence 99999999999999977654 478889999999999999999999986321 10 01122345 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++.......+..+..|+||++++..|++++|++|+|+|. |++..+.+++|+|+||++++++.-. .+...+ T Consensus 295 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~----~~~~~~~~~~l~G~pv~~~~~~~~~--~~~~~~ 368 (437) T protein:vir:10 295 DLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQ----PNVTAATGYTLLGKTVVIVDDKLFP--SASAGD 368 (437) T ss_pred hHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeec----cCccCCCCcccccceeEEecccccC--CcCCCc Confidence 6666665322333334457999999999999999999998763 4566777889999999998776311 122345 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC-CCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK-PNPPAGN 315 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~-~~~~~~~ 315 (315) ..++||||++ |.+.++++++++.++. |..+...+|+..|+||+++||+||++|+...... .++|+.- T Consensus 369 ~~~~~gd~~~~~~~~~r~~~~~~~~~~----------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 369 VNIVVAPLKKAVINFKLTEITGQFQDT----------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred eEEEEeeccccEEEEeeeceEEEEecc----------cccccceeeEEEEEccEEecccceEEEEeeccccccCCCCCC Confidence 6789999986 6688899999987764 4455578899999999999999999999664333 3344444 No 95 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=6.5e-51 Score=295.72 Aligned_cols=268 Identities=12% Similarity=0.040 Sum_probs=209.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEe-CCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFS-GVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~-~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) |..+++++||++||++++.+|++.++++++|+++|++++++ +..+|+.. ....+.|++|++.+++++++|+++++.+ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 210 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTT 210 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecC--CceeeeeeccCCccccccccccccccccccceeeecc Confidence 88999999999999999999999999999999999998876 36678765 4568999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh-hhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI-AFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a-~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) ||++++++||+|||.++.. +|+++|.++|+++++++++.. +..|+| ++. +.++..... ...+ .+...++ T Consensus 211 ~k~~~~i~iS~ell~Ds~~----~l~~~i~~~la~~~~~~e~~~~~~~g~g--~g~-p~g~~~~~~--~~~~-~~~~~~d 280 (402) T protein:vir:93 211 NKFKVFAAISDTVIHGSDV----DLVNWVENALQSGLAAKERKDALAVSPK--SGL-EHMSFYNGS--VKEV-EGADMYD 280 (402) T ss_pred eeeeeechhhHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHhHhhcCCC--ccc-cceeeeccc--cccc-cccchHH Confidence 9999999999999976544 478889999999999987654 444443 333 233333211 1112 2344578 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++..+...+ ..+..|+||+.++..++++++..|++++ .+.+++|+|+||++++.++ T Consensus 281 ~l~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~d~~~~~~---------~~~~~~llG~PV~~t~~~~---------- 340 (402) T protein:vir:93 281 AIINALADLHEDY-RDNATIYMRYADYVKIISVLSNGTTNFF---------DTPAEKVFGKPVVFTDAAV---------- 340 (402) T ss_pred HHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcCCCccc---------ccCCccccccceEEecCCC---------- Confidence 8999998886553 3355799999998887767666665543 2456799999999988654 Q ss_pred ceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 239 ~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) .++||||+++++. +.++.++...+. .+|++.||+..|+|++|++|+||+.|+.+++..|||- T Consensus 341 -~i~~GDf~~~~~~-~~~~~~~~~~~~----------~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 341 -KPIVGDFNYFGIN-YDGTTYDTDKDV----------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred -ceeeechhhhhhh-hhhhhhhhhhcc----------cCCceEEEEEEEeCcEEechhheEEEEeecCCCCCCC Confidence 2678999987665 445555544432 2589999999999999999999999998888888887 No 96 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=7.5e-50 Score=289.90 Aligned_cols=269 Identities=13% Similarity=0.014 Sum_probs=210.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccC-CCccceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~~~~v~l~~ 79 (315) +..++.++||++||+++.++|++.+++.|+++++|++++++ +..++|+.++++.++|++|+++++ +++++|++++|.+ T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~ 157 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceeEeecccccccccCccceeEeeee Confidence 77888899999999999999999999999999999999986 568999999999999999998876 5689999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc--------- Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV--------- 150 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~--------- 150 (315) ||++++++||+|||+++..+ |+++|++++++++++++|.++++|+|. + .|.|+.+......... T Consensus 158 ~kl~~~~~is~~ll~ds~~~----le~~i~~~l~~~~~~~~~~a~i~G~G~--~-~P~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) T protein:vir:96 158 FKLTAFVVIPKDALKFGPKW----LKQFITEQLKEAIAVALELAIVKGNGL--L-QPVGLLKDLSQPTVDQSTGRDITTY 230 (377) T ss_pred eeEEeechhhHHHhhcchhh----HHHHHHHHHHHHHHHHHhhceEeccCC--C-cceeeeeccccccccccccccccce Confidence 99999999999999776654 889999999999999999999999983 3 4667765432211100 Q ss_pred --------ccccchhHHHHHHHHHhhhcc----------cccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 151 --------DATDSATTDLVKAVGLIAGAG----------LQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 151 --------~~~~~~~~di~~~~~~~~~~~----------~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) .......+.+.+++..+.... ...+..|+||+.++..+ .+++.|. + ..|+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~------~~~~~~~-----~-~~G~ 298 (377) T protein:vir:96 231 KTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL------EAKFTSR-----N-QFGE 298 (377) T ss_pred eeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc------ccccccc-----C-CCCC Confidence 001112233444444332211 11123599999987755 2333332 1 3466 Q ss_pred Cccccceee--EeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEecc Q lcl|NC_018838. 213 LDNWRGLNV--GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV 290 (315) Q Consensus 213 ~~~l~G~Pv--~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~ 290 (315) +.+++|+|+ +.++.||.. .++||||++|.++++++++++.+++. +|.+|++.||+.+|+|+ T Consensus 299 ~~~~l~~p~~v~~s~~~p~~---------~i~fgdf~~Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dG 361 (377) T protein:vir:96 299 YVTVLPHGITILESLAVETG---------KAIAFVANRYDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYG 361 (377) T ss_pred ceeccCCCceEEecCCCCcc---------cEEEEEcCcEEEEEecccEEEeehhh--------hhhcCCeEEEEEEEEcC Confidence 778898885 456777742 37899999999999999999999874 69999999999999999 Q ss_pred EeecccceEEEeeccC Q lcl|NC_018838. 291 AIESLDSFAVVKEKAA 306 (315) Q Consensus 291 ~v~~~~af~~l~~~~a 306 (315) ++++++||++|+.+-. T Consensus 362 ~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 362 KAKDNHTAALLTLAGG 377 (377) T ss_pred EEecCCcEEEEEEecC Confidence 9999999999987766 No 97 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2.8e-49 Score=286.77 Aligned_cols=282 Identities=16% Similarity=0.096 Sum_probs=213.2 Q ss_pred CCCCcc-CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEee Q lcl|NC_018838. 1 MADDFL-SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~-s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~ 79 (315) +.+..+ ++|+++||+++++.|++.+++.+++++++++++++ +..++|+....+.+.|++|++.+++++++|+++++.+ T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~-g~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~ 226 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLK-GTARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDG 226 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecC-ceeEeeeecCCcceeecccccccccccccccceeecc Confidence 333333 45568999999999999999999999999999987 5689999988899999999999999999999999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc------- Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA------- 152 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~------- 152 (315) ||++++++||+|||.++.. +|+++|..+++++++.++|.++++|+| ++. |.|+.+.....+..... T Consensus 227 ~k~~~~~~iS~ell~ds~~----~l~~~i~~~la~~~~~~~~~ail~G~G--~~~-P~Gil~~~~~~~~~~~~~~~~~~~ 299 (466) T protein:vir:80 227 YKVGGFIPIPNSTLEDSDL----NLADEILDAIGQAIGFALDKAILYGTG--TKM-PVGIVTRLAQTTQPPNWGTKAPAW 299 (466) T ss_pred eeeeeehhhhHHHHhcchH----HHHHHHHHHHHHHHHHHHhhheeeccC--CCC-cceeeecccccccccccccccccc Confidence 9999999999999976654 488999999999999999999999998 333 45766543221111000 Q ss_pred ccchh-----------------HHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhh---ccCccccccccccccccCC Q lcl|NC_018838. 153 TDSAT-----------------TDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVY---PKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 153 ~~~~~-----------------~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d---~~g~~~~~~~~~~~~~~~~ 212 (315) ..... .++...+..+..........|+||+.+...|.++++ .+|.+++. .. . T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~------~~--~ 371 (466) T protein:vir:80 300 TNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVAS------LN--N 371 (466) T ss_pred cccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCcccccc------CC--C Confidence 00111 111112222222222223359999999999988874 34433321 11 1 Q ss_pred CccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEe Q lcl|NC_018838. 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI 292 (315) Q Consensus 213 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v 292 (315) ...|+|+||+++++||.. .+++|||++|++++|++++++++++. .|.+|++.||+.+|+|+++ T Consensus 372 ~~~i~G~pvv~s~~~~~~---------~~~~g~~~~y~i~~r~~~~i~~~~~~--------~f~~d~~~~r~~~r~dg~~ 434 (466) T protein:vir:80 372 TMPIVGGDIVILDFIPDN---------DIIGGYGSLYLLAERADIKLAQSEHV--------RFIEDQTVFKGTARYDGKP 434 (466) T ss_pred cccccccceeecCccCcc---------ceeeeccccEEEEeecceEEEechhh--------hhhcCcEEEEEEEEEccEE Confidence 235999999999999853 27899999999999999999999874 5999999999999999999 Q ss_pred ecccceEEEeec-----cCCCCCCCCCC Q lcl|NC_018838. 293 ESLDSFAVVKEK-----AAPKPNPPAGN 315 (315) Q Consensus 293 ~~~~af~~l~~~-----~a~~~~~~~~~ 315 (315) ++++||++++.+ +++..+|--++ T Consensus 435 ~~~~afv~~~~~~~~~~~~~~~~~~~~~ 462 (466) T protein:vir:80 435 VFGEGFVAVNIANANPTTSITFAPDEAN 462 (466) T ss_pred eccCceEEEEecCCCcccceeeecCcCc Confidence 999999999754 34444444444 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1.9e-48 Score=282.24 Aligned_cols=263 Identities=12% Similarity=0.007 Sum_probs=216.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeC-CceeEEeecccccCC-CccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPS-ASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~-s~~~~~~v~l~ 78 (315) ++..+..+|+++||+++.+.|++ ++..+.++++|++++++++...+|+... +..++|++|++..++ ++++|+++++. T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~ 210 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYS 210 (397) T ss_pred hhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeec Confidence 67777889999999999999997 5788889999999999988888888654 467899999999996 68999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhH Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATT 158 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) +||+++++++|+|+|+++.. +|+++|.+++++++++++|.++++|+|... +. +...|+ T Consensus 211 ~~~~~~~~~~s~ell~ds~~----~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~---~~---------------~~~~~d 268 (397) T protein:vir:96 211 VATRRGYIPISQEMIDDASY----DVTGLIADEIQDQSLNTKNADIAAVLKTAT---AK---------------SVVGVD 268 (397) T ss_pred HhHhhcchhhHHHHHhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccc---cc---------------cccchH Confidence 99999999999999977654 478889999999999999999999986322 11 123477 Q ss_pred HHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccccc Q lcl|NC_018838. 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) Q Consensus 159 di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 238 (315) ++.+++....... .++.|+|||+++..|++++|++|+|++. |++..+.+++|+|+||++++...... ...+ T Consensus 269 ~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~~~~~----~~~~~~~~~~l~G~pv~~~~~~~~~~---~~~~ 339 (397) T protein:vir:96 269 GLKDLINKEIKKV--YDVKLFISASMYSELDKLKDKNGRYLLQ----DSITAASGKQLLGKEVVVLDDDVIGK---SVGN 339 (397) T ss_pred HHHHHHHHhhhhh--cCcEEEEcHHHHHHHHHhhccCCCeEec----cCccCCCcccccccceEEecccccCC---CCCc Confidence 8888877654443 3567999999999999999999998764 45667778899999999876543222 2334 Q ss_pred ceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 239 ~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ..+++|||++ |+++++++++++.+++. .| ...+|+.+|+|++++||+||++|+..+| T Consensus 340 ~~~~~gd~~~~~~~~~~~~~~~~~~~~~--------~~---~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 340 VVGFIGDAKAFASFFDRKQVSVSWVDNN--------IY---GQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred eEEEEeehhcceEeEeecceEEEEeccc--------cc---ceeEEEEEEEccEEecccceEEEEeecC Confidence 5788999996 67899999999988763 23 4578999999999999999999998888 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=3.5e-41 Score=242.41 Aligned_cols=287 Identities=11% Similarity=0.022 Sum_probs=219.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee-cCCCceEEEEEeCC----ceeEEeecccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP-TIFGPVKGAVFSGV----PRAKIVGEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~-~~~~~~~ip~~~~~----~~a~wv~Eg~~~~~s~~~~~~v 75 (315) |.. +..+|||++|+++ +++++.+++.|++++++++++ +++....||+...+ +.+.|.+|.++.++++++|+++ T Consensus 14 it~-~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~ 91 (314) T protein:vir:41 14 IDV-PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTN 91 (314) T ss_pred ccc-ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCccCCcccccccce Confidence 654 4567999999887 579999999999999999985 57778899987543 3356778888899999999999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-----cccccccccccccccccc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT-----GKPAAAVKVSLDKTTKTV 150 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~-----~~~~~~~~~~~~~~~~~~ 150 (315) +|.+||+...++||+|+|+++.. ..+|+++|...|++++++.++.++++|+|... ...+.|+...+....... T Consensus 92 ~l~~~kl~~~v~is~e~L~D~a~--~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~ 169 (314) T protein:vir:41 92 TLEMKELVTKVVLEDEALEDNIE--QSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDA 169 (314) T ss_pred eeeeEEEEEeecccHHHHHhhhc--hhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeec Confidence 99999999999999999977642 24699999999999999999999999987432 124566665433222211 Q ss_pred cc--ccchhHHHHHHHHHhhhccccc--ceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 151 DA--TDSATTDLVKAVGLIAGAGLQV--PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 151 ~~--~~~~~~di~~~~~~~~~~~~~~--~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) +. .....+.+.+++.++...++.. ..+|+||+.+..+++++++.+++++|.+. ...+++.+|+|+||+.++. T Consensus 170 ~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~----~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 170 EPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSA----LIGATGLQYDGIPIQYVPA 245 (314) T ss_pred CccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchh----hhCCCCceecceeeEeccc Confidence 11 1233455778888886654322 34699999999999999999999887543 4567778999999999998 Q ss_pred cCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ||... ..+..++||||++++++++..++++..++ -+++++.+.+.+|+|+.+...+|.++....-+ T Consensus 246 ~~~~~----~~~~~i~fgd~~nlv~~~~~~ir~~~~~~----------a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 246 LDALG----DDKARALLTVPTNLVYGFWRNIRIEPKRD----------AAMRRTEYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred ccccC----CCCceEEEechhheEEEeeceeEEeeccc----------CcCCeEEEEEEEEeceEEEEcCcEEEEEeecc Confidence 87532 23467889999999999999888776665 35788999999999999988877766542221 Q ss_pred CCC Q lcl|NC_018838. 307 PKP 309 (315) Q Consensus 307 ~~~ 309 (315) ..- T Consensus 312 ~~~ 314 (314) T protein:vir:41 312 SGG 314 (314) T ss_pred CCC Confidence 111 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=1.3e-40 Score=239.31 Aligned_cols=281 Identities=13% Similarity=0.026 Sum_probs=211.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee-cCCCceEEEEEeCC----ceeEEeecccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP-TIFGPVKGAVFSGV----PRAKIVGEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~-~~~~~~~ip~~~~~----~~a~wv~Eg~~~~~s~~~~~~v 75 (315) |. .+..+||+++|++. +++|+.+++.|+++++|++++ +.+....+++..-+ ....|.+|.++.++++++|+++ T Consensus 19 ~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~ 96 (315) T protein:vir:41 19 ID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTN 96 (315) T ss_pred cC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcCCCCCCcccccee Confidence 54 45567889888776 569999999999999999864 55544555543211 2356889999999999999999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccc---cccccccccccccccc--cc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT---GKPAAAVKVSLDKTTK--TV 150 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~---~~~~~~~~~~~~~~~~--~~ 150 (315) ++.+||+.+.+.||+|+|.++.. ..+++++|...+++++++.++.++++|++... .+.+.|+...+..... .. T Consensus 97 ~l~~~~l~~~~~it~elL~D~~~--~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~ 174 (315) T protein:vir:41 97 TLYMREMVTKVVIHEDAIEDNIE--GKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESDV 174 (315) T ss_pred eeceeeeeeeccccHHHHHhhhc--cccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceeccccccccccc Confidence 99999999999999999976542 23589999999999999999999999986321 1344566553322211 11 Q ss_pred --ccccchhHHHHHHHHHhhhccccc--ceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 151 --DATDSATTDLVKAVGLIAGAGLQV--PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 151 --~~~~~~~~di~~~~~~~~~~~~~~--~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) ++.....+.+.+++..+...++.. +.+|+||+++..++|++++.+|+++|.+ .+..+++.+|+|+||+.++. T Consensus 175 ~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~----~~~~g~~~tl~G~PV~~~~~ 250 (315) T protein:vir:41 175 DPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQ----ALTGANSILYDGRPVQYVPA 250 (315) T ss_pred ccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccc----hhhcCCCceecccceEeccc Confidence 122233456778888886654322 3469999999999999999999998754 45678889999999999999 Q ss_pred cCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccc--eEEEee Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDS--FAVVKE 303 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a--f~~l~~ 303 (315) ||.... .+..++||||++++++++++++++..+++ ..+.+.|.+.+|+|+.+...++ .+.+|. T Consensus 251 m~~~~~----~~~~ilf~d~~nl~~~~~~~i~i~~~~~a----------~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 251 LEALND----GKSRALFVVPTQLVYGFWRNIKVVPDYDA----------EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ccccCC----CCccEEEecccceEEEeccccEEEeeecC----------CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 876432 23458899999999999999999888764 3456788889999998776665 333433 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=2.7e-37 Score=221.09 Aligned_cols=295 Identities=11% Similarity=0.025 Sum_probs=217.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeec-c-cccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGE-G-EVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E-g-~~~~~s~~~~~~v~l~ 78 (315) +...+..++|++||+++.++|++.+++.|+++++++++++.+....+|....++.+.|+++ + ...+.++++|+++++. T Consensus 18 ~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 97 (321) T protein:vir:31 18 ALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDIS 97 (321) T ss_pred cccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccccccccccccccceeeeeeee Confidence 4444567788999999999999999999999999999999998899999877777788873 3 3456788999999999 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc---cccccccccccccccc--cccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG---KPAAAVKVSLDKTTKT--VDAT 153 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~---~~~~~~~~~~~~~~~~--~~~~ 153 (315) +||+.+.++||+|+|.++.. ..+++++|.+.+++++++.++.++++|++.... ....|+.+.+...... .... T Consensus 98 ~~k~~~~~~it~e~L~d~a~--~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~~ 175 (321) T protein:vir:31 98 TEKATVAWDLPREVVQENPE--GEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAADD 175 (321) T ss_pred eEEEEeehhccHHHHHhhhc--chhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccccccccc Confidence 99999999999999976532 235899999999999999999999999874221 1123554432222222 1222 Q ss_pred cchhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc Q lcl|NC_018838. 154 DSATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 154 ~~~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) ...++.+.+++..|.+.++..++ +|+||+.+..+++......+.+++. +.+..+.+.+|+|+||+.+++||.. T Consensus 176 ~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~----~~l~~~~~~tl~G~pvv~~~~mP~~-- 249 (321) T protein:vir:31 176 ILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGD----NVIMGEADVNPFSFPIIGSGLWPDD-- 249 (321) T ss_pred ccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCcccc----chhhccccccccceeEEEcCCCCCC-- Confidence 34467788888888765544333 6999999988776544444445543 2345566779999999999999853 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC-CCCCC Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA-PKPNP 311 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a-~~~~~ 311 (315) .++++||+++.+++++++++++..+.... ..+.+.+......++|+.|.+.+|++.+++... .++.. T Consensus 250 -------~il~t~~~nl~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~~~ 317 (321) T protein:vir:31 250 -------KAMFTDPQNLIYALYRDLEIDVLTESDKV-----SERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEHLE 317 (321) T ss_pred -------cEEEeccccEEEEEeeccEEEEeecCccc-----cccceeeEeeeeeecceeEeccccEEEEecCCcchhccc Confidence 37889999999999999999888764321 123344444556679999999999999997543 22222 Q ss_pred CCCC Q lcl|NC_018838. 312 PAGN 315 (315) Q Consensus 312 ~~~~ 315 (315) |.-. T Consensus 318 ~~~~ 321 (321) T protein:vir:31 318 EETS 321 (321) T ss_pred CCCC Confidence 2222 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=9.9e-37 Score=217.98 Aligned_cols=278 Identities=13% Similarity=0.031 Sum_probs=198.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~ 80 (315) +......-|++++|+.+...|+..++..++++.++++.+. ....+|..+....+.|+.||+.+|+++++|+++++.+| T Consensus 239 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i--~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~ 316 (517) T protein:vir:97 239 AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL--PTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQ 316 (517) T ss_pred eecccccccccccchHHHHHHHHhhhhhccceeeeeeccc--cceeeecccccceeeeeecCCcccccccceeeEEeeHh Confidence 1112234478999999999999999999999888876544 34667777777789999999999999999999999999 Q ss_pred EEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 81 kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) ++++++++|+|+|+++..|....|+++|.++|++.++++++.++++|+| ++.+..++............... +.+ T Consensus 317 ~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdG--tg~~~~gi~~~a~~~~~~~~~~~---~~~ 391 (517) T protein:vir:97 317 YVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV--TGVSETQIYPVVGDAWATNVTGT---TNI 391 (517) T ss_pred hhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccC--CCccccccccccccccccccccc---chH Confidence 9999999999999999888888899999999999999999999999987 34444444332211111111111 222 Q ss_pred HHHHHHhhhccc-ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccccccc Q lcl|NC_018838. 161 VKAVGLIAGAGL-QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV 239 (315) Q Consensus 161 ~~~~~~~~~~~~-~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~ 239 (315) .+++..+..... ..+..|+||+.++..|+++||++|+|+|.. ....+.+.+++|.. ..+|... .+. T Consensus 392 ~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~----~~~~~~~~~l~G~~----~~~~~~~---~~~-- 458 (517) T protein:vir:97 392 QELLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV----GVSNQTIATHFGFN----RLVQSVA---VDE-- 458 (517) T ss_pred HHHHHHHHHHhhhccCCEEEECHHHHHHHHHhhcCCCCeeccC----cCCcccccccCCcc----ccccccc---cCc-- Confidence 233333332222 224569999999999999999999998743 23445567888842 2233221 111 Q ss_pred eEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 240 KAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 240 ~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) ..+++++.|.++.+.++++..+-+ +.+|+..|+.++|+++.|+.+++|++....+. -+| T Consensus 459 -~~~~~~~~y~i~~~~g~~~~~~fd----------~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~-----~~~ 517 (517) T protein:vir:97 459 -KTAVSLSGYVTNGSRGMEFEQGTI----------LVENNKEYLFEMPISGSLEYKGTTAYGTYTPP-----VAG 517 (517) T ss_pred -eeEeeccccEEEeecceeeeeeee----------cccCceeEeeeeeeccccccccceEEEEEcCC-----CCC Confidence 123346778887777766432211 45788999999999999999999998765431 122 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=2.1e-35 Score=210.68 Aligned_cols=268 Identities=13% Similarity=0.044 Sum_probs=210.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee----ecCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||+++++.+..++|+.+++.|++.+++.+.+.+++.+. ..++..++||++...+.+.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999998887653 23345699999988889999999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+|+++..+++|+|++.++..| +.+.+.+++++++++++|..++.... +. ... . .+... T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d----~~~~~~~~~~~~~a~~~d~~i~~~~~---~a-----~~~-------~-~~~~t 140 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGD----PVGQAAKQIVEAIDHKVDADVLDALS---KS-----TQT-------V-EATAT 140 (272) T ss_pred EEeeeeeeeeeecHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHhc---cc-----ccc-------c-ccccC Confidence 99999999999999998776554 67788999999999999998875321 10 000 0 12234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++..+.+.+ .....|+|||.++..|++.+.......+.. ....+..|..++|+|+||+++++||.. T Consensus 141 ~d~i~da~~~l~~~~-~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~~------ 212 (272) T protein:vir:30 141 VDGVSKALDIFNDED-DAETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPKG------ 212 (272) T ss_pred HHHHHHHHHHHhccC-CCccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCcc------ Confidence 778888888876544 445679999999999987653332111111 112344566689999999999999843 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) .+|+.+.+.+.++.+++++++..++. .++...++..+|+++++.+|++|+++|.+++-+- T Consensus 213 ---t~~~~~~~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 213 ---TAYMVRKGALRIMLKRNTMVETDRDI----------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ---eEEEEcCCeEEEEecCCceeeecccc----------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 25666778888988999999888764 2456788999999999999999999998877666 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=2.1e-35 Score=210.68 Aligned_cols=268 Identities=13% Similarity=0.044 Sum_probs=210.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee----ecCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||+++++.+..++|+.+++.|++.+++.+.+.+++.+. ..++..++||++...+.+.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999998887653 23345699999988889999999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +.+|+++..+++|+|++.++..| +.+.+.+++++++++++|..++.... +. ... . .+... T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d----~~~~~~~~~~~~~a~~~d~~i~~~~~---~a-----~~~-------~-~~~~t 140 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGD----PVGQAAKQIVEAIDHKVDADVLDALS---KS-----TQT-------V-EATAT 140 (272) T ss_pred EEeeeeeeeeeecHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHhc---cc-----ccc-------c-ccccC Confidence 99999999999999998776554 67788999999999999998875321 10 000 0 12234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++++.+++..+.+.+ .....|+|||.++..|++.+.......+.. ....+..|..++|+|+||+++++||.. T Consensus 141 ~d~i~da~~~l~~~~-~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~~------ 212 (272) T protein:vir:98 141 VDGVSKALDIFNDED-DAETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPKG------ 212 (272) T ss_pred HHHHHHHHHHHhccC-CCccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCcc------ Confidence 778888888876544 445679999999999987653332111111 112344566689999999999999843 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) .+|+.+.+.+.++.+++++++..++. .++...++..+|+++++.+|++|+++|.+++-+- T Consensus 213 ---t~~~~~~~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 213 ---TAYMVRKGALRIMLKRNTMVETDRDI----------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ---eEEEEcCCeEEEEecCCceeeecccc----------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 25666778888988999999888764 2456788999999999999999999998877666 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.95 E-value=3e-31 Score=187.89 Aligned_cols=264 Identities=10% Similarity=-0.057 Sum_probs=165.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCc--cceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSAS--VDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~--~~~~~v~l~ 78 (315) ...+...+++. +|+.+...+.......+++...+... ..+.....|++|+...+++. .++.+.++. T Consensus 211 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 278 (480) T protein:vir:40 211 ADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDKNKSQTATKRSLR 278 (480) T ss_pred ccccccccccc-cccchhhheeechhhhhhhhhcceee-----------eccccceeeeeeeecccccccccccccchhh Confidence 11111122233 33344333333333333333332211 22344567888876665542 234455554 Q ss_pred ---eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 79 ---PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 79 ---~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) .++++.+.++|.++| +|.. +|+++|.++|++.++++++.+|++|+|.+. ..+.++.......+ .... T Consensus 279 ~~~v~~l~~~~k~t~~lL----DDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~-~~~~g~~~~~~~~~----~~~~ 348 (480) T protein:vir:40 279 PQMAEAYLQMDKATVRGV----NDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGS-NGFYGLKTATDGWT----KQIE 348 (480) T ss_pred HHHHHHHHHhHHHHHHHh----hhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCc-cccccceeeccccc----ccch Confidence 478888899999998 3333 699999999999999999999999965332 23444443322211 1122 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc-cCcccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST-VSGAPEMS 234 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~-v~~~~~~~ 234 (315) ..+.+..++.++.+.+....+.|+||+.++.+|++|||++|+|+|+ |.++.+++.+|+|+||++++. +|..- T Consensus 349 ~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q----~~~~~~~~~~llG~pvv~~~~~~~~~~--- 421 (480) T protein:vir:40 349 YTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFN----ELATKEQIAQSFGAVNLETRVWMPKDE--- 421 (480) T ss_pred hHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeecc----CcccccCcceecccceeeeeccccCCc--- Confidence 2344556777776555443336999999999999999999999885 356788899999999987654 34221 Q ss_pred ccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 235 PASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 235 ~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) +.+..+..++.+++++ ++ ..+.- -++.++..|+++.|+++.+.+|+||..+|.+..=-+ T Consensus 422 -----~~~~~~~~~~~~~d~~-~~--~~~~~--------~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 422 -----VAVYNHDEYVLIGDLN-VE--NYNDF--------DLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred -----ceeeeCCccEEEEecc-cc--eeccc--------ccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 1222233445565543 22 22221 256888899999999999999999999998765555 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.92 E-value=3.5e-26 Score=160.11 Aligned_cols=269 Identities=13% Similarity=0.097 Sum_probs=201.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-+..++|+.++..+.+.+++...+.+++.+... ++..++||+++..+.+.|+.||+.++.++.+.++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 9999999999999999999999999999888888865432 234689999987778999999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+....++++...++..| +.+.+.+++++++++.+|+.++.... +.. ......... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~~d~~~~~~~~---~a~------------~~~~~~~~~ 141 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALM---GAK------------LTVNADITK 141 (274) T ss_pred EEeeeecccccccHHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------ccccccccC Confidence 99999999999999988665544 45667888899999999988775432 111 111122334 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccc-ccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPM-YPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~-~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++..++.+++ .....++|||..+..|++.... +.+..... -+-+..|.-++++|+||++++.+|.. T Consensus 142 ~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~~L~k~~~~--~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~----- 213 (274) T protein:vir:93 142 LNGLQSAIDKFNDED-LEPMVLFINPLDAGKLRGDAST--NFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG----- 213 (274) T ss_pred HHHHHHHHHHhhhcc-CCccEEEeCHHHHHHHHhhhhh--cccccccccccceeecccceecCeeEEEcCCCCcc----- Confidence 788888888876543 3456789999999999754211 11111000 01133456689999999999999843 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..++.+.+.+.++..+++.++..++.. +..-.+++..++++++.+|+++++++.+.+.--- T Consensus 214 ----t~~l~~~gai~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 214 ----TAILAKKGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----eEEEEeCCeEEEEecCCcccccccchh----------hcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 255666788888888888888777643 2345788999999999999999999987655444 No 107 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.90 E-value=3e-25 Score=155.04 Aligned_cols=274 Identities=14% Similarity=0.052 Sum_probs=197.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||+.++.-+..++|+.++..+.+.+++...+.+++..... ++..++||++...+.+.++.|++.++..+.+.++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 9998888899999999999999999998888888754432 234589999987678899999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+..+.++++...++..| +.+.+.+++++++++.+|+.++.... +.. ...... ......... T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~l~~~l~---~a~-----~~~~~~-~t~~~~~~~ 147 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGD----PVEEAQKQIRMAIASKVDNDILEEAL---TTT-----LEVKGA-INIGLIDKI 147 (278) T ss_pred EeeehhhccccccHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHHh---ccc-----cccccc-cccchhhhH Confidence 99999988999999987666555 55678888899999999987765431 111 001111 112222334 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) +..+.++..++..++......++|||..+..|++....+..... ...-+-+..|.-++++|++|+++++||.. T Consensus 148 ~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~-~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~------ 220 (278) T protein:vir:80 148 ENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKAS-QLGDDLLVKGAFGELLGWEIVRTKKLADG------ 220 (278) T ss_pred HHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccc-cccccceeeccceeecceeEEEcCCCCcc------ Confidence 66677777777665555444588999999999876432211000 00001234556689999999999999843 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) ..|+..-+.+.++..++++++..++.. +..-.+++.++++.++.||++++++++.+.. T Consensus 221 ---t~~l~~~gAi~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 221 ---NALAVKAGALKTFLKRNLLAESGRDMD----------HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ---eEEEEeccceeeeecCCcccccccchh----------hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 123334456667777888887777642 2345778889999999999999999988876 No 108 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.89 E-value=7.3e-25 Score=152.92 Aligned_cols=269 Identities=15% Similarity=0.102 Sum_probs=199.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||++.+.-+..++|+.++..+++.++....+.+++..-.. ++..++||++...+.+..+.||+.++.++.+.++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 9999998899999999999999999988888888755322 244689999987678888999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+..+.++++...++..| +...+.+++++++++.+|..++.-.. +.. ......... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~~d~~i~~~l~---~a~------------~~~~~~~~~ 141 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGD----PQGEAVRQHGLAIANKVDNDVLEALK---GAT------------LTVEADITK 141 (274) T ss_pred EEEEeeeceeeecHHHHHhhcch----HHHHHHHHHHHHHHHHHHHHHHHHHh---cCC------------CCcCccccc Confidence 99999998999999987665444 45667788888888889887664322 100 011122234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccc-ccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPM-YPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~-~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++...+.+++ .....++|||..+..|++....+ .+..... -..+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~~L~k~~~~~--f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~----- 213 (274) T protein:vir:96 142 LDGLQTAIDKFNDED-LEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG----- 213 (274) T ss_pred HHHHHHHHHHhcccC-CCceEEEeCHHHHHHHHhccccc--ccccccccccceeecccceecCeeEEEcCCCCcc----- Confidence 778888888876544 34567899999999997764211 1100000 01233456689999999999999853 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..|+...+.+.++..+++.++..++.. +..-.+++.++++.++.+|+++++++.+++-++- T Consensus 214 ----t~~l~~~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 214 ----EALLAKKGAVKLITKRDFFLEKDRDAS----------RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred ----eEEEEeCcceeeeecCCcccccccchh----------hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 234445567777777887887776542 2345778889999999999999999999999988 No 109 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.89 E-value=4.4e-25 Score=154.12 Aligned_cols=267 Identities=13% Similarity=0.062 Sum_probs=194.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||++.+.-...++|+.++..+.+.+.+...+.+++..-.. ++..++||++.....+.++.||++++.++.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 9999999999999999999999999998888888855432 234589999988778899999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+..+.++++...++..| +.+.+.+++++.+++.+|+.++.... +... . ...... T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~i~~~l~---~~~~--------~-----~~~~~~ 140 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGD----PIGESNKQLGLSLANKVDDDLLSAAK---TTSQ--------T-----VSTKAN 140 (272) T ss_pred EeeehhhccccccHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHhc---cccc--------c-----cccccc Confidence 99999999999999887665554 45667888888888888887664321 1000 0 012334 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccc-cccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPL-AGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~-~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++...+.+.+. ....++|||..+..|++.......+. .+. +-+..|.-++++|++|++++.||.... T Consensus 141 ~d~i~~A~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~---~~~~~G~ig~~~G~~Vv~s~~~p~~~~--- 213 (272) T protein:vir:36 141 VDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSEVGA---NALINGTYADVLGAQIVRSKKLAEGSA--- 213 (272) T ss_pred HHHHHHHHHHhhhcCC-CceEEEEcHHHHHHHhcccccccccccccc---cceeeeccceecCeeEEEeCCCCCCce--- Confidence 6788888888865543 45678999999999976543222111 010 112345557899999999999995422 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ....++++ -..+.++..+++++|..++.. +..-.+++.++++.++.+|+++++++.+-. T Consensus 214 -~~~~~~~~-~gA~~~~~~~~~~vE~~R~~~----------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 214 -LMFKIVSN-SPALKLVLKRGVQVETDRDIV----------TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred -eEEEEEec-ccceeeeecCCcccccccchh----------hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 11223333 244555667788888777642 223468888999999999999999998876 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.89 E-value=2e-24 Score=150.53 Aligned_cols=272 Identities=14% Similarity=0.080 Sum_probs=198.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee----cCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP----TIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||++.+.-...++|+.++.-+.+.+++...+.+++.+-. .++..++||++...+.+.++.||++++..+.+.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 999888889999999999999999999999988886543 2455699999988788999999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) ...++.+..+.++++....+..|. ...+.+.++.++++.+|+.++.-.. + ........... T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp----~~~~~~~~~~~~a~~~d~~~~~~l~---~------------~~~~~~~~~~t 141 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDP----QGEAVRQHGLAIANKVDNDVLEALR---G------------TKLTVSADIGT 141 (276) T ss_pred EEeehccccccccHHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHh---c------------ccccccccccC Confidence 999999999999999987776664 3446667777777777776553110 0 00011122345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) ++.+.++...+.+.. .....++|||.....|+++.+..-...... --+-+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~~------ 213 (276) T protein:vir:10 142 LAGLEAAIDTFDDED-LEPMVLFINPKDAGKLRSSASDNFTRATEL-GDNIIVKGAFGEALGAVIVRSKKLDEG------ 213 (276) T ss_pred HHHHHHHHHHhcccc-CcccEEEEcHHHHHHHHHhccccccccccc-cccceeccccceecceeEEEcCCCCcc------ Confidence 788888888886543 355678999999999987653332111100 001233455679999999999999843 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~ 314 (315) ..|+..-..+.++..+++.+|.+++.. +..-.+++.++++.++.+|+.+++++.++-..|+ +- T Consensus 214 ---t~~l~~~gAi~~~~~~~~~vE~dRd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~--~~ 276 (276) T protein:vir:10 214 ---EAILAKRGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS--GA 276 (276) T ss_pred ---eEEEEeccceeeeecCCceeecccchh----------hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcC--CC Confidence 234444566777778888888888753 2345677889999999999999999977633333 22 No 111 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.86 E-value=5.6e-23 Score=142.59 Aligned_cols=269 Identities=13% Similarity=0.087 Sum_probs=196.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-...++|+.++..+.+.+++...+.+++.+-.. ++..++||++...+.+..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 9999999999999999999999999888777777755432 345689999987678889999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+....++++....+..| +...+.+++++++++.+|..++.-... .. ....+.... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d----p~~~~~~~~a~a~a~~vd~~~~~~l~~---a~------------~~~~~~~~~ 141 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMG---AK------------LTVNADITK 141 (274) T ss_pred EEeeeecceecccHHHHHhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhc---cC------------ccccccccC Confidence 99999998999999987665555 445677888888888888876643210 00 011122335 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccc-cccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~-~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++..++.+.. .....++|||..+..|++.. .-+.+.....- +-+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~----- 213 (274) T protein:vir:97 142 LNGLQSAIDKFNDED-LEPMVLFVNPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG----- 213 (274) T ss_pred HHHHHHHHHHhhccC-CCceEEEeCHHHHHHHHhhh--hhhccccCcccccceeccccceecCeeEEEcCCCCcc----- Confidence 788888888876543 34567899999999997532 11111110000 1133455679999999999999842 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..++...+.+.++..+++.++..++.. . ..-.+++.+++++++.+|+++++++++.+.--- T Consensus 214 ----t~~l~~~gA~~~~~~~~~~vE~~Rd~~--------~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 214 ----TAILAKKGAVKLILKRDFFLEVARDAS--------T--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----eEEEEeCcceEeeecCCceeccccchh--------h--cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 244555677888888888888887643 2 234677889999999999999999976554433 No 112 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.86 E-value=5.6e-23 Score=142.59 Aligned_cols=269 Identities=13% Similarity=0.087 Sum_probs=196.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-...++|+.++..+.+.+++...+.+++.+-.. ++..++||++...+.+..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 9999999999999999999999999888777777755432 345689999987678889999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+....++++....+..| +...+.+++++++++.+|..++.-... .. ....+.... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d----p~~~~~~~~a~a~a~~vd~~~~~~l~~---a~------------~~~~~~~~~ 141 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMG---AK------------LTVNADITK 141 (274) T ss_pred EEeeeecceecccHHHHHhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhc---cC------------ccccccccC Confidence 99999998999999987665555 445677888888888888876643210 00 011122335 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccc-cccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~-~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++..++.+.. .....++|||..+..|++.. .-+.+.....- +-+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~----- 213 (274) T protein:vir:94 142 LNGLQSAIDKFNDED-LEPMVLFVNPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG----- 213 (274) T ss_pred HHHHHHHHHHhhccC-CCceEEEeCHHHHHHHHhhh--hhhccccCcccccceeccccceecCeeEEEcCCCCcc----- Confidence 788888888876543 34567899999999997532 11111110000 1133455679999999999999842 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..++...+.+.++..+++.++..++.. . ..-.+++.+++++++.+|+++++++++.+.--- T Consensus 214 ----t~~l~~~gA~~~~~~~~~~vE~~Rd~~--------~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 214 ----TAILAKKGAVKLILKRDFFLEVARDAS--------T--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----eEEEEeCcceEeeecCCceeccccchh--------h--cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 244555677888888888888887643 2 234677889999999999999999976554433 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.86 E-value=3.8e-23 Score=143.47 Aligned_cols=270 Identities=14% Similarity=0.055 Sum_probs=193.9 Q ss_pred CCCCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeE Q lcl|NC_018838. 1 MADDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v 75 (315) ||... +.-...++|+.++..+.+.+++...+.+++.+-.. ++..++||++...+.+.++.||++++..+.+.++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 66643 45567899999999999999999999888865433 34468999998777889999999999999999999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) ++..++.+..+.++++....+..|. ...+.++++.++++.+|..++.-.+..+ ........ T Consensus 81 ~~~i~~~~~~~~i~D~~~~~~~~d~----~~~~~~~~a~~~a~~~d~~ll~~l~~a~---------------~~~~~~~~ 141 (275) T protein:vir:96 81 QATIRKIGKGTVLTDEALLSGYGDP----KGEAVRQHGLAIANKVDNDVLEALQGAT---------------LKVEADIT 141 (275) T ss_pred eEEeehhcccccccHHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHhccc---------------cccccccc Confidence 9999999999999999876665553 4456777888888888887664322100 11112234 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .++.+.++...+.+.. .....++|||.....|++....+-..... .--+.+..|.-++++|++|++++.+|... T Consensus 142 ~~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~~L~k~~~~~f~~~~~-~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t---- 215 (275) T protein:vir:96 142 KLAGLQTAIDKFNDED-LEPMVLFVNPLDAGKLRASATDNFTRATL-LGDNVIVKGAFGEALGAIIVRSNKIKEGE---- 215 (275) T ss_pred CHHHHHHHHHHhcccc-CCccEEEeCHHHHHHHHhccccccccccc-ccccceeccccceecCeeEEEeCCCCcce---- Confidence 5788888888885443 34567899999999998764221100000 00012345566899999999999998431 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) .+++ .-..+.++..+++.+|..++.. +..-.+++.++++.++.+|+++++++..++---. T Consensus 216 ----~~i~-~~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 216 ----AILA-KRGAVKLITKRDFFLETERHAS----------HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred ----EEEE-eccceeeeecCCcccccccchh----------hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 2334 3466777778888888887643 2345778889999999999999999886655544 No 114 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.83 E-value=6.9e-22 Score=136.61 Aligned_cols=269 Identities=13% Similarity=0.074 Sum_probs=194.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee----cCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP----TIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-...++|+.++..+.+.+++...+.+++.+-. .++..++||++...+.+..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 999999999999999999999999988877777775532 2345689999987678889999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+....++++....+..| +...+.++++.++++.+|..++.-.... . ......... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~q~~~~~a~~vd~~~l~~~~~a---~------------~~~~~~a~~ 141 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGA---K------------LTVNADITK 141 (274) T ss_pred EEeeeecceeeecHHHHHhcccc----hHHHHHHHHHHHHHHHHHHHHHHHHhcc---c------------ccccccccC Confidence 99999999999999877665555 3455778888888888888766432210 0 011122345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccc-cccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQP-MYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~-~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++...+.+.+ ......+|||.....|++.... +.+.... --+-+..|.-++++|++|++++.||... T Consensus 142 ~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~~L~k~~~~--~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t---- 214 (274) T protein:vir:12 142 LNGLQSAIDKFNDED-LEPMVLFINPLDAGKLRGDAST--NFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGT---- 214 (274) T ss_pred HHHHHHHHHHhcccc-ccccEEEeCHHHHHHHHhhhhh--hccccccccccceecccceeecCeeEEEeCCCCcce---- Confidence 788888888876543 3456789999999998764210 1111000 0011234556789999999999998532 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) .+++| ...+.++..+++++|..++.. +..-.+++.+++++++.+|+++++++++.+.--- T Consensus 215 ----~~l~~-~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 215 ----AILAK-KGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----EEEEe-ccceeeeecCCceeccccchh----------hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 23444 466667778888888888753 2234778889999999999999999976654433 No 115 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.83 E-value=1.2e-21 Score=135.30 Aligned_cols=290 Identities=10% Similarity=0.054 Sum_probs=208.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCc-cceeeEEEee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSAS-VDVSAFTAQP 79 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~-~~~~~v~l~~ 79 (315) |+.-|....+.+.|..+...||+.+.+.|.+++.....++.++..++++.+.-+.+.|...++..+++. .+|.+++... T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l 104 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSEL 104 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeeccccccccCcceeeeeeech Confidence 887788888999999999999999999999999988888888889999999999999999998888765 5899999999 Q ss_pred EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc----ccccc Q lcl|NC_018838. 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV----DATDS 155 (315) Q Consensus 80 ~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~----~~~~~ 155 (315) +.+++.+.|.+++..... +. .+...+..+.+.+++.++++.+++||+.. .....|+....... +.+ ..+.. T Consensus 105 ~~l~~~~~Vd~~iadl~g-~~-~d~~~~q~~~~ieal~~~~e~~linGDs~--~~~F~GL~~~~~~~-q~i~tg~~gg~~ 179 (330) T protein:vir:94 105 TTLIGDAEVNGLIQATRS-DF-MDQTSVQVASKAKSIGRQYQASMITGDGT--GNSFQGMMGLVAAS-QTISAGANGGTL 179 (330) T ss_pred hhhhhhHHHHHHHHHhcC-CH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC--CccccchhhcCCcc-cEEecCCCCCCC Confidence 999999999999853221 11 12445566778899999999999999753 23444665544332 222 22445 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCC-ccccceeeEeecccCcccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DNWRGLNVGASSTVSGAPEMS 234 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~l~G~Pv~~s~~v~~~~~~~ 234 (315) +.++++.++.++.... ..+..|+||++...+++.+....|++.-.+. .....|.+ .++.|.|++.++.+|.....+ T Consensus 180 T~d~LDeLl~~v~~~~-g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~--~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~ 256 (330) T protein:vir:94 180 TFELLDQLLDLVKDKD-GQVDYLMSSFAMRRKYFSLLRALGGAAIGEV--MTLPSGRQIPTYRGVPWFVNDFIPSNMTQG 256 (330) T ss_pred CHHHHHHHHHHhcCCC-CCCcEEEechhHHHHHHHHHHhccCCCCCCc--ccccCCCEEeeeCCeEEEecccccCCCCcc Confidence 5678888888875433 2466799999999999999988876542111 11123433 578899999999998754321 Q ss_pred c-cccceEEEeccc-----ceEEEee----ccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 235 P-ASGVKAIVGDFS-----RVHWGFQ----RNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 235 ~-~~~~~~~~gDf~-----~~~i~~~----~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) . +....+|+..|. +.++|.. .++.++..-. .-+++.+.+|.+++++.++.+++|+++|+.. T Consensus 257 ~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~---------~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 257 TATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA---------KENADETITRVKMYCGFANFSQLGLAAIKGL 327 (330) T ss_pred cCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCC---------ccccceeeEEEEEeeeeEEechhheeeeccc Confidence 1 222345554443 4566653 2444433221 1234567789999999999999999999875 Q ss_pred cCC Q lcl|NC_018838. 305 AAP 307 (315) Q Consensus 305 ~a~ 307 (315) .-= T Consensus 328 ~~g 330 (330) T protein:vir:94 328 IPG 330 (330) T ss_pred cCC Confidence 422 No 116 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.83 E-value=1.4e-21 Score=134.88 Aligned_cols=269 Identities=13% Similarity=0.067 Sum_probs=193.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee----cCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP----TIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-..+++|+.++..+.+.++....+.+++..-. .++..++||++...+.+..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 999999899999999999999999998888888764432 2355789999987778889999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+..+.++++-...+..| +...+.++++.++++.+|..++.-.. + . .......... T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~vd~~i~~~l~--~-a------------~~~~~~~~~~ 141 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGD----PQGEQVRQHGLAHANKVDDDVLEALK--S-A------------KLTVEADITK 141 (274) T ss_pred EEeeeeecceeehHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHh--c-c------------cccccccccC Confidence 99999999999999877665445 44557777888888888887653221 0 0 0111123345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccc-ccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPM-YPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~-~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++...+.+.. ......+|||..+..|++.. .-+.+..... -+-+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~----- 213 (274) T protein:vir:96 142 LTGLQTAIDKFNDED-LEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG----- 213 (274) T ss_pred HHHHHHHHHHhcccc-ccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCc----- Confidence 788888888876443 34556899999999997642 1111110000 01223455678999999999998743 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..+++| ...+.++..+++++|..++.. +..-.+++.+++++++.+|++++++++.+-.--- T Consensus 214 ---t~~l~~-~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 214 ---TAILAK-KGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ---eEEEEe-ccceeeeecCCcccccccccc----------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 124444 456667778888888887643 3445778889999999999999999866533322 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.83 E-value=1.4e-21 Score=134.88 Aligned_cols=269 Identities=13% Similarity=0.067 Sum_probs=193.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee----cCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP----TIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+++.+.-..+++|+.++..+.+.++....+.+++..-. .++..++||++...+.+..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 999999899999999999999999998888888764432 2355789999987778889999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) +..++.+..+.++++-...+..| +...+.++++.++++.+|..++.-.. + . .......... T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~vd~~i~~~l~--~-a------------~~~~~~~~~~ 141 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGD----PQGEQVRQHGLAHANKVDDDVLEALK--S-A------------KLTVEADITK 141 (274) T ss_pred EEeeeeecceeehHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHh--c-c------------cccccccccC Confidence 99999999999999877665445 44557777888888888887653221 0 0 0111123345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccc-ccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPM-YPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~-~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++.+.++...+.+.. ......+|||..+..|++.. .-+.+..... -+-+..|.-++++|++|++++.+|.. T Consensus 142 ~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~----- 213 (274) T protein:vir:95 142 LTGLQTAIDKFNDED-LEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG----- 213 (274) T ss_pred HHHHHHHHHHhcccc-ccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCc----- Confidence 788888888876443 34556899999999997642 1111110000 01223455678999999999998743 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) ..+++| ...+.++..+++++|..++.. +..-.+++.+++++++.+|++++++++.+-.--- T Consensus 214 ---t~~l~~-~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 214 ---TAILAK-KGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ---eEEEEe-ccceeeeecCCcccccccccc----------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 124444 456667778888888887643 3445778889999999999999999866533322 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.77 E-value=1.1e-19 Score=124.46 Aligned_cols=265 Identities=11% Similarity=0.045 Sum_probs=188.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----CCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) |+.|..+ ..++|+.+..-|.+.+.+...+.+++..-.. ++..+++|.++..+.+.-+.||++++..+.+.++.. T Consensus 1 Ma~T~~~--d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~ 78 (270) T protein:vir:95 1 MTQTKKA--NLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTK 78 (270) T ss_pred CCceehh--hhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchhe Confidence 9998665 7899999999999999998888888865332 345689999988788888999999999999999999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) ...++.+..+.++++....+..|... .+.+.++..+++.+|+-++.-. .+. .... ..... T Consensus 79 a~i~~~gk~~~itD~a~~~~~~dp~~----~~~~q~a~~~a~~~d~~li~~l---~~a-----~~~~--------~~~~t 138 (270) T protein:vir:95 79 VTVKETGKAVEVTQTAIITNVNGTLQ----EASRQLAMSLADKVEIDYIAEL---NKS-----KQTA--------TVSAD 138 (270) T ss_pred eeeehhhCcceecHHHHhhhccchHH----HHHHHHHHHHHHHHHHHHHHHh---ccc-----cccc--------ccccC Confidence 99999999999999987665555443 4566667777777776554211 110 1000 11234 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhc-cCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYP-KGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~-~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) ++++.+++..+.+. ......++|||+++..|++...- ..++.+ ..+..|.-++++|++|++++.++... T Consensus 139 ~~~~~dA~~~lgd~-~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~-----~~~~~G~ig~~~G~~Viv~s~~~~~~---- 208 (270) T protein:vir:95 139 ATGILDAIEVFNSE-NDEDYVLYVNPKDYNKLVKSLFKVGGNVQD-----RAISKGDLVEIVGVSDIVKSKRVSEN---- 208 (270) T ss_pred HHHHHHHHHHhccc-cCCCcEEEEcHHHHHHHHhhhccccccccc-----chhcccccceecceeEEEeCCCCCce---- Confidence 67788888777544 45567899999999999864311 111111 12334566899999999988776431 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNP 311 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~ 311 (315) ..|+.....+.++..+++.+|..++.. +..-.++..+++++++.++..+++++.+.+..-.- T Consensus 209 ----~~~l~~~gAi~~~~~~~~~vEtdRd~~----------~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 209 ----TAFLQRYGAMEIVNKKKPEAYTDFDIL----------KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred ----eEEEEeccceeeeecCCceeeeccchh----------hcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 234444567778888888888888753 22346777799999999999999998753322222 No 119 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.65 E-value=1.1e-17 Score=113.57 Aligned_cols=293 Identities=16% Similarity=0.142 Sum_probs=190.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCc---cceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSAS---VDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~---~~~~~v~ 76 (315) |+. .+|.++||+.+++-|++..++....-++...+....|. ..+|-. +--.++-|+||++.|+.. .+++.|+ T Consensus 74 mtt---~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~-g~~Ra~~IgEGgE~~~~sld~~T~dsv~ 149 (393) T protein:vir:79 74 MAT---PSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI-GIMRAYDVAEGQEIPEDSIDWQTHESPE 149 (393) T ss_pred hcC---CCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch-heeeeccccccccccccchhhhcCCcee Confidence 443 35789999999999999998888888888777774443 444433 345778899999999765 5688999 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccc-----ccccc Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKT-----TKTVD 151 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~-----~~~~~ 151 (315) +..+|.+..+.+|+|++.+|.-| |-.+....+.+++++..|.-++++.-..+-...-++.+..... .+... T Consensus 150 ~~~gK~G~~Ia~SqEmIsDSg~D----vin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~q 225 (393) T protein:vir:79 150 IRVGKSGIRLRFTDEMISDSQWD----LMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQ 225 (393) T ss_pred EEechhhhhhhhHHHHhhcchHH----HHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccc Confidence 99999999999999999888776 4555667778888888999999886432221111122211111 11233 Q ss_pred cccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccc-----ccccccCCCccccc-----eee Q lcl|NC_018838. 152 ATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPM-----YPAAGFAGLDNWRG-----LNV 221 (315) Q Consensus 152 ~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~-----~~~~~~~~~~~l~G-----~Pv 221 (315) .++....|++++.-++.. +.+.++.++|||-+|..+.+-..-.+.+.+...- |+.-..-+|..|.| +.| T Consensus 226 NGTlSleDllDm~~av~~-~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv 304 (393) T protein:vir:79 226 NDTFSAEDFLDLIIAVMA-NEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNV 304 (393) T ss_pred cccccHHHHHHHHHHHhc-ccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeE Confidence 344556789988888864 4567889999999999987765444443332222 22222333444444 588 Q ss_pred EeecccCccccccccccceEEEecccceEEEe-eccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecc-cceE Q lcl|NC_018838. 222 GASSTVSGAPEMSPASGVKAIVGDFSRVHWGF-QRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESL-DSFA 299 (315) Q Consensus 222 ~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~-~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~-~af~ 299 (315) +++..+|-... +++-.++.-|-+.+-+.. +.++..+..++ ..+|...++...|+|++|++. +|++ T Consensus 305 ~~sPfvp~d~k---~~rFd~~~Vd~NnvgvlLV~D~i~tdq~dd----------k~rdiq~iKl~ERYG~gvLn~gkaia 371 (393) T protein:vir:79 305 NLSPFIPLDKK---SRRFDVYAVDRNNVGVLLVRDDLKTDQWDE----------KARGLQNIKMIERYGIGILNEGKAIA 371 (393) T ss_pred EEecccccccc---cceeeEEEeecCCceEEEEecCcceecccc----------ccccceeeeeeeeeceeeeeCCceEE Confidence 89999985432 333355566665554433 34444433333 346777889999999988774 5666 Q ss_pred EEeeccCCCC------CCCCCC Q lcl|NC_018838. 300 VVKEKAAPKP------NPPAGN 315 (315) Q Consensus 300 ~l~~~~a~~~------~~~~~~ 315 (315) ..+..+-.+. ..--|| T Consensus 372 vakNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 372 VAKNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred EEecceeecccccchhhhccCC Confidence 5554333222 233344 No 120 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.61 E-value=4.7e-16 Score=104.61 Aligned_cols=288 Identities=10% Similarity=0.035 Sum_probs=189.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeec-----ccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGE-----GEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E-----g~~~~~s~~~~~~v 75 (315) |..=+.+..+.+.+..+...||+.+.+.|.+.+.....++.++..++.+...-+.+.+.+. .+..+++..+|.++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 8866666667899999999999999999999999888888888888888876544443322 23445678899999 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc----c Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV----D 151 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~----~ 151 (315) +...+-+++.+.|.+.+...-..+.-..+..+ .+...+++.++++..++||+.. .....|+...+... +.+ . T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Q-l~~~iea~~~~~e~~lINGD~a--~n~F~GL~~~~~~~-q~i~~~~~ 156 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQ-IASKAKSAGRKYQDQLINGNGA--GNEFAGLIQLCASG-QKATTGAT 156 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHH-HHHHHHHHHHHHHHHhhccccC--CCcccchhhcCCcc-ceeecCCC Confidence 99999999999999765431101111223233 3445688899999999999853 23344666554432 222 2 Q ss_pred cccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccc--cccCCC-ccccceeeEeecccC Q lcl|NC_018838. 152 ATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPA--AGFAGL-DNWRGLNVGASSTVS 228 (315) Q Consensus 152 ~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~--~~~~~~-~~l~G~Pv~~s~~v~ 228 (315) ++..+.++++.++.++.... ..+..++|||++..+++.+.-..+ ++.+||. ...|.+ .++.|.|++.++.+| T Consensus 157 gg~~t~d~LDeLl~~v~~~~-g~p~~~l~~~~~~r~i~A~~R~~~----~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip 231 (310) T protein:vir:97 157 GSAISFAILDELMDLVVDKD-GQVDYLTMHARTLRSYKALLRALG----GASINEVVELPSGAEVPAYSGTPIFRNDYIP 231 (310) T ss_pred CCCCCHHHHHHHHHHHhcCC-CCCCEEEecHHHHHHHHHHHHHhc----CCCCCCccccCCCCEEeeeCCeEEEEeCccC Confidence 24455688888888875432 245679999998777765543322 2223432 223433 589999999999998 Q ss_pred cccccc-ccccceEEEeccc-----ceEEEee----ccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccce Q lcl|NC_018838. 229 GAPEMS-PASGVKAIVGDFS-----RVHWGFQ----RNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSF 298 (315) Q Consensus 229 ~~~~~~-~~~~~~~~~gDf~-----~~~i~~~----~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af 298 (315) ..-... ......+|+.-|. +-++|.. .++.++.... .=+++...+|.+++++.++.+|+|+ T Consensus 232 ~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~---------~~~~~v~~~~V~~Y~~~av~~~~A~ 302 (310) T protein:vir:97 232 TNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE---------SEDSDEHIWRVKWYCGLALFSEKGL 302 (310) T ss_pred CCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCc---------ccCCcceeEEEEEeeeEEEecccce Confidence 653221 1122345544443 2344432 2344333221 1234567789999999999999999 Q ss_pred EEEeeccC Q lcl|NC_018838. 299 AVVKEKAA 306 (315) Q Consensus 299 ~~l~~~~a 306 (315) ++|+...- T Consensus 303 a~L~~V~~ 310 (310) T protein:vir:97 303 ACADGITN 310 (310) T ss_pred eeeccccC Confidence 99998886 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.57 E-value=1.7e-16 Score=106.99 Aligned_cols=231 Identities=11% Similarity=0.048 Sum_probs=161.9 Q ss_pred cceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHH Q lcl|NC_018838. 35 SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGA 114 (315) Q Consensus 35 ~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~ 114 (315) -+-+.++ ..+++|.+ .+.+.-++||++++..+.++++.+...|+.+..+.|++|-......|.. ....+.++. T Consensus 1 ~~~~~~G-dtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~----~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGINLA-NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPI----GESNKQLGL 73 (231) T ss_pred CccccCC-ceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchH----HHHHHHHHH Confidence 2333333 56899987 3477889999999999999999999999999999999998877666643 446777788 Q ss_pred HHHHHHHHhhhcccccccccccccccccccccccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhc Q lcl|NC_018838. 115 SIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYP 194 (315) Q Consensus 115 ~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~ 194 (315) +|++++|..++.-. ..... . . .....++.+.+++..+.+.. ..+...+|||+....||+..+. T Consensus 74 ~iA~kvD~di~~~~---~~a~l----~--------~-~~~~t~d~i~~A~~~fgde~-~~~~vivv~p~~~~~Lrk~~~~ 136 (231) T protein:vir:73 74 SLANKVDDDLLKAA---KTTSQ----T--------V-STKANVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANA 136 (231) T ss_pred HHHHhhhHHHHHhh---ccccc----c--------c-cccccHHHHHHHHHHhcccc-ccceEEEEcchHHHhhhhccch Confidence 88888887655311 11010 0 0 12245788888888886543 4556789999999999986644 Q ss_pred cCccccccccccccccCCCccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhh Q lcl|NC_018838. 195 KGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDL 274 (315) Q Consensus 195 ~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~ 274 (315) ...... ..-+-+..|.-+++.|+||++|+++|..... .. -++.-...+.+...++++++..++.. T Consensus 137 ~~~~~~--~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~----~~-~~i~~~gAl~~~~k~~~~vEtdRd~~-------- 201 (231) T protein:vir:73 137 KNIGSE--VGANALINGTYADVLGAQIVRSKKLAEGSAL----MF-KIVSNSPALKLVLKRGVQVETDRDIV-------- 201 (231) T ss_pred hhhhhh--hccceeeecccceEcceEEEEcCCCCCCcee----ee-eEEeeccceeeeecccceeecccccc-------- Confidence 322111 1112234566689999999999999853211 11 12233456777788899999888753 Q ss_pred hhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 275 KGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 275 f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +....+++.++++..+.+|+.+++++.+=. T Consensus 202 --~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 202 --TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred --ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 334577888999999999999999987765 No 122 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.54 E-value=1.2e-15 Score=102.48 Aligned_cols=266 Identities=13% Similarity=0.040 Sum_probs=160.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee----ecCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||.. .++|+.+++++++.+++.+++.+++..- ...+..++||+......+..+.++..++..+.+.++++ T Consensus 1 MA~~------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEE Confidence 9985 3689999999999999999988887432 22345699999876666778899988888888888888 Q ss_pred EeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 77 AQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 77 l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) +...+. +.-+.|++.-...+..+ +++ +.+.++.++++++|.-++.-.. ...... . .+... .... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~ala~~vD~~i~~~~~-~a~~~~---~-----~~~~~-~~~~ 139 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADMLV-DNGTAL---T-----GSAPS-DADD 139 (273) T ss_pred EEEeeecccceeeccHHHHhhccc----HHH-HHHHHHHHHHHHHHHHHHHHHh-hccccc---c-----ccccc-chhh Confidence 877653 45566666333333333 444 4455677888888874432110 000000 0 00010 1123 Q ss_pred hhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhcc-CccccccccccccccCCCccccceeeEeecccCccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPK-GSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~-g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 233 (315) .++.+..+..++.+++..... .++++|.....|.+..... .....+. -..+..|..++|.|++|+.++++|...+ T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~--~~~l~~G~ig~~~G~~i~~s~~lp~~~~- 216 (273) T protein:vir:79 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD--AAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) T ss_pred HHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhccc--ccceeeeEeeEEeceEEEecccccccCc- Confidence 466777777777665543323 4678999988886543211 1111110 0123456678999999999999985422 Q ss_pred cccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 234 SPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 234 ~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ..++.+--+.+... .+-..++..+... .| ...+++.+.+|++++||++++.|+..-+ T Consensus 217 -----~~~~a~~~~A~~~a-~~~~~~e~~r~~~-------~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 217 -----EQFVAFHPSAAAYV-SQIDTVEALRDQD-------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -----eEEEEEeccceeee-eehhhhhcccCcc-------cc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 11222222222221 1222333332221 12 3568889999999999999999987766 No 123 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.53 E-value=1.5e-15 Score=101.80 Aligned_cols=266 Identities=14% Similarity=0.045 Sum_probs=159.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee----ecCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||.. .++|+.+++++++.+++.+++..++..- ...+..++||+......+....++..++..+.+-++++ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 8884 5689999999999999999998887441 12235689999876666778888887777777777777 Q ss_pred EeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 77 AQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 77 l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) +...+. +.-+.|++.-..++..+ +++ +.+.++.+++.++|.-++.-.. ..+.. . ..+... .... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~alA~~vD~~i~~~~~-~a~~~-----~---~~~~~~-~~~~ 139 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADMLV-DNGTA-----L---TGSAPT-DADD 139 (273) T ss_pred EEEeeeeecceEeecHHHhhhhcc----HHH-HHHHHHHHHHHHHHHHHHHHHh-ccccc-----c---cccccc-chhH Confidence 766543 44455665332222222 444 4456678888888875542110 00000 0 000111 1123 Q ss_pred hhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCc-cccccccccccccCCCccccceeeEeecccCccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGS-PLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~-~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 233 (315) .++.|.++...+.+++..... .++++|.....|.+...-..+ ...+. ...+..|..++|.|++|+.++++|...+ T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~--~~~l~~G~ig~i~G~~v~~s~~lp~~~~- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD--AAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc--ccceeeeeeeEEeceEEEEecccccCCc- Confidence 467788888888666543333 367899999988654211110 00010 0123456678999999999999985421 Q ss_pred cccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 234 SPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 234 ~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ..++.+--+.+.+.. +-..++..+... .| ...+++.+.+|++++||++++.|+..-+ T Consensus 217 -----~~~~~~~~~A~~~a~-q~~~~e~~r~~~-------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 -----EQFVAFHPSAAAYVS-QIDTVEALRDQD-------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -----cEEEEEeccceeeee-eeehhhcccCCC-------cc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 123333333332221 111233222221 12 3468889999999999999999987766 No 124 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.53 E-value=1.5e-15 Score=101.80 Aligned_cols=266 Identities=14% Similarity=0.045 Sum_probs=159.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee----ecCCCceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||.. .++|+.+++++++.+++.+++..++..- ...+..++||+......+....++..++..+.+-++++ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 8884 5689999999999999999998887441 12235689999876666778888887777777777777 Q ss_pred EeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 77 AQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 77 l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) +...+. +.-+.|++.-..++..+ +++ +.+.++.+++.++|.-++.-.. ..+.. . ..+... .... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~alA~~vD~~i~~~~~-~a~~~-----~---~~~~~~-~~~~ 139 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADMLV-DNGTA-----L---TGSAPT-DADD 139 (273) T ss_pred EEEeeeeecceEeecHHHhhhhcc----HHH-HHHHHHHHHHHHHHHHHHHHHh-ccccc-----c---cccccc-chhH Confidence 766543 44455665332222222 444 4456678888888875542110 00000 0 000111 1123 Q ss_pred hhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCc-cccccccccccccCCCccccceeeEeecccCccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGS-PLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~-~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 233 (315) .++.|.++...+.+++..... .++++|.....|.+...-..+ ...+. ...+..|..++|.|++|+.++++|...+ T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~--~~~l~~G~ig~i~G~~v~~s~~lp~~~~- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD--AAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc--ccceeeeeeeEEeceEEEEecccccCCc- Confidence 467788888888666543333 367899999988654211110 00010 0123456678999999999999985421 Q ss_pred cccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 234 SPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 234 ~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ..++.+--+.+.+.. +-..++..+... .| ...+++.+.+|++++||++++.|+..-+ T Consensus 217 -----~~~~~~~~~A~~~a~-q~~~~e~~r~~~-------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 -----EQFVAFHPSAAAYVS-QIDTVEALRDQD-------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -----cEEEEEeccceeeee-eeehhhcccCCC-------cc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 123333333332221 111233222221 12 3468889999999999999999987766 No 125 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.53 E-value=3.3e-15 Score=100.00 Aligned_cols=288 Identities=8% Similarity=0.045 Sum_probs=169.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEe-eccccc-CCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIV-GEGEVK-PSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~-~~s~~~~~~v~l~ 78 (315) |... +-|+.+++++..+++++.+++.+++.+.+++++|.+....|++..-+..-.-- .|+... ...+++..++.+. T Consensus 23 it~~--~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~ 100 (360) T protein:vir:99 23 IGLA--ELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFN 100 (360) T ss_pred cccc--ccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCCcCCcCccccCccc Confidence 3333 33578889999999999999999999999999999888888765443211111 122121 1233444455553 Q ss_pred -eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc-----------ccccccccccccc Q lcl|NC_018838. 79 -PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG-----------KPAAAVKVSLDKT 146 (315) Q Consensus 79 -~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~-----------~~~~~~~~~~~~~ 146 (315) .+++-....++.+-++++........++.|.+.++.++++-+..-.++|+..... ...-|+....... T Consensus 101 ~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~ 180 (360) T protein:vir:99 101 ATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGD 180 (360) T ss_pred cccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcc Confidence 3455555667777776553322223456777888888887777767777532110 0111111111100 Q ss_pred ccccc----------------------------ccc----chhHHHHHHHHHhhhcccccc--e-EEEEeHHHHHHHHHH Q lcl|NC_018838. 147 TKTVD----------------------------ATD----SATTDLVKAVGLIAGAGLQVP--N-GVALDPAFSFALSTE 191 (315) Q Consensus 147 ~~~~~----------------------------~~~----~~~~di~~~~~~~~~~~~~~~--~-~~~m~~~~~~~L~~l 191 (315) .+.++ ... ....-+.+++..|...+...+ + .|+|++......+.. T Consensus 181 ~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~ 260 (360) T protein:vir:99 181 AQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMS 260 (360) T ss_pred cchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHH Confidence 00000 000 011225677777765543221 2 699998876655443 Q ss_pred hhccCccccccccccccccCCCccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccc Q lcl|NC_018838. 192 VYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTG 271 (315) Q Consensus 192 ~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~ 271 (315) -..-..++...-+ ..++.-+.+|+|++..+.+|.. .+++-+++++.+|.+++++++.+.+... T Consensus 261 L~~R~t~LGd~~l----~g~~~~~~~Gipi~~v~~~pd~---------~~mlT~p~NLi~g~~~~iri~~~~e~~~---- 323 (360) T protein:vir:99 261 LTEREDPLGSAVI----FGDSDITPFSYDLVGVNGFPDE---------YMMFTDPNNLAFGLYEEMELDQSTDTDK---- 323 (360) T ss_pred HhccCcccchhhe----ecccccccceeeeEEcCCCCCC---------ceEEeccCceeEEeeeeeEEeecccchh---- Confidence 3222222221111 1122236789999988888743 3778899999999999999987655321 Q ss_pred hhhhhcC-cEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 272 RDLKGHN-EVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 272 ~~~f~~~-~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) +-++. .+.+-...++|+.+.+++|.+.+++...|+. T Consensus 324 --~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 324 --VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred --hhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 11111 1334446779999999999999999887777 No 126 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.52 E-value=1.6e-15 Score=101.70 Aligned_cols=283 Identities=14% Similarity=0.007 Sum_probs=162.0 Q ss_pred CCCCcc-----CCCceEc------chhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCC---ceeEEeeccccc Q lcl|NC_018838. 1 MADDFL-----SAGKLEL------PGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGV---PRAKIVGEGEVK 65 (315) Q Consensus 1 m~~~~~-----s~Gg~~v------P~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~---~~a~wv~Eg~~~ 65 (315) |.+-+. .++.++| |+.+...|++.+++.-+--.|.+.+... ++.+.+-..... ..+.-|+|++++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 665433 2344433 6677778888887776666666665443 343444333322 466789999999 Q ss_pred CCCccceeeEEE-eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccc Q lcl|NC_018838. 66 PSASVDVSAFTA-QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLD 144 (315) Q Consensus 66 ~~s~~~~~~v~l-~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~ 144 (315) |.+.+.+++..+ ..+|.+..+.||+|++..+..+.+... ..+++..+.+..|+.++.-.-+... ..+..+.. T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~----~~~l~Nti~r~~d~~a~dal~sa~t---~~~~~s~~ 153 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQ----MLQLRNTFIRANDRSAKALLQSPIV---PTLAVPTA 153 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHH----HHHHHHHHHHHHHHHHHHHHhcccc---ccccCCcC Confidence 999999987776 558999999999999988877755443 4455555666666655433211110 00111111 Q ss_pred ccc--cccccccchhHHHHH-------HH-HHhhhcccccceEEEEeHHHHHHHHHHhh------ccCcccccccccccc Q lcl|NC_018838. 145 KTT--KTVDATDSATTDLVK-------AV-GLIAGAGLQVPNGVALDPAFSFALSTEVY------PKGSPLAGQPMYPAA 208 (315) Q Consensus 145 ~~~--~~~~~~~~~~~di~~-------~~-~~~~~~~~~~~~~~~m~~~~~~~L~~l~d------~~g~~~~~~~~~~~~ 208 (315) ... +.......+.+.+.. +. .....+..+.++.++|||.++..|++-++ .++.+.+- .+.. T Consensus 154 w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~---~~~~ 230 (318) T protein:vir:10 154 WDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVST---APDW 230 (318) T ss_pred CCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhh---cccc Confidence 110 000000001111111 00 11123456778899999999999944433 23322211 1222 Q ss_pred ccCCCccccceeeEeecccCccccccccccceEEEecccce-EEEeeccceEEEecc--CCccccchhhhhcCcEEEEEE Q lcl|NC_018838. 209 GFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRV-HWGFQRNFPIELIEY--GDPDQTGRDLKGHNEVMVRAE 285 (315) Q Consensus 209 ~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~-~i~~~~~~~v~~~~~--~~~~~~~~~~f~~~~v~~r~~ 285 (315) ...-+++++|+.|+.+.++|... +++.+-+.+ .+.+.++++.+-... ..+++. .+....+|+. T Consensus 231 tg~~~g~~lGl~vi~s~~~p~~~---------alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~-----~~~s~~~~~~ 296 (318) T protein:vir:10 231 TGNFPGSVMGLNVIRSRTFPIDR---------VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGG-----PTESYRADAS 296 (318) T ss_pred cccccceeeceEEeecCccCCCe---------eEEEecCCcceeeccccceeeecccCCCCCCCC-----cchhhheehh Confidence 22235688999999999999643 343332211 123455555444431 122222 2233567788 Q ss_pred EEeccEeecccceEEEeeccCC Q lcl|NC_018838. 286 AVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 286 ~r~~~~v~~~~af~~l~~~~a~ 307 (315) .+-...|.+|+|+++||+.-+| T Consensus 297 ~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 297 HKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeeeeeeCcceeEEEeeccCC Confidence 8888999999999999999888 No 127 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.44 E-value=6.9e-15 Score=98.22 Aligned_cols=298 Identities=10% Similarity=-0.037 Sum_probs=162.5 Q ss_pred CCCCccCCC--------ceEcchhHHHHHHHHHHhccchhhhcceee---cCCCceEEEEEeCCceeEEeecccccCCCc Q lcl|NC_018838. 1 MADDFLSAG--------KLELPGSMIGAVRDRAIDSGVLAKLSPEQP---TIFGPVKGAVFSGVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~~~~s~G--------g~~vP~~~~~~ii~~~~~~s~i~~l~~~~~---~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~ 69 (315) |+.+-+-+| .-+||+.++.+|++.+++.++++++++-.+ ..+..++||+.. .+.+.-..++..++..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 777766665 236899999999999999999888876443 224568999864 56677778888888777 Q ss_pred cceeeEEEee-EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc Q lcl|NC_018838. 70 VDVSAFTAQP-IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK 148 (315) Q Consensus 70 ~~~~~v~l~~-~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~ 148 (315) .+-.++++.. +..+.-+.|+++-..++..| +.+.+.++.++++++++|+.++.-............... ..... T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d----~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~-~~~~~ 154 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEIQASYD----LRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSS-SNGAI 154 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccC-ccccc Confidence 7777777766 33345566776444333333 556667777888888888765532111111111000110 01111 Q ss_pred ccccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeeccc Q lcl|NC_018838. 149 TVDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTV 227 (315) Q Consensus 149 ~~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v 227 (315) ........|+.+.++...+.+++...... .+++|.....|.+...-......+. ..+..|..++|+|++|+.++++ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~---~~l~~G~ig~i~G~~V~~Sn~l 231 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINN---APIAQGQIGSLMGVRVIRTSLI 231 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhcccc---chhheeeeeeEeceEEEEeccc Confidence 11223345777877777776655433333 5679999999864321111111111 1245666789999999999999 Q ss_pred CccccccccccceE------------------EEecccceE--EEeeccc-eEEEec---------cCCccccchhhhhc Q lcl|NC_018838. 228 SGAPEMSPASGVKA------------------IVGDFSRVH--WGFQRNF-PIELIE---------YGDPDQTGRDLKGH 277 (315) Q Consensus 228 ~~~~~~~~~~~~~~------------------~~gDf~~~~--i~~~~~~-~v~~~~---------~~~~~~~~~~~f~~ 277 (315) |............. .-+|++.+. ++-++.+ .+++.+ +....... ..-++ T Consensus 232 p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~-~~~~~ 310 (341) T protein:vir:94 232 GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQS-FENRE 310 (341) T ss_pred cccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhcccccccccccc-chhhh Confidence 86543321110000 011222111 1111111 111111 00000000 00011 Q ss_pred CcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 278 NEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 278 ~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) -.-.+++.+-+|.+++||++.+.|+...+.- T Consensus 311 ~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 311 QVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhhhhhhhhhcccccCcceeEEEecCcCCC Confidence 1234556677899999999988776544333 No 128 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.42 E-value=2.9e-14 Score=94.82 Aligned_cols=289 Identities=11% Similarity=-0.000 Sum_probs=161.6 Q ss_pred CCCC----------ccCCCceEcc-hhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccccCCC Q lcl|NC_018838. 1 MADD----------FLSAGKLELP-GSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~~~----------~~s~Gg~~vP-~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) |++- .++.+...+. ++++.+|.......++++++.++..+.+ +.++||+. +..+++...-|+++..+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~~ 79 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVVQ 79 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCCC Confidence 6655 1222223444 8999999999999999999999988774 45799976 66788888888888887 Q ss_pred ccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc------cccccccc---cc Q lcl|NC_018838. 69 SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI------DPATGKPA---AA 138 (315) Q Consensus 69 ~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~------g~~~~~~~---~~ 138 (315) .++-++++|..-.+ ..-..|-+-=-.++.. .+++.+.+++++++++++|++++.-. .++....+ .| T Consensus 80 ~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~----D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 80 KNVSDKLNLTVDTVLYARHFFDKFDEWTSNL----DVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CcccCceEEEEeeeeehhhhHhhHHHHhcCc----chHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 77777777766553 2222222211111111 27888899999999999999775321 01111011 11 Q ss_pred cccccccccccccc---ccchhHHHHHHHHHhhhccccc----ceEEEEeHHHHHHHHHHhhccCccccccccc----cc Q lcl|NC_018838. 139 VKVSLDKTTKTVDA---TDSATTDLVKAVGLIAGAGLQV----PNGVALDPAFSFALSTEVYPKGSPLAGQPMY----PA 207 (315) Q Consensus 139 ~~~~~~~~~~~~~~---~~~~~~di~~~~~~~~~~~~~~----~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~----~~ 207 (315) +........+..+. .+....-+..+...+.+.+... .-..+++|+.+..|..-. +-++..+.- .. T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~----r~~n~d~~~s~~~~~ 231 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHD----RLMNVEFGAKEGGNS 231 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhccc----ccccceecccccccc Confidence 11111111111111 1111233445555555444331 123678999999885432 112211100 11 Q ss_pred cccCCCccccceeeEeecccCccccccc--cccceEEEecccceEEEeec----------cceEEEeccCCccccchhhh Q lcl|NC_018838. 208 AGFAGLDNWRGLNVGASSTVSGAPEMSP--ASGVKAIVGDFSRVHWGFQR----------NFPIELIEYGDPDQTGRDLK 275 (315) Q Consensus 208 ~~~~~~~~l~G~Pv~~s~~v~~~~~~~~--~~~~~~~~gDf~~~~i~~~~----------~~~v~~~~~~~~~~~~~~~f 275 (315) ...+...+++|.||+.|+++|....+.. +.....+-|||+....-... ++..++.++.. .| T Consensus 232 ~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-------~~ 304 (334) T protein:vir:80 232 FVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK-------DF 304 (334) T ss_pred ccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh-------hH Confidence 2334457899999999999997654433 22223456677654422222 22222222211 11 Q ss_pred hcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 276 GHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 276 ~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) . -.+.+.+-+|.+++||++.+.++....-. T Consensus 305 ~---d~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 305 G---HYLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred H---HHHHHHHHcCCceeccceEEEEEEeeecC Confidence 1 12334466899999999999988654222 No 129 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.40 E-value=3.4e-14 Score=94.44 Aligned_cols=284 Identities=14% Similarity=0.043 Sum_probs=162.7 Q ss_pred CCCCcc-----------CCC-----ceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeeccc Q lcl|NC_018838. 1 MADDFL-----------SAG-----KLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGE 63 (315) Q Consensus 1 m~~~~~-----------s~G-----g~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~ 63 (315) |+.-++ .+| .+.| +.++.+|+......|+++++.++..+.+ +.+++|+. +..+++....|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCC Confidence 554433 111 1234 7899999999999999999999887774 45788876 777888888888 Q ss_pred ccCCC--ccceeeEEEeeEEE--EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc-------cccc Q lcl|NC_018838. 64 VKPSA--SVDVSAFTAQPIKV--VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-------DPAT 132 (315) Q Consensus 64 ~~~~s--~~~~~~v~l~~~kl--~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~-------g~~~ 132 (315) +...+ +++.++.+|..-++ .. ..|-+-=-.++.. .+.+.+.+++++++++.+|++++.-. .+.+ T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~-~~VddiD~~q~~~----D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~ 153 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTAD-VLIYDIEDAMNHY----DVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYN 153 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhh-hhHhhHHHHhcCc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 87654 46677755544332 22 1222110011111 36777889999999999998776311 1111 Q ss_pred ccccccccccc----cccc----cccccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccc Q lcl|NC_018838. 133 GKPAAAVKVSL----DKTT----KTVDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQP 203 (315) Q Consensus 133 ~~~~~~~~~~~----~~~~----~~~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~ 203 (315) + .+.++.... .... .........++.|..+...+.+.+...... .+++|..+..|..-+.-. ...+ T Consensus 154 ~-~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~----~~~~ 228 (345) T protein:vir:22 154 E-NIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPN----AANY 228 (345) T ss_pred c-cccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccc----cccc Confidence 1 111111111 1111 111112234666777766776655544444 568999999885432211 1111 Q ss_pred cc-cccccCCCccccceeeEeecccCcccccc-----------------------ccccceEEEecccceEEEeeccceE Q lcl|NC_018838. 204 MY-PAAGFAGLDNWRGLNVGASSTVSGAPEMS-----------------------PASGVKAIVGDFSRVHWGFQRNFPI 259 (315) Q Consensus 204 ~~-~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~-----------------------~~~~~~~~~gDf~~~~i~~~~~~~v 259 (315) .- .....|..+++.|.+|+.++++|...... ..++..+++.-.+.+......++.+ T Consensus 229 ~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~ 308 (345) T protein:vir:22 229 AALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLAL 308 (345) T ss_pred ccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeeccee Confidence 10 11234556789999999999988432210 1112233333333333444445555 Q ss_pred EEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 260 ELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 260 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +..++.. .|. -.+++.+-+|.+++||++.+.|+.+.. T Consensus 309 e~~r~~~-------~~~---d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 309 ERARRAN-------FQA---DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeechh-------HHH---HHHHHHHhcCCcccccceeEEEEEeeC Confidence 5555431 222 256778889999999999999988775 No 130 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.39 E-value=3.2e-13 Score=89.07 Aligned_cols=280 Identities=8% Similarity=0.009 Sum_probs=162.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcce---------e--ecCCCceEEEEEeCC-ceeEEeecccccCCC Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPE---------Q--PTIFGPVKGAVFSGV-PRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~---------~--~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s 68 (315) ||.+.. ...++|+.+..-+.+.+.+.+.+.+-+-+ . ..++..+++|.+..- ..+.-+.|+.+++.+ T Consensus 1 MA~T~l--sd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MAYTKI--SDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CCceee--eceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 996654 48899999988777777776655332211 1 134556899999763 577888999999999 Q ss_pred ccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhh---cccccccccccccccccccc Q lcl|NC_018838. 69 SVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAF---HGIDPATGKPAAAVKVSLDK 145 (315) Q Consensus 69 ~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~---~G~g~~~~~~~~~~~~~~~~ 145 (315) +.+-++-.-..++.+....++++-...+..|.... |.++++..+.+..+..++ .|.-..+....+... . T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~----i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~d----v 150 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQA----IGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLD----I 150 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHH----HHHHHHHHHHHHHHHHHHHHHHHhhhccccccceee----e Confidence 98887777777777777778876655555554443 556666666665555443 221100000000000 0 Q ss_pred cccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 146 TTKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 146 ~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) ...+.....+..+.+++.++-+ +.....+|+||+.+...|++...-+- +.+.-....-++++|++|++++ T Consensus 151 --sa~~~~~~s~~~l~~A~~~~GD-~~~~~~~ivmhS~v~~~L~~~~li~~-------~~~s~~~~~i~~~~G~~VivdD 220 (324) T protein:vir:59 151 --SGTADGIYSAETFVDASYKLGD-HESLLTAIGMHSATMASAVKQDLIEF-------VKDSQSGIRFPTYMNKRVIVDD 220 (324) T ss_pred --eccccceecHHHHHHHHHHhCC-cccCcEEEEEchHHHHHHHHhhhhhh-------ccccccCceeeeecccEEEEeC Confidence 1111122345678888877654 44556789999999999987643211 1111122344789999999999 Q ss_pred ccCccccccccccce-EEEecccceEEEe-eccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEee Q lcl|NC_018838. 226 TVSGAPEMSPASGVK-AIVGDFSRVHWGF-QRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKE 303 (315) Q Consensus 226 ~v~~~~~~~~~~~~~-~~~gDf~~~~i~~-~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~ 303 (315) .||.....+..++.. .+++. ..+.++. ...+.+++.++. .++...+....++. +||..|...+. T Consensus 221 ~~p~~~~~~~~~~y~s~l~~~-GAi~~~~~~~~v~vE~dRd~----------~~g~~~l~~r~~~~---~~p~G~s~~~~ 286 (324) T protein:vir:59 221 SMPVETLEDGTKVFTSYLFGA-GALGYAEGQPEVPTETARNA----------LGSQDILINRKHFV---LHPRGVKFTEN 286 (324) T ss_pred CCCccccCCCCceEEEEEEec-CeEEEeecCCCcceecccCc----------cccceEEEEeeEEE---eEeeeEEeccc Confidence 999655444433333 34443 4455554 334566676653 23344555555544 66666655433 Q ss_pred ccCCCCCCCCCC Q lcl|NC_018838. 304 KAAPKPNPPAGN 315 (315) Q Consensus 304 ~~a~~~~~~~~~ 315 (315) +. ....|-..+ T Consensus 287 ~~-~~~sPt~~~ 297 (324) T protein:vir:59 287 AM-AGTTPTDEE 297 (324) T ss_pred cc-CCCCCChhh Confidence 21 112222222 No 131 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.39 E-value=2.7e-14 Score=95.01 Aligned_cols=286 Identities=14% Similarity=0.043 Sum_probs=157.5 Q ss_pred CCCCccC------CC-c---------eEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeeccc Q lcl|NC_018838. 1 MADDFLS------AG-K---------LELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGE 63 (315) Q Consensus 1 m~~~~~s------~G-g---------~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~ 63 (315) |++.++. .+ + +.| +.++.+|+......|+++++.++..+.+ +.+++|+. +..++.....|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCC Confidence 8866332 11 1 144 7899999999999999999999887774 45788977 667888888888 Q ss_pred ccCCC--ccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc----c--ccccc Q lcl|NC_018838. 64 VKPSA--SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI----D--PATGK 134 (315) Q Consensus 64 ~~~~s--~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~----g--~~~~~ 134 (315) +.+.+ ++.-++++|..-++ ..-..|-+-=-.++.. .+.+.+.++.++++++.+|+.++.-. . ++... T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~----D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHY----DVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred CCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCc----chHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 87654 45666666655443 1112222110011111 36777889999999999998775311 0 00011 Q ss_pred cccccccccc--------ccccccccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCccccccccc Q lcl|NC_018838. 135 PAAAVKVSLD--------KTTKTVDATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) Q Consensus 135 ~~~~~~~~~~--------~~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~ 205 (315) .+.+...... ..+.....+...++.+.++...+...+......| +++|..+..|..-+.-. ...+.- T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~----~~~~~~ 230 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPN----AANYAA 230 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccc----cccccc Confidence 1111111100 0011111112235556667777766655433444 56999988874332111 111110 Q ss_pred -cccccCCCccccceeeEeecccCccccccc----c-c-------cceEEEecccce----------EEEeeccceEEEe Q lcl|NC_018838. 206 -PAAGFAGLDNWRGLNVGASSTVSGAPEMSP----A-S-------GVKAIVGDFSRV----------HWGFQRNFPIELI 262 (315) Q Consensus 206 -~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~----~-~-------~~~~~~gDf~~~----------~i~~~~~~~v~~~ 262 (315) .....|..++++|++|+.++++|....... . . ..-.+..||++. ......++.++.. T Consensus 231 ~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~ 310 (344) T protein:vir:10 231 LIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERA 310 (344) T ss_pred ccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecc Confidence 122345567899999999999985321110 0 0 001112244332 1222334444444 Q ss_pred ccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 263 EYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 263 ~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ++. ..|. -.+++.+-+|.+++||++.+.++.++- T Consensus 311 r~~-------~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 311 RRA-------NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred cch-------hHHH---HHHHHHhhcccceecccceEEEEeecC Confidence 432 1232 256788889999999998866665543 No 132 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.39 E-value=3.8e-14 Score=94.18 Aligned_cols=287 Identities=14% Similarity=0.026 Sum_probs=164.1 Q ss_pred CCCCccCC------------Cc---eEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccc Q lcl|NC_018838. 1 MADDFLSA------------GK---LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEV 64 (315) Q Consensus 1 m~~~~~s~------------Gg---~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~ 64 (315) ||+..... |. +.| +.++.+|.......++++.+.++..+. +..++||+. +..+++....|++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcC Confidence 77554322 11 345 899999999999999999999887655 455788865 5578888889988 Q ss_pred cCCC--ccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhc----ccc--cccccc Q lcl|NC_018838. 65 KPSA--SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GID--PATGKP 135 (315) Q Consensus 65 ~~~s--~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~----G~g--~~~~~~ 135 (315) +..+ ++..++.+|..-++ ..-..|-+-=-.++.. .+.+.+.++.++++++.+|+.++. +.+ +....+ T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~----D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHY----DVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNEN 154 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCc----chHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 7654 57788777766544 2222222210011111 367778899999999999987752 111 111111 Q ss_pred ccccccc----c----cccccccccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCcccccccccc Q lcl|NC_018838. 136 AAAVKVS----L----DKTTKTVDATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQPMYP 206 (315) Q Consensus 136 ~~~~~~~----~----~~~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~~ 206 (315) +.+.+.. + .............|+.+.++...|.+++......| +++|+.+..|.+..+..- .+.. ... T Consensus 155 ~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~--~~~~-~~~ 231 (347) T protein:vir:94 155 IAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNA--ANYQ-ALI 231 (347) T ss_pred cccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccc--cccc-ccc Confidence 1121111 0 01111111122335667777777766655433345 457999888865332221 1111 112 Q ss_pred ccccCCCccccceeeEeecccCccccccccccc----------------eEEEecccceE----------EEeeccceEE Q lcl|NC_018838. 207 AAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV----------------KAIVGDFSRVH----------WGFQRNFPIE 260 (315) Q Consensus 207 ~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~----------------~~~~gDf~~~~----------i~~~~~~~v~ 260 (315) ++..|..+++.|++|+.++++|........... .-+-+||++.. .....++.++ T Consensus 232 ~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e 311 (347) T protein:vir:94 232 DPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALE 311 (347) T ss_pred ccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhccccee Confidence 445677789999999999999864322111110 01223443322 2223344444 Q ss_pred EeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 261 ~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +.++. .++. ..+.+.+-+|.+++||++-+.++.+.| T Consensus 312 ~~~~~--------~~~~--~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 312 RARRA--------NFQA--DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred eeech--------hhhh--hhhhhhhhhcCcccccceeEEEEecCC Confidence 44332 1222 356777889999999999998876666 No 133 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.39 E-value=3.5e-14 Score=94.34 Aligned_cols=287 Identities=11% Similarity=0.010 Sum_probs=159.5 Q ss_pred CCCCccCC------------Cc---eEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccc Q lcl|NC_018838. 1 MADDFLSA------------GK---LELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEV 64 (315) Q Consensus 1 m~~~~~s~------------Gg---~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~ 64 (315) ||+.++.. +. +.| ++++.+|+......|.++.+.++....+ ..++||+. +..++.....|+. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGEN 78 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cceeeeeeccccC Confidence 76544321 11 234 8899999999999999999998876554 45788865 5567788788877 Q ss_pred cCCC--ccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccc------ccccc Q lcl|NC_018838. 65 KPSA--SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDP------ATGKP 135 (315) Q Consensus 65 ~~~s--~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~------~~~~~ 135 (315) +..+ ++..++++|..-++ ..-..|.+-=..+... .+.+.+.++.++++++++|+.++.-... ..... T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~----D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHY----DVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcC----CchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 6543 56777777766554 2222333221111112 2566788888999999999877532110 01111 Q ss_pred ccccccccccc-c------cccccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhcc-Ccccccccccc Q lcl|NC_018838. 136 AAAVKVSLDKT-T------KTVDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPK-GSPLAGQPMYP 206 (315) Q Consensus 136 ~~~~~~~~~~~-~------~~~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~-g~~~~~~~~~~ 206 (315) ..|+....... + .........++.|..+...+.+++...... ++++|..+..|.+-...+ ..+. .. . T Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~---~ 230 (347) T protein:vir:88 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA-AL---I 230 (347) T ss_pred cCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhc-cc---c Confidence 12221111000 0 000111123566777777776655433333 678899888875432221 1111 11 1 Q ss_pred ccccCCCccccceeeEeecccCccccccc----c------------ccceEEEecccceEEEe----------eccceEE Q lcl|NC_018838. 207 AAGFAGLDNWRGLNVGASSTVSGAPEMSP----A------------SGVKAIVGDFSRVHWGF----------QRNFPIE 260 (315) Q Consensus 207 ~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~----~------------~~~~~~~gDf~~~~i~~----------~~~~~v~ 260 (315) ++..|..++++|++|+.++++|....... . ....-+.+||++..-.. ..++.++ T Consensus 231 ~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e 310 (347) T protein:vir:88 231 DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) T ss_pred chhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceee Confidence 34566778999999999999984221100 0 00011334554322222 2233344 Q ss_pred EeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 261 ~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) ..+.. ..| . -.+++.+.+|.+++||++.+.|+...+. T Consensus 311 ~~r~~-------~~~-~--d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 311 RARRP-------EFQ-A--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred eeech-------hhH-H--HHhhhhhhhcCceeccceEEEEEeCCCC Confidence 33322 112 2 3677889999999999999888754443 No 134 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.38 E-value=2.5e-13 Score=89.66 Aligned_cols=284 Identities=11% Similarity=0.033 Sum_probs=154.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee---------ecCCCceEEEEEeC-CceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ---------PTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~---------~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~ 70 (315) ||.+.. ...++|+.+..-+.+...+.+.+.+-+-+. .-++..+++|.+.. +.++.-+.|+.+++..+. T Consensus 1 MA~T~l--sd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MAETHL--SDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CCceee--eeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 997654 588999999887767666666554422222 12455689999975 357788899999998888 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhh---ccc-cccccccccccccccccc Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAF---HGI-DPATGKPAAAVKVSLDKT 146 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~---~G~-g~~~~~~~~~~~~~~~~~ 146 (315) +-++-....++.+....++++-...+..|.... |+++++...++..+..++ .|. +.......+ ....+ T Consensus 79 tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~----i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~----~~d~t 150 (351) T protein:vir:15 79 TSGKQQGIKFYQTKAYGYTDLGTMISGAPVQET----IGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSK----VYDQT 150 (351) T ss_pred cccceeEEEEeeccceehhhhhHhhccchHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccc----eeccc Confidence 777666666777777778876555555555444 555555555555554433 222 000000000 00111 Q ss_pred ccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccC-ccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 147 TKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKG-SPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 147 ~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g-~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) ....+.....+..+.+++..+-+.....-.+|+||+.++..|++...-+- ++. -....-++++|++|++++ T Consensus 151 ~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s--------~~~~~i~t~~G~~VivdD 222 (351) T protein:vir:15 151 KVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQ--------NGATPFEAYNGLRIVLDD 222 (351) T ss_pred cccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhcccc--------ccCcccceecceEEEEcC Confidence 11112223445678888888765544445789999999999987652211 111 112234789999999999 Q ss_pred ccCccccccccccc-eEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 226 TVSGAPEMSPASGV-KAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 226 ~v~~~~~~~~~~~~-~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) .||.........+. ..+||. ..+.++.. +..+++.++....+ ++-.+..+.++ +.||..|..-+.. T Consensus 223 ~~p~~~~~~~~~~ytsyl~~~-GAi~~~~~-~~~ve~~rd~~~~~--------g~d~l~~r~~~---~~hp~G~s~~~~~ 289 (351) T protein:vir:15 223 DIEIDLTDKTKPVSTSYIFAP-GAVRYSTN-MRSTETKYDPLING--------GQDVIVQKRVG---TIHVAGTSIKASF 289 (351) T ss_pred CCccccCCCCCceeEEEEEec-ceeeeecC-CcCcceeecccCCC--------CceEEEEeeee---eeeeeeeeecccc Confidence 99965444333322 334433 33434433 23355555543322 22222222222 3555555543211 Q ss_pred cCCCCCCCCCC Q lcl|NC_018838. 305 AAPKPNPPAGN 315 (315) Q Consensus 305 ~a~~~~~~~~~ 315 (315) .....+-|... T Consensus 290 ~~~~~~sPt~~ 300 (351) T protein:vir:15 290 SPSKASFPTID 300 (351) T ss_pred cccCcCCcChH Confidence 11111112111 No 135 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.38 E-value=2e-14 Score=95.68 Aligned_cols=287 Identities=13% Similarity=0.031 Sum_probs=155.0 Q ss_pred CCCCccCCC--------------ceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeeccccc Q lcl|NC_018838. 1 MADDFLSAG--------------KLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVK 65 (315) Q Consensus 1 m~~~~~s~G--------------g~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~ 65 (315) |++...+.= ...| +++..+++......|+++.+.++..+.+ ..++||+. +..+++....|+++ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERL 78 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCc Confidence 777665431 1234 6889999999999999999998887664 45788887 66777877878777 Q ss_pred CCC--ccceeeEEEeeEEEEEeehhhHHHhccCh-hhhHHHHHHHHHHHHHHHHHHHHHHhhhcc----c---ccccccc Q lcl|NC_018838. 66 PSA--SVDVSAFTAQPIKVVTQQRVSDEFMWADA-DYRLGVLQDLISPALGASIGRAVDLIAFHG----I---DPATGKP 135 (315) Q Consensus 66 ~~s--~~~~~~v~l~~~kl~~~~~iS~ell~~~~-~d~~~~l~~~i~~~la~~i~~~~d~a~~~G----~---g~~~~~~ 135 (315) +.+ +.+=.+++|...++- .++.++.+-. .-....+.+.+.++.+++|++.+|+.++.- . ++.. .. T Consensus 79 ~~~~~~~~~~e~~itID~~~----~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~-~~ 153 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLL----TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASN-EN 153 (347) T ss_pred CCCCCCCCcceEEEEecchh----hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cc Confidence 543 234445444443331 1122221100 000113667788899999999999877521 0 1111 11 Q ss_pred ccccc-ccc-cccccccc-----cccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccccc Q lcl|NC_018838. 136 AAAVK-VSL-DKTTKTVD-----ATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYPA 207 (315) Q Consensus 136 ~~~~~-~~~-~~~~~~~~-----~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~ 207 (315) ..++. ... ........ .....++.|.++...+...+...... .+++|..+..|..-+.-......+. .. T Consensus 154 ~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~---~~ 230 (347) T protein:vir:94 154 IAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAAL---ID 230 (347) T ss_pred cCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccc---cc Confidence 11111 111 00011100 11222445556666665554433333 5789999887743222111110010 12 Q ss_pred cccCCCccccceeeEeecccCccccccccc---------cceE--------EEecccceEE----------EeeccceEE Q lcl|NC_018838. 208 AGFAGLDNWRGLNVGASSTVSGAPEMSPAS---------GVKA--------IVGDFSRVHW----------GFQRNFPIE 260 (315) Q Consensus 208 ~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~---------~~~~--------~~gDf~~~~i----------~~~~~~~v~ 260 (315) ...|..++++|.+|+.|+++|......... ..-. +-+||++..- ....+++++ T Consensus 231 ~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e 310 (347) T protein:vir:94 231 PETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALE 310 (347) T ss_pred ccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccccccc Confidence 345666899999999999999533221110 0011 2233332221 112222333 Q ss_pred EeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 261 ~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) ..++. ..|. -.+++.+.+|.+++||++.+.|+..+|+ T Consensus 311 ~~r~~-------~~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 311 RDRDV-------DAQG---DLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred chhch-------hhHH---HHhhhhhhhcCcccccceeEEEEecCCC Confidence 33321 1222 3678889999999999999999988877 No 136 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.37 E-value=5.7e-13 Score=87.71 Aligned_cols=292 Identities=9% Similarity=-0.005 Sum_probs=157.8 Q ss_pred CCCCccCCCc--------eEcc-hhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGK--------LELP-GSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg--------~~vP-~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |+.....+++ .-+. +++..++.......++++.+.++..+.++ ..++|+. +..+++...-|+....+.+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~~~~ 79 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDASPT 79 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCCCCc Confidence 7776554432 2233 78999999999999999999988877654 4788887 5567777766666665666 Q ss_pred ceeeEEEeeEEEE-EeehhhH--HHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc---cc----cccccccccc Q lcl|NC_018838. 71 DVSAFTAQPIKVV-TQQRVSD--EFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI---DP----ATGKPAAAVK 140 (315) Q Consensus 71 ~~~~v~l~~~kl~-~~~~iS~--ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~---g~----~~~~~~~~~~ 140 (315) .-++.+|..-.+- .-..|-+ |.. ++ .| .+++.+.+++++++++.+|+.++.-. ++ +....+.+.. T Consensus 80 ~~~k~~itID~ll~a~~~V~diDe~q-~~-~D---~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~ 154 (364) T protein:vir:10 80 EFDKNRLVVDTTVIARNTVAHFHDVQ-ND-ID---GLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAG 154 (364) T ss_pred ccCcEEEEecceeeechhhhhHHHHh-cC-cc---chhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccC Confidence 6777666665432 1122211 111 11 11 24677888888999999999774210 00 0000111111 Q ss_pred c--cccccccccccc---cchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccc---cccccC Q lcl|NC_018838. 141 V--SLDKTTKTVDAT---DSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMY---PAAGFA 211 (315) Q Consensus 141 ~--~~~~~~~~~~~~---~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~---~~~~~~ 211 (315) . .+.......+.. ....+.+.++...+.+.+...... .+++|..+..|.+- .+-++..+.. .+...| T Consensus 155 ~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~----~~lvn~d~~~~~~~~~~~G 230 (364) T protein:vir:10 155 HGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDA----DRIVDKSYTIAASDNTVDG 230 (364) T ss_pred CcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcC----CccccccccccCCCccccc Confidence 0 111111111000 011122334445555544433333 57889999888552 1222222211 112344 Q ss_pred CCccccceeeEeecccCccccccccc--------------cceEEEecccc----------eEEEeeccceEEEeccCCc Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSPAS--------------GVKAIVGDFSR----------VHWGFQRNFPIELIEYGDP 267 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~~~--------------~~~~~~gDf~~----------~~i~~~~~~~v~~~~~~~~ 267 (315) ...++.|.||+.|+++|........+ ...-..+||+. +..+...++..++.++... T Consensus 231 ~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~ 310 (364) T protein:vir:10 231 FVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKE 310 (364) T ss_pred eeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccce Confidence 45689999999999999643321110 00001244432 2233334444444433211 Q ss_pred cccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 268 DQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 268 ~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) | ...+.+.+-+|.+++||++.+.|+... ...|++| T Consensus 311 -------~---~~~ida~~a~G~g~lRPeaa~~i~~~~---~~~~~~~ 345 (364) T protein:vir:10 311 -------K---TWYIDTFLAEGAIPDRWEAVAVVTAAD---TAELATD 345 (364) T ss_pred -------e---eeeeeeehcccCcccCccceEEEEecC---CCCCccc Confidence 1 223445667999999999999997655 5667777 No 137 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.35 E-value=2.4e-13 Score=89.75 Aligned_cols=293 Identities=12% Similarity=0.007 Sum_probs=162.5 Q ss_pred CCCCc----------cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCc Q lcl|NC_018838. 1 MADDF----------LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~~~----------~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~ 69 (315) |++-. .+.-.+.| ++++.+|.......++++++.++..+.++ ..++|+. +..+++...-|+++..+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCCC Confidence 55443 22223455 89999999999999999999998887754 4788877 667888888888887777 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHH-HHHHHHHHHHHHHHHHHHHHhhhc----ccc--cccccc---cccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLG-VLQDLISPALGASIGRAVDLIAFH----GID--PATGKP---AAAV 139 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~-~l~~~i~~~la~~i~~~~d~a~~~----G~g--~~~~~~---~~~~ 139 (315) +..++.++..-.+- +++.++.+-.+.... .+++.+.+++++++++.+|++++. +.. +..... ..|+ T Consensus 79 ~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred ccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 77777666665442 222222111111111 267788999999999999997752 111 000000 0122 Q ss_pred cccccccccc-cccccchhHHHHHHHHHhhhccccc----ceEEEEeHHHHHHHHHHhhccCccccccccc----ccccc Q lcl|NC_018838. 140 KVSLDKTTKT-VDATDSATTDLVKAVGLIAGAGLQV----PNGVALDPAFSFALSTEVYPKGSPLAGQPMY----PAAGF 210 (315) Q Consensus 140 ~~~~~~~~~~-~~~~~~~~~di~~~~~~~~~~~~~~----~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~----~~~~~ 210 (315) ......++.. .+..+...+-+..+...+.+.+... .-..+++|+.+..|..-. +-++..+.- .+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~----~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHD----KLMNVEYQATGATNDYVK 230 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccc----cccccccccccccccccC Confidence 2221111111 1111111233445555555444321 124678999999885532 223322211 11234 Q ss_pred CCCccccceeeEeecccCcccccccc--ccceEEEecccceE----------EEeeccceEEEeccCCccccchhhhhcC Q lcl|NC_018838. 211 AGLDNWRGLNVGASSTVSGAPEMSPA--SGVKAIVGDFSRVH----------WGFQRNFPIELIEYGDPDQTGRDLKGHN 278 (315) Q Consensus 211 ~~~~~l~G~Pv~~s~~v~~~~~~~~~--~~~~~~~gDf~~~~----------i~~~~~~~v~~~~~~~~~~~~~~~f~~~ 278 (315) +....+.|.||+.++++|....++.. .....+-|||.... .+..+++..++.++.. -|. T Consensus 231 g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-------~~~-- 301 (335) T protein:vir:63 231 SRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-------KFS-- 301 (335) T ss_pred ceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc-------hhh-- Confidence 45678999999999999966544321 11123445664333 2222233333222211 111 Q ss_pred cEEEEEEEEeccEeecccceEEEeeccCCCCCCCC Q lcl|NC_018838. 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPA 313 (315) Q Consensus 279 ~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~ 313 (315) -.+.+.+-+|.+++||++.+.++..-..+.---+ T Consensus 302 -~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 302 -WVLDTFQMYNIGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred -HHhHHHHHcCCcccccceEEEEEEcCCCceeecC Confidence 2344556699999999999999865443333333 No 138 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.35 E-value=2.2e-13 Score=89.92 Aligned_cols=293 Identities=12% Similarity=0.007 Sum_probs=161.8 Q ss_pred CCCCc----------cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCc Q lcl|NC_018838. 1 MADDF----------LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~~~----------~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~ 69 (315) |++-. .+.-.+.| ++++.+|.......++++++.++..+.++ .+++|+. +..+++...-|++...+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCCC Confidence 55433 23333456 89999999999999999999988887654 4789976 667888888888877776 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhH-HHHHHHHHHHHHHHHHHHHHHhhhc----c--cccccccc---cccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRL-GVLQDLISPALGASIGRAVDLIAFH----G--IDPATGKP---AAAV 139 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~-~~l~~~i~~~la~~i~~~~d~a~~~----G--~g~~~~~~---~~~~ 139 (315) +..++.++..-.+- +++.++.+-.+... -.+++.+.+++++++++.+|++++. + ..+....+ ..|+ T Consensus 79 ~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred cccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 77777666665432 22222211111111 1367888999999999999998752 1 11111100 0121 Q ss_pred ccccccccccc-ccccchhHHHHHHHHHhhhccccc----ceEEEEeHHHHHHHHHHhhccCccccccccc----ccccc Q lcl|NC_018838. 140 KVSLDKTTKTV-DATDSATTDLVKAVGLIAGAGLQV----PNGVALDPAFSFALSTEVYPKGSPLAGQPMY----PAAGF 210 (315) Q Consensus 140 ~~~~~~~~~~~-~~~~~~~~di~~~~~~~~~~~~~~----~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~----~~~~~ 210 (315) ......++... ........-+..+...+.+.+... .-..+++|+.+..|..-. +-++..+.- .+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~----~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHD----KLMSVEYQATGATNDYVK 230 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccc----cccccccccccccccccc Confidence 11111111111 111111223333444444333321 124789999999885532 223322211 11234 Q ss_pred CCCccccceeeEeecccCcccccccc------------ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcC Q lcl|NC_018838. 211 AGLDNWRGLNVGASSTVSGAPEMSPA------------SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHN 278 (315) Q Consensus 211 ~~~~~l~G~Pv~~s~~v~~~~~~~~~------------~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~ 278 (315) +....++|.||+.++++|....++.. ++..++++--+.+..+...++..++.++.. .|. T Consensus 231 g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-------~~~-- 301 (335) T protein:vir:78 231 SRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-------QFS-- 301 (335) T ss_pred ceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-------hhh-- Confidence 45578999999999999965433211 112333333233333333333444433221 121 Q ss_pred cEEEEEEEEeccEeecccceEEEeeccCCCCCCCC Q lcl|NC_018838. 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPA 313 (315) Q Consensus 279 ~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~ 313 (315) -.+.+.+-+|.+++||++.+.|+..-.++.---+ T Consensus 302 -~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 302 -WVLDTFQMYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred -HhhhHHHHcCCcccCcceEEEEEecCCCcccccC Confidence 2344556699999999999999866555544444 No 139 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.33 E-value=1.2e-12 Score=85.92 Aligned_cols=288 Identities=9% Similarity=0.032 Sum_probs=157.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee---------cCCCceEEEEEeC-CceeEEeeccc-ccCCCc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP---------TIFGPVKGAVFSG-VPRAKIVGEGE-VKPSAS 69 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~---------~~~~~~~ip~~~~-~~~a~wv~Eg~-~~~~s~ 69 (315) ||++++.-...++|+.+..-+.+...+.+.+.+-+-+.+ -++..+++|.+.. +..+.-+.||+ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 999888888999999998877777766665544332222 3456689999974 35777788986 688888 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT 149 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~ 149 (315) .+-++-....++.+....++++-...+..|....+..++++...+...+.+. +.+.|.-.................... T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~ll-a~l~gvf~~~~~~~~~~~~~~~~~~~~ 159 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALI-ATLNGIFATGTAGEKGALEETHVSDQS 159 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHH-HHHHhhhhhhhcccchhhhhhheeccc Confidence 8777777777777777777777655555565555444444433333322222 222222100000000000000001111 Q ss_pred cccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCc Q lcl|NC_018838. 150 VDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSG 229 (315) Q Consensus 150 ~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~ 229 (315) .+.....+..+.++..++-+ +.....+|+||+.++..|++...-+- +.+....+.-++++|++|++++.||. T Consensus 160 ~~~a~~s~~~l~~A~~~~GD-~~~~~~~ivmhS~v~~~L~~~~li~~-------~~~s~~~~~i~~~~G~~VivdD~~p~ 231 (330) T protein:vir:10 160 KASTGIDAGMVLDAKQLLGD-SADQVTAIAMHSAVYTKLQKDNLIQY-------IQPTTATINIPTYLGYRVIIDDGIAP 231 (330) T ss_pred ccccccCHHHHHHHHHHhcc-ccccceEEEEcHHHHHHHHHhhhhhh-------hcccccCcccccccceEEEEeCCCCC Confidence 11223345667777777644 34456789999999999987542211 11111223447899999999999985 Q ss_pred cccccccccceEEEecccceEEEe---eccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 230 APEMSPASGVKAIVGDFSRVHWGF---QRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 230 ~~~~~~~~~~~~~~gDf~~~~i~~---~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ... .....+|+. ..+.++. .+.+.+|++++.. +++..+....++ ++||..|..-+.... T Consensus 232 ~~~----~yt~yl~~~-GAi~~~~~~~~~~v~~EtdRd~~----------~g~~~l~~r~~~---~~hp~G~s~~~~~~~ 293 (330) T protein:vir:10 232 TGD----IYTSYLFRT-GSIGLNTGNPSGLTTFETSREAA----------KGNDMIYTRRAL---VMHPYGVKWTGAEVD 293 (330) T ss_pred CCC----ceeEEEEec-CceeeecccCCccccccccCCcc----------ccceEEEEeeEE---Eeeeeeeeecccccc Confidence 321 222334442 3444442 2234566666532 223334334443 466777665543221 Q ss_pred CCCCCCCCC Q lcl|NC_018838. 307 PKPNPPAGN 315 (315) Q Consensus 307 ~~~~~~~~~ 315 (315) ..-+-|... T Consensus 294 ~~~~sPt~~ 302 (330) T protein:vir:10 294 AGNITPSNA 302 (330) T ss_pred cCcCCcChH Confidence 111122222 No 140 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.31 E-value=3.6e-13 Score=88.78 Aligned_cols=288 Identities=13% Similarity=0.041 Sum_probs=155.0 Q ss_pred CCCCccCC--------Cc-------eEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccc Q lcl|NC_018838. 1 MADDFLSA--------GK-------LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEV 64 (315) Q Consensus 1 m~~~~~s~--------Gg-------~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~ 64 (315) ||+..+.. || ..| +.++.+|....+..|+++.+.++.... +..++||+. +..+++....|+. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~i-G~~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeec-cceeeeeecCCCC Confidence 77543322 11 356 899999999999999999999876655 455788876 4466677777777 Q ss_pred cCCC--ccceeeEEEeeEEEE-EeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhc-----cccc---ccc Q lcl|NC_018838. 65 KPSA--SVDVSAFTAQPIKVV-TQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH-----GIDP---ATG 133 (315) Q Consensus 65 ~~~s--~~~~~~v~l~~~kl~-~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~-----G~g~---~~~ 133 (315) ++.+ ++...+.+|..-++- .-..|.+-=-.++.. .+.+.+.++.++++++..|+.++. +... ... T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~----D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHY----DVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCC----chhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 6543 355666555433221 111122110011111 256678888899999999987752 1100 000 Q ss_pred cc-ccc--ccccccc-cc---cccccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccc Q lcl|NC_018838. 134 KP-AAA--VKVSLDK-TT---KTVDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) Q Consensus 134 ~~-~~~--~~~~~~~-~~---~~~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~ 205 (315) .. +.+ ....... +. .....+...|+.+.++...|..++...... .+++|..+..|.+-..-. +..+.- T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~----~~d~~~ 230 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPN----AANYQA 230 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccc----cccccc Confidence 00 000 0000000 10 111112334666777777776665543344 568999988885432211 111111 Q ss_pred -cccccCCCccccceeeEeecccCcccccccccc-------------ceEEEecccce--------EEEe--eccceEEE Q lcl|NC_018838. 206 -PAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG-------------VKAIVGDFSRV--------HWGF--QRNFPIEL 261 (315) Q Consensus 206 -~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~-------------~~~~~gDf~~~--------~i~~--~~~~~v~~ 261 (315) ..+..|..++++|++|+.|+++|.......... ...+-++|+.. .+|. .+++.++. T Consensus 231 ~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~ 310 (347) T protein:vir:33 231 LLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALER 310 (347) T ss_pred ccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeee Confidence 124456678999999999999997543211100 01122333221 1222 22233444 Q ss_pred eccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 262 IEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 262 ~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) .++.. .| .-.+++.+.+|.+++||++.+.|+.+-..| T Consensus 311 ~r~~~-------~~---~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 311 ARRAN-------YQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ccchh-------hh---hHhhhhhhhcCCceecccceEEEecCCCCC Confidence 43321 12 135667788899999999999997543333 No 141 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.27 E-value=1.5e-12 Score=85.43 Aligned_cols=289 Identities=11% Similarity=0.021 Sum_probs=156.4 Q ss_pred CCCCccC-------CCc----eEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccccCCC Q lcl|NC_018838. 1 MADDFLS-------AGK----LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~~~~~s-------~Gg----~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) |+..--+ +|. ++| +.++.+|++.....|+++.+.++.... +..++||+. +..+++....|+.+... T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~~l~~~ 84 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGTPIVGD 84 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCCCCCCC Confidence 4433222 232 455 899999999999999999999877665 455888887 55677776666665432 Q ss_pred -ccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc--cccccccccccccc-- Q lcl|NC_018838. 69 -SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI--DPATGKPAAAVKVS-- 142 (315) Q Consensus 69 -~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~--g~~~~~~~~~~~~~-- 142 (315) +++-++++|..-+. +....|-+ +-+ ......|.+.+.++.++++++.+|+.++.-. +.....+.++...+ T Consensus 85 ~~~~~~~~~l~ID~~ky~~~~Vdd-iD~---~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~ 160 (332) T protein:vir:78 85 AGIKANEKTLVMDDLLVSSQFVYS-LDE---IFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFH 160 (332) T ss_pred CCCCCceEEEEEehhhhhHHHHHh-HHH---HhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccc Confidence 45555555555432 12222221 111 1111236777889999999999998765311 10011111111111 Q ss_pred ccccccccccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCccccccccc-c-ccccC-CCccccc Q lcl|NC_018838. 143 LDKTTKTVDATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQPMY-P-AAGFA-GLDNWRG 218 (315) Q Consensus 143 ~~~~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~-~-~~~~~-~~~~l~G 218 (315) +..+..........|+-|.++...+...+......| +++|..+..|.+.++. +-.+....- . .+..+ .-++++| T Consensus 161 ~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~--~~~n~~~~~~~~~~~~g~~i~~i~G 238 (332) T protein:vir:78 161 VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDT--NILNREIGNSQGDMNSGKGLYSIAG 238 (332) T ss_pred cccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCc--eeeeeeccccccceecceeeeEEee Confidence 001111111123346677788778876665444455 5699999888553332 111110000 0 11222 2468999 Q ss_pred eeeEeecccCccccccccc-----cceEEEecccceEE--------Ee--eccceEEEeccCCccccchhhhhcCcEEEE Q lcl|NC_018838. 219 LNVGASSTVSGAPEMSPAS-----GVKAIVGDFSRVHW--------GF--QRNFPIELIEYGDPDQTGRDLKGHNEVMVR 283 (315) Q Consensus 219 ~Pv~~s~~v~~~~~~~~~~-----~~~~~~gDf~~~~i--------~~--~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r 283 (315) .+|+.|+++|......... ..-.+-|||+.... +. ..++.+++.+.-... ..| .-.++ T Consensus 239 ~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~----~~~---~d~i~ 311 (332) T protein:vir:78 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV----QYQ---GDLIV 311 (332) T ss_pred eEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccch----hhh---Hhhhh Confidence 9999999999654332211 11234555554221 11 122233222211111 122 13567 Q ss_pred EEEEeccEeecccceEEEeec Q lcl|NC_018838. 284 AEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 284 ~~~r~~~~v~~~~af~~l~~~ 304 (315) +.+.+|.+++||++.+.|+.+ T Consensus 312 ~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 312 GKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhhcCceecccceEEEeeC Confidence 778899999999999999888 No 142 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.25 E-value=1.4e-12 Score=85.55 Aligned_cols=289 Identities=12% Similarity=0.030 Sum_probs=150.9 Q ss_pred CCCCccCC---------Cc------eEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccc Q lcl|NC_018838. 1 MADDFLSA---------GK------LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEV 64 (315) Q Consensus 1 m~~~~~s~---------Gg------~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~ 64 (315) |+++.+-. |. ..| +.++.+|+...+..|.++.+.++.... +..++||+.. ..+++....|+. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~i-e~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGEN 78 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHH-HHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCC Confidence 77765422 11 122 578899999999999999999877655 4557888774 467777777877 Q ss_pred cCCC--ccceeeEEEeeEEEE-EeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc------cccc--c Q lcl|NC_018838. 65 KPSA--SVDVSAFTAQPIKVV-TQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI------DPAT--G 133 (315) Q Consensus 65 ~~~s--~~~~~~v~l~~~kl~-~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~------g~~~--~ 133 (315) ++.+ +.+..+.+|..-+.- .-..|-+-=-.++.. .+.+.+.++.++++++.+|+.++.-. .+.. + T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~----D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHY----DVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 6543 456666555543321 111221110011111 36677888889999999998775211 0000 0 Q ss_pred ccc---ccccccccccccc----cccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCccccccccc Q lcl|NC_018838. 134 KPA---AAVKVSLDKTTKT----VDATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) Q Consensus 134 ~~~---~~~~~~~~~~~~~----~~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~ 205 (315) ... .++.......+.. .......++-+..+...|..++......| +++|..+..|.+-.+-....-.+. T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~--- 231 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL--- 231 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccc--- Confidence 000 0111111111100 00111123344444555655554433445 568999998854332221111111 Q ss_pred cccccCCCccccceeeEeecccCccccccc-----cccceE--------EEecccce--------EE--EeeccceEEEe Q lcl|NC_018838. 206 PAAGFAGLDNWRGLNVGASSTVSGAPEMSP-----ASGVKA--------IVGDFSRV--------HW--GFQRNFPIELI 262 (315) Q Consensus 206 ~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~-----~~~~~~--------~~gDf~~~--------~i--~~~~~~~v~~~ 262 (315) ..+..|..++++|++|+.|+++|....+.. ....-. .-++|+.. .+ ...+++.++.. T Consensus 232 ~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~ 311 (347) T protein:vir:15 232 IDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERA 311 (347) T ss_pred ccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeec Confidence 124456667899999999999996433211 000001 11222111 12 22333444444 Q ss_pred ccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 263 EYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 263 ~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) ++.. .| .-.+++.+.+|.+++||++.+.|+.+-..| T Consensus 312 ~~~~-------~~---~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 312 RRAN-------YQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ccch-------hh---hhhhehhhhcCCceeccccEEEEecCCCCC Confidence 4321 12 235667778899999999999987543333 No 143 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.17 E-value=9.1e-12 Score=81.11 Aligned_cols=289 Identities=12% Similarity=-0.038 Sum_probs=150.2 Q ss_pred CCCCc----------cCCC-ceEcchhHHHHHHHHHHhccchhhhcceeec---CCCceEEEEEeCCceeEEeecccccC Q lcl|NC_018838. 1 MADDF----------LSAG-KLELPGSMIGAVRDRAIDSGVLAKLSPEQPT---IFGPVKGAVFSGVPRAKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~----------~s~G-g~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~---~~~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) ||+-- .++. .-+||+.++.++++.+++.+++..+++.... .+..++||+.. .+++....++..++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCccc Confidence 44322 2211 2478999999999999999998888765432 23468899874 56888889999888 Q ss_pred CCccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccc----ccccc---cccc Q lcl|NC_018838. 67 SASVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGID----PATGK---PAAA 138 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g----~~~~~---~~~~ 138 (315) ..+.+..++++...+. ..-..|+++-...+..| +.+.+.+.++.++++++|+.++.-.. ..... .... T Consensus 80 ~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D----~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~ 155 (381) T protein:vir:80 80 LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYT----LRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTT 155 (381) T ss_pred ccccCCceEEEEEeeeeecceeechHHHHhhccC----hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Confidence 8887777776666333 34456666433333333 55667777788888888887653211 00100 0111 Q ss_pred ccccccccccccccccchhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccC-ccccccccccccccCCCccc Q lcl|NC_018838. 139 VKVSLDKTTKTVDATDSATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKG-SPLAGQPMYPAAGFAGLDNW 216 (315) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g-~~~~~~~~~~~~~~~~~~~l 216 (315) +.................++.|.++...+.+++..... .++++|.....|.+...-.. .+... ..+..|..++| T Consensus 156 i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~----~~l~~G~Ig~i 231 (381) T protein:vir:80 156 LGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQV----KPVTSGVVGTI 231 (381) T ss_pred ccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccc----hhhhceeeeEE Confidence 11111111111112234577888888888766543333 46789999998865321111 11111 23556677899 Q ss_pred cceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEee-cc Q lcl|NC_018838. 217 RGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIE-SL 295 (315) Q Consensus 217 ~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~-~~ 295 (315) +|++|+.++++|........ .-.|-.... ... +.-.++. .-|..+..++|....+|..+. +- T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~----~~agap~~~----~~~--~~~~~~~-------g~~s~~a~av~~~k~yd~~~~~~~ 294 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYV----NGQGAPTQP----TPG--VLGSPYL-------PDQAGTANVVNTGSASDLAVSLSY 294 (381) T ss_pred cceEEEeeccccccccccee----eeccccccc----ccc--ccccccc-------cccccceeeeeeeeeeceeeeeee Confidence 99999999999864332111 101100000 000 0000000 013334456666666666552 22 Q ss_pred cceEEEeeccC-------------------CCCC-----CCC---------CC Q lcl|NC_018838. 296 DSFAVVKEKAA-------------------PKPN-----PPA---------GN 315 (315) Q Consensus 296 ~af~~l~~~~a-------------------~~~~-----~~~---------~~ 315 (315) ..+-...++-. .... |-. ++ T Consensus 295 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (381) T protein:vir:80 295 FGLPVFSGAGATAADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESS 347 (381) T ss_pred ccceeeecceeeecCCCceeeeehhhhhhhhhcccccccccccceeEeecccc Confidence 22222221110 0000 000 11 No 144 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.13 E-value=5.6e-11 Score=76.77 Aligned_cols=296 Identities=11% Similarity=0.016 Sum_probs=155.7 Q ss_pred CCCCccCC------------C------ceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeec Q lcl|NC_018838. 1 MADDFLSA------------G------KLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGE 61 (315) Q Consensus 1 m~~~~~s~------------G------g~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E 61 (315) |++..-+. | .+.| +.++.+|.......|+++.+.++..+.+ +.++||+. +..+++...- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecC Confidence 44332221 1 1234 7899999999999999999998877764 45788877 5567776665 Q ss_pred ccccCC---CccceeeEEEeeEEE-EEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc------ccc Q lcl|NC_018838. 62 GEVKPS---ASVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI------DPA 131 (315) Q Consensus 62 g~~~~~---s~~~~~~v~l~~~kl-~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~------g~~ 131 (315) |+++.. .+....+.+|..-++ +.-..|.+-=-.++ ...|.+.+.++.++++++.+|+.++.-. ..+ T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa----~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p 154 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLA----HYELRGEISKKIGYALAEKYDRLIFRSITRGARSASP 154 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhc----CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 655432 244444444444332 11122221100111 1136777889999999999998775211 000 Q ss_pred -ccc--ccccccccccccccc---cccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCcccccccc Q lcl|NC_018838. 132 -TGK--PAAAVKVSLDKTTKT---VDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPM 204 (315) Q Consensus 132 -~~~--~~~~~~~~~~~~~~~---~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~ 204 (315) ++. ...|.......+.+. .......|+.+.++...+.+.+...... .+++|..+..|.+-++.+ +-.+..+. T Consensus 155 ~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~-~~~n~d~~ 233 (375) T protein:vir:10 155 VSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSN-GLVNRDVQ 233 (375) T ss_pred cccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCcc-ceeeeccc Confidence 000 001111111111111 1112344677888887776666544344 568999998885544332 11111111 Q ss_pred ccc-cccCCCccccceeeEeecccCccccccc----------------------------c-------------ccceEE Q lcl|NC_018838. 205 YPA-AGFAGLDNWRGLNVGASSTVSGAPEMSP----------------------------A-------------SGVKAI 242 (315) Q Consensus 205 ~~~-~~~~~~~~l~G~Pv~~s~~v~~~~~~~~----------------------------~-------------~~~~~~ 242 (315) -.. ...+...++.|++|+.++++|....... . ++...+ T Consensus 234 ~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~ 313 (375) T protein:vir:10 234 GSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGL 313 (375) T ss_pred ccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEE Confidence 011 1234456899999999999996543210 0 111222 Q ss_pred EecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec-cCCCCC Q lcl|NC_018838. 243 VGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK-AAPKPN 310 (315) Q Consensus 243 ~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~-~a~~~~ 310 (315) +.-.+.+-.....++.+++.+..+. -.+| ...+.+.+-+|..+.||++-+.|+.. ++|..= T Consensus 314 ~~~~~A~g~v~~~~~~~~~~~~~~~-----~~~q--~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 314 IFQKEAAGVVEAIGPQVQVTNGDVS-----VIYQ--GDVILGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred EEchhheeeeeeeccccccccchhh-----heee--eeeeeeeeeeccCccCceeEEEEecCcCccccC Confidence 2222222222334444444321000 1123 34556677899999999999999754 333333 No 145 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.07 E-value=1.1e-11 Score=80.76 Aligned_cols=273 Identities=12% Similarity=0.011 Sum_probs=144.3 Q ss_pred CCCCccCCCceEcchhHH---HHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCcccee--- Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMI---GAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVS--- 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~---~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~--- 73 (315) ||++-.....=+++.+.. ..+=+-+.....+....|.+||..| .+++|++.-...+.-|+||+++|.++.+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~ 80 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDK 80 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeeee Confidence 999866665555544322 2332223333334444588888865 489999998888899999999999998875 Q ss_pred eEEEeeEEEEEeehhhHHHhc-cChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMW-ADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA 152 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~-~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~ 152 (315) ..+++.+|.+..+ |.|.++ ....+.... -.+.|.++|++++|.-++.-....+. .. ..+. T Consensus 81 t~t~kikK~rK~t--TdEAIqlsGygdpvge----ad~qL~~~ia~kId~D~~~~lktat~-------t~------tg~~ 141 (295) T protein:vir:99 81 DYTVKWFKKRRAT--TAEAIARHGAARAITE----ADKRIMRELQNGIKDAFFTFLKTKPT-------KV------KGVG 141 (295) T ss_pred eeEEEeeeecccc--cHHHHHhcCCCchhHH----HHHHHHHHHHHhhhHHHHHHhccCce-------ee------ehhh Confidence 4677778887754 999984 444554443 45566777777777776643221110 00 0011 Q ss_pred ccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccc--cccccccccccCCCcccccee-eEeecccCc Q lcl|NC_018838. 153 TDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPL--AGQPMYPAAGFAGLDNWRGLN-VGASSTVSG 229 (315) Q Consensus 153 ~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~--~~~~~~~~~~~~~~~~l~G~P-v~~s~~v~~ 229 (315) -...++.+...+....+.+ ..+...++||...+.+|+-..-+.+.. .|.-+ + -.++|.. |+.+..+|. T Consensus 142 lq~a~a~~~~al~~f~Ee~-~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~---L-----~nfLG~q~II~S~kv~~ 212 (295) T protein:vir:99 142 LQKALSASWAKLATFNEFE-GSPLVSFVSPLDVANYLGDTKVGADASNVFGMTL---L-----KNFLGMQNVIVMPSVPE 212 (295) T ss_pred HHHHHHHhhhhhhhccccc-CCceEEEEehHHHHHHHhccccccchhhhhhhhh---h-----hhhhccceEEEcccCCC Confidence 1223445555554444333 234568899999999876543322211 11111 1 1389997 889999987 Q ss_pred cccccccccceEE--E----ecccceEEEee---ccceEEEeccCCccccchhhhhcCcEEEEEEEEeccE--eecccce Q lcl|NC_018838. 230 APEMSPASGVKAI--V----GDFSRVHWGFQ---RNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVA--IESLDSF 298 (315) Q Consensus 230 ~~~~~~~~~~~~~--~----gDf~~~~i~~~---~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~--v~~~~af 298 (315) .....+..+.+.+ + ||+++...... +=|.+..+... +...+......++. .-+++++ T Consensus 213 G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~------------~~~t~et~~~~~~~lfpE~~dgi 280 (295) T protein:vir:99 213 GKIYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQL------------SNLTYESVFFGANVLFAEIPEGV 280 (295) T ss_pred ceEEEeeccceEEEEecCCchhhhhhhhhccCcccceEEEecccc------------ceeeehhhhHhHHHhcccccceE Confidence 6655554444333 1 33322211000 00001111100 00011111111111 2456777 Q ss_pred EEEeeccCCCCCCCCC Q lcl|NC_018838. 299 AVVKEKAAPKPNPPAG 314 (315) Q Consensus 299 ~~l~~~~a~~~~~~~~ 314 (315) ++.+..+ +++.--+| T Consensus 281 v~~tI~~-~~~~~~~~ 295 (295) T protein:vir:99 281 VEATIEA-AAVPGIGG 295 (295) T ss_pred EEEEEec-CcCCCCCC Confidence 7777643 22222223 No 146 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.06 E-value=1.4e-11 Score=80.02 Aligned_cols=262 Identities=11% Similarity=0.085 Sum_probs=131.1 Q ss_pred hcceeecCCCceEEEEEeCCceeEEeecccccCC--CccceeeEEE--eeEEEEEeehhhHHHhccChhhhHHHHHHHHH Q lcl|NC_018838. 34 LSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPS--ASVDVSAFTA--QPIKVVTQQRVSDEFMWADADYRLGVLQDLIS 109 (315) Q Consensus 34 l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~--s~~~~~~v~l--~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~ 109 (315) +.|.+. +++..++|+. +..+++...-|+++.. .++.-.+.+| .-.++... .|-+-=-.++.. .+.+.+. T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~----Dlr~e~s 73 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHY----DVRSEYS 73 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhh-hhhhHHHHhcCc----cchhHHH Confidence 444433 3466899987 6677777777776643 3344444333 33333221 111110011112 3677888 Q ss_pred HHHHHHHHHHHHHhhhccc------cccc-cccc--cccccccccccccccc---ccchhHHHHHHHHHhhhcccccceE Q lcl|NC_018838. 110 PALGASIGRAVDLIAFHGI------DPAT-GKPA--AAVKVSLDKTTKTVDA---TDSATTDLVKAVGLIAGAGLQVPNG 177 (315) Q Consensus 110 ~~la~~i~~~~d~a~~~G~------g~~~-~~~~--~~~~~~~~~~~~~~~~---~~~~~~di~~~~~~~~~~~~~~~~~ 177 (315) ++.++++++.+|+.++.-. .... ..+. .+........+...+. ....++.|.++...|...+...... T Consensus 74 ~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR 153 (324) T protein:vir:99 74 TQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDR 153 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCC Confidence 9999999999998764210 0000 1111 1111111111111111 1123556666666776655543334 Q ss_pred -EEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc----------------ccce Q lcl|NC_018838. 178 -VALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA----------------SGVK 240 (315) Q Consensus 178 -~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~----------------~~~~ 240 (315) .+++|..+..|..-+.-+-....+. .....|..++++|++|+.|+++|........ +... T Consensus 154 ~~vv~P~~y~~Ll~~~~~~~~~~~~~---~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ 230 (324) T protein:vir:99 154 TFYTDPDTYSAILAALMPNAANYAAL---IDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTG 230 (324) T ss_pred EEEeChHHHHHHhhcccccccccccc---cceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccc Confidence 5689999887743322211111111 1344566688999999999999965332110 0001 Q ss_pred EEEecccc----------eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEe--eccCCC Q lcl|NC_018838. 241 AIVGDFSR----------VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVK--EKAAPK 308 (315) Q Consensus 241 ~~~gDf~~----------~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~--~~~a~~ 308 (315) -+.+||+. +......++.++..++. ..| .-.+++.+-+|.+++||++.+.++ .-++|. T Consensus 231 ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~-------~~~---~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~ 300 (324) T protein:vir:99 231 KMTVGADNVVGLFVHRSAVATLKLKDMALERARRP-------EYQ---ADQIIAKYAMGHGGLRPEAVGAIIFEDGETPA 300 (324) T ss_pred ccccccCceeEEEEehhheEEEeeecceecceech-------hhH---HHhhhhhhhhcCcccccceEEEEEEccCcccc Confidence 12333332 22223333344444332 112 245677788999999999887655 444444 Q ss_pred CCCCCCC Q lcl|NC_018838. 309 PNPPAGN 315 (315) Q Consensus 309 ~~~~~~~ 315 (315) ++|.--. T Consensus 301 ~~~~~~~ 307 (324) T protein:vir:99 301 VAPDVIT 307 (324) T ss_pred ccchhhh Confidence 5554322 No 147 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.06 E-value=7.9e-11 Score=75.97 Aligned_cols=295 Identities=10% Similarity=0.016 Sum_probs=150.6 Q ss_pred CCCCccCCCc--------eEcc-hhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGK--------LELP-GSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg--------~~vP-~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |+.....+++ .-+. +++.+++.......++++++.++..+.++ ..++|+. +..+++...-|+....+.+ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg~~~ 79 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCCCCc Confidence 7776554432 2233 78999999999999999999988877654 4788877 6567777766666655666 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccCh--hhhHHHHHHHHHHHHHHHHHHHHHHhhhc-----c---ccccccccccccc Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADA--DYRLGVLQDLISPALGASIGRAVDLIAFH-----G---IDPATGKPAAAVK 140 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~--~d~~~~l~~~i~~~la~~i~~~~d~a~~~-----G---~g~~~~~~~~~~~ 140 (315) .-++.+|..-.+- +++.++.+-. -+....+++.+.+++++++++.+|+.++. + ..+... .+.+.. T Consensus 80 ~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~-~~~~~~ 154 (402) T protein:vir:97 80 QADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN-KPRVKG 154 (402) T ss_pred ccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-cCcccc Confidence 6666666654432 2222221000 00011146778888999999999997643 1 111010 111111 Q ss_pred c--cccccccccccc---cchhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCccccccccc---cccccC Q lcl|NC_018838. 141 V--SLDKTTKTVDAT---DSATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGSPLAGQPMY---PAAGFA 211 (315) Q Consensus 141 ~--~~~~~~~~~~~~---~~~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~~~~~~~~~---~~~~~~ 211 (315) . +.........+. ....+-+..+...+.+.+..... .++++|..+..|.+-. +-.+..+.. .....| T Consensus 155 ~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~----rl~n~d~~~~~~g~~~~G 230 (402) T protein:vir:97 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD----RIVDKTYTISQSGATING 230 (402) T ss_pred cccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcc----cccchhhccccCCccccc Confidence 1 101111111010 11122333444455444433333 3678899998885421 112221111 012344 Q ss_pred CCccccceeeEeecccCcccccccc--------ccceEEEecccceEE----------EeeccceEEEeccCCccccchh Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSPA--------SGVKAIVGDFSRVHW----------GFQRNFPIELIEYGDPDQTGRD 273 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~~--------~~~~~~~gDf~~~~i----------~~~~~~~v~~~~~~~~~~~~~~ 273 (315) ....+.|.||+.|+++|........ ....-+-|||+.... ....+++-++-++... T Consensus 231 ~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~------ 304 (402) T protein:vir:97 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE------ 304 (402) T ss_pred eeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhH------ Confidence 5568999999999999964322111 011112356542221 1111111111111100 Q ss_pred hhhcCcEEEEEEEEeccEeecccceEEEe--eccCCCCCCCCCC Q lcl|NC_018838. 274 LKGHNEVMVRAEAVLYVAIESLDSFAVVK--EKAAPKPNPPAGN 315 (315) Q Consensus 274 ~f~~~~v~~r~~~r~~~~v~~~~af~~l~--~~~a~~~~~~~~~ 315 (315) |. ..+-+.+-+|..++||++..+++ ....+..++.-+. T Consensus 305 -~~---~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~~~ 344 (402) T protein:vir:97 305 -KT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGD 344 (402) T ss_pred -HH---HHHHHHHHhCCcccCccceEEEEEecccccccCCcccc Confidence 11 11223456888999999998874 3334444444333 No 148 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.01 E-value=5.2e-11 Score=76.96 Aligned_cols=295 Identities=10% Similarity=-0.006 Sum_probs=148.0 Q ss_pred CCCCccCCCc---------eEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGK---------LELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg---------~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |++-...+.. -+.=+++.+++.......++++++.++..+.+++ .++|+. +..+++...-|+....+.+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~~~~ 79 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCCCCc Confidence 6655443211 1333688999999999999999999998887554 788877 6678888877777766667 Q ss_pred ceeeEEEeeEEEE-EeehhhH--HHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc-----cc--cccccccccc Q lcl|NC_018838. 71 DVSAFTAQPIKVV-TQQRVSD--EFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-----DP--ATGKPAAAVK 140 (315) Q Consensus 71 ~~~~v~l~~~kl~-~~~~iS~--ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~-----g~--~~~~~~~~~~ 140 (315) .-++..|..-.+- .-..|-+ |.. .+ ...+++.+.+++++++++++|+.++.-. .. +-...+.+.. T Consensus 80 ~~dK~~ItID~lL~a~~~V~dlDe~q----~~-yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~ 154 (401) T protein:vir:70 80 QADKNQLVIDATVIARNTVAHLHDVQ----GD-IDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKG 154 (401) T ss_pred ccccEEEEeCceeehhhhhhhHHHHH----hc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCC Confidence 7777666665441 1111211 111 01 1114677888899999999998663321 00 0001111111 Q ss_pred --cccccccccccccc---chhHHHHHHHHHhhhcccccceEEEEeHHHHH-HHHHHhhccCcccccccccc---ccccC Q lcl|NC_018838. 141 --VSLDKTTKTVDATD---SATTDLVKAVGLIAGAGLQVPNGVALDPAFSF-ALSTEVYPKGSPLAGQPMYP---AAGFA 211 (315) Q Consensus 141 --~~~~~~~~~~~~~~---~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~-~L~~l~d~~g~~~~~~~~~~---~~~~~ 211 (315) .............. .-.+.+.++...+.+.+.......+++|..++ .|.. .+ +-.+..+.+. ....| T Consensus 155 ~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~-~d---~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 155 HGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRD-AD---RIVDKTYTISQSGATIQG 230 (401) T ss_pred CceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHh-cC---cccchhhccccCCccccc Confidence 11111111111111 11234555555665554433223444444444 4422 11 1111111111 12233 Q ss_pred CCccccceeeEeecccCccccccc--------cccceEEEecccceEEEe----------eccceEEEeccCCccccchh Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSP--------ASGVKAIVGDFSRVHWGF----------QRNFPIELIEYGDPDQTGRD 273 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~--------~~~~~~~~gDf~~~~i~~----------~~~~~v~~~~~~~~~~~~~~ 273 (315) ...++.|.||+.++++|....... .....-+-|||+...--. ..+++-++.++.. T Consensus 231 ~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r------- 303 (401) T protein:vir:70 231 FTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKK------- 303 (401) T ss_pred eEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhh------- Confidence 445799999999999996432210 001111235664332111 1111111111110 Q ss_pred hhhcCcEEEEEEEEeccEeecccceEEEeec-cCCCCCCCCCC Q lcl|NC_018838. 274 LKGHNEVMVRAEAVLYVAIESLDSFAVVKEK-AAPKPNPPAGN 315 (315) Q Consensus 274 ~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~-~a~~~~~~~~~ 315 (315) -|. ..+-+.+-+|..++||++.+.++.+ +...+.+.+.+ T Consensus 304 ~~~---~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~ 343 (401) T protein:vir:70 304 EKT---YYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTD 343 (401) T ss_pred hhH---HHHHHHHHhCCcccchhheEEEeecCcccccccccCC Confidence 011 1222456789999999999998643 33333333333 No 149 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.99 E-value=4.4e-11 Score=77.34 Aligned_cols=288 Identities=10% Similarity=0.037 Sum_probs=152.9 Q ss_pred CCCCccCCCc--eEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccccCCCccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGK--LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg--~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l 77 (315) |+.|-.++.+ +++|+.++.+|+.-+++..+...+.++...+ +..++||.... ++..=-.+++.+.-.+.+-.+++| T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~-~tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGT-PVVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccc-cccccccCCCCcccccCCCceEEE Confidence 8887765543 5669999999998888887777776655544 35588987744 444444455555444444444444 Q ss_pred --eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhc--ccccc--ccc-cccccccccccccccc Q lcl|NC_018838. 78 --QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH--GIDPA--TGK-PAAAVKVSLDKTTKTV 150 (315) Q Consensus 78 --~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~--G~g~~--~~~-~~~~~~~~~~~~~~~~ 150 (315) ...|..+ ..|+++.. +. ...|.....++.+++++...|.-... -+|.. ... .+..+......-.... T Consensus 80 ~IDq~KYfa-f~VdDD~~-Qa----~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~g 153 (322) T protein:vir:31 80 ILRDEVYAG-NAISKKLR-QD----SRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTG 153 (322) T ss_pred EEehhhhhc-cccchhHH-Hh----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccC Confidence 4444433 34777443 32 34477778888888888877774311 11211 000 1111111100000111 Q ss_pred ccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHh-----hccCccccccccccc-cccCC--Cccccceee Q lcl|NC_018838. 151 DATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEV-----YPKGSPLAGQPMYPA-AGFAG--LDNWRGLNV 221 (315) Q Consensus 151 ~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~-----d~~g~~~~~~~~~~~-~~~~~--~~~l~G~Pv 221 (315) +.....|+.++++..++..++......| +++|.....|..+. ..++|... +-.. ...|+ .++++|+.| T Consensus 154 t~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~---i~~sG~a~g~~~Vg~~~GF~V 230 (322) T protein:vir:31 154 TDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEG---IVESGIAPDMQFVRSVYGIDL 230 (322) T ss_pred CCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccc---cccccchhhHHHHHHHhceee Confidence 1234568899999888877766555565 56799887774431 11222110 0001 11111 478999999 Q ss_pred EeecccCccccccccccc--eEEEecccceE----------EEeeccc---eEEEeccCCccccchhhhhcCcEEEEEEE Q lcl|NC_018838. 222 GASSTVSGAPEMSPASGV--KAIVGDFSRVH----------WGFQRNF---PIELIEYGDPDQTGRDLKGHNEVMVRAEA 286 (315) Q Consensus 222 ~~s~~v~~~~~~~~~~~~--~~~~gDf~~~~----------i~~~~~~---~v~~~~~~~~~~~~~~~f~~~~v~~r~~~ 286 (315) ++|+.++...-...+... ....|=++.++ ++.++.+ +=.++++ +..-.+|+.+ T Consensus 231 ~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~------------~~~d~~~~~~ 298 (322) T protein:vir:31 231 FVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDY------------NDDLNTATTA 298 (322) T ss_pred eeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCcc------------ccccceeeee Confidence 999998632211100000 00111111111 1112222 1111111 2234678999 Q ss_pred EeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 287 VLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 287 r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) |+|.+++|||..+.|..-++|--- T Consensus 299 ~~g~g~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 299 RWGNGLVRDENLVCVLANADKVTF 322 (322) T ss_pred eecceeecccceEEEEeccccccC Confidence 999999999999988654433222 No 150 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.86 E-value=1.2e-09 Score=69.59 Aligned_cols=279 Identities=11% Similarity=-0.007 Sum_probs=161.5 Q ss_pred CCCCccCCCceEcch---hHHHHHHHHHHhccchhhhcceee-cCCC--ceEEEEEeCCceeEEeecc-cccCCCcccee Q lcl|NC_018838. 1 MADDFLSAGKLELPG---SMIGAVRDRAIDSGVLAKLSPEQP-TIFG--PVKGAVFSGVPRAKIVGEG-EVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP~---~~~~~ii~~~~~~s~i~~l~~~~~-~~~~--~~~ip~~~~~~~a~wv~Eg-~~~~~s~~~~~ 73 (315) |...-..++|.++-. .+.+.|++...+.-..+++..+.. .+.+ .+.+.+....+.+.|++.+ .++|..+..++ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 777755665555543 356777777777777777766543 2222 3566667677788998865 45888888888 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA- 152 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~- 152 (315) ......+.++.-+.++.+=|+.+. ..-..|...-+...++++++.+|+.+++|+.. ....|+.+........... T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~-~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~---~g~~GLlN~p~v~~~~~~~~ 156 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQ-ATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA---HGIPSVFDYPNINNVVSGGS 156 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHH-HhCCChHHHHHHHHHHHHHHhhceEEEeeccc---ccceeEeecCCCccccccCC Confidence 888888888888888766554432 22234666677888899999999999999753 2334555432211111110 Q ss_pred ---ccchhHHHHHHHHHhhh--cccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCccccceeeEeecc Q lcl|NC_018838. 153 ---TDSATTDLVKAVGLIAG--AGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWRGLNVGASST 226 (315) Q Consensus 153 ---~~~~~~di~~~~~~~~~--~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~G~Pv~~s~~ 226 (315) ....++|+.+++..+.. .....+..++|+|+....|.......|.-+. .-+... ...+|.+.|.. T Consensus 157 W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l-----~~ik~~~~~l~i~~~~~l---- 227 (296) T protein:vir:10 157 WSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYG-----EFFRQNNSGVTVEFVQYL---- 227 (296) T ss_pred ccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHH-----HHHHHhcCCceEEEeeee---- Confidence 12347789999887754 3455567899999999888655433322111 011110 11223333332 Q ss_pred cCccccccccccceEEEec--ccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec-cEeecccceEEEee Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGD--FSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY-VAIESLDSFAVVKE 303 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gD--f~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~-~~v~~~~af~~l~~ 303 (315) .. .+...+..+++-+ -..+.+...+.++..-.+. +.=...+++..|++ ..+.+|.|++++++ T Consensus 228 -~~---a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~-----------~~l~~~~~~~~~~~Gv~i~~P~ai~~~dG 292 (296) T protein:vir:10 228 -ND---YNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQP-----------KDLHFKIPVTSKATGLIVYRPLTMAVMKG 292 (296) T ss_pred -cc---CCCCcceEEEEEEcCCceEEEEcCcceeeecccc-----------cCceEEEeeEeeEEEEEEECCceeEEEee Confidence 22 1222233333333 3333333334333221111 11124566778885 78899999999987 Q ss_pred ccCC Q lcl|NC_018838. 304 KAAP 307 (315) Q Consensus 304 ~~a~ 307 (315) .+=. T Consensus 293 I~~~ 296 (296) T protein:vir:10 293 ITFA 296 (296) T ss_pred eecC Confidence 6544 No 151 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.85 E-value=1e-09 Score=69.93 Aligned_cols=295 Identities=9% Similarity=0.001 Sum_probs=149.9 Q ss_pred CCCCccCCC----c-----eEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAG----K-----LELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~G----g-----~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |++-...+- | -+-=+++.+++.......++++++.++..+.+++ .++|+. +..+++...-|+++..+.+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~~~ 79 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCCCc Confidence 665443321 1 1233688999999999999999999998887654 788877 7778888888888766666 Q ss_pred ceeeEEEeeEEE-EEeehhhH--HHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc--------cccccc-cccc Q lcl|NC_018838. 71 DVSAFTAQPIKV-VTQQRVSD--EFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI--------DPATGK-PAAA 138 (315) Q Consensus 71 ~~~~v~l~~~kl-~~~~~iS~--ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~--------g~~~~~-~~~~ 138 (315) .-++..+..-.+ ..-..|-. |.+ +..| .+++.+.+.+++++++.+|+.++.-. ....+. .... T Consensus 80 ~~dk~~ItIDtLL~a~~~V~dlDd~q--~~yD---~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~ 154 (400) T protein:vir:10 80 QADKNQLVIDATVIARNTVAHLHDVQ--GDID---SLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKG 154 (400) T ss_pred ccCcEEEEeCceeeecchhhhHHHHh--hccc---cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccc Confidence 667666665444 22222211 211 1111 15677888889999999998775311 111100 0111 Q ss_pred ccccccccccccccccc---hhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCcccccccccc---ccccC Q lcl|NC_018838. 139 VKVSLDKTTKTVDATDS---ATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYP---AAGFA 211 (315) Q Consensus 139 ~~~~~~~~~~~~~~~~~---~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~---~~~~~ 211 (315) ...+....+...+.... --..+..+...+.+.+...... +++.|..+..|.. .+ +-++..+.+. +...| T Consensus 155 ~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~-~d---kLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 155 HGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRD-AD---RIVDKSYTISQSGATIQG 230 (400) T ss_pred cccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHh-CC---cccchhccccCCCccccc Confidence 11111111111111111 0122344444454444332223 4444554444432 11 2222222111 12233 Q ss_pred CCccccceeeEeecccCcccccc--------ccccceEEEecccceEE----------EeeccceEEEeccCCccccchh Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMS--------PASGVKAIVGDFSRVHW----------GFQRNFPIELIEYGDPDQTGRD 273 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~--------~~~~~~~~~gDf~~~~i----------~~~~~~~v~~~~~~~~~~~~~~ 273 (315) ...++.|.||+.++++|...... ......-+-|||+...- ....+++-++-++.. T Consensus 231 ~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r------- 303 (400) T protein:vir:10 231 FVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKK------- 303 (400) T ss_pred eEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchh------- Confidence 34579999999999998532110 00001112356643321 111122211111110 Q ss_pred hhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 274 LKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 274 ~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) -| ...+-+.+-+|..++||++.+.++.+-...+.--.|+ T Consensus 304 ~~---~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~ 342 (400) T protein:vir:10 304 EK---TYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGN 342 (400) T ss_pred hH---HHHHHHHHHhCCcccchhheEEEEecCCcccccccCc Confidence 01 1223345678999999999999987655444444444 No 152 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.85 E-value=1.4e-09 Score=69.10 Aligned_cols=284 Identities=8% Similarity=-0.009 Sum_probs=147.2 Q ss_pred CCCCccCCCce----Ecc----hhHHHHHHHHHHh-ccchhhhcceeecCCCceEEEEEeCCceeEEeeccc-------- Q lcl|NC_018838. 1 MADDFLSAGKL----ELP----GSMIGAVRDRAID-SGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGE-------- 63 (315) Q Consensus 1 m~~~~~s~Gg~----~vP----~~~~~~ii~~~~~-~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~-------- 63 (315) |+-+..-+|=. .|+ +++.+++...+.+ .|.+++-++...-..+...+-.+ +...++-++++. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~ 79 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETL-ASMDPDAVKRKRSRQQSADG 79 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeec-ccccccccccccccccccCc Confidence 55544444411 233 4555555555443 44566655433222221111111 111222222221 Q ss_pred --ccCCCcc--ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc-cccccccccc Q lcl|NC_018838. 64 --VKPSASV--DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-DPATGKPAAA 138 (315) Q Consensus 64 --~~~~s~~--~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~-g~~~~~~~~~ 138 (315) ..|.... ....+.+..+..+ ..|.+.-+.+...| +.+...+..+.+++++.|..++.+. |..+ . .. T Consensus 80 ~~dtp~~~~~~~~r~~~~~d~~~~--~~VDd~D~~k~~~D----~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~--~~ 150 (322) T protein:vir:10 80 TYPTPVNNKPFAKRRTNVDTYDTG--HVVEQEDISQMLLD----PNSALITSQAYAMARKTDDLIIAGAWKPAS-I--KG 150 (322) T ss_pred ccCCCccccccceEEEeecccccc--eecchHHHHHhhcC----chHHHHHHHHHHhhhHHHHHHHhhhhcccc-c--cc Confidence 2333332 3334555555443 45555443332233 4566677888888888888666532 1110 0 01 Q ss_pred cccccc-cccc--ccccccchhHHHHHHHHHhhhcccccce-E-EEEeHHHHHHHHHHhhccCc-ccccccccccc-ccC Q lcl|NC_018838. 139 VKVSLD-KTTK--TVDATDSATTDLVKAVGLIAGAGLQVPN-G-VALDPAFSFALSTEVYPKGS-PLAGQPMYPAA-GFA 211 (315) Q Consensus 139 ~~~~~~-~~~~--~~~~~~~~~~di~~~~~~~~~~~~~~~~-~-~~m~~~~~~~L~~l~d~~g~-~~~~~~~~~~~-~~~ 211 (315) ..+.+. .... .......+++.++++...+..++..... . ++++|..+..|-....-... +... ..+ ..| T Consensus 151 ~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~----~~l~~~G 226 (322) T protein:vir:10 151 TGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSA----MDLQSKG 226 (322) T ss_pred cccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccc----hhhhhcC Confidence 111111 1111 1112244577788887777766554322 4 56789988887433222211 1111 123 235 Q ss_pred CCccccceeeEeecccCccccc---------cccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEE Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEM---------SPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMV 282 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~---------~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~ 282 (315) ..++++|+.++.++.+|.+... ...+....+++--+.+.++...++..++....+.. +...+ T Consensus 227 ~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~---------~a~~I 297 (322) T protein:vir:10 227 IITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSAS---------FAWRI 297 (322) T ss_pred eeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcc---------hhhhh Confidence 6789999999999999844321 11222335566667788888888888776544321 12446 Q ss_pred EEEEEeccEeecccceEEEeeccCC Q lcl|NC_018838. 283 RAEAVLYVAIESLDSFAVVKEKAAP 307 (315) Q Consensus 283 r~~~r~~~~v~~~~af~~l~~~~a~ 307 (315) ++.+-+|..+++|+.++.|.-..+= T Consensus 298 ~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 298 YSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhhhhhCceEeccCcEEEEEEeccC Confidence 6678899999999999999875554 No 153 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.76 E-value=3.6e-09 Score=66.90 Aligned_cols=276 Identities=12% Similarity=0.062 Sum_probs=152.4 Q ss_pred CCCCccCCCceEcc---hhHHHHHHHHHHhccchhhhccee-ecCCC--ceEEEEEeCCceeEEeecc-cccCCCcccee Q lcl|NC_018838. 1 MADDFLSAGKLELP---GSMIGAVRDRAIDSGVLAKLSPEQ-PTIFG--PVKGAVFSGVPRAKIVGEG-EVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP---~~~~~~ii~~~~~~s~i~~l~~~~-~~~~~--~~~ip~~~~~~~a~wv~Eg-~~~~~s~~~~~ 73 (315) |-+....+.|++.- +.+...|++...+.-..+++..+. +.+-+ .+.+......+.+.|++.+ ..+|..+..++ T Consensus 23 ~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~ 102 (319) T protein:vir:10 23 VKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGT 102 (319) T ss_pred chhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccccccceeccce Confidence 22222223344433 245567788877777777777654 23322 3456666667788999875 44788888888 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc---- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT---- 149 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~---- 149 (315) ......+.++....++.+=|.... ..-..|...-+...++++++.+|+-+++|.... ...|+.+........ T Consensus 103 ~~~~~i~~~~~~~~~~~~El~~a~-~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~---g~~GLlN~p~~~~~~~~~~ 178 (319) T protein:vir:10 103 SEFGKVFRLGNAYLISIDEIKAGQ-ATGRPLSTRKASACQLAHDQLVNRLVFKGSAPH---KIVSVFNHPNITKITSGKW 178 (319) T ss_pred eeEEEEEEEEeeeeecHHHHHHHH-HhCCChHHHHHHHHHHHHHHhhceEEEeecccc---cceeEEeCCCceeeecCCC Confidence 888888888888777755444332 122335566678888999999999999997532 234444432211100 Q ss_pred ---cc-cccchhHHHHHHHHHhhh--cccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCccccceeeE Q lcl|NC_018838. 150 ---VD-ATDSATTDLVKAVGLIAG--AGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWRGLNVG 222 (315) Q Consensus 150 ---~~-~~~~~~~di~~~~~~~~~--~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~G~Pv~ 222 (315) .+ .....++|+.+++.++.. .....+..++|+|+.+..|.......|.-+. .-+... ...+|.+.|.. T Consensus 179 ~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l-----~~lk~~~~~l~I~~~pel 253 (319) T protein:vir:10 179 IDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYL-----DYFKSQNSGIEIDSIAEL 253 (319) T ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHH-----HHHHHhcCCceEEEeeee Confidence 00 112345788888888763 3455677899999999988654433332111 111111 11123333332 Q ss_pred eecccCccccccccccceEEEecc--cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec-cEeecccceE Q lcl|NC_018838. 223 ASSTVSGAPEMSPASGVKAIVGDF--SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY-VAIESLDSFA 299 (315) Q Consensus 223 ~s~~v~~~~~~~~~~~~~~~~gDf--~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~-~~v~~~~af~ 299 (315) .. .+.+.+..+++-.. ..+.+.....++. ..--.. ++ ...+.+..|++ ..+.+|.||+ T Consensus 254 -----~~---ag~~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~e~~-----~l----~~~~~~~~r~~Gv~i~~P~ai~ 314 (319) T protein:vir:10 254 -----ED---IDGAGTKGVLVYEKNPMNMSIEIPEAFNM--LPAQPK-----DL----HFKVPCTSKCTGLTIYRPMTIV 314 (319) T ss_pred -----cc---cCCCcceEEEEEecCCceEEEecCcceee--eeeeec-----Cc----eEEEeeeeeeEEEEEEccceeE Confidence 22 12222333333332 2333333333332 221000 00 12344556665 5679999999 Q ss_pred EEeec Q lcl|NC_018838. 300 VVKEK 304 (315) Q Consensus 300 ~l~~~ 304 (315) ++++. T Consensus 315 ~~dGI 319 (319) T protein:vir:10 315 LITGV 319 (319) T ss_pred eeecC Confidence 99999 No 154 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.75 E-value=6.9e-09 Score=65.33 Aligned_cols=275 Identities=11% Similarity=0.005 Sum_probs=154.7 Q ss_pred CCCCccCCCceEcch--hHHHHHHHHHHhccchhhhccee-ecCCC--ceEEEEEeCCceeEEeecccc-cCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPG--SMIGAVRDRAIDSGVLAKLSPEQ-PTIFG--PVKGAVFSGVPRAKIVGEGEV-KPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~--~~~~~ii~~~~~~s~i~~l~~~~-~~~~~--~~~ip~~~~~~~a~wv~Eg~~-~~~s~~~~~~ 74 (315) |-+. +.|.+++-. .+.++|++.+++.-..|++..+. +.+-+ .+.+........+.|.+.++. +|..+..++. T Consensus 1 ~~~~--~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQGK--ITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CCcc--ccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 5444 666665432 46678888888888888876553 33333 345566666678899887654 7888888888 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc------ Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK------ 148 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~------ 148 (315) .....+.++.-..++.+=|..... .-..|...-+...++++++.+|+.+++|... ....|+.+....... T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~-~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~---~g~~GLlN~p~~~~~~~~~~~ 154 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARM-QGTTVDAAKATTVRRAIAEKENSIAFRGEKK---YAIKGAFEATGIQIDVSPTTG 154 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHH-hCCChHHHHHHHHHHHHHHhhceEEeeeccc---ccceeeecCCCcccccccCcc Confidence 888888888877777655544321 2233556667888999999999999999753 233444443221110 Q ss_pred ---ccc----cccchhHHHHHHHHHhhhc--ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCC-ccccc Q lcl|NC_018838. 149 ---TVD----ATDSATTDLVKAVGLIAGA--GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DNWRG 218 (315) Q Consensus 149 ---~~~----~~~~~~~di~~~~~~~~~~--~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~l~G 218 (315) ... .....++|+.+++.++... +...+..++|+|+.+..|......+..... +..-+....+ .+|.. T Consensus 155 ~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~t---vl~~l~~~~~~~~I~~ 231 (301) T protein:vir:80 155 VGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRS---VLKVLQDNAWFSAIVR 231 (301) T ss_pred cccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCee---HHHHHHHHcCcceEEE Confidence 000 1112367888888887543 344567799999999998654432221111 1000110000 12333 Q ss_pred eeeEeecccCccccccccccceEEEe--cccceEEEeeccceEEEeccCCccccchhhhhcCc-EEEEEEEEe-ccEeec Q lcl|NC_018838. 219 LNVGASSTVSGAPEMSPASGVKAIVG--DFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNE-VMVRAEAVL-YVAIES 294 (315) Q Consensus 219 ~Pv~~s~~v~~~~~~~~~~~~~~~~g--Df~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~-v~~r~~~r~-~~~v~~ 294 (315) .|-. . +.+.+.+..+++- +-..+.+...+.++. ..- -.+++ ....+..|+ |..+.+ T Consensus 232 ~p~L-----~---~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~--~~~----------e~~~~~~~~~~~~r~~Gv~i~~ 291 (301) T protein:vir:80 232 VPDL-----A---GMGTAGSDSFAVIHDSNETAELIIPMDITR--HPE----------EYSFPRTKVPFEERTAGVVVRF 291 (301) T ss_pred ccee-----c---cCCCCcccEEEEEecCCcEEEEEecCceee--ecc----------eecCceeEeeeeeeeEEEEEEc Confidence 3322 2 1222222333322 222233333333322 211 11221 223345666 458899 Q ss_pred ccceEEEeec Q lcl|NC_018838. 295 LDSFAVVKEK 304 (315) Q Consensus 295 ~~af~~l~~~ 304 (315) |.||+++++. T Consensus 292 P~ai~~~~GI 301 (301) T protein:vir:80 292 PAAIVRVDGI 301 (301) T ss_pred cceEEEEecC Confidence 9999999999 No 155 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.74 E-value=3.3e-09 Score=67.10 Aligned_cols=282 Identities=12% Similarity=0.039 Sum_probs=141.7 Q ss_pred CCCCccCCCce----EcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEE---EeCCceeEEeecccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKL----ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAV---FSGVPRAKIVGEGEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~----~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~---~~~~~~a~wv~Eg~~~~~s~~~~ 72 (315) |+....-...- .+--++.+++=+-+....-+....|.+||..|. +++++ ++....++-|+||+.||.++.+- T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 65544332211 112234444434444444444555788888654 55444 45556788999999999999875 Q ss_pred e---eEEEeeEEEEEeehhhHHHh-ccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc Q lcl|NC_018838. 73 S---AFTAQPIKVVTQQRVSDEFM-WADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK 148 (315) Q Consensus 73 ~---~v~l~~~kl~~~~~iS~ell-~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~ 148 (315) . ..++..+|.+..+ |.|.+ +....+....- .+.|.++|.+++|+.|+.-.-..++.. ..+. T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVget----d~qL~~~Iq~kIdnd~~~~lktaT~t~---------~~t~ 145 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQT----DNEMIKYVQKKFRAKFFETLKSAIENG---------KRTN 145 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHH----HHHHHHHHHhhhhHHHHHHHhhccccc---------cccc Confidence 3 5788889988855 99998 45555655544 445566666666666553221111000 0000 Q ss_pred ccccccchhHHHHHHHHHhh----h-cccccceEEEEeHHHHHHHHHHhhccCcc-ccccccccccccCCCccccceeeE Q lcl|NC_018838. 149 TVDATDSATTDLVKAVGLIA----G-AGLQVPNGVALDPAFSFALSTEVYPKGSP-LAGQPMYPAAGFAGLDNWRGLNVG 222 (315) Q Consensus 149 ~~~~~~~~~~di~~~~~~~~----~-~~~~~~~~~~m~~~~~~~L~~l~d~~g~~-~~~~~~~~~~~~~~~~~l~G~Pv~ 222 (315) ......+.+.+++.... . .+.......++||...+.+++-..-..+. ..|.-+ + -.++|..|+ T Consensus 146 ---~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~---L-----~nfLG~~II 214 (303) T protein:vir:10 146 ---KTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNL---L-----TPYVGVKIV 214 (303) T ss_pred ---ceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhh---h-----hhhhcceEE Confidence 01112334444444221 1 11122346889999999986532221111 011111 1 138999999 Q ss_pred eecccCccccccccccceEE-EecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccE--eecccceE Q lcl|NC_018838. 223 ASSTVSGAPEMSPASGVKAI-VGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVA--IESLDSFA 299 (315) Q Consensus 223 ~s~~v~~~~~~~~~~~~~~~-~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~--v~~~~af~ 299 (315) .+..+|......+..+.+.+ ..|.+. - ..+...+.+++.+.. +..++- ..+...+-.....++. .-+.++++ T Consensus 215 ~S~kv~~G~~~~T~~~Ni~~ay~~~~g-~--l~~~f~~t~D~tglI-Gv~h~~-~~~~~t~eT~~~~~~~lfpE~~dgiv 289 (303) T protein:vir:10 215 EFADVPQGEVWMTVAENLNVAYANPRG-E--LSRAFAFATDATGFV-GVLHDI-QPQRLTSDTIYASAISMFPENIDAVI 289 (303) T ss_pred EeccCCCceEEEeeccceEEEEecCch-h--hhhhhhhccccccce-EEEecc-ccceeeehhHhHhHHHhcccccceEE Confidence 99999976655555444333 233211 0 011222222211100 000000 0000111111111111 35678899 Q ss_pred EEeeccCCCCCCCC Q lcl|NC_018838. 300 VVKEKAAPKPNPPA 313 (315) Q Consensus 300 ~l~~~~a~~~~~~~ 313 (315) +.+....+.+.-|+ T Consensus 290 ~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 290 KVTIKKDEAGELPS 303 (303) T ss_pred EEEEeccccCCCCC Confidence 99988888888888 No 156 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.70 E-value=2.7e-09 Score=67.58 Aligned_cols=265 Identities=13% Similarity=0.052 Sum_probs=156.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCcee-------EEeecccccCCCcccee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRA-------KIVGEGEVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a-------~wv~Eg~~~~~s~~~~~ 73 (315) ..-+++.+-...||+.+....|+.+....++..+....|..+..+.+|+.+..... +.-.||...+..+.+|+ T Consensus 131 ~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~ 210 (410) T protein:vir:83 131 ADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVID 210 (410) T ss_pred hccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeee Confidence 22232222345788889999999999999999999889988888999888766543 23458999999999999 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHH---Hhhhccccccccccccccccccccccccc Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVD---LIAFHGIDPATGKPAAAVKVSLDKTTKTV 150 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d---~a~~~G~g~~~~~~~~~~~~~~~~~~~~~ 150 (315) ..+...|.++++..+|++.+.-+....... .-+.|+.+.+++-+ +++|+.+- ++ .. . .... T Consensus 211 t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~----~lraL~~AYA~atea~vra~L~~t~--t~--------~~-a-~~~~ 274 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDFSSPSALDL----VVNGLGQQYAIETEALVGAALASTS--TG--------AV-G-YGNA 274 (410) T ss_pred eccceeehhcCcccccceeeecCChhhHHH----HHHHHHHHHHHHHHHHHHHHHHHhh--hh--------hh-h-hhhc Confidence 999999999999999999998776554332 33444333333333 34454431 11 00 0 0011 Q ss_pred ccccchhH-HHHHHHHHhhhcccc-cceEEEEeHHHHHHHHHHh-hcc--CccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 151 DATDSATT-DLVKAVGLIAGAGLQ-VPNGVALDPAFSFALSTEV-YPK--GSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 151 ~~~~~~~~-di~~~~~~~~~~~~~-~~~~~~m~~~~~~~L~~l~-d~~--g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) +...|. -+.++..++.++... ....+.++|.++..+-.+. +-+ |....|..+ ..+..+-.+.|+|.||+... T Consensus 275 --Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~-~~lg~gi~G~~~~ipVvm~~ 351 (410) T protein:vir:83 275 --TADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEA-GRFGQGVMGSISGIPVVMSA 351 (410) T ss_pred --cHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccc-cccccchhhhhcccceEEec Confidence 111222 233444455443211 1224788998876554332 222 222222111 11223445789999999876 Q ss_pred ccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) ..++. .++|.|...+..-....-.+.+.+. +..+..++| - .++.+.+..++++.-|.+. T Consensus 352 ~a~Ag---------TA~f~~~~Ai~~~eS~~gp~qL~d~-~i~nLt~~y--------S--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 352 ALGSG---------DAYLFSTAAIECFEQRVGTLQVVEP-SVFGLQVAY--------A--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred CCCcC---------eeeEeccceeeeeecCCceeEeeCC-chhhhhhhh--------e--eeeeeccccccceeeeccC Confidence 65533 2555576655554444323444432 222222221 1 5678889999998888776 No 157 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=98.63 E-value=1.4e-09 Score=69.09 Aligned_cols=274 Identities=15% Similarity=0.104 Sum_probs=161.5 Q ss_pred CCCCcc--CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEE-eecccccCCCccceeeEEE Q lcl|NC_018838. 1 MADDFL--SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKI-VGEGEVKPSASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~--s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~w-v~Eg~~~~~s~~~~~~v~l 77 (315) +++.-. ..-...+|.-+...|-+.++.+.++.++.++...+.--+.-+ .....-+| ..-|+.+.++..+|..-+| T Consensus 117 l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~--~dt~~qa~gHk~G~~K~eq~~tl~~rtL 194 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS--FDSANEAQVHKDGQTKTEQAATLTIDTL 194 (400) T ss_pred hhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeecc--hhhhcccceeccCCcccceeeeeeeecc Confidence 444333 233457899999999999999999999998887742212212 22233456 6678999999999999999 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHH-HHHHhhhccccccccc---ccccccccccccccccccc Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGR-AVDLIAFHGIDPATGK---PAAAVKVSLDKTTKTVDAT 153 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~-~~d~a~~~G~g~~~~~---~~~~~~~~~~~~~~~~~~~ 153 (315) .|+-++.+.++.+-.. ++..+ .+.|-.++.++|.+.+-. +.++|++-|+|.-+-. ..+-+......+.+.-.+. T Consensus 195 ~P~~VYk~~~la~~~~-~~~~t-ygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~ 272 (400) T protein:vir:93 195 EPVMVYKLQSLAERVK-RLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAG 272 (400) T ss_pred CHHHHHHHhhhhhhhh-hcccc-HHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcC Confidence 9988877777744333 33333 477899999999999885 5699999997632110 1111111112222222234 Q ss_pred cchhHHHHHH-HHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCccccce-eeEeecccCcc Q lcl|NC_018838. 154 DSATTDLVKA-VGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWRGL-NVGASSTVSGA 230 (315) Q Consensus 154 ~~~~~di~~~-~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~G~-Pv~~s~~v~~~ 230 (315) ...+.++..- +.-+.+.. ...--++|.|..++.|+.|++++|.+ .|+..... +-.+=+|+ .+++...++.. T Consensus 273 ~~~~qdl~E~~~d~~~~~a-ad~~~Iv~s~d~~A~L~~lk~a~~~a-----~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~ 346 (400) T protein:vir:93 273 KTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANA-----NVRIKNDDTEIASEVGVDEIIVYTGSKAL 346 (400) T ss_pred CccHHHHHHHHHhhhhhcc-CCceeEEeccchHHHHHHhcCCccee-----eeeeccccchhhhhcccceeeeeccCCCC Confidence 4445555443 33222222 22334889999999999999987654 44433222 22333444 22223333322 Q ss_pred ccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 231 PEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 231 ~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) . ..+ +-|- ++++++ ++ ++....- -+++|+=.+..+..+++.+.-+++=++++-+ T Consensus 347 k-------p~V-~VDe-k~~i~~-~~--~~t~~sf--------~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 347 K-------PTV-LVDQ-KYHIDM-QD--LTKVDAF--------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred C-------cee-eeeh-hhhccc-cC--ceeccce--------eeeeccceEEeeeeeccceecccceeeEeeC Confidence 1 112 2242 233322 11 2111110 1555666666778899999999998888877 No 158 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.55 E-value=2.7e-08 Score=62.05 Aligned_cols=279 Identities=12% Similarity=0.050 Sum_probs=153.1 Q ss_pred CC-CCccCCCceEcc--hhHHHHHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeeccc-ccCCCcccee Q lcl|NC_018838. 1 MA-DDFLSAGKLELP--GSMIGAVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGE-VKPSASVDVS 73 (315) Q Consensus 1 m~-~~~~s~Gg~~vP--~~~~~~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~-~~~~s~~~~~ 73 (315) |- .+..++|.|++. +.+..+|++...+.-..+++..+....+ -.+.+......+.+.|++..+ .+|..+..++ T Consensus 19 ~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~ 98 (314) T protein:vir:10 19 MGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMT 98 (314) T ss_pred hcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCcccccceeecccc Confidence 33 223344555554 3456677777777666666665542221 235666777778889998754 4888888888 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccc-- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVD-- 151 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~-- 151 (315) +.....+.++..+.++.+=|..... .-..|...-+...++++++.+|+.+++|+.. ....|+.+.........+ T Consensus 99 ~~~~~i~~~~~~~~~~~~El~~a~~-~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~---~g~~GLlN~p~v~~~~~~~~ 174 (314) T protein:vir:10 99 EKQGKVFRFGNAFLISTDEIKAGAA-TGQSLSARKQALAFEAHDNLLDKLVWSGSAP---HGIVSVFDQPNINNVVATPN 174 (314) T ss_pred eeEEEEEEEEeeEEecHHHHHHHHH-hCCChHHHHHHHHHHHHHHhhceEEEeeccc---ccceeEeecCCCccccCCCC Confidence 8888888888888886554443322 2234666677888999999999999999742 233445443221111111 Q ss_pred --cccchhHHHHHHHHHhhhc--ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC-CccccceeeEeecc Q lcl|NC_018838. 152 --ATDSATTDLVKAVGLIAGA--GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG-LDNWRGLNVGASST 226 (315) Q Consensus 152 --~~~~~~~di~~~~~~~~~~--~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~-~~~l~G~Pv~~s~~ 226 (315) .....++|+.+++.++... ....++.++|+|.....|....+..+.-+. .-+.... .-+|.+.|-. T Consensus 175 WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl-----~~l~~n~~~l~I~~~~el---- 245 (314) T protein:vir:10 175 WSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYG-----ELFTRNNPGLTIRFLQFL---- 245 (314) T ss_pred cccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHH-----HHHHHhCCCcEEEEcccc---- Confidence 1123367888888888643 445567799999988777543332222111 0011101 1123333332 Q ss_pred cCccccccccccceEE--EecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEe-ccEeecccceEEEee Q lcl|NC_018838. 227 VSGAPEMSPASGVKAI--VGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVL-YVAIESLDSFAVVKE 303 (315) Q Consensus 227 v~~~~~~~~~~~~~~~--~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~-~~~v~~~~af~~l~~ 303 (315) . ..+.+.+..++ .-|-..+.+.....++. ...-.. .=...+.+..|+ |..+.+|.||+++++ T Consensus 246 -~---~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~--l~~e~~---------~~~~~~~~~~r~~Gv~i~~P~ai~~~dG 310 (314) T protein:vir:10 246 -D---NYDGAGGKAALAFEKSPLNMSIEIPEVTNV--LPAQPK---------DLHFRYPVTSKATGLIVYRPLTMAVIKG 310 (314) T ss_pred -c---ccCCCcceEEEEEecCCcEEEEecCcccee--ecceec---------CceEEEcceeeeEEEEEECcceeEeeee Confidence 2 22222233332 22323333333333322 221000 001233345666 467899999999887 Q ss_pred ccCC Q lcl|NC_018838. 304 KAAP 307 (315) Q Consensus 304 ~~a~ 307 (315) .+=. T Consensus 311 I~~~ 314 (314) T protein:vir:10 311 ITFA 314 (314) T ss_pred eecC Confidence 7654 No 159 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.53 E-value=4e-08 Score=61.17 Aligned_cols=285 Identities=13% Similarity=0.063 Sum_probs=127.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhccee---ec---CCCceEEEEEeCCceeEEe-----ecccccCCCc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ---PT---IFGPVKGAVFSGVPRAKIV-----GEGEVKPSAS 69 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~---~~---~~~~~~ip~~~~~~~a~wv-----~Eg~~~~~s~ 69 (315) ||+. +++|+.++.++++.+++..++..++.+- .. .+..++||+... ..+.+. +++..+...+ T Consensus 1 Ma~~------~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Cccc------cccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccc Confidence 9874 5999999999999999999998887442 11 133488987654 333332 3445555555 Q ss_pred cceeeEEEee-EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc Q lcl|NC_018838. 70 VDVSAFTAQP-IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK 148 (315) Q Consensus 70 ~~~~~v~l~~-~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~ 148 (315) .+-+.+++.. +..+.-+.++++-..+...+ +...+.+...+++++++|.-++.-- .+.... . ..... T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~----~~~~~~~~a~~ala~~vd~~i~~~~---~~a~~~-~----~~~~~ 141 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLES----FATQILPRQVRGVADILEEGVRDMI---VGAPYE-A----AGAVH 141 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhh----hHHHHHHHHHHHHHHHHHHHHHHHH---hccccc-c----ccccc Confidence 5555565555 33345566776654443333 3444556667777777776544211 000000 0 00111 Q ss_pred ccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCc-c-ccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 149 TVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGS-P-LAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 149 ~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~-~-~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) .. .....|.++.++...|.+++......++++|.....|.+. ..-. . -.+......+..|..+++.|++|+.+.. T Consensus 142 ~~-~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~--~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~ 218 (392) T protein:vir:99 142 EV-APDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILND--DRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) T ss_pred cc-ChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcc--cceeecccccchhhhhhhcceeeeeeeeEEEeecc Confidence 11 1233577888888888766544333467899888877533 2110 0 0010001123456668999999999998 Q ss_pred cCcccccccc-------ccceEEEecccc-eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEee---cc Q lcl|NC_018838. 227 VSGAPEMSPA-------SGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIE---SL 295 (315) Q Consensus 227 v~~~~~~~~~-------~~~~~~~gDf~~-~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~---~~ 295 (315) +|........ +......-+-.+ ..+--...+......... . -+..+...+.. ..+.... .. T Consensus 219 ~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~--~----t~~s~~~~v~~--~~g~~~v~~~~~ 290 (392) T protein:vir:99 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYD--S----TITSNRSLIDT--YFGLKVVEDPNG 290 (392) T ss_pred cccccceeeeccccccccccccccccccceeEEecccceecceeeccc--c----eeeccccccce--eEEEEEEeeccc Confidence 8754321110 000000000000 000000001110000000 0 00011111100 0111111 11 Q ss_pred cceEE---EeeccCC-CCCCC--C--------CC Q lcl|NC_018838. 296 DSFAV---VKEKAAP-KPNPP--A--------GN 315 (315) Q Consensus 296 ~af~~---l~~~~a~-~~~~~--~--------~~ 315 (315) .+|.. ++....+ +.+|- . +. T Consensus 291 ~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~ 324 (392) T protein:vir:99 291 VGFVRARKIHLIPGSIEVAPEAGANATITAAAGE 324 (392) T ss_pred cceeeeeeeeeecceeeeeeeecccceeEeeecc Confidence 11111 1100000 01111 1 11 No 160 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.46 E-value=1.1e-07 Score=58.68 Aligned_cols=279 Identities=13% Similarity=0.085 Sum_probs=153.4 Q ss_pred CCCCccCCCceEcch--hHHHHHHHHHHhccchhhhcceee-cCC--CceEEEEEeCCceeEEeecc-cccCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPG--SMIGAVRDRAIDSGVLAKLSPEQP-TIF--GPVKGAVFSGVPRAKIVGEG-EVKPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~--~~~~~ii~~~~~~s~i~~l~~~~~-~~~--~~~~ip~~~~~~~a~wv~Eg-~~~~~s~~~~~~ 74 (315) |..+..+.|.|++.+ .+...|++..++.-..+++..+.. .+- -.+.+......+.+.|.+.+ ..+|..+..+.. T Consensus 29 ~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~ 108 (329) T protein:vir:79 29 AKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTS 108 (329) T ss_pred ceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCcccccceeecccce Confidence 333323334444432 356778888887777777766542 222 23566667777788998864 568888877777 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccc----- Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKT----- 149 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~----- 149 (315) .....+.++....++.+=|..... .--.|...-+...++++++.+|+-+++|+.. ....|+++........ T Consensus 109 ~~~~i~~~~~~~~~~~~El~~a~~-~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~---~g~~GLlN~p~v~~~~~~~~~ 184 (329) T protein:vir:79 109 EFGKVFRLGNAFLISIDEIKAGQR-TGKSLSTRKANAAQNAHDQLVNHLVFKGSKP---HKIISVFEHPNLTTINSAGWN 184 (329) T ss_pred eEEEEEEEEEEEEecHHHHHHHHH-hCCChHHHHHHHHHHHHHHhhccEEEeeccc---ccceeeecCCCccccccCCCC Confidence 777777777777776554433321 2234666677888899999999999999753 2234444432221100 Q ss_pred -ccc----ccchhHHHHHHHHHhhhc--ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCC-ccccceee Q lcl|NC_018838. 150 -VDA----TDSATTDLVKAVGLIAGA--GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DNWRGLNV 221 (315) Q Consensus 150 -~~~----~~~~~~di~~~~~~~~~~--~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~l~G~Pv 221 (315) ... ....++|+.+++.++... +...+..++|+|+.+..|.......|.-+.. -+....+ -+|.+ T Consensus 185 ~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~-----~lk~~~~~l~I~~--- 256 (329) T protein:vir:79 185 NAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLD-----YFKQQNGGITIES--- 256 (329) T ss_pred CccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHH-----HHHHhCCCcEEEE--- Confidence 001 112357888888888653 3344667999999888885433333321110 0111001 12222 Q ss_pred EeecccCccccccccccceEEEecc--cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec-cEeecccce Q lcl|NC_018838. 222 GASSTVSGAPEMSPASGVKAIVGDF--SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY-VAIESLDSF 298 (315) Q Consensus 222 ~~s~~v~~~~~~~~~~~~~~~~gDf--~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~-~~v~~~~af 298 (315) +|.....+.+.+..+++-+. ..+.+.....++ ...--.. .=...+.+..|++ ..+.+|.|| T Consensus 257 -----~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~q~~---------~~~~~v~~~~r~~Gv~i~~P~ai 320 (329) T protein:vir:79 257 -----ISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFN--MLTAQPK---------DLHFKVPCTSKCTGLTIYRPLTL 320 (329) T ss_pred -----cccccccCCCCceEEEEEecCCceEEEecCccee--eeeceec---------CceEEEceeeeEEEEEEECccee Confidence 23222233333444444333 333232233332 2221100 0012334456665 577999999 Q ss_pred EEEeeccCC Q lcl|NC_018838. 299 AVVKEKAAP 307 (315) Q Consensus 299 ~~l~~~~a~ 307 (315) +++++...- T Consensus 321 ~~~dGI~~~ 329 (329) T protein:vir:79 321 VLIKGLVVG 329 (329) T ss_pred eeeeeeeeC Confidence 999988765 No 161 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=98.39 E-value=4.4e-07 Score=55.42 Aligned_cols=289 Identities=12% Similarity=0.040 Sum_probs=127.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee-----c--CCCceEEEEEeCCceeEEe-ecccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP-----T--IFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~-----~--~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~ 72 (315) |+++-.+ .+|+.++.+.++.+++..++.+++.+-. . .+..++|++........+. .++..+...+..- T Consensus 1 MaN~llT----~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MPNNLDS----NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 9977444 3899999999999999999988875521 1 1345677765433222232 2332333333333 Q ss_pred ee--EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc Q lcl|NC_018838. 73 SA--FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV 150 (315) Q Consensus 73 ~~--v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~ 150 (315) ++ +++..+|...+ .++.+=+..+..+ ++++++.. .+++++.+|..++.-. ......... ... T Consensus 77 ~~v~l~id~~k~va~-~v~d~E~~~~i~~----~~~~l~~A-~~aLA~~vd~~ia~~~---~~~~~~~~g---t~~---- 140 (423) T protein:vir:10 77 GKATGRVGNYITVAV-EYQQLEEAIKLNQ----LEEILAPV-RQRIVTDLETELAHFM---MNNGALSLG---SPN---- 140 (423) T ss_pred ceeEEEeeceeeeee-eechHHHhcChhh----HHHHHHHH-HHHHHHHHHHHHHHHH---hhccccccc---cCC---- Confidence 33 55555655444 3444333333333 44545444 5778888888765321 000111110 010 Q ss_pred ccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccccccccCCC-ccccceeeEeecccC Q lcl|NC_018838. 151 DATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DNWRGLNVGASSTVS 228 (315) Q Consensus 151 ~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~l~G~Pv~~s~~v~ 228 (315) .....|.++.++-.+|...+...... .+++|.....|.+-. ..........-..+..++- +++.|+.|+.|+++| T Consensus 141 -t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~--~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip 217 (423) T protein:vir:10 141 -TPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQ--TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLA 217 (423) T ss_pred -cccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccc--cceecccccchhhhhhccceeeecceEEEEeCCCc Confidence 01234788888877776665544344 578898877764311 1000000001112334443 789999999999999 Q ss_pred ccccccccccceEEEecc-cceEEEeeccceEEEeccC-CccccchhhhhcCcEEEEE---EEEeccE------eecccc Q lcl|NC_018838. 229 GAPEMSPASGVKAIVGDF-SRVHWGFQRNFPIELIEYG-DPDQTGRDLKGHNEVMVRA---EAVLYVA------IESLDS 297 (315) Q Consensus 229 ~~~~~~~~~~~~~~~gDf-~~~~i~~~~~~~v~~~~~~-~~~~~~~~~f~~~~v~~r~---~~r~~~~------v~~~~a 297 (315) ............+-.+-. .+..........+.+.... +..+. +..-|.+.|-+ ..+.... -.++.- T Consensus 218 ~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~---l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~ 294 (423) T protein:vir:10 218 SRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGF---LKAGDQVKFTNTYWLQQQTKQALYNGATPISFT 294 (423) T ss_pred cccccccccceeeeecceeccccccccceeeeeeeeccccccCc---eeecceEEecceeeecccccccccccccCcceE Confidence 643222111000000000 0000000111111111000 00000 01111111111 0111111 112223 Q ss_pred eEEEee-----------ccCCCCCCCCCC Q lcl|NC_018838. 298 FAVVKE-----------KAAPKPNPPAGN 315 (315) Q Consensus 298 f~~l~~-----------~~a~~~~~~~~~ 315 (315) |+++.. .-.|++.+|+.+ T Consensus 295 ~~v~a~~~~~~~g~~tv~i~p~~i~~~~~ 323 (423) T protein:vir:10 295 ATVTADANSDSGGDVTVTLSGVPIYDTTN 323 (423) T ss_pred EEEEeeeeeccCCceeeeccCccccccCC Confidence 333321 122566666554 No 162 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.39 E-value=4.6e-08 Score=60.79 Aligned_cols=275 Identities=12% Similarity=0.042 Sum_probs=127.8 Q ss_pred CCC------CccCCCceE---cchhHHHHHHHHHHhccchhhhcceeecCCCc-e-EEEEEeCCceeEEeecccccCCCc Q lcl|NC_018838. 1 MAD------DFLSAGKLE---LPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-V-KGAVFSGVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~------~~~s~Gg~~---vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~-~ip~~~~~~~a~wv~Eg~~~~~s~ 69 (315) |-. .-.....-+ .--.+.+++=+-+....-+....|.+||..|. + .+|.++-...++-|+||++||.++ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplsk 80 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccchhh Confidence 321 111111111 11233444433344444445556888998765 5 456688888899999999999999 Q ss_pred cceee---EEEeeEEEEEeehhhHHHhc-cChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccc Q lcl|NC_018838. 70 VDVSA---FTAQPIKVVTQQRVSDEFMW-ADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDK 145 (315) Q Consensus 70 ~~~~~---v~l~~~kl~~~~~iS~ell~-~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~ 145 (315) .+-.. .++..+|.+.-+ |.|.++ ....+....- .+.|.++|++++|+.++.-....++. . .. T Consensus 81 vt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVget----d~qL~~~iq~kId~d~~t~LktaT~t-----~---~~ 146 (296) T protein:vir:98 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNT----DNALVRQLQKKIRTDFVTALKTGTGT-----Q---DA 146 (296) T ss_pred heeeecceEEEEeecccccc--CHHHHHhhcCCchhHHH----HHHHHHHHHHhhhHHHHHHHhcccce-----e---ee Confidence 88753 677788887774 999984 5555555444 45566666666776665432111100 0 00 Q ss_pred cccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 146 TTKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 146 ~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) .... .....+..+.++.....+ +.......++||...+.+++ +++ +.-+..| .++.. -.++|.-|+.|. T Consensus 147 t~~~--lQ~Ala~~~~~l~~~fed-ed~~~~V~FVnP~D~a~ylg--~a~---it~qt~f-G~tyl--~nfLG~~II~S~ 215 (296) T protein:vir:98 147 LGAG--LQGALASAWGKLQVLFED-YGSERAIVFANSLDVAEYIA--KAG---ITTQTAF-GLTYL--VDFTGTVIISTN 215 (296) T ss_pred chhh--HHHHHHHHhhhhhhhccc-cCCCceEEEEehHHHHHHhc--CCc---cchhhee-chhhh--hhccccEEEEcC Confidence 0000 000001112222233322 22334568899999888743 221 1111111 11111 028899999999 Q ss_pred ccCccccccccccceEEE-ecccceEEEeeccceEEEeccCCcccc---chhhhhcCcEEEEEEEEeccE--eecccceE Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIV-GDFSRVHWGFQRNFPIELIEYGDPDQT---GRDLKGHNEVMVRAEAVLYVA--IESLDSFA 299 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~-gDf~~~~i~~~~~~~v~~~~~~~~~~~---~~~~f~~~~v~~r~~~r~~~~--v~~~~af~ 299 (315) .+|.........+.+.++ .|.+. +++.-...-+.+..+. .++- ..+...+......++. .-++++++ T Consensus 216 kV~~G~~~~T~~~Ni~~ay~~~~~------~~l~~~f~~~~d~tglIGv~h~~-~~~~~t~eT~~~~~~~lfpE~~dgiv 288 (296) T protein:vir:98 216 DVTKGEIWATVPENIIFAYINPNN------SELAKEFNLYGDPTGYIGMNHFQ-ENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) T ss_pred cCCCceEEEeeecceEEEeecccc------cchhhhhccccccccceEEEecc-ccceeeehhHhHhHHHhcccccceEE Confidence 999665544444333321 12110 1111100111100000 0000 0000011111111111 24567777 Q ss_pred EEeeccCC Q lcl|NC_018838. 300 VVKEKAAP 307 (315) Q Consensus 300 ~l~~~~a~ 307 (315) +.+..++- T Consensus 289 ~~tI~~~~ 296 (296) T protein:vir:98 289 KVTLTPGV 296 (296) T ss_pred EEEecCCC Confidence 77664333 No 163 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.37 E-value=9.9e-07 Score=53.51 Aligned_cols=285 Identities=8% Similarity=0.055 Sum_probs=129.8 Q ss_pred CCCCccCCCceEcch--hHHHHHHHHHHhccchhhhccee---------ecCCCceEEEEEeC-Cc--eeEEeecc--cc Q lcl|NC_018838. 1 MADDFLSAGKLELPG--SMIGAVRDRAIDSGVLAKLSPEQ---------PTIFGPVKGAVFSG-VP--RAKIVGEG--EV 64 (315) Q Consensus 1 m~~~~~s~Gg~~vP~--~~~~~ii~~~~~~s~i~~l~~~~---------~~~~~~~~ip~~~~-~~--~a~wv~Eg--~~ 64 (315) ||.+..+ ...+|+ .+..-+.+.-.+.+.+.+-+-+. ..++..+.+|.+.. +. +..+-..+ +. T Consensus 1 Ma~T~l~--D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTIG--DIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEe--eeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 9987665 677887 46665655555555544433222 23345589999864 22 22222222 22 Q ss_pred cCCCccc-eeeEEEeeEEEE--EeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHH---Hhhhccc---ccccccc Q lcl|NC_018838. 65 KPSASVD-VSAFTAQPIKVV--TQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVD---LIAFHGI---DPATGKP 135 (315) Q Consensus 65 ~~~s~~~-~~~v~l~~~kl~--~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d---~a~~~G~---g~~~~~~ 135 (315) .+..+.+ ..++-...++-. ..-.++.++- -.|... .|+++++....+... .+++.|. +...... T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~ls---G~dpm~----~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~ 151 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT---SQNPLQ----SVASRLDNFWQRQAQRRLIATALGLYNDNVSATDA 151 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh---CchHHH----HHHHHHHHHHhhHHHHHHHHHHHHhhcccccccch Confidence 3333332 333332222222 2233455542 124333 344444443333222 2333332 1111111 Q ss_pred cccccccccccccccccccchhHHHHHHHHHhhhc----ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC Q lcl|NC_018838. 136 AAAVKVSLDKTTKTVDATDSATTDLVKAVGLIAGA----GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA 211 (315) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~di~~~~~~~~~~----~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~ 211 (315) ...... .... ..+.+......+.++...+.+. +...-++++||+.+...|+++..-.- +.+.-... T Consensus 152 ~~~~~~-~t~d--~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~-------i~~s~~~~ 221 (349) T protein:vir:78 152 YHEQND-MVVD--VSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF-------IRDAENNT 221 (349) T ss_pred hhhccc-ceee--eccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhh-------ccCcccCc Confidence 000000 0000 0011112334455555444332 34455689999999999987753211 10111122 Q ss_pred CCccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhcCcEEEEEEEEecc Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV 290 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~ 290 (315) .-++++|++|++++.||-....+.......+||. ..+.++.-.. ..+++.++.... -..++-.+....++ T Consensus 222 ~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~et~rd~~~g------~~~G~d~l~~R~~~-- 292 (349) T protein:vir:78 222 MFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQ-GAIGYGEGNPVMPLEYEREASRA------NGGGVETLWTRKTW-- 292 (349) T ss_pred ccceecCeEEEEeCCCccccCCCCceEEEEEeec-ceEEEccCCCccceeeecccccC------CcceeEEEEEeeEE-- Confidence 3468999999999999965433333334456664 4454554332 235555554321 01233344444443 Q ss_pred EeecccceEEEeeccCCCC------CCCCC-C Q lcl|NC_018838. 291 AIESLDSFAVVKEKAAPKP------NPPAG-N 315 (315) Q Consensus 291 ~v~~~~af~~l~~~~a~~~------~~~~~-~ 315 (315) +.||..|...+...+ .+ .-|.. + T Consensus 293 -~~hp~G~s~~~a~v~-~~~~~~~~~sPt~ae 322 (349) T protein:vir:78 293 -LLHPFGYRFTSAVIT-GNGTETIARSASWQD 322 (349) T ss_pred -Eeeeeeeeecccccc-CCccccccCCCChHH Confidence 578888776654322 11 11111 1 No 164 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=98.35 E-value=1.1e-06 Score=53.18 Aligned_cols=286 Identities=8% Similarity=0.062 Sum_probs=131.5 Q ss_pred CCCCccCCCceEcch--hHHHHHHHHHHhccchhhhccee---------ecCCCceEEEEEeC-Cce--eEEeecc--cc Q lcl|NC_018838. 1 MADDFLSAGKLELPG--SMIGAVRDRAIDSGVLAKLSPEQ---------PTIFGPVKGAVFSG-VPR--AKIVGEG--EV 64 (315) Q Consensus 1 m~~~~~s~Gg~~vP~--~~~~~ii~~~~~~s~i~~l~~~~---------~~~~~~~~ip~~~~-~~~--a~wv~Eg--~~ 64 (315) ||.+..+ ...||+ .+..-+.+.-.+.+.+.+-+-++ ..++..+.+|.+.. ... ..+-+.. .. T Consensus 1 Ma~T~l~--D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITTIG--NIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEe--eeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 9987665 567887 46666655555555555443332 23345579998864 222 2222222 12 Q ss_pred cCCCccc-eeeEEEeeEE--EEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHH---hhhccc---ccccccc Q lcl|NC_018838. 65 KPSASVD-VSAFTAQPIK--VVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDL---IAFHGI---DPATGKP 135 (315) Q Consensus 65 ~~~s~~~-~~~v~l~~~k--l~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~---a~~~G~---g~~~~~~ 135 (315) .+..+.+ ..++-...++ --..-.++.++- -.|.... |+++++....+...+ +++.|. +...... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~ls---G~dpm~~----Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~ 151 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT---SQNPLQS----VASRLDNFWQRQAQRRLIATALGLYNDNVSATDA 151 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh---CchHHHH----HHHHHHHHHhhHHHHHHHHHHHhhhccccccccc Confidence 3333433 2333222222 223344555542 1233333 444444433333222 333332 1000000 Q ss_pred cccccccccccccccccccchhHHHHHHHHHhhhc----ccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC Q lcl|NC_018838. 136 AAAVKVSLDKTTKTVDATDSATTDLVKAVGLIAGA----GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA 211 (315) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~di~~~~~~~~~~----~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~ 211 (315) ... ....... ..+.+......+.++...+-+. +...-++++||+.+...|++++.-.- +.+.-... T Consensus 152 ~~~-~~~~~~d--~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~-------i~~s~~~~ 221 (349) T protein:vir:94 152 YHE-QNDMVVD--VSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF-------IRDAENNT 221 (349) T ss_pred ccc-cCceeEE--ecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhh-------ccCcccCc Confidence 000 0000000 0011122234455555544332 33445679999999999987753211 10111122 Q ss_pred CCccccceeeEeecccCccccccccccceEEEecccceEEEeec-cceEEEeccCCccccchhhhhcCcEEEEEEEEecc Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQR-NFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV 290 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~-~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~ 290 (315) +-++++|++|++++.||-....+.......+||. ..+.++.-. ...+++.++..... ..++-.+....|+ T Consensus 222 ~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~E~~rd~~~g~------~~G~d~L~~R~~~-- 292 (349) T protein:vir:94 222 MFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQ-GAIGYGEGNPEMPLEYEREASRAN------GGGVETLWTRKTW-- 292 (349) T ss_pred ccceecCcEEEEeCCCccccCCCCceEEEEEeec-ceEEeecCCCCcceeeecccccCC------cceeEEEEEeeEE-- Confidence 3468999999999999965444444444456664 445555433 22355565543211 1222333333333 Q ss_pred EeecccceEEEeeccCCCC------CCCCCC Q lcl|NC_018838. 291 AIESLDSFAVVKEKAAPKP------NPPAGN 315 (315) Q Consensus 291 ~v~~~~af~~l~~~~a~~~------~~~~~~ 315 (315) +.||..|...+...+..+ .|-..+ T Consensus 293 -~~hp~G~s~~~a~v~~~~~~~~~~sPt~ae 322 (349) T protein:vir:94 293 -LLHPFGYSFTSAVITGNGTETIARSASWQD 322 (349) T ss_pred -EeeeeeeeecccccCCCccccccCCCChHH Confidence 578888876654332111 121111 No 165 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.29 E-value=1.3e-06 Score=52.93 Aligned_cols=293 Identities=9% Similarity=0.008 Sum_probs=135.1 Q ss_pred CCCCccCC--CceEcchhHHHHHHHHHHhccchhhhcceee---------cCCCceEEEEEeCC-ceeEEeecccc---c Q lcl|NC_018838. 1 MADDFLSA--GKLELPGSMIGAVRDRAIDSGVLAKLSPEQP---------TIFGPVKGAVFSGV-PRAKIVGEGEV---K 65 (315) Q Consensus 1 m~~~~~s~--Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~---------~~~~~~~ip~~~~~-~~a~wv~Eg~~---~ 65 (315) |+.-...+ ...++|+.+..-+.+...+.+.+.+-+-+.+ .++..+.+|.+..- ....-+.|... . T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 88644322 4579999887777766666666554443332 34455899998543 33333333332 3 Q ss_pred CCCcccee-e--EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc-c------ Q lcl|NC_018838. 66 PSASVDVS-A--FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK-P------ 135 (315) Q Consensus 66 ~~s~~~~~-~--v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~-~------ 135 (315) +..+.+-+ + +.+..-|--....++.++- -.|....+.+++++.-.+.. .+.-.+++.|.=..+.. . T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~ls---G~dpm~~Ia~qva~yW~r~~-q~~Lla~L~Gvf~~~~a~~~~~~~~ 156 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRNRFGVYWTRQW-QRRIIAMAVGVYKSNLAGNFATIKT 156 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhh---CchHHHHHHHHHHHHhhhhh-HHHHHHHHHHhhccccccchhhhhh Confidence 33333322 2 2222233333345555542 23443333333332222221 11222333332100000 0 Q ss_pred -------cccccccc--ccccccc-ccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccc Q lcl|NC_018838. 136 -------AAAVKVSL--DKTTKTV-DATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) Q Consensus 136 -------~~~~~~~~--~~~~~~~-~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~ 205 (315) ..+..+.. ...+... +........+.++..++-+ +...-.+++||+.++..|++++.-. ++- T Consensus 157 ~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD-~~~~l~~i~mHS~V~~~L~~~~li~-------~i~ 228 (367) T protein:vir:80 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGD-HVGSIAAIAVHSMVYKRMTNNDEIE-------FIP 228 (367) T ss_pred hhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcc-ccccccEEEEchHHHHHHHhccccc-------ccc Confidence 00000100 0011110 1122334566777666644 4445678999999999998875311 111 Q ss_pred cccccCCCccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhcCcEEEEE Q lcl|NC_018838. 206 PAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHNEVMVRA 284 (315) Q Consensus 206 ~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~~~v~~r~ 284 (315) +.-...+-++++|++|++++.||-....+.......+||. ..+.++.... ..+++.++..... ..++-.+.. T Consensus 229 ~sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~-GAi~~~~~~~~~~~E~~Rd~~~~~------~gG~d~L~~ 301 (367) T protein:vir:80 229 DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------GSGLEYILE 301 (367) T ss_pred CCCCccccceecceeEEEeCCCcccccCCCceEEEEEEec-ceeeecccCCccceecccchhhhc------CCceEEEEe Confidence 1111234578999999999999965444444444556664 3344443322 2245555542210 012222222 Q ss_pred EEEeccEeecccceEEEeecc-CCCC-CCCCC-----------C Q lcl|NC_018838. 285 EAVLYVAIESLDSFAVVKEKA-APKP-NPPAG-----------N 315 (315) Q Consensus 285 ~~r~~~~v~~~~af~~l~~~~-a~~~-~~~~~-----------~ 315 (315) ..| .+.||-.|...+... +|.. .+|+| + T Consensus 302 Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~e 342 (367) T protein:vir:80 302 RKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) T ss_pred eee---EEeecceeeecccccccccccccccccccccCCCChHH Confidence 233 478898887654432 2211 11111 1 No 166 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=98.28 E-value=9.2e-07 Score=53.67 Aligned_cols=285 Identities=12% Similarity=0.025 Sum_probs=126.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee-----c--CCCceEEEEEeCCceeEEe-ecccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP-----T--IFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~-----~--~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~ 72 (315) |+++-.+ .+|+.++.+.++.+++..++.+++.+-. . .+..++||+........+- ..+..+..++..- T Consensus 1 MaN~llT----~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPNNLDS----NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 9987553 3899999999999999999988875522 1 1345777763322111221 1222233333332 Q ss_pred e--eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc Q lcl|NC_018838. 73 S--AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV 150 (315) Q Consensus 73 ~--~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~ 150 (315) + .+++..+|...+ .++.+=+..+..+ ++++++.. .+++++.+|..++.-. .......... .. . T Consensus 77 ~~v~l~id~~k~va~-~v~d~E~~~~i~~----~~~~l~~A-~~aLA~~vd~~ia~~~---~~~a~~~~gt---~~-t-- 141 (423) T protein:vir:17 77 GKATGRVGNYITVAV-EYQQLEEAIKLNQ----LEEILAPV-RQRIVTDLETELAHFM---MNNGALSLGS---PN-T-- 141 (423) T ss_pred ceeEEEeeceeeeee-eecHHHHhcChhH----HHHHHHHH-HHHHHHHHHHHHHHHH---hhcccccccc---CC-c-- Confidence 3 355555655444 4444433333333 44444444 5778888887654321 0000111110 00 0 Q ss_pred ccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccccccccCCC-ccccceeeEeecccC Q lcl|NC_018838. 151 DATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DNWRGLNVGASSTVS 228 (315) Q Consensus 151 ~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~l~G~Pv~~s~~v~ 228 (315) ....|.++.++-..|...+...... .+++|.....|.+-. ..........-..+..++- +++.|+.|+.|+++| T Consensus 142 --~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~--~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip 217 (423) T protein:vir:17 142 --PITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQ--TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLA 217 (423) T ss_pred --ccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccc--cceecccccchHHHhhccceeeecceEEEEeCCCc Confidence 1134788888888886665544444 578898877764311 1000000000112334443 789999999999999 Q ss_pred ccccccccccceEEEecc-cceEEEee--ccceEEEe---ccCCccccchhhhhcCcEEEEE---EEEecc------Eee Q lcl|NC_018838. 229 GAPEMSPASGVKAIVGDF-SRVHWGFQ--RNFPIELI---EYGDPDQTGRDLKGHNEVMVRA---EAVLYV------AIE 293 (315) Q Consensus 229 ~~~~~~~~~~~~~~~gDf-~~~~i~~~--~~~~v~~~---~~~~~~~~~~~~f~~~~v~~r~---~~r~~~------~v~ 293 (315) .............-.+.. ........ ..+.+... .+.+ +...|.+.|-+ ..+... ... T Consensus 218 ~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~-------l~~GD~~t~aGv~~v~~~tk~v~~~~~t~ 290 (423) T protein:vir:17 218 SRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGF-------LKAGDQVKFTNTYWLQQQTKQALYNGATP 290 (423) T ss_pred cccccceeceeeecccccccccccccccceeeeeeeeeeeccCc-------eeecceEEecceeeecccccccccccccc Confidence 543222111000000100 00000000 00001100 0110 01111111111 001111 111 Q ss_pred cccceEEEe-----------eccCCCCCCCCCC Q lcl|NC_018838. 294 SLDSFAVVK-----------EKAAPKPNPPAGN 315 (315) Q Consensus 294 ~~~af~~l~-----------~~~a~~~~~~~~~ 315 (315) ++.-|.+.. ..-.|++.||+.+ T Consensus 291 ~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~ 323 (423) T protein:vir:17 291 ISFTATVTADANSDSSGDVTVTLSGVPIYDTTN 323 (423) T ss_pred cceEEEEEecccccccCceEEEecCccccccCC Confidence 222333321 1122566666655 No 167 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.25 E-value=4e-07 Score=55.65 Aligned_cols=230 Identities=13% Similarity=0.047 Sum_probs=139.1 Q ss_pred CCCCcc-----CC-CceEcchhHHHHHHHHHHhccchhhhcceeecC-CCceEEEEEeCCceeEEeecccccCCCcccee Q lcl|NC_018838. 1 MADDFL-----SA-GKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~-----s~-Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~ 73 (315) |++=.. .. ...+-|......|||.+.+.+.|....++.... .....+.+.++-|+++|..=++..++++.++. T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt~ 80 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTTV 80 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccceeE Confidence 544321 11 233456667778999999999999999888775 34478889999999999999999999999999 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---------------------- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA---------------------- 131 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~---------------------- 131 (315) +++-..+-+++.+.|.+.+..... +.. .+...-...+.+++...+...+|||+... T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~G-n~~-~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~q 158 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNG-NTA-EFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQN 158 (328) T ss_pred EEEEEEEEEecceeechHHHhhcC-CHH-HHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccc Confidence 999999999999999998875543 222 23333445677888888888889884210 Q ss_pred ------ccccccccccc---------cc--cc------------------cc---------------------------c Q lcl|NC_018838. 132 ------TGKPAAAVKVS---------LD--KT------------------TK---------------------------T 149 (315) Q Consensus 132 ------~~~~~~~~~~~---------~~--~~------------------~~---------------------------~ 149 (315) ++...+++.-. .. .. .. - T Consensus 159 iidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 159 IIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred eeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 00010000000 00 00 00 0 Q ss_pred ccc----ccchhHHHHHH-HHHhh--hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeE Q lcl|NC_018838. 150 VDA----TDSATTDLVKA-VGLIA--GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVG 222 (315) Q Consensus 150 ~~~----~~~~~~di~~~-~~~~~--~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~ 222 (315) ++. .....+++.++ +.++. ++.......|.||.+....|++.....+.. +....+.....+-.++|.||. T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~---~~~~~~~~g~~~t~~~gipir 315 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSL---AISVKETEGEWWTSFRGVPIR 315 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcce---eeeeeccCCcceeEECCeEEE Confidence 000 00112233333 22221 223333456999999999998875444321 111122222234579999999 Q ss_pred eecccCccccccccccceEE Q lcl|NC_018838. 223 ASSTVSGAPEMSPASGVKAI 242 (315) Q Consensus 223 ~s~~v~~~~~~~~~~~~~~~ 242 (315) .++.+-.. ...++ T Consensus 316 ~~dai~~t-------E~~vv 328 (328) T protein:vir:95 316 ETDALLET-------EARVV 328 (328) T ss_pred EEeeeecC-------ccccC Confidence 88877521 12222 No 168 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.11 E-value=4.4e-06 Score=49.93 Aligned_cols=280 Identities=14% Similarity=0.057 Sum_probs=127.0 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec-----C--CCceEEEEEeCCceeEEe-ecccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT-----I--FGPVKGAVFSGVPRAKIV-GEGEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~-----~--~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~ 72 (315) ||++-.+ .||+.++.+.++.+++..++.+++.+-.- . +..++||+........+- +.+..+...+..- T Consensus 1 MAN~llT----~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MANNLES----NISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 9977543 38999999999999999999988755221 1 245678865432222221 1122223333333 Q ss_pred ee--EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc Q lcl|NC_018838. 73 SA--FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV 150 (315) Q Consensus 73 ~~--v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~ 150 (315) .+ +++..+|. ..+.++.+=+.++..+ +++++... ++++++++|..++...- ... .+.+...+ T Consensus 77 ~~v~l~id~~k~-~a~~v~d~e~~l~i~~----~~~~l~~a-~~ala~~vd~~l~~~l~--~~a-----~~~vgt~~--- 140 (423) T protein:vir:35 77 AKATGKVGKYIT-VAVEWTQIEEALKLNQ----LDQILSPI-HERMVTDLETELAHFMM--NNG-----ALSLGSPN--- 140 (423) T ss_pred ceeeEEecccee-ccceeCHHHHHhhHHH----HHHHHHHH-HHHHHHHHHHHHHHHHh--hcc-----cccccccc--- Confidence 33 44555554 3445555544333333 44445444 45667777776543210 000 01100000 Q ss_pred ccccchhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCccccccc-cccccccCCC-ccccceeeEeeccc Q lcl|NC_018838. 151 DATDSATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQP-MYPAAGFAGL-DNWRGLNVGASSTV 227 (315) Q Consensus 151 ~~~~~~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~-~~~~~~~~~~-~~l~G~Pv~~s~~v 227 (315) .....|+++.++-..|...+......| +++|.....|.+- ..+-..... .-..+..++. +++.|+.|+.|+++ T Consensus 141 -t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~---~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnv 216 (423) T protein:vir:35 141 -TAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADA---QSGLHAADQLVRTAWENAQISGNFGGIRALMSNGL 216 (423) T ss_pred -CCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhcc---ccceeccccchhHHHhhccceeeecceEEEEcCCC Confidence 112347888888888876665544455 7889887776421 111000000 0012334443 78999999999999 Q ss_pred CccccccccccceEE----------EecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEe---ccE--- Q lcl|NC_018838. 228 SGAPEMSPASGVKAI----------VGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVL---YVA--- 291 (315) Q Consensus 228 ~~~~~~~~~~~~~~~----------~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~---~~~--- 291 (315) |......... .... ..+-+...++... .. +..+.+ +...|.+.|-+..-+ ... T Consensus 217 p~~T~gt~~~-~~~v~~a~~v~~~a~~~~~~~~~~~~~-~~--~~~~g~-------l~~GD~~t~aGv~~v~~~t~~~~~ 285 (423) T protein:vir:35 217 ASRKQGDFDG-AITVKTAPNVDYLSVKDSYQFTVALTG-AT--PSKTGF-------LKAGDQLKFTSTHWLNQQSKQTLY 285 (423) T ss_pred cccccccccc-ceeeccccccccccccccccceeeeee-ee--eccCCc-------EEecceEEeeeeeeccccccceee Confidence 9532222111 0110 0111111111100 00 011110 111121221111000 000 Q ss_pred ---eecccceEEE-----------eeccCCCCCCCCCC Q lcl|NC_018838. 292 ---IESLDSFAVV-----------KEKAAPKPNPPAGN 315 (315) Q Consensus 292 ---v~~~~af~~l-----------~~~~a~~~~~~~~~ 315 (315) -.++.=|+++ +-.-.|+++||+.+ T Consensus 286 ~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~ 323 (423) T protein:vir:35 286 NGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKN 323 (423) T ss_pred cccCCceeEEEEeccccccccCceeEEccccccccCCC Confidence 0011122221 12233567777776 No 169 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.09 E-value=2.4e-06 Score=51.43 Aligned_cols=286 Identities=11% Similarity=-0.025 Sum_probs=123.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhc---c----eeecCCCceEEEEEeCCc----eeEEeecccccCCCc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLS---P----EQPTIFGPVKGAVFSGVP----RAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~---~----~~~~~~~~~~ip~~~~~~----~a~wv~Eg~~~~~s~ 69 (315) |+-....- -.+......++.+.+.......+ . ..+..+.-+.+|.+.+-. +..-+.+....+.++ T Consensus 1 m~lsD~~v----fN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDLAV----YSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhhhh----hhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 65543331 23333444555544432222211 1 112222334677765321 223344444454444 Q ss_pred cc-eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc-cccc Q lcl|NC_018838. 70 VD-VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL-DKTT 147 (315) Q Consensus 70 ~~-~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~-~~~~ 147 (315) .+ ..++....+.=.+......+.+.....+ -..+...|.+.+++...+.+-+.++.+.... .++....+ ...+ T Consensus 77 itt~~~~av~~~r~~g~~~~d~~~~~~g~~~-~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a----~~~~~~~v~dis~ 151 (325) T protein:vir:95 77 LKHLVDTSVKVAAGTPPVRLDPGQFRWIQQN-PEVAGAAMGQQLAVDTMADMLNVGLGSVYSA----LSQVSDVVYDATA 151 (325) T ss_pred eccccceeeEEecccCcccccHHHHhhcCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcccccceeeeec Confidence 33 3344333332223222222222222222 2223345666666655444444443322100 00000100 0010 Q ss_pred ccc-ccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecc Q lcl|NC_018838. 148 KTV-DATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) Q Consensus 148 ~~~-~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~ 226 (315) ... .........+.++..++ ..+...-..|+||+.++..|+++...+.. .++.......-.+++|++|+++|. T Consensus 152 ~~~~~~~~~s~~~l~~A~~kl-GD~~~~l~~~~MHS~v~~~L~~~~L~~~~-----~~~~~~g~~~i~t~~G~~VIVdD~ 225 (325) T protein:vir:95 152 NTDAADKLPTWNNLNNGQAKF-GDQSSQIAAWIMHSTPMHKLYGSNLTNGE-----RLFTYGTVNVVRDPFGKLLVMTDS 225 (325) T ss_pred ccCcccccccHHHHHHHHHHh-cccccceeEEEEchHHHHHHHHhhccccc-----cccccCCcccccccCCcEEEEeCC Confidence 100 00112345677777776 44556667899999999999886654321 122111112235789999999999 Q ss_pred cCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 227 VSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 227 v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) ||.............+||. ..+.++...+......+.. .-++-...+|.+.. -++||..+..-+. . T Consensus 226 ~p~~~~g~~~~ytty~lg~-GAi~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~t---f~lhp~G~sw~~s--~ 291 (325) T protein:vir:95 226 PNLFAAGTPNVYHILGLVP-GGVLIGQNNDFDANEETKN--------GDENIIRTYQAEWS---YNIGVKGFAWDKA--N 291 (325) T ss_pred CCCCCccCceeEEEEEEec-CeEEecCCCCccccccccC--------cccceeeeeeeeee---EEeecceeeeecc--c Confidence 9865433333333344443 4444544444332222111 11222233333221 3578998887332 1 Q ss_pred CCCCCCCCC Q lcl|NC_018838. 307 PKPNPPAGN 315 (315) Q Consensus 307 ~~~~~~~~~ 315 (315) ....|-..+ T Consensus 292 ~g~sPt~ae 300 (325) T protein:vir:95 292 GGKSPTDAA 300 (325) T ss_pred ccCCcChHh Confidence 112322222 No 170 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.05 E-value=2.4e-06 Score=51.43 Aligned_cols=284 Identities=11% Similarity=-0.026 Sum_probs=150.9 Q ss_pred CCCCccC---CCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCcee-EEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLS---AGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRA-KIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s---~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s~~~~~~v~ 76 (315) ||.-+.. .-....-..++++|...-....|+.++.......+...++...+-...+ .-..||++.+.....-.... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 7665443 2333455678888888877888887776554444444566654433322 23458877665432221111 Q ss_pred EeeEE-EEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cc-----cccccccccccc---- Q lcl|NC_018838. 77 AQPIK-VVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA-TG-----KPAAAVKVSLDK---- 145 (315) Q Consensus 77 l~~~k-l~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~-~~-----~~~~~~~~~~~~---- 145 (315) =+.-. +...+.||.-....+.......+..++.. -..++.+-++.++++|.-.. ++ -...|+...+.. T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~k-k~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~ 159 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAK-KSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSL 159 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHH-HHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCcee Confidence 11111 12223333322211111111123333333 34567788888999986311 11 122233222111 Q ss_pred -----------cccc--cccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 146 -----------TTKT--VDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 146 -----------~~~~--~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) .... .+......+++.+++.++..+.. .++.+++++.....|.++....+..+.... .+-..+. T Consensus 160 ~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg-~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~--~~~~~g~ 236 (317) T protein:vir:88 160 GANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG-QANSIQTSSSIKKAISKNMKGRATEITLDA--SDNRIAQ 236 (317) T ss_pred ccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC-CCCEEEeChHHHHHHHHHhcCCceeEEEcc--cCeEEEE Confidence 0011 11123456788999999988764 456788999999999888543332221000 0000000 Q ss_pred -Cc---cccc-eeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEE Q lcl|NC_018838. 213 -LD---NWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAV 287 (315) Q Consensus 213 -~~---~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r 287 (315) .. +=+| +.++.+.+||.. .+++.|++++.+..-+++..+-+... -+......... T Consensus 237 ~v~~~~tdfG~v~ii~~r~lp~~---------~~~~~D~~~~~l~~Lr~~~~e~laKt-----------Gd~~k~~i~~E 296 (317) T protein:vir:88 237 TVDVYESDFGKYTIRANRWFHEN---------TLFVFDPKMHSLCYLRPFFQHELAKT-----------GDSEKRQLLVE 296 (317) T ss_pred EEEEEEeCCeEEEEEeCCCCCCC---------eEEEEcccccceeecccceeeccCCC-----------cccceeEEEEE Confidence 00 1112 377788888742 47788999888777666544433221 12233455677 Q ss_pred eccEeecccceEEEeeccCCC Q lcl|NC_018838. 288 LYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 288 ~~~~v~~~~af~~l~~~~a~~ 308 (315) ++..+.+++|.++|...+++- T Consensus 297 ~tLe~~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 297 YTFRVNNEKSGALIRDVVAQL 317 (317) T ss_pred EEEEEcCccceeEEEEecccC Confidence 999999999999999988877 No 171 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.94 E-value=1.4e-06 Score=52.62 Aligned_cols=279 Identities=10% Similarity=-0.042 Sum_probs=145.1 Q ss_pred cCCCceEcchh--HHHHHHHHHHhccchhhhcceeec---CCCceEEEEEeCCceeE--Eeecc-cccCCCccceeeEEE Q lcl|NC_018838. 6 LSAGKLELPGS--MIGAVRDRAIDSGVLAKLSPEQPT---IFGPVKGAVFSGVPRAK--IVGEG-EVKPSASVDVSAFTA 77 (315) Q Consensus 6 ~s~Gg~~vP~~--~~~~ii~~~~~~s~i~~l~~~~~~---~~~~~~ip~~~~~~~a~--wv~Eg-~~~~~s~~~~~~v~l 77 (315) -|+..|++.+- +.++|.+.-.+.-..+++..+... ....+.+...+..+.+. |++.+ .++|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 56666775531 234444433344444555444322 22235555555555666 98754 668888888888777 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccc------- Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTV------- 150 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~------- 150 (315) ..+.++.-..+|.+=|+..... -..|.+.=++...+++...+|+..++|..+. ....|+++......... T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~-g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~--~g~~GllN~p~v~~~~~~~~~a~~ 157 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLL-GLALNTAKIMALNKNAQQTLQKVAFLGHAKD--SRLTGLLNNKSVEVYAIKGAAQNT 157 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHh-CCCcHHHHHHHHHHHHHhhhceEEEEeeccc--cceEEEEeCCCcceeeecCCccCC Confidence 7777777766665544443221 2235555566777788999999999996431 12334444322211100 Q ss_pred c----cccchhHHHHHHHHHhhhccc--ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEee Q lcl|NC_018838. 151 D----ATDSATTDLVKAVGLIAGAGL--QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGAS 224 (315) Q Consensus 151 ~----~~~~~~~di~~~~~~~~~~~~--~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s 224 (315) . ......+||.+++.++..... ..++.++|.|+.+..|......++......++ ....+ -..|+|+-+ T Consensus 158 ~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l----~~n~~-~~~g~~l~I- 231 (304) T protein:vir:52 158 KVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFL----TKHLS-AAAGRQVAI- 231 (304) T ss_pred ccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHH----HHhcc-cccCCcceE- Confidence 0 011234577788887754333 44567999999888885433222221111111 01011 123555432 Q ss_pred cccCc-cccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcE--EEEEEEEecc-EeecccceEE Q lcl|NC_018838. 225 STVSG-APEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEV--MVRAEAVLYV-AIESLDSFAV 300 (315) Q Consensus 225 ~~v~~-~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v--~~r~~~r~~~-~v~~~~af~~ 300 (315) ..++. ....+.+.+..+++-+.+.=.+.+.-.+.+..+.. ..+|.. .+=+..|+|+ .+.+|.++++ T Consensus 232 ~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~----------q~~~~~~~~vp~~~r~gGv~v~~P~a~~y 301 (304) T protein:vir:52 232 KALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDA----------QPKGLLAFESGLRMAFGGVTFMEPDSALY 301 (304) T ss_pred EEecccccccCCCCceEEEEEecChhheEEecCccccccch----------hhcCCceEEecceeeeeeEEEEccceeee Confidence 11221 12233344444555454432333333333333332 334432 3335677766 6789999999 Q ss_pred Eee Q lcl|NC_018838. 301 VKE 303 (315) Q Consensus 301 l~~ 303 (315) +.. T Consensus 302 ~D~ 304 (304) T protein:vir:52 302 VDY 304 (304) T ss_pred ecC Confidence 999 No 172 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=280 Identities=8% Similarity=-0.052 Sum_probs=134.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhh--hc--ceeecCCCceEEEEEeCCceeEEe-ecccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAK--LS--PEQPTIFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~--l~--~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~~~v 75 (315) .|+-+.+...+..-+-++.. ++.+.....+-. ++ .....+++.++||+.....-..+- ..+-....-+.+.... T Consensus 19 ~~~~~~~~nt~~l~~k~~~~-LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~ 97 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGI-LERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTY 97 (319) T ss_pred hhccCCCcchHHHHHHHHHH-HHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEE Confidence 66777777777777777664 444444333221 12 245566788999998764333331 1221111222333344 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) +|.-.|.-.+. |-. +...+.+........+.+.....+.-.+|.-.+...-...+.. . . . ...... T Consensus 98 tidqdR~~~F~-VD~--~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------~-~---~-~~t~~n 163 (319) T protein:vir:97 98 FLDQEKYWGRF-VDA--LDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------L-T---V-GTGSDA 163 (319) T ss_pred Eeecccccccc-cch--hhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------c-c---c-ccCHHH Confidence 44444332221 110 0000000000011223333444444455542221110000000 0 0 0 011234 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .|+.|+++...+.+++....-..+++|.....|.+-..-......+. ..+..+..++|.|.+|+.+ |+... T Consensus 164 ~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~v---ps~~~--- 234 (319) T protein:vir:97 164 QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKV---PTKLL--- 234 (319) T ss_pred HHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEe---ccccc--- Confidence 58889999988877665432235688988888755432222111110 1234566789999999853 33221 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) .+..+++|..+ ..+...+--.+++.+... . .| --.++...++|..|.++++...+..+.+++++.+.+- T Consensus 235 -k~in~i~~h~~-A~~~~~k~~~~~~~~p~~-~-----~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:97 235 -QGLQAIAVVGE-VLASPIQADLAKTNSNIP-G-----MF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred -ccceEEEEcCC-eeeeeeeeeeeeccCCCc-c-----cc---ceeeeeeeeeeeEEeccccceEEEeecCCcccCCCcc Confidence 22335556543 333333433344432110 0 11 2467888999999999998877776655555544443 No 173 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=280 Identities=8% Similarity=-0.052 Sum_probs=134.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhh--hc--ceeecCCCceEEEEEeCCceeEEe-ecccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAK--LS--PEQPTIFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~--l~--~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~~~v 75 (315) .|+-+.+...+..-+-++.. ++.+.....+-. ++ .....+++.++||+.....-..+- ..+-....-+.+.... T Consensus 19 ~~~~~~~~nt~~l~~k~~~~-LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~ 97 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGI-LERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTY 97 (319) T ss_pred hhccCCCcchHHHHHHHHHH-HHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEE Confidence 66777777777777777664 444444333221 12 245566788999998764333331 1221111222333344 Q ss_pred EEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccc Q lcl|NC_018838. 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDS 155 (315) Q Consensus 76 ~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) +|.-.|.-.+. |-. +...+.+........+.+.....+.-.+|.-.+...-...+.. . . . ...... T Consensus 98 tidqdR~~~F~-VD~--~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------~-~---~-~~t~~n 163 (319) T protein:vir:94 98 FLDQEKYWGRF-VDA--LDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------L-T---V-GTGSDA 163 (319) T ss_pred Eeecccccccc-cch--hhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------c-c---c-ccCHHH Confidence 44444332221 110 0000000000011223333444444455542221110000000 0 0 0 011234 Q ss_pred hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCccccccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 235 (315) .|+.|+++...+.+++....-..+++|.....|.+-..-......+. ..+..+..++|.|.+|+.+ |+... T Consensus 164 ~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~v---ps~~~--- 234 (319) T protein:vir:94 164 QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKV---PTKLL--- 234 (319) T ss_pred HHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEe---ccccc--- Confidence 58889999988877665432235688988888755432222111110 1234566789999999853 33221 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) .+..+++|..+ ..+...+--.+++.+... . .| --.++...++|..|.++++...+..+.+++++.+.+- T Consensus 235 -k~in~i~~h~~-A~~~~~k~~~~~~~~p~~-~-----~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:94 235 -QGLQAIAVVGE-VLASPIQADLAKTNSNIP-G-----MF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred -ccceEEEEcCC-eeeeeeeeeeeeccCCCc-c-----cc---ceeeeeeeeeeeEEeccccceEEEeecCCcccCCCcc Confidence 22335556543 333333433344432110 0 11 2467888999999999998877776655555544443 No 174 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.82 E-value=1.6e-05 Score=46.92 Aligned_cols=228 Identities=11% Similarity=0.025 Sum_probs=133.6 Q ss_pred CCCCccCCCceEc--------chh-HHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGKLEL--------PGS-MIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~v--------P~~-~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |++- ..+-.++ |.. +...|+|.+.+.+.|....+++.-..+. -...+.++-|+++|..=++..++++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 6653 2222222 322 4567999999999998888887643222 34567788899999999999999999 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccc-------------------- Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDP-------------------- 130 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~-------------------- 130 (315) ++.+++-..+-+++.+.|.+.+..... + ...++....+.+.+++...+...+|||+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~G-n-~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNG-N-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcC-C-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998876643 2 233445556677888888888889988521 Q ss_pred --------cccccccccccc---------cc--cc------------------c-------------------------- Q lcl|NC_018838. 131 --------ATGKPAAAVKVS---------LD--KT------------------T-------------------------- 147 (315) Q Consensus 131 --------~~~~~~~~~~~~---------~~--~~------------------~-------------------------- 147 (315) +++...+++.-. .. .. + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:10 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 011111010000 00 00 0 Q ss_pred -ccccc-----ccchhHHHHHHH-HHhh--hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCcccc Q lcl|NC_018838. 148 -KTVDA-----TDSATTDLVKAV-GLIA--GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWR 217 (315) Q Consensus 148 -~~~~~-----~~~~~~di~~~~-~~~~--~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~ 217 (315) .-++. ...+-.|+.+++ .+.. ++.......|.||.+....|++.....++. +.+..+-..| ..-.+. T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~---~~~~~~~~~g~~~t~~~ 313 (331) T protein:vir:10 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA---STLTMEEIAGKKVVAFD 313 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce---eeeeeeecCCcceeEEC Confidence 00000 011112232222 2221 222333456999999999998875444321 1111111122 234689 Q ss_pred ceeeEeecccCccccccccccceEE Q lcl|NC_018838. 218 GLNVGASSTVSGAPEMSPASGVKAI 242 (315) Q Consensus 218 G~Pv~~s~~v~~~~~~~~~~~~~~~ 242 (315) |.||..++.+-.. ...++ T Consensus 314 gipir~~dai~~t-------E~~Vv 331 (331) T protein:vir:10 314 GIPCRRTDALLLT-------EARVV 331 (331) T ss_pred CeeEEEeeeeecC-------ccccC Confidence 9999988877531 11122 No 175 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.82 E-value=1.6e-05 Score=46.92 Aligned_cols=228 Identities=11% Similarity=0.025 Sum_probs=133.6 Q ss_pred CCCCccCCCceEc--------chh-HHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGKLEL--------PGS-MIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~v--------P~~-~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |++- ..+-.++ |.. +...|+|.+.+.+.|....+++.-..+. -...+.++-|+++|..=++..++++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:98 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 6653 2222222 322 4567999999999998888887643222 34567788899999999999999999 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccc-------------------- Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDP-------------------- 130 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~-------------------- 130 (315) ++.+++-..+-+++.+.|.+.+..... + ...++....+.+.+++...+...+|||+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~G-n-~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNG-N-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcC-C-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998876643 2 233445556677888888888889988521 Q ss_pred --------cccccccccccc---------cc--cc------------------c-------------------------- Q lcl|NC_018838. 131 --------ATGKPAAAVKVS---------LD--KT------------------T-------------------------- 147 (315) Q Consensus 131 --------~~~~~~~~~~~~---------~~--~~------------------~-------------------------- 147 (315) +++...+++.-. .. .. + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:98 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 011111010000 00 00 0 Q ss_pred -ccccc-----ccchhHHHHHHH-HHhh--hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCcccc Q lcl|NC_018838. 148 -KTVDA-----TDSATTDLVKAV-GLIA--GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWR 217 (315) Q Consensus 148 -~~~~~-----~~~~~~di~~~~-~~~~--~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~ 217 (315) .-++. ...+-.|+.+++ .+.. ++.......|.||.+....|++.....++. +.+..+-..| ..-.+. T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~---~~~~~~~~~g~~~t~~~ 313 (331) T protein:vir:98 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA---STLTMEEIAGKKVVAFD 313 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce---eeeeeeecCCcceeEEC Confidence 00000 011112232222 2221 222333456999999999998875444321 1111111122 234689 Q ss_pred ceeeEeecccCccccccccccceEE Q lcl|NC_018838. 218 GLNVGASSTVSGAPEMSPASGVKAI 242 (315) Q Consensus 218 G~Pv~~s~~v~~~~~~~~~~~~~~~ 242 (315) |.||..++.+-.. ...++ T Consensus 314 gipir~~dai~~t-------E~~Vv 331 (331) T protein:vir:98 314 GIPCRRTDALLLT-------EARVV 331 (331) T ss_pred CeeEEEeeeeecC-------ccccC Confidence 9999988877531 11122 No 176 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.82 E-value=1.6e-05 Score=46.92 Aligned_cols=228 Identities=11% Similarity=0.025 Sum_probs=133.6 Q ss_pred CCCCccCCCceEc--------chh-HHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGKLEL--------PGS-MIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~v--------P~~-~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) |++- ..+-.++ |.. +...|+|.+.+.+.|....+++.-..+. -...+.++-|+++|..=++..++++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 6653 2222222 322 4567999999999998888887643222 34567788899999999999999999 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccc-------------------- Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDP-------------------- 130 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~-------------------- 130 (315) ++.+++-..+-+++.+.|.+.+..... + ...++....+.+.+++...+...+|||+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~G-n-~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNG-N-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcC-C-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998876643 2 233445556677888888888889988521 Q ss_pred --------cccccccccccc---------cc--cc------------------c-------------------------- Q lcl|NC_018838. 131 --------ATGKPAAAVKVS---------LD--KT------------------T-------------------------- 147 (315) Q Consensus 131 --------~~~~~~~~~~~~---------~~--~~------------------~-------------------------- 147 (315) +++...+++.-. .. .. + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:10 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 011111010000 00 00 0 Q ss_pred -ccccc-----ccchhHHHHHHH-HHhh--hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccC-CCcccc Q lcl|NC_018838. 148 -KTVDA-----TDSATTDLVKAV-GLIA--GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNWR 217 (315) Q Consensus 148 -~~~~~-----~~~~~~di~~~~-~~~~--~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~-~~~~l~ 217 (315) .-++. ...+-.|+.+++ .+.. ++.......|.||.+....|++.....++. +.+..+-..| ..-.+. T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~---~~~~~~~~~g~~~t~~~ 313 (331) T protein:vir:10 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA---STLTMEEIAGKKVVAFD 313 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce---eeeeeeecCCcceeEEC Confidence 00000 011112232222 2221 222333456999999999998875444321 1111111122 234689 Q ss_pred ceeeEeecccCccccccccccceEE Q lcl|NC_018838. 218 GLNVGASSTVSGAPEMSPASGVKAI 242 (315) Q Consensus 218 G~Pv~~s~~v~~~~~~~~~~~~~~~ 242 (315) |.||..++.+-.. ...++ T Consensus 314 gipir~~dai~~t-------E~~Vv 331 (331) T protein:vir:10 314 GIPCRRTDALLLT-------EARVV 331 (331) T ss_pred CeeEEEeeeeecC-------ccccC Confidence 9999988877531 11122 No 177 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.78 E-value=2e-05 Score=46.40 Aligned_cols=281 Identities=8% Similarity=-0.049 Sum_probs=128.8 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhh-hc--ceeecCCCceEEEEEeCCceeEEe-ecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAK-LS--PEQPTIFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~-l~--~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~~~v~ 76 (315) .++-+..=+.+..-+-++..+-+.+...+.-.. ++ .....+++.++||+.....-..+- ..+-....-+.++...+ T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~t 109 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNATNEFDHPQIQETTYF 109 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCCCccccccccceeEEE Confidence 455555555555555666655555544332221 12 245567788999998654333332 22211112223344445 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccccch Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSA 156 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) |.-.|.-.+. |-. +...+.+........+.+.....+.-.+|.-.+.-.-...+. .. . .. ...... T Consensus 110 idqdR~~~F~-VD~--~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~------~~-~---~~-~t~~na 175 (329) T protein:vir:10 110 LDQEKYWGRF-VDA--LDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK------HL-T---VG-SGADAQ 175 (329) T ss_pred eecccceeee-cch--hhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc------cc-c---cc-cCHHHH Confidence 5544433222 110 000000100011222334444455555564322111000000 00 0 01 112345 Q ss_pred hHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccccccc Q lcl|NC_018838. 157 TTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPA 236 (315) Q Consensus 157 ~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 236 (315) |+.++++...+.+++....-..+++|.....|.+...-....... -.....+..++|.|.+|+.++ +... T Consensus 176 y~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~---~~~~~~g~Vg~idG~~Ii~vp---s~~~---- 245 (329) T protein:vir:10 176 YDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNR---QQVLGKGVQGELDGFTIVKVP---SKML---- 245 (329) T ss_pred HHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhcccccc---ccceeeeeeeeecCeEEEEec---CCcc---- Confidence 888888888887665432223567898888876532111111000 012345566889999999543 2221 Q ss_pred ccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec-cCCCCCCCCCC Q lcl|NC_018838. 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK-AAPKPNPPAGN 315 (315) Q Consensus 237 ~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~-~a~~~~~~~~~ 315 (315) .+.-++++..+ ..+...+--.+++.+... . ++--.++...++|..|.++++...+... .+++..+.+++ T Consensus 246 k~in~ii~~~~-A~~~~~K~~~~~~~~p~~-~--------~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~~~ 315 (329) T protein:vir:10 246 QGVEAMAVIGE-VMASPIQANEAKLNSNVP-G--------MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETNRDGVD 315 (329) T ss_pred cceeEEEEcCC-ceeeeeeeeeeeeeCCCC-c--------cchheeeeeeeeeeEEEccccCEEEEecccCcccCCCCCC Confidence 12234555543 333333333444433210 0 1124678889999999999977665533 23333333333 No 178 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.64 E-value=3.4e-05 Score=45.07 Aligned_cols=278 Identities=12% Similarity=-0.029 Sum_probs=128.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceeec----C-CCceEEEEEeCCceeEEeecccccCCCccceeeE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----I-FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAF 75 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~----~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v 75 (315) |+. ....++-|+.++.++++.+++..++.+++.+-.- . +..++||+... .-+.++..+...+.+-..+ T Consensus 1 m~~---~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~dg~~~~~~~~te~~v 73 (418) T protein:vir:10 1 MAV---QDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSASGRTLVKQPMVDQTI 73 (418) T ss_pred CCc---cccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecccCCccccccccceE Confidence 765 2346677999999999999999999888755221 2 24688887432 2233455555555554555 Q ss_pred EEee-EEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccccc Q lcl|NC_018838. 76 TAQP-IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATD 154 (315) Q Consensus 76 ~l~~-~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (315) ++.- +..+..+.++.+=+.++..+ +.+.+.+...+++++.+|..++.-. .+. ...... .. ... T Consensus 74 ~l~id~~k~~~~~itD~e~a~~~~d----~~~~~l~~A~~aLA~~vD~~ia~l~---~~a-----~~~~gt--~g--t~~ 137 (418) T protein:vir:10 74 PFKIAYQEHVGLEYTVKDKTLDIMQ----FSERYLKSGMVQIANQIDRSLALTL---KKA-----FHSSGT--PG--VRP 137 (418) T ss_pred EEEEecccccceeechHHHhhhhhH----HHHHHHHHHHHHHHHHHHHHHHHHH---hhc-----cccccc--CC--cCc Confidence 5443 22234555666544333333 4455566678888888887654211 000 011000 00 122 Q ss_pred chhHHHHHHHHHhhhcccccc-eEE-EEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc Q lcl|NC_018838. 155 SATTDLVKAVGLIAGAGLQVP-NGV-ALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 155 ~~~~di~~~~~~~~~~~~~~~-~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) ..|.++.++-..+..++.... ..| +++|.....|.+ +.... ......-..+..|..++|.|+.|+.++++|.... T Consensus 138 ~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~~~~-~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta 214 (418) T protein:vir:10 138 GAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSD--EVTKL-FKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV 214 (418) T ss_pred chHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhh--hcccc-ccccccchhhheeeeeeeeceEEEEecCCCcccc Confidence 358888888877766654433 244 689988776643 22211 1111111124466778999999999999995332 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCc-cccchhhhhcCcEEEEE---EEEeccE-eecccceEEEeecc-- Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP-DQTGRDLKGHNEVMVRA---EAVLYVA-IESLDSFAVVKEKA-- 305 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~-~~~~~~~f~~~~v~~r~---~~r~~~~-v~~~~af~~l~~~~-- 305 (315) +.......+.|=. ..+..+.+.-.... .+. +-.-+.+.|-. ..++... ..++.-|++..... T Consensus 215 -g~~~~t~~v~ga~-------~~~~~~~~~~~t~s~~g~---l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~ 283 (418) T protein:vir:10 215 -GDHGGTPLVNGTV-------VNGDTVGFDGGTASTTGF---LKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTD 283 (418) T ss_pred -cccccceeeeccc-------ccceeEEEeecceeeccc---eeeccEEEECceeecccccccccccceEEEEEeecccc Confidence 2111112222221 11111111100000 000 00000011100 0000000 01222222221110 Q ss_pred ---------CCC----------------CCCCCCC Q lcl|NC_018838. 306 ---------APK----------------PNPPAGN 315 (315) Q Consensus 306 ---------a~~----------------~~~~~~~ 315 (315) .|+ +..+..+ T Consensus 284 ~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~ 318 (418) T protein:vir:10 284 AGGAGSIKISPSLNDGTATINNENGDPVSLTAYQN 318 (418) T ss_pred ccCcceeEeccccccccccccccccccccccCCCc Confidence 000 0001111 No 179 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.56 E-value=4.5e-05 Score=44.40 Aligned_cols=272 Identities=11% Similarity=0.050 Sum_probs=132.6 Q ss_pred CCCCccCCCceEcchhHHHHH----HHHHHh-ccchhhhcceeecC-CCceEEEEEeCCceeEEeecccccCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAV----RDRAID-SGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~i----i~~~~~-~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~ 74 (315) ++.+ -+++. -|..+.+-+ ++.-+. ....+++|+....+ ....+..+..+.++..-|.|+.+.......=+. T Consensus 359 ~A~~-hsTsD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~ 435 (652) T protein:vir:79 359 AAFT-HSTSD--FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQ 435 (652) T ss_pred HHhh-cCcch--HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCcc Confidence 2221 01111 233332222 222221 22467777765544 233455666677888889999998776554456 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh---hhcccccccccccccccccccccccccc Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI---AFHGIDPATGKPAAAVKVSLDKTTKTVD 151 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a---~~~G~g~~~~~~~~~~~~~~~~~~~~~~ 151 (315) .++...++|.++.||+|.+-.+..+. + ..|-..++++.++.++.. +|.++ |.-......+.. .....+..+ T Consensus 436 e~~~l~tyG~~~~iTRqaiINDDL~a---~-~~ip~~~g~aA~~~~~~~vy~~l~~N-p~~~~DGk~LF~-hA~H~Nl~~ 509 (652) T protein:vir:79 436 ATIALATYGELFSITRQAIINDDLNM---L-TDVPMKLGRAAKSTIADLVYAILTSN-PKISTDNVSLFD-KAKHANVLE 509 (652) T ss_pred ceeeeecccCeeeeehheeeccchhH---H-HHHHHHHHHHHHHHHHHHHHHHHhcC-cccccCCceeec-ccccccccc Confidence 68889999999999999997654443 3 335566666666666553 33332 210000001110 001111111 Q ss_pred cccchhHHHHHHHHHh---hhccc---ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccce-eeEee Q lcl|NC_018838. 152 ATDSATTDLVKAVGLI---AGAGL---QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGL-NVGAS 224 (315) Q Consensus 152 ~~~~~~~di~~~~~~~---~~~~~---~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~-Pv~~s 224 (315) .+...-+.+.++..+. .+.+. -.|.-|+..+......+.+.-+... .+ .+.+.+..+.+.|+ .++++ T Consensus 510 ~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v--~~----a~~~~~~~Np~~~~~~~i~e 583 (652) T protein:vir:79 510 SAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSV--KG----ADINAGIINPVKDFATVIAE 583 (652) T ss_pred cccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCC--cc----cccccccccccccccccccc Confidence 1112222333333322 22111 1122255555555555444322111 00 01122233445554 55556 Q ss_pred cccCccccccccccceEEEeccc---ceEEEeeccc---eEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccce Q lcl|NC_018838. 225 STVSGAPEMSPASGVKAIVGDFS---RVHWGFQRNF---PIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSF 298 (315) Q Consensus 225 ~~v~~~~~~~~~~~~~~~~gDf~---~~~i~~~~~~---~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af 298 (315) ..+..+.. . ..|+++-. -+.+++-.|. .++..+ -|..+-+.+|+...+|.+++|=-.+ T Consensus 584 prL~~~s~-----~-~wylaa~~~~dtiev~yL~G~~~P~ie~~~----------gf~~dG~~~kvrlD~G~~~iD~RG~ 647 (652) T protein:vir:79 584 PRLDDNSQ-----T-TFYLAASKGSDTIEVAYLNGVDTPYIDQME----------GFSVDGVTTKVRIDAGVAPVDHRGL 647 (652) T ss_pred cccCCCCc-----c-cEEEecCCCCCeEEEEEecCCCCCeeeecC----------CCCcceEEEEEEEeccCceeeccce Confidence 65543211 1 12233211 1233332222 222211 2889999999999999999999998 Q ss_pred EEEee Q lcl|NC_018838. 299 AVVKE 303 (315) Q Consensus 299 ~~l~~ 303 (315) +|.+. T Consensus 648 ~k~t~ 652 (652) T protein:vir:79 648 VKCTA 652 (652) T ss_pred eeecC Confidence 88876 No 180 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.56 E-value=2.6e-05 Score=45.72 Aligned_cols=230 Identities=10% Similarity=-0.017 Sum_probs=131.9 Q ss_pred CCCCccC------CCceEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcccee Q lcl|NC_018838. 1 MADDFLS------AGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~s------~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~ 73 (315) |++=... ....+-|......|||.+.+.+.|.+..++..-.... -...+.++-|+++|..=++..++++.++. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQTV 80 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceEE Confidence 5443211 1222345566677999999999998887776422111 22345677789999998999999999999 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---------------------- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA---------------------- 131 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~---------------------- 131 (315) +++...+-+++.+.|-+.+..... + ...++......+.+++...+...+|||+-.. T Consensus 81 qvt~~l~ilgg~~eVDr~La~~~G-n-~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~ 158 (335) T protein:vir:73 81 PVTDTTGMLYDLGFVDKALADRSN-N-AAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAAS 158 (335) T ss_pred EEEEEEEEecchhhhhHHHHhhcC-C-HHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCc Confidence 999999999999999987765543 3 2334555566678888888999999984210 Q ss_pred ---------ccccccccccc---------ccc-c-------------------cc------------------------- Q lcl|NC_018838. 132 ---------TGKPAAAVKVS---------LDK-T-------------------TK------------------------- 148 (315) Q Consensus 132 ---------~~~~~~~~~~~---------~~~-~-------------------~~------------------------- 148 (315) ++...+++--. ... . +. T Consensus 159 a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 159 AENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 00000000000 000 0 00 Q ss_pred --ccccc-----cchhHHHHHH-HHHhh----hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCC-cc Q lcl|NC_018838. 149 --TVDAT-----DSATTDLVKA-VGLIA----GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL-DN 215 (315) Q Consensus 149 --~~~~~-----~~~~~di~~~-~~~~~----~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~-~~ 215 (315) -++.+ ...-.+|.++ +.++. +.-......|.||......|++....... .+.-..+ ..+.. -. T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n---~~l~~~~-~~g~~~t~ 314 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKN---VNLTIEE-YGGKKIVS 314 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCc---eeeeeec-cCCceeEE Confidence 00000 0111233333 23332 22222234699999999999887654432 1111112 12222 46 Q ss_pred ccceeeEeecccCccccccccccceEEE Q lcl|NC_018838. 216 WRGLNVGASSTVSGAPEMSPASGVKAIV 243 (315) Q Consensus 216 l~G~Pv~~s~~v~~~~~~~~~~~~~~~~ 243 (315) ++|.||..++.+-..=. .+.. T Consensus 315 ~~gipir~~Dail~tE~-------~v~~ 335 (335) T protein:vir:73 315 FLGIPIRRVDAILNTES-------AVTA 335 (335) T ss_pred ECCeEEEEEeeeecCcc-------cccC Confidence 88999999888753311 1111 No 181 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.49 E-value=3.6e-05 Score=44.93 Aligned_cols=229 Identities=11% Similarity=0.031 Sum_probs=133.1 Q ss_pred CC---CCccC--C-CceEcchhHHHHHHHHHHhccchhhhcceeecCCCc-eEEEEEeCCceeEEeecccccCCCcccee Q lcl|NC_018838. 1 MA---DDFLS--A-GKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVFSGVPRAKIVGEGEVKPSASVDVS 73 (315) Q Consensus 1 m~---~~~~s--~-Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~ 73 (315) |+ .+..+ . ...+-|......|||.+.+.+.|.+..++..-.... -...+.++-|+++|..=++..++++.++. T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceEE Confidence 44 33222 1 234556677778999999999998887776422111 22345677789999998999999999999 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---------------------- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA---------------------- 131 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~---------------------- 131 (315) +++...+-+++.+.|-+.+..... +. ..++....+.+.+++...+...+|||+-.. T Consensus 81 qvt~~l~ilgg~~eVDr~la~~~G-n~-a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~q 158 (330) T protein:vir:10 81 QVTDNCGMLEAYAEVDKALADLNG-NT-AAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) T ss_pred EEEEEeEEecchhhhhhHHHhhcC-CH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhh Confidence 999999999999999998875543 32 334555667788888888999999984210 Q ss_pred ------ccccccccccc-----------ccc----------c--c-cccccc---------------------------- Q lcl|NC_018838. 132 ------TGKPAAAVKVS-----------LDK----------T--T-KTVDAT---------------------------- 153 (315) Q Consensus 132 ------~~~~~~~~~~~-----------~~~----------~--~-~~~~~~---------------------------- 153 (315) +|...+++--. .-. . + ...++. T Consensus 159 vIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence 00010000000 000 0 0 000000 Q ss_pred ---------cc-hhHHHHHHH-HHhh--hcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC-Cccccce Q lcl|NC_018838. 154 ---------DS-ATTDLVKAV-GLIA--GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG-LDNWRGL 219 (315) Q Consensus 154 ---------~~-~~~di~~~~-~~~~--~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~-~~~l~G~ 219 (315) .. ...++++++ .+.. ++.......|.||......|++....... .+.-+.+. .+. .-.+.|. T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n---~~l~~~~~-~g~~~t~~~gi 314 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIA---NNLTWETV-SGERVMTFDGI 314 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhccc---ceeeeeec-CCeeeEEECCe Confidence 00 111333332 1211 22223345699999999999987433321 11111121 222 2579999 Q ss_pred eeEeecccCccccccccccceEE Q lcl|NC_018838. 220 NVGASSTVSGAPEMSPASGVKAI 242 (315) Q Consensus 220 Pv~~s~~v~~~~~~~~~~~~~~~ 242 (315) ||..++.+-.. ...++ T Consensus 315 pir~~Dail~t-------E~~vv 330 (330) T protein:vir:10 315 PVQRTDALLNT-------ESRVV 330 (330) T ss_pred EEEEEeeeecC-------ccccC Confidence 99998877532 11222 No 182 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.42 E-value=6.8e-05 Score=43.43 Aligned_cols=272 Identities=11% Similarity=0.055 Sum_probs=129.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHH-----hccchhhhcceeecC-CCceEEEEEeCCceeEEeecccccCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAI-----DSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~-----~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~ 74 (315) ++.+- +++. -|-.+.+-+-+.++ .....+..|+....+ ....+..+..+.++..-|.|+.+.......=+. T Consensus 394 ~a~~h-tTSD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~ 470 (693) T protein:vir:95 394 LAFTH-TSSD--FGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERG 470 (693) T ss_pred HHHhc-Ccch--hHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCcc Confidence 22210 1111 22222222212222 123456666654433 233344445555666778888887654443334 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHh---hhcccccccccccccccccccccccccc Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLI---AFHGIDPATGKPAAAVKVSLDKTTKTVD 151 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a---~~~G~g~~~~~~~~~~~~~~~~~~~~~~ 151 (315) -++...++|.++.||+|.+-.+..+ .+ ..|-..++++.++.++.- +|.++ |. -.....+. .....+-.+ T Consensus 471 e~~~l~tyG~~~~iTRqaiINDDLg---a~-~~ip~~~g~aA~~~~~~~vy~~L~~N-p~-m~DGk~LF--hadH~Nl~t 542 (693) T protein:vir:95 471 EQIILATYGELFSITRQAIINDDLQ---ML-SDIPFKLGQAAKATIGDLVYAVLTGN-PA-MSDGKTLF--HADHSNLLT 542 (693) T ss_pred ceeehhhcCCeeeecHHhhhccchH---HH-HHHHHHHHHHHHHHHHHHHHHHHhcC-cc-ccCCccee--ecccccccc Confidence 5677888999999999999764433 33 335566777776666653 33332 11 00000111 011111111 Q ss_pred --cccchhHHHHHHHHHhhhc-------ccc----cceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccc Q lcl|NC_018838. 152 --ATDSATTDLVKAVGLIAGA-------GLQ----VPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRG 218 (315) Q Consensus 152 --~~~~~~~di~~~~~~~~~~-------~~~----~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G 218 (315) +.....+.+.++..++... ... .|.-|+..+......+.+..+...+. . +.+.+..+-+.| T Consensus 543 ga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~--a----~~~~~~~NP~~~ 616 (693) T protein:vir:95 543 GAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPG--A----DVNSGIVNPIRA 616 (693) T ss_pred ccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccc--c----ccccccccchhc Confidence 1112223333333332111 111 22235555555555555543322110 0 112222344555 Q ss_pred e-eeEeecccCccccccccccceEEEeccc--ceEEEeeccce---EEEeccCCccccchhhhhcCcEEEEEEEEeccEe Q lcl|NC_018838. 219 L-NVGASSTVSGAPEMSPASGVKAIVGDFS--RVHWGFQRNFP---IELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI 292 (315) Q Consensus 219 ~-Pv~~s~~v~~~~~~~~~~~~~~~~gDf~--~~~i~~~~~~~---v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v 292 (315) + .|+++..+.... .....++.|.. -+.+++-.|.+ ++..+ -|..|-+.+|+...+|.++ T Consensus 617 ~~~vi~~prL~~~s-----~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~----------gf~~dG~~~kvr~D~G~~~ 681 (693) T protein:vir:95 617 FAQVIGEPRLDDAS-----ATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQE----------GFTVDGVASKVRIDAGVAP 681 (693) T ss_pred cccccccceecCCC-----CCceEEecCCCCCeEEEEEecCCCCCeEeecC----------CCCcceEEEEEEEeccCce Confidence 4 555555553211 11234445532 13333333322 22222 2889999999999999999 Q ss_pred ecccceEEEeec Q lcl|NC_018838. 293 ESLDSFAVVKEK 304 (315) Q Consensus 293 ~~~~af~~l~~~ 304 (315) +|=-++.|-.++ T Consensus 682 iD~Rg~~kn~GA 693 (693) T protein:vir:95 682 LDFRGLQKSNGA 693 (693) T ss_pred eeccccccCCCC Confidence 999998888777 No 183 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.38 E-value=1.2e-05 Score=47.60 Aligned_cols=274 Identities=12% Similarity=0.003 Sum_probs=138.1 Q ss_pred CCCCccCCCce-------EcchhHHH----HHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccC Q lcl|NC_018838. 1 MADDFLSAGKL-------ELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~~s~Gg~-------~vP~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) |+......++. -||..+.+ .+++.+.+......|..+...+. ..+.+++....+.+.+.+-+++.| T Consensus 31 ~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P 110 (336) T protein:vir:36 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) T ss_pred hhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCc Confidence 44433222222 25554443 23334444444444544443332 234667766677888999999999 Q ss_pred CCccceeeEEEeeEEEEEeehhh-HHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc-- Q lcl|NC_018838. 67 SASVDVSAFTAQPIKVVTQQRVS-DEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL-- 143 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl~~~~~iS-~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~-- 143 (315) ..+......+-..+.++....++ .|+-+.. ..-.+|.+.-+...++++.+.+++-.++|+... ...|+.+.. T Consensus 111 ~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa--~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~---~~yGllNdP~l 185 (336) T protein:vir:36 111 DSGANINYPQRQSYFFQTWTRWGERELEMAG--AGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPSL 185 (336) T ss_pred eeecccceeeeeEEEEEeeeeeCHHHHHHHH--HhCCCcHHHHHHHHHHHHHHhhCcEEEEecccc---ceEEEEecCCC Confidence 98866666666677777777787 5554432 223345566777888889999998888887532 223443321 Q ss_pred ----cccccc--cccccchhHHHHHHHHHhhhccc-----ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 144 ----DKTTKT--VDATDSATTDLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 144 ----~~~~~~--~~~~~~~~~di~~~~~~~~~~~~-----~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) +..+.- .+.....++|+.+++.++..... ..+..++|.+.....|..- ...|.-+. .-+.. T Consensus 186 ~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl-----~~lk~-- 257 (336) T protein:vir:36 186 SAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAA-----AKLKD-- 257 (336) T ss_pred ccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCC-CccCccHH-----HHHHH-- Confidence 111111 11113346789999888865332 2355688888877777432 11121110 00110 Q ss_pred CccccceeeEeecccCccccccccccceEEEecccc---eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec Q lcl|NC_018838. 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSR---VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY 289 (315) Q Consensus 213 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~---~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~ 289 (315) .+-++.++- +|...+.+ +.....++-+... ..+...+.++. ... . .+.-....-+..|.+ T Consensus 258 --n~Pnl~i~t---~pEl~~a~-g~~~~l~~~~~~~~~t~~~~~p~~~~~--l~v--q-------~~~~~~~v~~~~rt~ 320 (336) T protein:vir:36 258 --IFPKLEFVT---IPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA--HSI--E-------RYSSYFRQKKSAGTW 320 (336) T ss_pred --hcCccEEEE---ccccccCC-CceEEEEEEecCCCcceeeecchhhhc--cce--e-------ecCceeEecccccee Confidence 111222332 33322222 2222222211111 11111111110 000 0 011123445566666 Q ss_pred c-EeecccceEEEeec Q lcl|NC_018838. 290 V-AIESLDSFAVVKEK 304 (315) Q Consensus 290 ~-~v~~~~af~~l~~~ 304 (315) + .+.+|-||+++++. T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 6 56899999999999 No 184 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.30 E-value=0.0001 Score=42.49 Aligned_cols=281 Identities=12% Similarity=0.032 Sum_probs=119.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcceee-----c--CCCceEEEEEeCCce---eEEeecccccCCCcc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQP-----T--IFGPVKGAVFSGVPR---AKIVGEGEVKPSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~-----~--~~~~~~ip~~~~~~~---a~wv~Eg~~~~~s~~ 70 (315) ||++..+ ++|+.++.++++.+++..++.+++.+-. . .+..++||+-..... ..+-..+. ...+. T Consensus 1 MANsl~~----l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~--~~~~l 74 (423) T protein:vir:10 1 MANNLDA----NVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGK--SKNSL 74 (423) T ss_pred Ccccccc----ccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcc--ccccc Confidence 9866332 8999999999999999999998875522 1 134567776432211 11111111 11122 Q ss_pred cee--eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccc Q lcl|NC_018838. 71 DVS--AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTK 148 (315) Q Consensus 71 ~~~--~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~ 148 (315) .-+ .+++..+|... +.++.+=+..+..+ ++++++.. .++++..+|..+..... ......+. .... T Consensus 75 ~e~~v~l~id~~k~~a-~~v~d~E~~l~i~~----~~~~l~~A-~~aLA~~vd~~ia~~~~---~~~~~~vg----t~~t 141 (423) T protein:vir:10 75 ISAKATGEVGNYITVA-VEYRQIEEALKLNQ----LDQILVPI-NERMVTDLETELALFMM---KHGALSLG----SPNT 141 (423) T ss_pred ccceEEEEecceeeee-eeeChHHHhcChhH----HHHHHHHH-HHHHHHHHHHHHHHHhh---hccccccc----cccc Confidence 222 34555555533 44544433344444 34444333 57788888876642221 00111111 1001 Q ss_pred ccccccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHH-H---hhccCccccccccccccccCC-CccccceeeE Q lcl|NC_018838. 149 TVDATDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALST-E---VYPKGSPLAGQPMYPAAGFAG-LDNWRGLNVG 222 (315) Q Consensus 149 ~~~~~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~-l---~d~~g~~~~~~~~~~~~~~~~-~~~l~G~Pv~ 222 (315) ....|.++.++-..|...+...... .+++|.....|.+ + ...+... -..+..++ .+++.|+.++ T Consensus 142 ----~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~------~~alr~~~i~G~~~GFdi~ 211 (423) T protein:vir:10 142 ----PIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLV------RTAWENAQISGNFGGIRAL 211 (423) T ss_pred ----ccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccc------hHHHHhcccceeecceEEE Confidence 1124778887777776655444344 5788988777642 2 2211110 01123343 3789999999 Q ss_pred eecccCcccccccc-----ccceEEEec-------ccceEEEeecc--ceEEEeccCCcccc-chhhhhcC--------- Q lcl|NC_018838. 223 ASSTVSGAPEMSPA-----SGVKAIVGD-------FSRVHWGFQRN--FPIELIEYGDPDQT-GRDLKGHN--------- 278 (315) Q Consensus 223 ~s~~v~~~~~~~~~-----~~~~~~~gD-------f~~~~i~~~~~--~~v~~~~~~~~~~~-~~~~f~~~--------- 278 (315) .|+.+|.-.....+ +....+-|+ +....++.-.. -.+..-+.-+..+. .++-..+. T Consensus 212 ~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~ 291 (423) T protein:vir:10 212 MSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASAL 291 (423) T ss_pred EecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCc Confidence 99999843211100 000111111 00000000000 00111111111110 00100000 Q ss_pred cEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 279 ~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) .-.|++.. +....-..++. | +-.|++.++.++ T Consensus 292 ~~~~~V~~--~~~~~a~~~~t-v--~i~p~~~~~~~~ 323 (423) T protein:vir:10 292 SFTATVME--DANAHSSGDVT-V--KISGVPIFDAGY 323 (423) T ss_pred ceEEEEEe--cccccccCceE-E--EeccccccccCc Confidence 00011100 00001112221 1 222334433333 No 185 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.23 E-value=5.5e-05 Score=43.93 Aligned_cols=273 Identities=12% Similarity=-0.016 Sum_probs=142.4 Q ss_pred CCCCccCCC-------ceEcc----hhHHHHHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccC Q lcl|NC_018838. 1 MADDFLSAG-------KLELP----GSMIGAVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~~s~G-------g~~vP----~~~~~~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) |+......+ -..|| +-+..+|++...+.-..+.+..+.+.+. ..+.++..+..+.+.|.+.+++.| T Consensus 35 ~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~P 114 (339) T protein:vir:94 35 YAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANG 114 (339) T ss_pred hhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCC Confidence 333322111 12233 2334666677777778888887776653 346888888888999999999998 Q ss_pred CCc--cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccc Q lcl|NC_018838. 67 SAS--VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLD 144 (315) Q Consensus 67 ~s~--~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~ 144 (315) ..+ .++.+.++..+.++-... ..|+-+.. ..-..|.+.-++..++++.+.+|+-.++|... ....|+.+... T Consensus 115 l~~~~v~~~~~~v~~~~~g~~y~-~~E~~~A~--~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~---~~~~GLlN~P~ 188 (339) T protein:vir:94 115 MSKANVNFESRQNYRYQTWTEYG-DLEMATYG--EAGIDYVARQEISASLVMAKFANSSYLLGVAG---IANYGLMNDPS 188 (339) T ss_pred cccccceeeEEeEEEEEEEEeec-HHHHHHHH--hhCCChHHHHHHHHHHHHHHhhceEEeeeecc---cceEEEEeCCC Confidence 776 567777766666655443 34443322 11233556667888889999999999999743 23344444321 Q ss_pred c------ccccccc-ccchhHHHHHHHHHhhhccc-----ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 145 K------TTKTVDA-TDSATTDLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 145 ~------~~~~~~~-~~~~~~di~~~~~~~~~~~~-----~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) . +++=.+. ....++|+.+++.++...-. ..+..++|.++....|... ...|.- +..-+.. T Consensus 189 l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~T-----vl~~lk~-- 260 (339) T protein:vir:94 189 LPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLS-----AGAKIAQ-- 260 (339) T ss_pred ccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCcc-----HHHHHHH-- Confidence 1 1100011 11235788888888754322 1233588888888877532 221211 1111111 Q ss_pred CccccceeeEeecccCccccccccccceEEEecc---cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec Q lcl|NC_018838. 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDF---SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY 289 (315) Q Consensus 213 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf---~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~ 289 (315) .+-++.++- +|...+.+ +.....+.-.. +...+...+.++ ..+- . .+.-...+-+..|.+ T Consensus 261 --n~pnl~i~~---~~el~~a~-g~~~~~~~~~~~~~~~~~~~~p~~~~--~lpv-q--------~~~~~~~v~~~~rt~ 323 (339) T protein:vir:94 261 --TYPNIQFVA---VPEFDTAS-GRLVQLWVPEVNGQPTGEVAFAEKLR--SHSI-E--------RYSTTTRQKHSGATF 323 (339) T ss_pred --hcCCcEEEE---ccccccCC-CceEEEEEEeccCCcceEEEcchhhh--cccc-E--------EcCceEEecceeeee Confidence 111233442 33332222 22222221111 111222222111 1110 0 011123445666744 Q ss_pred -cEeecccceEEEeec Q lcl|NC_018838. 290 -VAIESLDSFAVVKEK 304 (315) Q Consensus 290 -~~v~~~~af~~l~~~ 304 (315) ..+.+|.||+++++. T Consensus 324 Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 324 GAVIYQPWAVTQELGV 339 (339) T ss_pred eEEEEccceeeeeecC Confidence 467999999999999 No 186 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.07 E-value=4.7e-05 Score=44.29 Aligned_cols=274 Identities=13% Similarity=0.014 Sum_probs=136.3 Q ss_pred CCCCccC-------CCceEcchhHHH----HHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccC Q lcl|NC_018838. 1 MADDFLS-------AGKLELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~~s-------~Gg~~vP~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) |+-.... .+...||..+.+ .+++.+.+-.....|..+...+. ..+.+++....+.+.+.+-+++.| T Consensus 31 ~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P 110 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) T ss_pred hhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCc Confidence 3332211 112235543322 22343444444444544444332 234667766677888999999999 Q ss_pred CCccceeeEEEeeEEEEEeehhh-HHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc-- Q lcl|NC_018838. 67 SASVDVSAFTAQPIKVVTQQRVS-DEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL-- 143 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl~~~~~iS-~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~-- 143 (315) ..+......+-..+.++....++ .|+-+.. ..-.+|.+.-+...++++.+.+++-.++|+... ...|+.+.. T Consensus 111 ~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~--~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~---~~yGllN~P~l 185 (336) T protein:vir:10 111 DSGANINYPQRQSYFFQTWTRWGERELEMAG--AGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPSL 185 (336) T ss_pred eeecccceeeeeEEEEEeeeeeCHHHHHHHH--HhCCCcHHHHHHHHHHHHHHhhCcEEEEecccc---ceEEEEeCCCC Confidence 98866666666677777777788 4443332 223345666778888889999998888887532 223333321 Q ss_pred ----ccccccc--ccccchhHHHHHHHHHhhhccc-----ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 144 ----DKTTKTV--DATDSATTDLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 144 ----~~~~~~~--~~~~~~~~di~~~~~~~~~~~~-----~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) +..+.-- +.....++|+.+++..+..... ..+..++|.+.....|..- ...|.-+. .-+.. T Consensus 186 ~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl-----~~lk~-- 257 (336) T protein:vir:10 186 SAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAA-----AKLKD-- 257 (336) T ss_pred ccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCC-CccCccHH-----HHHHH-- Confidence 1111111 1113346789988888865332 2356788888877776432 11121110 00110 Q ss_pred CccccceeeEeecccCccccccccccceEEEecccc---eEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec Q lcl|NC_018838. 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSR---VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY 289 (315) Q Consensus 213 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~---~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~ 289 (315) .+-++.++- +|...+.+ +.....++-+... ..+...+.++. ... . .+.-....-+..|.+ T Consensus 258 --n~Pnl~i~t---~pEl~~a~-G~~~~l~~~~~~~~~t~~~~~p~~~~~--l~v--q-------~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 258 --IFPKLEFVT---IPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA--HSI--E-------RYSSYFRQKKSAGTW 320 (336) T ss_pred --hcCccEEEE---ccccccCC-CceEEEEEEecCCCcceeeecchhhhc--cce--e-------ecCceeEecccccee Confidence 111222332 33222222 2222222211111 11111111110 100 0 011123445566666 Q ss_pred c-EeecccceEEEeec Q lcl|NC_018838. 290 V-AIESLDSFAVVKEK 304 (315) Q Consensus 290 ~-~v~~~~af~~l~~~ 304 (315) + .+.+|-||+++++. T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 6 56899999999999 No 187 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=97.02 E-value=0.00014 Score=41.65 Aligned_cols=278 Identities=10% Similarity=-0.038 Sum_probs=129.0 Q ss_pred CCCCccCCCc------e-------EcchhH---HHHHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeec Q lcl|NC_018838. 1 MADDFLSAGK------L-------ELPGSM---IGAVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGE 61 (315) Q Consensus 1 m~~~~~s~Gg------~-------~vP~~~---~~~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~E 61 (315) +|..+...|+ - .+|..+ ...+++.+-.-..+..|..+...+. ..+.+++....+.+.+.+- T Consensus 54 ~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd 133 (379) T protein:vir:10 54 FAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTD 133 (379) T ss_pred hhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEecc Confidence 2222211111 0 123322 2344555544444444444443332 2356677777788899999 Q ss_pred ccccCCCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccc Q lcl|NC_018838. 62 GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKV 141 (315) Q Consensus 62 g~~~~~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~ 141 (315) +++.|..+...+...-..+.++..+.++.+=+.... ..-..|.+.-+...++++.+.+|+-.++|.+.. +....|+++ T Consensus 134 ~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa-~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~-~~~~yGllN 211 (379) T protein:vir:10 134 GGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSS-RVQVSSADEKRAMVGEALEVQRNRVAFYGYNDG-SGRTFGFLN 211 (379) T ss_pred ccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHH-HhCCChHHHHHHHHHHHHHHhhceEEEEeecCC-CcceEEEEe Confidence 888888775555444444555555555543232221 223446667788888999999999999996422 222233333 Q ss_pred ccc------ccc---ccccc----ccchhHHHHHHHHHhhhcccc------cceEEEEeHHHHHHHHHHhhccCcccccc Q lcl|NC_018838. 142 SLD------KTT---KTVDA----TDSATTDLVKAVGLIAGAGLQ------VPNGVALDPAFSFALSTEVYPKGSPLAGQ 202 (315) Q Consensus 142 ~~~------~~~---~~~~~----~~~~~~di~~~~~~~~~~~~~------~~~~~~m~~~~~~~L~~l~d~~g~~~~~~ 202 (315) ... ..+ ..... ....++||..++..+...... .+...+|.+.....|..- ...|.-++ T Consensus 212 dP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl-- 288 (379) T protein:vir:10 212 DPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVA-- 288 (379) T ss_pred CCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHH-- Confidence 211 110 01101 112356788888876533221 122577888877777532 11111111 Q ss_pred ccccccccCCCccccceeeEeecccCccccccccccceEEEecc-cce--------EEEeeccceEEEeccCCccccchh Q lcl|NC_018838. 203 PMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDF-SRV--------HWGFQRNFPIELIEYGDPDQTGRD 273 (315) Q Consensus 203 ~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf-~~~--------~i~~~~~~~v~~~~~~~~~~~~~~ 273 (315) .. +.. .+-++.++- +|...+.+.+.+...++.|- ... .......++ ....- T Consensus 289 ~~---lk~----n~Pnl~i~t---~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~--~l~ve-------- 348 (379) T protein:vir:10 289 QY---MRE----SYPNVTFVS---APELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMF--TLGVE-------- 348 (379) T ss_pred HH---HHH----hcCCcEEEE---cccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhh--hccce-------- Confidence 00 110 111233442 23222223333333333331 110 000111110 00000 Q ss_pred hhhcCcEEEEEEEEecc-EeecccceEEEeec Q lcl|NC_018838. 274 LKGHNEVMVRAEAVLYV-AIESLDSFAVVKEK 304 (315) Q Consensus 274 ~f~~~~v~~r~~~r~~~-~v~~~~af~~l~~~ 304 (315) .+.-....-+..|.++ .+.+|.||+++.++ T Consensus 349 -~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 349 -KKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred -ecCceeEeccccceeeeeeecchhhheecCC Confidence 0001122334455555 66899999999999 No 188 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.73 E-value=0.00013 Score=41.81 Aligned_cols=200 Identities=13% Similarity=0.060 Sum_probs=87.7 Q ss_pred EEEeehhhHHHhccC-hhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc--ccccccccc---cccccccccccccccccc Q lcl|NC_018838. 82 VVTQQRVSDEFMWAD-ADYRLGVLQDLISPALGASIGRAVDLIAFHGI--DPATGKPAA---AVKVSLDKTTKTVDATDS 155 (315) Q Consensus 82 l~~~~~iS~ell~~~-~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~--g~~~~~~~~---~~~~~~~~~~~~~~~~~~ 155 (315) +=. .-+|+-++.+= ..-..-.+.+...+++++++++.+|+.++.-. +..+..+.+ +........+.. ..... T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t-~~~~~ 78 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNT-NNAQA 78 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceecccccc-CCHHH Confidence 111 22333333211 01112236677889999999999998875311 100111111 111110011111 11122 Q ss_pred hhHHHHHHHHHhhhcccccceEE-EEeHHHHHHHHHHhhccCccccccccc--cccccC-CCccccceeeEeecccCccc Q lcl|NC_018838. 156 ATTDLVKAVGLIAGAGLQVPNGV-ALDPAFSFALSTEVYPKGSPLAGQPMY--PAAGFA-GLDNWRGLNVGASSTVSGAP 231 (315) Q Consensus 156 ~~~di~~~~~~~~~~~~~~~~~~-~m~~~~~~~L~~l~d~~g~~~~~~~~~--~~~~~~-~~~~l~G~Pv~~s~~v~~~~ 231 (315) .++-+.++...+.+.+......| +++|+.+..|-+..+. +-.+....- -....+ ....+.|++|+.|+++|... T Consensus 79 l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~--~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~ 156 (221) T protein:vir:17 79 IVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDT--NILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLY 156 (221) T ss_pred HHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCc--ceeeeecccccccccccceeeeecCcEEEEeccCCccc Confidence 35667777777766665544555 4589888877432111 111110000 012222 34679999999999999643 Q ss_pred cccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCC Q lcl|NC_018838. 232 EMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNP 311 (315) Q Consensus 232 ~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~ 311 (315) ++ +....-|+|. ........ ++-+ ... .-+.+.|++|+..+|--. |..-| T Consensus 157 gt----~~~~~ag~~~-~~~~~~~~------------------yr~~-----fs~-~~glv~~~~Avgtvkl~~-~~~~~ 206 (221) T protein:vir:17 157 GT----NLVTDPGDAT-TSGENNGS------------------YRPA-----ITD-RAGLVFHKEAADTVEVLL-PPSRP 206 (221) T ss_pred cc----ccccCCcccc-cccccccc------------------cccc-----ccc-eEEEEEcchheeeeeeec-CCCCC Confidence 22 1112223331 00000000 0000 001 114578888888776543 33334 Q ss_pred CCCC Q lcl|NC_018838. 312 PAGN 315 (315) Q Consensus 312 ~~~~ 315 (315) |--- T Consensus 207 ~~~~ 210 (221) T protein:vir:17 207 PLVI 210 (221) T ss_pred ceee Confidence 3222 No 189 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=96.70 E-value=0.00038 Score=39.32 Aligned_cols=281 Identities=10% Similarity=-0.043 Sum_probs=126.8 Q ss_pred CCCCcc-----CCCceEcchhHHHHH----HHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccCCC Q lcl|NC_018838. 1 MADDFL-----SAGKLELPGSMIGAV----RDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~~~~~-----s~Gg~~vP~~~~~~i----i~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) +|..+. +.++.-||-.+.+-| ++.+.+-.....|..+...+. ..+.+++....+.|.+.+-+++.|.. T Consensus 61 ~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~ 140 (382) T protein:vir:96 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) T ss_pred cccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCcc Confidence 222222 122333566655544 444555555555655544332 24577777777889999999988877 Q ss_pred cc--ceeeEEEeeEEEEEeehh-hHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccc Q lcl|NC_018838. 69 SV--DVSAFTAQPIKVVTQQRV-SDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDK 145 (315) Q Consensus 69 ~~--~~~~v~l~~~kl~~~~~i-S~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~ 145 (315) +. ++.+.+..-..++ ..+ ..|+.+.... -..+.+.-+...++++.+.+++-.++|...+......|+.+.... T Consensus 141 d~~~~~~~r~v~~~~~g--~~yg~lE~~rAa~~--~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l 216 (382) T protein:vir:96 141 SWNANFERRTIVRGELG--LLVGTLEEGRASAI--RLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNL 216 (382) T ss_pred ccccceeEEEEEEEEEe--eeecHHHHHHHHhh--CCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCc Confidence 64 4555555444444 444 5666654322 223444557788889999999999999643323333455443221 Q ss_pred c------ccc-cc-cccchhHHHHHHHHHhhhcccc---c---ceEEEEeHHHHHHHHHHhhccCccccccccccccccC Q lcl|NC_018838. 146 T------TKT-VD-ATDSATTDLVKAVGLIAGAGLQ---V---PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA 211 (315) Q Consensus 146 ~------~~~-~~-~~~~~~~di~~~~~~~~~~~~~---~---~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~ 211 (315) . +.. .. .....++|+.+++.++...-.. . +...+|-++....|..- ...|.-+. .-+.. T Consensus 217 ~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl-----~~lk~- 289 (382) T protein:vir:96 217 PPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVS-----DWIEQ- 289 (382) T ss_pred ccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHH-----HHHHH- Confidence 1 000 00 1112356888888887543221 1 22366777766666321 11111000 00000 Q ss_pred CCccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEE--Ee---ccCCccccch-----hhhhcC-cE Q lcl|NC_018838. 212 GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIE--LI---EYGDPDQTGR-----DLKGHN-EV 280 (315) Q Consensus 212 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~--~~---~~~~~~~~~~-----~~f~~~-~v 280 (315) .+-++.++-...+......+.+....+++ ....+... .+ +.++...... ....+. .. T Consensus 290 ---n~Pnl~i~t~peL~~a~~~g~g~~~~~~~---------~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~ 357 (382) T protein:vir:96 290 ---TYPKMRIVSAPELSGVQMQGKTPEDALVL---------FVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSY 357 (382) T ss_pred ---hcCCcEEEEccccccccCCCccceeEEEE---------ecchhhhhcccccccCcceeccccceeeeccceeeccee Confidence 01122232211221111111111111111 00110000 00 0000000000 000000 00 Q ss_pred EEEEEEE-eccEeecccceEEEeec Q lcl|NC_018838. 281 MVRAEAV-LYVAIESLDSFAVVKEK 304 (315) Q Consensus 281 ~~r~~~r-~~~~v~~~~af~~l~~~ 304 (315) ..-+..| .|..+.+|.||+++++. T Consensus 358 ~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 358 VEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred EeccccceeeeEEEcchhhhhccCC Confidence 1111222 56678999999999999 No 190 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.65 E-value=0.00044 Score=38.99 Aligned_cols=286 Identities=10% Similarity=-0.005 Sum_probs=150.4 Q ss_pred CC--CCc---cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeec--c-cccCCCccc Q lcl|NC_018838. 1 MA--DDF---LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGE--G-EVKPSASVD 71 (315) Q Consensus 1 m~--~~~---~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E--g-~~~~~s~~~ 71 (315) +| +++ ..+-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-+.- + +..|..... T Consensus 16 ~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~ 95 (355) T protein:vir:18 16 LAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) T ss_pred HHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeeccccCCCCCcccccccc Confidence 22 222 2345788999999999999999999999988888773 2234444444455554321 1 223333344 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------ccc Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VKV 141 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~~ 141 (315) ++.-.+..++.-.-..|+-+.|.... -+..++..+++.+.+.++.-...-.++|+--...+ +|.+ ... T Consensus 96 l~~~~Y~c~qtn~dt~i~y~~LD~WA--~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:18 96 LESNKYECNQINFDFHLTYKRLDLWA--RFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQ 173 (355) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHh--cChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 56666777777666778888886653 23557788888888887765566678886311111 1111 110 Q ss_pred ------------ccc-c----ccccc-ccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHH--HHHHHhhcc Q lcl|NC_018838. 142 ------------SLD-K----TTKTV-DATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSF--ALSTEVYPK 195 (315) Q Consensus 142 ------------~~~-~----~~~~~-~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~--~L~~l~d~~ 195 (315) ... . ....+ -+..-.|.++.+++..+. +.....+. ++++...... ++..+. .. T Consensus 174 ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~ 252 (355) T protein:vir:18 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVN-KQ 252 (355) T ss_pred HHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhh-cc Confidence 000 0 00011 112334777777665432 22222222 4566655433 222222 22 Q ss_pred CccccccccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCcccc Q lcl|NC_018838. 196 GSPLAGQPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQT 270 (315) Q Consensus 196 g~~~~~~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~ 270 (315) ..|+ +...++ ..+|-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. T Consensus 253 ~~pt-------E~~Aa~~i~s~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----- 311 (355) T protein:vir:18 253 QENT-------ESLAADIIISQKRIGNLPAVRVPYFPANA---------VFVTTLENLSIYFMDESHRRSIDENP----- 311 (355) T ss_pred CChH-------HHHHHHHHHHHHhhCCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc----- Confidence 2222 111111 2479999999999999764 344455555554433322 2222221 Q ss_pred chhhhhcCcEEEEEEEEeccEeecccceEEEeec----cCCCCCCCCCC Q lcl|NC_018838. 271 GRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK----AAPKPNPPAGN 315 (315) Q Consensus 271 ~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~----~a~~~~~~~~~ 315 (315) +++++.-.=..--|+.|-+.++++.+... +.++++|++|. T Consensus 312 -----~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 312 -----KKDRVENYESMNIDYVVEAYAAGCLLENITLGDFTAPAAPEGGE 355 (355) T ss_pred -----ccccccchhhhcceeeeeccccEEEEeeeeecCCCCcccccCCC Confidence 12223222233456677777777776532 22344555555 No 191 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.60 E-value=0.0002 Score=40.82 Aligned_cols=275 Identities=12% Similarity=-0.003 Sum_probs=140.1 Q ss_pred CCCCccCCCce-------EcchhHHH----HHHHHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccC Q lcl|NC_018838. 1 MADDFLSAGKL-------ELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKP 66 (315) Q Consensus 1 m~~~~~s~Gg~-------~vP~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) |+......++. .||..+.+ ++++.+........|..+...+. ..+.++.....+.+.+.+-+.+.| T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P 110 (336) T protein:vir:78 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCC Confidence 33332222221 24544432 23344444444445544444332 245777777778899999999999 Q ss_pred CCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc--- Q lcl|NC_018838. 67 SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL--- 143 (315) Q Consensus 67 ~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~--- 143 (315) ..+...+..+-..+.++....++.+=++... ..-.+|.+.-+...++++.+.++.-.++|+.. ....|+.+.. T Consensus 111 ~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~-~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~---~~~~GllN~P~l~ 186 (336) T protein:vir:78 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAG-AGRVDLASELNYSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLS 186 (336) T ss_pred eeecceeeEEEEEEEEEeeeeecHHHHHHHH-HhCCCcHHHHHHHHHHHHHHhhCeEEEEeccc---cceEEEEeCCCCC Confidence 9988888887788888888888844443332 22334566667778888888888888888742 2334444421 Q ss_pred ---cccccc--cccccchhHHHHHHHHHhhhccc-----ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCC Q lcl|NC_018838. 144 ---DKTTKT--VDATDSATTDLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL 213 (315) Q Consensus 144 ---~~~~~~--~~~~~~~~~di~~~~~~~~~~~~-----~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~ 213 (315) +..+.. .......++|+.+++..+...-. ..+..++|.+.....|..- ...|. ..... +... T Consensus 187 a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~g~--tv~~~---lk~n-- 258 (336) T protein:vir:78 187 APITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGL--SAAAK---LKEI-- 258 (336) T ss_pred cccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-CccCc--cHHHH---HHHh-- Confidence 111110 01112346788888888754322 2234588888888777432 11111 10000 1100 Q ss_pred ccccceeeEeecccCccccccccccceEEEecc---cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEecc Q lcl|NC_018838. 214 DNWRGLNVGASSTVSGAPEMSPASGVKAIVGDF---SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV 290 (315) Q Consensus 214 ~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf---~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~ 290 (315) +-++.++ .+|...+.+ +....++.-+. +...+...+.++. .+.- .+.-....-+..|.++ T Consensus 259 --~Pnl~i~---t~pel~~Ag-g~~~~~~~~~~~~~~t~~~~~p~~f~~--lpvq---------~~~~~~~v~~~~rt~G 321 (336) T protein:vir:78 259 --FPKLEFV---TIPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA--HSIE---------RYSSYFRQKKSAGTWG 321 (336) T ss_pred --cCccEEE---EcccccccC-cceEEEEEeeccCCcceeeecchhhhc--ccee---------ecCceeEeccccceee Confidence 1122333 234332222 22222222221 1122222221111 1100 0111233455566666 Q ss_pred -EeecccceEEEeec Q lcl|NC_018838. 291 -AIESLDSFAVVKEK 304 (315) Q Consensus 291 -~v~~~~af~~l~~~ 304 (315) .+.+|-||+++++. T Consensus 322 v~i~~P~ai~~~~GI 336 (336) T protein:vir:78 322 AVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeccchheeeccC Confidence 56899999999999 No 192 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=96.58 E-value=0.00024 Score=40.44 Aligned_cols=281 Identities=10% Similarity=-0.077 Sum_probs=126.1 Q ss_pred CCCCcc-----CCCceEcchhHHHHHH----HHHHhccchhhhcceeecCC---CceEEEEEeCCceeEEeecccccCCC Q lcl|NC_018838. 1 MADDFL-----SAGKLELPGSMIGAVR----DRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~~~~~-----s~Gg~~vP~~~~~~ii----~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) +|.... +.++.-||-.+.+-|. +.+..-.....|..+...+. ..+.+++....+.+.+.+-+++.|.. T Consensus 65 ~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~ 144 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) T ss_pred cccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCce Confidence 333322 2334446776666443 33333333333444433322 24567777777888899999999887 Q ss_pred ccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc----- Q lcl|NC_018838. 69 SVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL----- 143 (315) Q Consensus 69 ~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~----- 143 (315) +...+..+-..+.++....++.+=++... ..-.+|.+.-+...++++.+.+++-.|+|..........|+.+.. T Consensus 145 d~~~~~~~r~v~~~~~g~~yg~~El~~A~-~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~ 223 (388) T protein:vir:99 145 SWNVNFERRTIVRGEMGIQVGLLEEGRAS-AMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) T ss_pred eccceeeeeeEEEEEeeeeecHHHHHHHH-hhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccc Confidence 75444433333444444455543332221 122335566677888888888899899996532222333444321 Q ss_pred -ccccccc------ccccchhHHHHHHHHHhhhccccc------ceEEEEeHHHHHHHHHHhhccCcccccccccccccc Q lcl|NC_018838. 144 -DKTTKTV------DATDSATTDLVKAVGLIAGAGLQV------PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGF 210 (315) Q Consensus 144 -~~~~~~~------~~~~~~~~di~~~~~~~~~~~~~~------~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~ 210 (315) ..++... ......++||.+++..+...-... +...+|-+.....|..- ...|.-+ ..-+.. T Consensus 224 v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tv-----l~~lk~ 297 (388) T protein:vir:99 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISV-----RDWLKQ 297 (388) T ss_pred cccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccH-----HHHHHH Confidence 1111000 011223568888888875433211 12466777777767422 1111100 000000 Q ss_pred CCCccccceeeEeecccCccccc-cccccceEE-Eec-ccceEEEee---------ccceEEEeccCCccccchhhhhcC Q lcl|NC_018838. 211 AGLDNWRGLNVGASSTVSGAPEM-SPASGVKAI-VGD-FSRVHWGFQ---------RNFPIELIEYGDPDQTGRDLKGHN 278 (315) Q Consensus 211 ~~~~~l~G~Pv~~s~~v~~~~~~-~~~~~~~~~-~gD-f~~~~i~~~---------~~~~v~~~~~~~~~~~~~~~f~~~ 278 (315) .+-++.++- +|...+. +.+....++ +.+ +.....+.. -...+....-- .+.- T Consensus 298 ----n~Pnl~i~t---~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq---------~~~~ 361 (388) T protein:vir:99 298 ----TYPRVRVMS---APELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVE---------KRVK 361 (388) T ss_pred ----hcCCcEEEE---ecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccce---------ecCc Confidence 111223332 2222111 111222222 211 110000000 00001111000 0000 Q ss_pred cEEEEEEEEe-ccEeecccceEEEeec Q lcl|NC_018838. 279 EVMVRAEAVL-YVAIESLDSFAVVKEK 304 (315) Q Consensus 279 ~v~~r~~~r~-~~~v~~~~af~~l~~~ 304 (315) ....-+..|. |..+.+|.||+++++. T Consensus 362 ~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 362 NYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred eeEeccccceeeeEEeccchhheeccC Confidence 1222333444 4567999999999999 No 193 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=96.37 E-value=0.00069 Score=37.92 Aligned_cols=291 Identities=12% Similarity=0.040 Sum_probs=145.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcc-eeecCCC-ceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSP-EQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~-~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) |.-++-+ -.+.+-++++.+|...+.+.-.--.+.| +.-.++| .+.||.. +.+...--.|..+......+-+++++. T Consensus 1 ~~~TSNT-~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNT-RAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQ 78 (313) T ss_pred Ccccccc-hheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEE Confidence 6665432 3456667778887776666543333444 4444444 4778755 556666666666666667778889988 Q ss_pred eEEEEEe-ehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhc-ccccccc----ccccccccccccccccccc Q lcl|NC_018838. 79 PIKVVTQ-QRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH-GIDPATG----KPAAAVKVSLDKTTKTVDA 152 (315) Q Consensus 79 ~~kl~~~-~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~-G~g~~~~----~~~~~~~~~~~~~~~~~~~ 152 (315) ..++++- -.||+.|-+++ -+ ..+|-.....+-+|+|.+.+..-++. |..-..+ ....|.+-....+.+ - T Consensus 79 i~~Y~G~A~~vt~~LR~D~-~~-I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T---~ 153 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDG-TD-IDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAET---N 153 (313) T ss_pred EEeecCChhhhhhhhhhcc-hh-HHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccC---C Confidence 8887655 36999986654 33 44566667777788888887765543 2111111 111222221111111 1 Q ss_pred ccchhHHHHHHHHHhhhcccccc-eEEEEeHHHHHHHHHHhhccCccc-ccccccccccc-CC--CccccceeeEeeccc Q lcl|NC_018838. 153 TDSATTDLVKAVGLIAGAGLQVP-NGVALDPAFSFALSTEVYPKGSPL-AGQPMYPAAGF-AG--LDNWRGLNVGASSTV 227 (315) Q Consensus 153 ~~~~~~di~~~~~~~~~~~~~~~-~~~~m~~~~~~~L~~l~d~~g~~~-~~~~~~~~~~~-~~--~~~l~G~Pv~~s~~v 227 (315) ......++..+-......+.... ..+++.|.....|..+..-...-. ++..+..+.-+ ++ ...+.|..+.+|+-+ T Consensus 154 ~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L 233 (313) T protein:vir:95 154 GVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRL 233 (313) T ss_pred ceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhh Confidence 12233455555444444443322 248999999999988863322111 11112111111 11 135778888877755 Q ss_pred CccccccccccceEEEecc----c----ceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceE Q lcl|NC_018838. 228 SGAPEMSPASGVKAIVGDF----S----RVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFA 299 (315) Q Consensus 228 ~~~~~~~~~~~~~~~~gDf----~----~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~ 299 (315) ........-+..-.++|+. + .=..+-|+.+-- + +...+ .+-..+..+. .+|+|++++|.+-.+ T Consensus 234 ~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~--s-~~~~~----~~~~~~~~~~--~~R~G~Gi~R~~~L~ 304 (313) T protein:vir:95 234 HVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPK--S-EGERN----KDRARDEHVV--RCRYGFGIQRLDTLG 304 (313) T ss_pred hhccccccccccCceeeeeeeeeecccccceeeeeccccc--c-ccccc----ccccccccee--eeeecccceeeccee Confidence 3211110001111233331 0 011222332210 0 00000 1111233444 468999999988877 Q ss_pred EEe-eccCC Q lcl|NC_018838. 300 VVK-EKAAP 307 (315) Q Consensus 300 ~l~-~~~a~ 307 (315) .+- .+++= T Consensus 305 ~~~~~A~~~ 313 (313) T protein:vir:95 305 LLATSATAY 313 (313) T ss_pred EEEeccccC Confidence 654 34433 No 194 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=269 Identities=14% Similarity=0.046 Sum_probs=105.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhc---ceee--cC-CCceEE-EEEe-CCcee-EEeecccccCCCccc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLS---PEQP--TI-FGPVKG-AVFS-GVPRA-KIVGEGEVKPSASVD 71 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~---~~~~--~~-~~~~~i-p~~~-~~~~a-~wv~Eg~~~~~s~~~ 71 (315) |+.+..|+=-+.= +.+....++.+.+...+...+ ..+. .+ .|.+.. +.+. ++... .-+.....+...+.+ T Consensus 1 ~~~t~~sdl~vfn-~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDLVIYN-DTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeecceeeeh-hhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 9999988744432 233444555555433332221 1110 00 122211 1111 11111 011112222222221 Q ss_pred -eeeEEEeeEEEE-Eeeh--hhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccc Q lcl|NC_018838. 72 -VSAFTAQPIKVV-TQQR--VSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTT 147 (315) Q Consensus 72 -~~~v~l~~~kl~-~~~~--iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~ 147 (315) ..++. -|++ +.-+ .+.+.+.....+....+. .|.++++.++.+.+-...+.+.-. ...+...... T Consensus 80 ~~~dva---Vk~~~~~~~~~~~~~~~a~~g~dp~~~~~-~i~~~~~~~~l~~~l~~~l~~~~a----ai~~~t~~~~--- 148 (315) T protein:vir:96 80 ADEMVS---VKVPWKYGPYETTEEAFKRRARSPEEFSM-LIGQDMADATMAGWIGYALNALQG----AIGSNAGMNV--- 148 (315) T ss_pred ccccee---EEEeecCCchhccHHHHHHhhcCHHHHHH-HHHHHHHHHHHHHHHHHHHhhhhh----hhcccccccc--- Confidence 11221 1222 2222 333333322222222222 244444444444443333332210 0001111110 Q ss_pred cccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCcccccccccccccc---CCCccccceeeEee Q lcl|NC_018838. 148 KTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGF---AGLDNWRGLNVGAS 224 (315) Q Consensus 148 ~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~---~~~~~l~G~Pv~~s 224 (315) ..+.+......+.++..++ ..+...-..|+||..++..|.+.-..+ .+|.+... +...+.+|+||+|+ T Consensus 149 -~~~~a~~~~~~l~dA~~kl-GD~~~~l~~~vMHS~v~~~L~~q~L~~-------~~~~~~~~~~~~~~~~~lGkrViVd 219 (315) T protein:vir:96 149 -SGELATEGKKVLTKGLRTM-GDKASSIAIWVMDSTSYFDIVDEAIDN-------KLYEEAGVVVYGGTPGTLGKPVLVT 219 (315) T ss_pred -cccccccCHHHHHHHHHHh-cccccCeeEEEEchHHHHHHHHhhhhh-------hcccccceeEecCcCcccccEEEEE Confidence 1111223345667777776 445556678999999999997632221 12322221 11233459999999 Q ss_pred cccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEecc-EeecccceEEEee Q lcl|NC_018838. 225 STVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYV-AIESLDSFAVVKE 303 (315) Q Consensus 225 ~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~-~v~~~~af~~l~~ 303 (315) +.||... ...++. ..+.++.... +....+.. + ++=.+....|..| -.+||..|..-+ T Consensus 220 D~~P~~~--------~~gl~~-GAi~~~~~~~--~~~~~~~~--~--------g~e~l~~~~r~e~tf~l~p~G~sw~~- 277 (315) T protein:vir:96 220 DQCPATK--------IFGLVA-GAVMITESQA--PGMRSYQI--D--------DQENLAIGFRAEGTANVEVLGYKWKT- 277 (315) T ss_pred CCCCcce--------eeeeec-ceeeecCCCc--cccccccC--C--------CcceeEEEEeeeeEeeeeeeeEEeec- Confidence 9999521 122222 2222322222 11111111 0 1111222334334 357888877632 Q ss_pred ccCCCCCCCCCC Q lcl|NC_018838. 304 KAAPKPNPPAGN 315 (315) Q Consensus 304 ~~a~~~~~~~~~ 315 (315) .... . |... T Consensus 278 ~~~~--s-Pt~a 286 (315) T protein:vir:96 278 KTNV--N-PASA 286 (315) T ss_pred CCCc--C-CChH Confidence 2111 2 2222 No 195 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=290 Identities=10% Similarity=-0.003 Sum_probs=147.2 Q ss_pred CC--CCcc---CCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeec--c-cccCCCccc Q lcl|NC_018838. 1 MA--DDFL---SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGE--G-EVKPSASVD 71 (315) Q Consensus 1 m~--~~~~---s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E--g-~~~~~s~~~ 71 (315) +| +++. .+..|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-+.- + +..|..-.. T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~ 95 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) T ss_pred HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccccCCCCCCcccccccc Confidence 22 2222 335788999999999999999999999988888763 2234444444444444321 1 222333344 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc- Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK- 140 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~- 140 (315) ++.-.+..++.-.-..|+-+.|.... -+..++..+++.+.+.++.-...-.++|+--...+ +|.+ .. T Consensus 96 l~~~~Y~c~qtn~dt~i~y~~LD~WA--~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:98 96 LESSKYECNQINFDFHLKYKTLDLWA--RFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHh--cChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 55566667777666778888886653 23557788888888887765556678886311111 1111 11 Q ss_pred -----------cccc-cc----cccc-ccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHH--HHHHHhhcc Q lcl|NC_018838. 141 -----------VSLD-KT----TKTV-DATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSF--ALSTEVYPK 195 (315) Q Consensus 141 -----------~~~~-~~----~~~~-~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~--~L~~l~d~~ 195 (315) .... .. ...+ .+..-.|.++.+++..+. +.....+. ++++...... ++..+. .. T Consensus 174 ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~ 252 (355) T protein:vir:98 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVN-KQ 252 (355) T ss_pred HHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhh-cc Confidence 0000 00 0011 112344777777666432 22222222 4566655433 222222 22 Q ss_pred CccccccccccccccCCCccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhh Q lcl|NC_018838. 196 GSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDL 274 (315) Q Consensus 196 g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~ 274 (315) ..|.-. +-.++. ....+|-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. T Consensus 253 ~~ptE~--~Aa~~i-~s~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p--------- 311 (355) T protein:vir:98 253 QENSES--LAADII-ISQKRIGNLPAVRVPYFPANA---------VLVTTLENLSIYFMDESHRRSIDENP--------- 311 (355) T ss_pred CCcHHH--HHHHHH-HHhhhhCCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc--------- Confidence 222110 000000 012479999999999999764 344455555554433322 2222221 Q ss_pred hhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC----CCCCC Q lcl|NC_018838. 275 KGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN----PPAGN 315 (315) Q Consensus 275 f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~----~~~~~ 315 (315) +++++.-.=..--|+.|-+.+.++.+....--++. |+.|- T Consensus 312 -~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 312 -KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) T ss_pred -ccccccchhhhcceeeeeccccEEEeeceeeeCCCCCcccccCC Confidence 12223222233456677777777776543222222 22222 No 196 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.07 E-value=0.00047 Score=38.83 Aligned_cols=274 Identities=10% Similarity=-0.037 Sum_probs=135.1 Q ss_pred CCCCccCCCce-------EcchhHHHHHHH--HHHhccchhhhcceeecCC------CceEEEEEeCCceeEEeeccccc Q lcl|NC_018838. 1 MADDFLSAGKL-------ELPGSMIGAVRD--RAIDSGVLAKLSPEQPTIF------GPVKGAVFSGVPRAKIVGEGEVK 65 (315) Q Consensus 1 m~~~~~s~Gg~-------~vP~~~~~~ii~--~~~~~s~i~~l~~~~~~~~------~~~~ip~~~~~~~a~wv~Eg~~~ 65 (315) |+......++. .||..+.+ +++ ..+-.-.-++....+|+.. ..+.++.....+.+.+.+...+. T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~-~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~ 109 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTT-YVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD 109 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHh-hcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCC Confidence 33332222222 25544433 442 2222333333444444332 23456666666778888888899 Q ss_pred CCCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccc-- Q lcl|NC_018838. 66 PSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSL-- 143 (315) Q Consensus 66 ~~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~-- 143 (315) |..+...+...-..+.++....++.+=++... ..-..|.+.-+...++++.+.++.-.++|+.. ....|+.+.. T Consensus 110 P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~-~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~---~~~~GllN~P~l 185 (336) T protein:vir:10 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAG-AGRVDLASELNYSSALGLAKFLNGSYLFGVAG---LENYGLINDPSL 185 (336) T ss_pred cceeeeeeeeeeeEEEEEEEEeeCHHHHHHHH-HhCCCcHHHHHHHHHHHHHHhhCeEEEEeecc---cceEEEeecCCC Confidence 99887766666677778777888854443332 22334566677778888888888888888753 2233444421 Q ss_pred ----cccccc-c-ccccchhHHHHHHHHHhhhccc-----ccceEEEEeHHHHHHHHHHhhccCccccccccccccccCC Q lcl|NC_018838. 144 ----DKTTKT-V-DATDSATTDLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 144 ----~~~~~~-~-~~~~~~~~di~~~~~~~~~~~~-----~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) +..+.. . ......++|+.+++..+...-. ..+..++|.+.....|..- ...|. + +..-+... T Consensus 186 ~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~-n~~g~--t---v~~~lk~n- 258 (336) T protein:vir:10 186 SAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGL--S---AAAKLKEI- 258 (336) T ss_pred CcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-CccCc--c---HHHHHHHh- Confidence 111110 0 1112346788888888754322 1234588888888877432 11111 1 00001110 Q ss_pred CccccceeeEeecccCccccccccccceEEEecc---cceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEec Q lcl|NC_018838. 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDF---SRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLY 289 (315) Q Consensus 213 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf---~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~ 289 (315) +-++.++- +|...+.+ +....++.-+. +...+.+.+.++. .+- . .+.-....-+..|.+ T Consensus 259 ---~Pnl~i~t---~pel~~Ag-g~~~~~~~~~~~~~~t~~~~~P~~f~~--lpv--q-------~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 259 ---FPKLEFVT---IPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA--HSI--E-------RYSSYFRQKKSAGTW 320 (336) T ss_pred ---CCccEEEE---cccccccC-CceEEEEEecccCCcceeeecChhhhc--cce--e-------ecCceeEecccccee Confidence 11223432 34332222 22222222221 1122222221111 100 0 011123344556666 Q ss_pred c-EeecccceEEEeec Q lcl|NC_018838. 290 V-AIESLDSFAVVKEK 304 (315) Q Consensus 290 ~-~v~~~~af~~l~~~ 304 (315) + .+.+|-||+++++. T Consensus 321 Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMLGV 336 (336) T ss_pred eeeeeccchheeeccC Confidence 5 56899999999999 No 197 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=279 Identities=13% Similarity=-0.005 Sum_probs=149.0 Q ss_pred CC--CCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEee--cccccCCCcc-cee Q lcl|NC_018838. 1 MA--DDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVG--EGEVKPSASV-DVS 73 (315) Q Consensus 1 m~--~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~--Eg~~~~~s~~-~~~ 73 (315) +| +++ ..+..|.|.+...+.+.+.+++.|-+.+.-+++++..- +-.+-....++-++-+. .+......++ .++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~l~ 95 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSALD 95 (338) T ss_pred HHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccCCCCCccccccccccC Confidence 22 333 23557889999999999999999999999888887732 23444444444444432 1122222333 455 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc----ccccc------c---- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG----KPAAA------V---- 139 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~----~~~~~------~---- 139 (315) .-.+..++.-.-..|+-+.|.... .+..++..+++.+.+.++.-...-.++|+--... .+|.+ . T Consensus 96 ~~~Y~c~qtn~dt~i~y~~LD~WA--~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 173 (338) T protein:vir:11 96 NQRYECKHTDFDTAITYAMLDAWA--KFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQY 173 (338) T ss_pred CCccEEEEeeeeeeecHHHHHHHh--cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 556666666666778888886553 3445778888888888776555667888631111 11111 0 Q ss_pred --------cccccccc-cccc-cccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH--HHHHhhccCccccc Q lcl|NC_018838. 140 --------KVSLDKTT-KTVD-ATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPLAG 201 (315) Q Consensus 140 --------~~~~~~~~-~~~~-~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~~~ 201 (315) .+...... ..+. +..-.|.++.+++..+. +.....+. ++++....... +..+. ....|. T Consensus 174 Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n-~~~~pt-- 250 (338) T protein:vir:11 174 RNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVN-KDQPAT-- 250 (338) T ss_pred HhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHh-cCCChH-- Confidence 00000000 0111 22234777777666432 22222222 45666554332 11221 111111 Q ss_pred cccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhhhh Q lcl|NC_018838. 202 QPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDLKG 276 (315) Q Consensus 202 ~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~f~ 276 (315) +...++ ..+|-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. + T Consensus 251 -----E~~Aa~~~~s~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----------~ 306 (338) T protein:vir:11 251 -----EKIATDLILSQKRMGGLPPVEVPYVPEKG---------LMVTTLKNLSLYWQIGGRRRYLKEVP----------E 306 (338) T ss_pred -----HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc----------c Confidence 111111 2479999999999999764 344455555554433322 2222221 2 Q ss_pred cCcEEEEEEEEeccEeecccceEEEeeccCCC Q lcl|NC_018838. 277 HNEVMVRAEAVLYVAIESLDSFAVVKEKAAPK 308 (315) Q Consensus 277 ~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~ 308 (315) ++++.-.=..--|+.|-+.++++.+....--+ T Consensus 307 r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 307 KNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred cccccchhhhccceeeeccccEEEeecceecC Confidence 23333333344677888888888887655444 No 198 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=95.77 E-value=0.0015 Score=36.10 Aligned_cols=286 Identities=10% Similarity=-0.022 Sum_probs=148.2 Q ss_pred CC--CCcc---CCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCc-cc Q lcl|NC_018838. 1 MA--DDFL---SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSAS-VD 71 (315) Q Consensus 1 m~--~~~~---s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~-~~ 71 (315) +| +++. .+-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-.-..++-++-+. -+.+....+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcccccccccCCCCCcccccccc Confidence 22 2222 245788999999999999999999999888888763 223444444444444431 111222222 34 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc- Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK- 140 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~- 140 (315) ++.-....++.-.-..|+-+.|.... -+..++..+++.+.+.++.-...-.++|+--...+ +|.+ .. T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:60 96 LASNKYECDQINFDFYIRYKTLDLWA--RYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQ 173 (357) T ss_pred cCCCccEEEEeeeeccccHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 55666666666666778888886553 23457777888888877655555678886311111 1100 00 Q ss_pred -----------cc-c----ccccccc-ccccchhHHHHHHHHHh-----hhcccccce-EEEEeHHHHHH--HHHHhhcc Q lcl|NC_018838. 141 -----------VS-L----DKTTKTV-DATDSATTDLVKAVGLI-----AGAGLQVPN-GVALDPAFSFA--LSTEVYPK 195 (315) Q Consensus 141 -----------~~-~----~~~~~~~-~~~~~~~~di~~~~~~~-----~~~~~~~~~-~~~m~~~~~~~--L~~l~d~~ 195 (315) .. . ...+..+ .+..-.|.++.+++.-+ .+.....+. ++++-...... +..+. .. T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~ 252 (357) T protein:vir:60 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVN-RE 252 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhh-cC Confidence 00 0 0001111 11233577777766532 222222222 45555544331 22221 22 Q ss_pred CccccccccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCcccc Q lcl|NC_018838. 196 GSPLAGQPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQT 270 (315) Q Consensus 196 g~~~~~~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~ 270 (315) ..|. +....+ ..+|-|+|.+...++|.+. +++--|+++.|-...+ .+-.+.+.. T Consensus 253 ~~pT-------E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------llVT~L~NLsIY~Q~gs~RR~~~d~p----- 311 (357) T protein:vir:60 253 QDNS-------EMLAADVIISQKRIGNLPAVRVPYFPADA---------MLITKLENLSIYYMDDSHRRVIEENP----- 311 (357) T ss_pred CChH-------HHHHHHHHHHhhhhcCcceEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc----- Confidence 2221 111111 2478999999999999764 3334455555433332 222222222 Q ss_pred chhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 271 GRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 271 ~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) +++++.-.=..--|+.|-+.++++.+....-.++..|++. T Consensus 312 -----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~ 351 (357) T protein:vir:60 312 -----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred -----ccccccchhhhcceeeeeccccEEEeeeeeeccCcccccC Confidence 1223333223446777888888888775544444444433 No 199 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=95.28 E-value=0.0024 Score=34.96 Aligned_cols=286 Identities=12% Similarity=-0.003 Sum_probs=145.2 Q ss_pred CCCCc---cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccccCCCccceeeEE Q lcl|NC_018838. 1 MADDF---LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~---~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~ 76 (315) =.+++ ..+-.|.|.+...+.+.+.+++.|-+.+.-+++++.. .+-.+-.-..++-++-+.. ..+.....++.-. T Consensus 22 ~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt~t--r~~~~~~~l~~~~ 99 (358) T protein:vir:78 22 KAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRKKG--GRFKGKVGVDGNT 99 (358) T ss_pred HHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceecCC--CccccccccCCCc Confidence 12232 2356799999999999999999999988888888763 2233444344444444332 2233334455556 Q ss_pred EeeEEEEEeehhhHHHhccChhh-hHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc--ccccc----------- Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADY-RLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAA--AVKVS----------- 142 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d-~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~--~~~~~----------- 142 (315) +..++.-.-..|+-+.|....-. ....++..+++.+.+.++.-...-.++|+--...+.+. .++.- T Consensus 100 Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re 179 (358) T protein:vir:78 100 YELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQLARE 179 (358) T ss_pred cEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHh Confidence 66666666677888887554310 01135666777777776655555677886311111000 00000 Q ss_pred ---------cc-ccccccc-cccchhHHHHHHHHHh-----hhcccccce-EEEEeHHHHHH--HHHHhhccCccccccc Q lcl|NC_018838. 143 ---------LD-KTTKTVD-ATDSATTDLVKAVGLI-----AGAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPLAGQP 203 (315) Q Consensus 143 ---------~~-~~~~~~~-~~~~~~~di~~~~~~~-----~~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~~~~~ 203 (315) .. .....+. .+.-.|.++.+++.-+ .+.....+. ++++-...... +..+. ....|. T Consensus 180 ~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pT---- 254 (358) T protein:vir:78 180 WKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYS-EATKPS---- 254 (358) T ss_pred hchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhh-cCCCcH---- Confidence 00 0001111 1223577777666532 222222232 45555554331 22222 222221 Q ss_pred cccccccCC--CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhcCcE Q lcl|NC_018838. 204 MYPAAGFAG--LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHNEV 280 (315) Q Consensus 204 ~~~~~~~~~--~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~~~v 280 (315) +....+ ..+|-|+|.+...++|.+. +++--|+++.|-...+ .+-.+.+.. +++++ T Consensus 255 ---E~~Aa~~i~k~iGGlpa~~~PfFP~~~---------ilVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~ri 312 (358) T protein:vir:78 255 ---EQIAAQQLAKSIAGRKAYIPPFFPGKR---------MVVTTLDNLHCYTQRGTRKRKADDNQ----------DSKSF 312 (358) T ss_pred ---HHHHHHHHHHHhCCCeEEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc----------ccccc Confidence 111111 1578999999999999764 3334455555433222 222222222 12223 Q ss_pred EEEEEEEeccEeecccceEEEeec---cCCCCCCCCCC Q lcl|NC_018838. 281 MVRAEAVLYVAIESLDSFAVVKEK---AAPKPNPPAGN 315 (315) Q Consensus 281 ~~r~~~r~~~~v~~~~af~~l~~~---~a~~~~~~~~~ 315 (315) .-.=..--|+.|-+.+.++.+... -.++|.||+.. T Consensus 313 E~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~ 350 (358) T protein:vir:78 313 DNQYWRMEGYALGEHKAYGGFEEADIEIGADPAVLAVE 350 (358) T ss_pred cchhhhcceeeeeccccEEEEeeeeeeeCCCCCccccC Confidence 322233456778888888877643 23444444443 No 200 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=287 Identities=10% Similarity=-0.019 Sum_probs=149.7 Q ss_pred CC--CCcc---CCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCc-cc Q lcl|NC_018838. 1 MA--DDFL---SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSAS-VD 71 (315) Q Consensus 1 m~--~~~~---s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~-~~ 71 (315) +| +++. .+-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-.-..++-++-+. -+......+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 22 2222 245788999999999999999999999888888763 223444444444444432 112222223 34 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc- Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK- 140 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~- 140 (315) ++.-....++.-.-..|+-+.|.... -+..++..+++.+.+.++.-...-.++|+--...+ +|.+ .. T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:20 96 LASNKYECDQINFDFYIRYKTLDLWA--RYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cCCCccEEEEeeecccccHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 55566666666666778888886553 23457777888888877655555678886311111 1110 00 Q ss_pred -----------cccc-----cccccc-ccccchhHHHHHHHHHh-----hhcccccce-EEEEeHHHHHH-HHHHhhccC Q lcl|NC_018838. 141 -----------VSLD-----KTTKTV-DATDSATTDLVKAVGLI-----AGAGLQVPN-GVALDPAFSFA-LSTEVYPKG 196 (315) Q Consensus 141 -----------~~~~-----~~~~~~-~~~~~~~~di~~~~~~~-----~~~~~~~~~-~~~m~~~~~~~-L~~l~d~~g 196 (315) +..+ ..+..+ .+..-.|.++.+++.-+ .+.....+. ++++-...... --.|..... T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:20 174 KYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 0000 000111 11223577777666532 222222222 45555544331 111211222 Q ss_pred ccccccccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccc Q lcl|NC_018838. 197 SPLAGQPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTG 271 (315) Q Consensus 197 ~~~~~~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~ 271 (315) .|. +....+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+ .+-.+.+.. T Consensus 254 ~pt-------E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------ilVT~L~NLsIY~Q~gs~RR~~~d~p------ 311 (357) T protein:vir:20 254 DNS-------EMLAADVIISQKRIGNLPAVRVPYFPADA---------MLITKLENLSIYYMDDSHRRVIEENP------ 311 (357) T ss_pred ChH-------HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc------ Confidence 222 111111 2478999999999999764 3344455555433332 222222221 Q ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 272 RDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 272 ~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) +++++.-.=..--|+.|-+.++++.+....-.++..|++. T Consensus 312 ----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~ 351 (357) T protein:vir:20 312 ----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ----ccccccchhhhcceeeeeccccEEEeeeeeeccccCCccC Confidence 1233333333446778888888888876555544444444 No 201 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=95.22 E-value=0.0025 Score=34.84 Aligned_cols=281 Identities=10% Similarity=0.011 Sum_probs=150.1 Q ss_pred CC--CCcc----C-CCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee---cccccCCCc Q lcl|NC_018838. 1 MA--DDFL----S-AGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG---EGEVKPSAS 69 (315) Q Consensus 1 m~--~~~~----s-~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~---Eg~~~~~s~ 69 (315) +| +++. + +-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-+. -++..|..- T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 95 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVASTTDTSGDGERKTTSI 95 (342) T ss_pred HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCcccccccccCCCCCcccccc Confidence 32 2332 3 23688999999999999999999998888888763 223444444445454432 112223332 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------c Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------V 139 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~ 139 (315) ..++.-.+..++.-.-..|+-+.|.... .+..++..+++.+.+.++.-...-.++|+--...+ +|.+ . T Consensus 96 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GW 173 (342) T protein:vir:10 96 AKLVKQTYHCQQINFDTHINYKQLDMWA--KFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGW 173 (342) T ss_pred cccCCCccEEEEeeecccccHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHH Confidence 4556666667777666778888875543 34457777888888877655555678886311111 1100 0 Q ss_pred c------------ccccccccccccccchhHHHHHHHHHh-----hhcccccce-EEEEeHHHHHH--HHHHhhccCccc Q lcl|NC_018838. 140 K------------VSLDKTTKTVDATDSATTDLVKAVGLI-----AGAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPL 199 (315) Q Consensus 140 ~------------~~~~~~~~~~~~~~~~~~di~~~~~~~-----~~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~ 199 (315) . ..........-+..-.|.++.+++..+ .+.....+. ++++-...... +..+. ....|. T Consensus 174 lQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n-~~~~pt 252 (342) T protein:vir:10 174 LQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVN-QQNAPT 252 (342) T ss_pred HHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHh-cCCChH Confidence 0 000001111112233577777666532 222222222 45565554432 11121 111221 Q ss_pred cccccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhh Q lcl|NC_018838. 200 AGQPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDL 274 (315) Q Consensus 200 ~~~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~ 274 (315) +..+.+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+ .+-.+.+.. T Consensus 253 -------E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------ilVT~L~NLsIY~Q~gs~RR~~~d~p--------- 307 (342) T protein:vir:10 253 -------EELAADIVISQKRIGGLKAVRVPFFPANA---------ILITKLENLAIYVQEGTTRKHIENVP--------- 307 (342) T ss_pred -------HHHHHHHHHhhhhhcCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc--------- Confidence 111111 2478999999999999764 3334455555433222 222222222 Q ss_pred hhcCcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 275 KGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 275 f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) +++++.-.=..--|+.|-+.++++.+....-.+|. T Consensus 308 -~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 308 -KKDRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred -ccccccchhhhccceeeeccccEEEeecceecCCC Confidence 22333333334567788999999999877666666 No 202 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=95.01 E-value=0.003 Score=34.44 Aligned_cols=287 Identities=10% Similarity=-0.023 Sum_probs=147.5 Q ss_pred CC--CCcc---CCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCc-cc Q lcl|NC_018838. 1 MA--DDFL---SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSAS-VD 71 (315) Q Consensus 1 m~--~~~~---s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~-~~ 71 (315) +| +++. .+-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-.-..++-++-+. -+......+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 22 2222 245788999999999999999999999888888763 223444433444444432 112222223 34 Q ss_pred eeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc- Q lcl|NC_018838. 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK- 140 (315) Q Consensus 72 ~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~- 140 (315) ++.-....++.-.-..|+-+.|.... -+..++..+++.+.+.++.-...-.++|+--...+ +|.+ .. T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:56 96 LASNKYECDQINFDFYIRYKTLDLWA--RYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cCCCccEEEEeeecccccHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 55566666666666778888886553 23457777888888877655555678886311111 1100 00 Q ss_pred -----------cc-c----ccccccc-ccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH-HHHHhhccC Q lcl|NC_018838. 141 -----------VS-L----DKTTKTV-DATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA-LSTEVYPKG 196 (315) Q Consensus 141 -----------~~-~----~~~~~~~-~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~-L~~l~d~~g 196 (315) .. . ...+..+ .+..-.|.++.+++.-+. +.....+. ++++-...... --.|..... T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:56 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 00 0 0000111 112335777777665322 22222222 45555544331 111211222 Q ss_pred ccccccccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccc Q lcl|NC_018838. 197 SPLAGQPMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTG 271 (315) Q Consensus 197 ~~~~~~~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~ 271 (315) .|. +....+ ..+|-|+|.+...++|.+. +++--|+++.|-...+ .+-.+.+.. T Consensus 254 ~pT-------E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------llVT~L~NLsIY~Q~gs~RR~~~d~p------ 311 (357) T protein:vir:56 254 DNS-------EMLAADVIISQKRIGNLPAVRVPYFPADA---------MLITKLENLSIYYMDDSHRRVIEENP------ 311 (357) T ss_pred ChH-------HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc------ Confidence 222 111111 2478999999999999764 3334455555433332 222222222 Q ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 272 RDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 272 ~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) +++++.-.=..--|+.|-+.++++.+....-.++.+|+.. T Consensus 312 ----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 312 ----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ----ccccccchhhhcceeeeeccccEEEeeeeeeccCCCCccc Confidence 1222332223345677888888888775544444444333 No 203 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=94.86 E-value=0.0033 Score=34.17 Aligned_cols=299 Identities=11% Similarity=-0.019 Sum_probs=139.7 Q ss_pred CC-----CCccCCC----ceEcchhH-HHHHHHHHHhccchhhhcceeecCCCc-eEEEEE--eCCcee------EEeec Q lcl|NC_018838. 1 MA-----DDFLSAG----KLELPGSM-IGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVF--SGVPRA------KIVGE 61 (315) Q Consensus 1 m~-----~~~~s~G----g~~vP~~~-~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~--~~~~~a------~wv~E 61 (315) |- .+..++. |--+-+-+ ....+..+++.-++.+++...|++.+. .+|-+. ..-+.+ +..++ T Consensus 2 ~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~ 81 (401) T protein:vir:95 2 LNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDAS 81 (401) T ss_pred CccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCcc Confidence 22 1111211 22233322 344555555568889999999998543 233222 211111 12223 Q ss_pred cccc----------C-------------------CCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHH-HH Q lcl|NC_018838. 62 GEVK----------P-------------------SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLIS-PA 111 (315) Q Consensus 62 g~~~----------~-------------------~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~-~~ 111 (315) |+++ . ....+-..+..+.++++.+..+|++++.-. ...+|.+-+. +- T Consensus 82 G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~---~D~~l~~h~s~el 158 (401) T protein:vir:95 82 GATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFD---SDDGLMEHLSREL 158 (401) T ss_pred cccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhh---cchHHHHHHHHHH Confidence 3322 1 111222345667899999999999887433 3344554332 22 Q ss_pred HHHHH---HHHHHHhhhcccccccccccccccccccccccccccccchhHHHHHHHHHhhhcccc--------------- Q lcl|NC_018838. 112 LGASI---GRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDATDSATTDLVKAVGLIAGAGLQ--------------- 173 (315) Q Consensus 112 la~~i---~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di~~~~~~~~~~~~~--------------- 173 (315) +.-+- ...+-+-++++-+ +..-+......++.+......+...++++..+...|..++.. T Consensus 159 l~g~~~~t~d~i~~dll~ag~--~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk 236 (401) T protein:vir:95 159 MNGATQITEAVLQKDLLAAAG--TVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTK 236 (401) T ss_pred hhhhhhhHHHHHHHHHHhhcC--eeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcc Confidence 22221 1222223443211 000010001111111122222334466776666665532211 Q ss_pred --cceE-EEEeHHHHHHHHHHhhccCcccccc---cccccc-ccCCCccccceeeEeecccCcccccc------------ Q lcl|NC_018838. 174 --VPNG-VALDPAFSFALSTEVYPKGSPLAGQ---PMYPAA-GFAGLDNWRGLNVGASSTVSGAPEMS------------ 234 (315) Q Consensus 174 --~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~---~~~~~~-~~~~~~~l~G~Pv~~s~~v~~~~~~~------------ 234 (315) ..+- -++|+.....|+.++|-.|.+-|-. |-.++. -.|.-+.|-++.++.++.+---.+.+ T Consensus 237 ~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~ 316 (401) T protein:vir:95 237 VIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRT 316 (401) T ss_pred ccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccc Confidence 1122 3569999999999988777654432 111221 23455788899999888853110110 Q ss_pred ----ccccc----eEEEecccceEEEeeccc-----eEEEeccC--CccccchhhhhcCcEEEEEEEEeccEeecccceE Q lcl|NC_018838. 235 ----PASGV----KAIVGDFSRVHWGFQRNF-----PIELIEYG--DPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFA 299 (315) Q Consensus 235 ----~~~~~----~~~~gDf~~~~i~~~~~~-----~v~~~~~~--~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~ 299 (315) .+++. ..++|+-....++..++- .+-+..-+ +.+ ...-|=|++.+.++ +..++.+++++-.+ T Consensus 317 ~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad-~~DPlgQ~g~vgwK--~~~a~~vL~~e~m~ 393 (401) T protein:vir:95 317 SMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETAD-RNDPYGETGFSSIK--WYYGILVKRPERLA 393 (401) T ss_pred ccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCC-CCCcccceehhhhh--hhhhhheeccceeE Confidence 11111 235677665555543321 22222221 111 11124455555554 35778899999999 Q ss_pred EEeeccCCCCC Q lcl|NC_018838. 300 VVKEKAAPKPN 310 (315) Q Consensus 300 ~l~~~~a~~~~ 310 (315) +|+.++ |. T Consensus 394 ~ies~a---~~ 401 (401) T protein:vir:95 394 LIKTVA---PL 401 (401) T ss_pred EEEeec---CC Confidence 887654 22 No 204 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=94.84 E-value=0.0034 Score=34.14 Aligned_cols=280 Identities=11% Similarity=-0.001 Sum_probs=149.1 Q ss_pred CC--CCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCccceee Q lcl|NC_018838. 1 MA--DDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSASVDVSA 74 (315) Q Consensus 1 m~--~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~~~~~~ 74 (315) +| +++ ..+-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-+. .+...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~ 95 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCC Confidence 22 222 2345778888999999999999999999988888763 223444444444444432 22223333345566 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc---- Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK---- 140 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~---- 140 (315) -.+..++.-.-..|+-+.|.... .+..++..+++.+.+.++.-.-.-.++|+--...+ +|.+ .. T Consensus 96 ~~Y~c~qtn~dt~i~y~~LD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~R 173 (337) T protein:vir:10 96 NRYRCEKTDYDTAIPYRKLDMWA--KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYR 173 (337) T ss_pred CccEEEEeeeeeeccHHHHHHHh--cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHH Confidence 66777777666778888886553 34457788888888887765556678886311111 1111 00 Q ss_pred --------c-cccccccccccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH--HHHHhhccCccccccc Q lcl|NC_018838. 141 --------V-SLDKTTKTVDATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPLAGQP 203 (315) Q Consensus 141 --------~-~~~~~~~~~~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~~~~~ 203 (315) + .+....+..-+..-.|.++.+++..+. +.....+. ++++-...... +..+. ....|. T Consensus 174 e~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n-~~~~pt---- 248 (337) T protein:vir:10 174 ERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAPT---- 248 (337) T ss_pred hcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhc-cCCCcH---- Confidence 0 000011111122335777777665432 22222222 45555544331 11111 111211 Q ss_pred cccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhhhhcC Q lcl|NC_018838. 204 MYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDLKGHN 278 (315) Q Consensus 204 ~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~f~~~ 278 (315) +..+.+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. +++ T Consensus 249 ---E~~Aa~~i~s~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~ 306 (337) T protein:vir:10 249 ---ERLAADLIVSQKRIGNLPAVRVPFFPKRA---------LMVTKLSNLSIYYQEGARRRTLKEVP----------ERD 306 (337) T ss_pred ---HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeechhcEEEEecCcEEEEEEEcc----------ccc Confidence 111111 1479999999999999764 344455555554433322 2222221 233 Q ss_pred cEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 279 ~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) ++.-.-..--|+.|-+.++++.+....-.+. T Consensus 307 rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 307 RIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cccchhhccceeeeeccccEEEEeceeecCC Confidence 3333333445778888888888876544444 No 205 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=269 Identities=9% Similarity=-0.003 Sum_probs=120.9 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHh-ccchhhhcceeecCCCceEEEEEeCCcee-EEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAID-SGVLAKLSPEQPTIFGPVKGAVFSGVPRA-KIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~-~s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) |..+...=. .+-..+...+.+.... .+..+++|++++......++......+.. .|+||- +..++.-...++. T Consensus 1 m~it~~~l~--~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~---~~~~l~~~~~~i~ 75 (302) T protein:vir:10 1 MLINKQSLN--AAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAK---VVKNLKAYKYVVE 75 (302) T ss_pred CcccHHHHH--HHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccce---eeccccccceeEE Confidence 665542211 1111222333333332 23467888887765555666666665654 676663 3334444557788 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhc----ccccc--cccc--ccccc-------ccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA--TGKP--AAAVK-------VSL 143 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~----G~g~~--~~~~--~~~~~-------~~~ 143 (315) -++++..+.||+|.+.++... .+ .-+.+.++++.++.+|+.++. |.+++ .|.. ..... +.. T Consensus 76 ~~~~g~~v~i~R~~i~nDdlg---~~-~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g 151 (302) T protein:vir:10 76 NEDFEATVEVDRNDIEDDQIG---IY-SPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKG 151 (302) T ss_pred eecccceecccHHhhcccccc---hh-HHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceeccccccccccccccc Confidence 999999999999999765433 23 345666777777777664432 21111 0100 00000 000 Q ss_pred cc--cccccccccchhHHHHHHHHHhhhccccc----ceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCcccc Q lcl|NC_018838. 144 DK--TTKTVDATDSATTDLVKAVGLIAGAGLQV----PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWR 217 (315) Q Consensus 144 ~~--~~~~~~~~~~~~~di~~~~~~~~~~~~~~----~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~ 217 (315) .. ...........+.....++..+....+.. +.-+++.|.....-+.+-. .++..+ +..+.+. T Consensus 152 ~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~-~~~~~~----------g~~Np~~ 220 (302) T protein:vir:10 152 TAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLT-NPKLAD----------NTPNPYV 220 (302) T ss_pred chhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhh-ccccCC----------CCcceec Confidence 00 00000111112233333333333322222 2235555555554444421 111111 1122233 Q ss_pred c-eeeEeecccCccccccccccceEEEecccceEEE---eeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEee Q lcl|NC_018838. 218 G-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWG---FQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIE 293 (315) Q Consensus 218 G-~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~---~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 293 (315) | ..++++..+.+ .....++.|.+.+... -+++.+++..+. |..+.+.+|.+.++|..-+ T Consensus 221 g~~~~vv~p~L~s-------~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~----------~~~dgv~~k~~~d~Gvd~R 283 (302) T protein:vir:10 221 GTAELVVDGRIES-------DTAWFLLDTTKPVKPFIFQPRKQPEFVSQVN----------LDSDDVFNLRKLKFGAEAR 283 (302) T ss_pred cceEEEEeeccCC-------CCceEEEecCCccceEEEcCccccEEEeccC----------CCCCceEEEEEEEEeeeee Confidence 3 35566655532 1234555665443322 244455554443 5556667776666653222 Q ss_pred ------cccceEEEeeccC Q lcl|NC_018838. 294 ------SLDSFAVVKEKAA 306 (315) Q Consensus 294 ------~~~af~~l~~~~a 306 (315) .+..-..-+..++ T Consensus 284 ~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 284 AAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred eecchhhhhhhhccCccCC Confidence 2221122233332 No 206 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=280 Identities=11% Similarity=-0.008 Sum_probs=149.2 Q ss_pred CCCCccC-CCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCccceeeEE Q lcl|NC_018838. 1 MADDFLS-AGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSASVDVSAFT 76 (315) Q Consensus 1 m~~~~~s-~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~~~~~~v~ 76 (315) =.+++.+ +-.|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-+. .+...|..-..++.-. T Consensus 18 ~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~~~ 97 (337) T protein:vir:79 18 KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNR 97 (337) T ss_pred HhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCCCc Confidence 1233322 34578888999999999999999999988888763 223444444444444432 2222333334556666 Q ss_pred EeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cc------ Q lcl|NC_018838. 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VK------ 140 (315) Q Consensus 77 l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~------ 140 (315) +..++.-.-..|+-+.|.... .+..++..+++.+.+.++.-.-.-.++|+--...+ +|.+ .. T Consensus 98 Y~c~qtn~dt~i~y~~LD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ 175 (337) T protein:vir:79 98 YRCEKTDYDTAIPYRKLDAWA--KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRER 175 (337) T ss_pred cEEEEeeeeeeccHHHHHHHh--cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHhc Confidence 777777666778888886553 34457788888888887765556678886311111 1111 00 Q ss_pred ------c-cccccccccccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH--HHHHhhccCccccccccc Q lcl|NC_018838. 141 ------V-SLDKTTKTVDATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPLAGQPMY 205 (315) Q Consensus 141 ------~-~~~~~~~~~~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~~~~~~~ 205 (315) + .+....+..-+..-.|.++.+++..+. +.....+. +.++-...... +..+. ....|. T Consensus 176 ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n-~~~~pt------ 248 (337) T protein:vir:79 176 AAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVN-ATQAPT------ 248 (337) T ss_pred chhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhc-cCCCcH------ Confidence 0 000111111123335777777665432 22222222 45555544331 11111 111211 Q ss_pred cccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhhhhcCcE Q lcl|NC_018838. 206 PAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDLKGHNEV 280 (315) Q Consensus 206 ~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~f~~~~v 280 (315) +..+.+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. +++++ T Consensus 249 -E~~Aa~~i~s~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~ri 308 (337) T protein:vir:79 249 -ERLAADLIVSQKRIGNLPAVRVPFFPKRA---------LMVTKLSNLSIYYQEGARRRTLKEVP----------ERDRI 308 (337) T ss_pred -HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeechhcEEEEecCcEEEEEEEcc----------ccccc Confidence 111111 1479999999999999764 344455555554433322 2222221 23333 Q ss_pred EEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 281 MVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 281 ~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) .-.-..--|+.|-+.++++.+....-.+. T Consensus 309 e~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 309 ENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cchhhccceeeeeccccEEEEeceeecCC Confidence 33333445778888888888876554444 No 207 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=93.92 E-value=0.006 Score=32.78 Aligned_cols=283 Identities=10% Similarity=-0.018 Sum_probs=139.6 Q ss_pred CC--CCc-----cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccccCCCccce Q lcl|NC_018838. 1 MA--DDF-----LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSASVDV 72 (315) Q Consensus 1 m~--~~~-----~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~ 72 (315) +| +++ ..+..|.|.+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-..-+ ....++.+ T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~--R~~~~~~l 90 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQTG--RNLANLDH 90 (336) T ss_pred HHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccCCC--ccccccCc Confidence 21 222 2234699999999999999999999999988888763 22334433334433332221 22223455 Q ss_pred eeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-ccccccc------------- Q lcl|NC_018838. 73 SAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA-TGKPAAA------------- 138 (315) Q Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~-~~~~~~~------------- 138 (315) +.-.+..++.-.-..|+-+.|.... .+-......+..-+.++++.-.-.-.++|+--. +..+|.+ T Consensus 91 ~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re 169 (336) T protein:vir:37 91 TQNGFELAETDSGIIVPWALFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADLSDVNKGWLKLLQE 169 (336) T ss_pred CCcccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcccccchhHHHHHHh Confidence 5666666666666778888875542 221112233344444444443444567776311 1111111 Q ss_pred ------cccccccccccc-ccccchhHHHHHHHHH----hhhcccccce-EEEEeHHHHHH-HHHHhhccC-cccccccc Q lcl|NC_018838. 139 ------VKVSLDKTTKTV-DATDSATTDLVKAVGL----IAGAGLQVPN-GVALDPAFSFA-LSTEVYPKG-SPLAGQPM 204 (315) Q Consensus 139 ------~~~~~~~~~~~~-~~~~~~~~di~~~~~~----~~~~~~~~~~-~~~m~~~~~~~-L~~l~d~~g-~~~~~~~~ 204 (315) +........+.. .+....|.++.+++.. |.+.....+. +.++....... .-.+....+ +|. T Consensus 170 ~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt----- 244 (336) T protein:vir:37 170 QRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT----- 244 (336) T ss_pred ccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH----- Confidence 000001111111 1233447776665543 3222222222 45555543321 111222211 221 Q ss_pred ccccccC----CCccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhhhhcCc Q lcl|NC_018838. 205 YPAAGFA----GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDLKGHNE 279 (315) Q Consensus 205 ~~~~~~~----~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~f~~~~ 279 (315) +.... ...++-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. ++++ T Consensus 245 --E~~Aa~~~~~~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~r 303 (336) T protein:vir:37 245 --EKAALGSHNLMGSFGGMNAITPPNFPARA---------AAVTTLKNLSVYTEAESVRRSLRNDE----------DKKG 303 (336) T ss_pred --HHHHHHHHHHHHhhCCceeEEccccCCCc---------eEEeechhcEEEEecCcEEEEEEEcc----------cccc Confidence 11111 12579999999999999764 344455555554433322 2222221 1233 Q ss_pred EEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 280 VMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 280 v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) +.-.=..--|+.|-+.++++.+....-..|.+- T Consensus 304 ie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 304 LVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred ccchhhhcceeeeeccccEEEeeeeeeeecCcC Confidence 333333446778888888888876654443333 No 208 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=93.85 E-value=0.0062 Score=32.70 Aligned_cols=281 Identities=12% Similarity=0.005 Sum_probs=144.7 Q ss_pred CC--CCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEee--cccccCCCccceee Q lcl|NC_018838. 1 MA--DDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVG--EGEVKPSASVDVSA 74 (315) Q Consensus 1 m~--~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~--Eg~~~~~s~~~~~~ 74 (315) +| +++ ..+..|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-.-..++-++-+. -++..|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDTTQQDRETSDISTMDG 95 (339) T ss_pred HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccCCCCCcccccccccCC Confidence 22 222 2345688999999999999999999998888888763 223444434444444321 11222222235555 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------c----- Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------V----- 139 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~----- 139 (315) -....++.-.-..|+-+.|.... .+..++..+++.+.+.++.-...-.++|+--...+ +|.+ . T Consensus 96 ~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~R 173 (339) T protein:vir:79 96 RRYRCEQTNSDTHITYQKLDAWA--KFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNLR 173 (339) T ss_pred CccEEEEeeeeceecHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcCccccchhHHHHHH Confidence 56666666666678888875543 33457777888888877655555678886311111 1100 0 Q ss_pred -------cccc-cccccccc-cccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHH--HHHHHhhccCcccccc Q lcl|NC_018838. 140 -------KVSL-DKTTKTVD-ATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSF--ALSTEVYPKGSPLAGQ 202 (315) Q Consensus 140 -------~~~~-~~~~~~~~-~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~--~L~~l~d~~g~~~~~~ 202 (315) .+.. ....+... +....|.++.+++..+. +.....+. ++++-..... ++..+. ....|. T Consensus 174 e~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pt--- 249 (339) T protein:vir:79 174 EQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLVN-RDRDPV--- 249 (339) T ss_pred hhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHhh-cCCChH--- Confidence 0000 00111111 22335777776665432 22222332 4555554433 122221 111221 Q ss_pred ccccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhc Q lcl|NC_018838. 203 PMYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGH 277 (315) Q Consensus 203 ~~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~ 277 (315) +....+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+ .+-.+.+.. ++ T Consensus 250 ----E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------llVT~L~NLsIY~Q~gs~RR~~~d~p----------~r 306 (339) T protein:vir:79 250 ----QQIAADLIISQKRIGNLPAIRVPYFPANG---------LLVTRLDNLSIYYQEGGRRRTILDNA----------KR 306 (339) T ss_pred ----HHHHHHHHHHhhhhCCceeEEccccCCCc---------eEEeechhcEEEEecCcEEEEEEecc----------cc Confidence 111111 1478999999999999764 3344455555433332 222222222 12 Q ss_pred CcEEEEEEEEeccEeecccceEEEeeccCCCCC Q lcl|NC_018838. 278 NEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) Q Consensus 278 ~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~ 310 (315) +++.-.-..--|+.|-+.+.++.+....-.+.. T Consensus 307 ~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 307 DRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred ccccchhhccceeeeeccccEEEeeeeecccCC Confidence 333332233457778888888887754333333 No 209 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=93.04 E-value=0.009 Score=31.80 Aligned_cols=283 Identities=10% Similarity=-0.023 Sum_probs=138.7 Q ss_pred CC--CCc-----cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccccCCCccce Q lcl|NC_018838. 1 MA--DDF-----LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSASVDV 72 (315) Q Consensus 1 m~--~~~-----~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~ 72 (315) +| +++ ..+..|.|.+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-..-+.... ...+ T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~r~r~--~~~l 90 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQTGRNLA--TLDH 90 (336) T ss_pred HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccCCCCCcc--ccCC Confidence 22 222 2234799999999999999999999999988888763 223444333444443332222211 1234 Q ss_pred eeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc-cccc------------- Q lcl|NC_018838. 73 SAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK-PAAA------------- 138 (315) Q Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~-~~~~------------- 138 (315) +.-.+..++.-.-..|+-+.|.... .+-......+..-+.++++.-.-.-.++|+--...+ +|.+ T Consensus 91 ~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re 169 (336) T protein:vir:37 91 SQNGYELSETDSGILVNWSLFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDLSDVNKGWLKLLQE 169 (336) T ss_pred CCCccEEEEeeeeeeccHHHHHHHh-cChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccccccchhHHHHHHh Confidence 4455566666556778888875542 211112333344444444433344567775311111 1111 Q ss_pred ------cccccccccccc-ccccchhHHHHHHHHH----hhhcccccce-EEEEeHHHHHH-HHHHhhccC-cccccccc Q lcl|NC_018838. 139 ------VKVSLDKTTKTV-DATDSATTDLVKAVGL----IAGAGLQVPN-GVALDPAFSFA-LSTEVYPKG-SPLAGQPM 204 (315) Q Consensus 139 ------~~~~~~~~~~~~-~~~~~~~~di~~~~~~----~~~~~~~~~~-~~~m~~~~~~~-L~~l~d~~g-~~~~~~~~ 204 (315) +........+.. .+....|.++.+++.. |.+.....+. ++++....... --.+....+ +|. T Consensus 170 ~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt----- 244 (336) T protein:vir:37 170 QRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT----- 244 (336) T ss_pred ccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH----- Confidence 000001111111 1233447776665543 3222222222 45555543321 111111111 121 Q ss_pred ccccccC----CCccccceeeEeecccCccccccccccceEEEecccceEEEeeccce-EEEeccCCccccchhhhhcCc Q lcl|NC_018838. 205 YPAAGFA----GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP-IELIEYGDPDQTGRDLKGHNE 279 (315) Q Consensus 205 ~~~~~~~----~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~-v~~~~~~~~~~~~~~~f~~~~ 279 (315) +.... ...++-|+|.+..+++|.+. +++--|+++.|-...+-. -.+.+.. ++++ T Consensus 245 --E~~Aa~~~~~~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~r 303 (336) T protein:vir:37 245 --EKAALGSHNLMGSFGGMNAITPPNFPARA---------AAVTTLKNLSVYTEAESVRRSLRNDE----------DKKG 303 (336) T ss_pred --HHHHHHHHHHHHhhCCceEEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEEcc----------cccc Confidence 11111 12579999999999999764 344455555554433322 2222221 1233 Q ss_pred EEEEEEEEeccEeecccceEEEeeccCCCCCCC Q lcl|NC_018838. 280 VMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) Q Consensus 280 v~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~ 312 (315) +.-.=..--|+.|-+.+.++.+....-..|.+- T Consensus 304 ie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 304 LVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred ccchhhhcceeeeeccccEEEeeeeeeeccccC Confidence 333333446778889999998887655444444 No 210 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=92.95 E-value=0.0094 Score=31.71 Aligned_cols=280 Identities=11% Similarity=0.004 Sum_probs=146.2 Q ss_pred CC--CCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeec--ccccCCCccceee Q lcl|NC_018838. 1 MA--DDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGE--GEVKPSASVDVSA 74 (315) Q Consensus 1 m~--~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E--g~~~~~s~~~~~~ 74 (315) +| +++ ..+..|.|-+...+.+.+.+++.|-+.+.-+++++.. .+-.+-....++-++-..- ++..|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecCCCcccccccccccCC Confidence 22 222 2345788999999999999999999998888888763 2234443334444443322 2222222344556 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------------ Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------------ 138 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------------ 138 (315) -....++.-.-..|+-+.|.... .+..++..+++.+.+.++.-...-.++|+--...+ +|.+ T Consensus 96 ~~Y~c~qTn~dt~i~Y~~lD~WA--~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~R 173 (337) T protein:vir:78 96 NRYRCEKTDYDTAIPYRKLDMWA--KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYR 173 (337) T ss_pred CccEEEEeceecccCHHHHHHHh--cChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHH Confidence 66666666666778888875543 34457777888888877655555678886311111 1100 Q ss_pred -------ccccccccccccccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH--HHHHhhccCccccccc Q lcl|NC_018838. 139 -------VKVSLDKTTKTVDATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA--LSTEVYPKGSPLAGQP 203 (315) Q Consensus 139 -------~~~~~~~~~~~~~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~--L~~l~d~~g~~~~~~~ 203 (315) +...+.......-+..-.|.++.+++..+. +.....+. ++++-...... +..+. ....|. T Consensus 174 e~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n-~~~~pt---- 248 (337) T protein:vir:78 174 ERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAPT---- 248 (337) T ss_pred hcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHh-cCCCcH---- Confidence 000000011111123335777776665432 22222222 45555554432 11111 111221 Q ss_pred cccccccCC----CccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhcC Q lcl|NC_018838. 204 MYPAAGFAG----LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHN 278 (315) Q Consensus 204 ~~~~~~~~~----~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~~ 278 (315) +....+ ..+|-|+|.+..+++|.+. +++--|+++.|-...+ .+-.+.+.. +++ T Consensus 249 ---E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~---------ilVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~ 306 (337) T protein:vir:78 249 ---ERLAADLIVSQKRIGNLPAVRVPFFPKRA---------LMVTKLSNLSIYYQEGARRRTLKEVP----------ERD 306 (337) T ss_pred ---HHHHHHHHHHhhhhcCcceEEccccCCCc---------eEEeechhcEEEEecCcEEEEEEecc----------ccc Confidence 111111 1478999999999999764 3334455555433322 222222222 223 Q ss_pred cEEEEEEEEeccEeecccceEEEeeccCCCC Q lcl|NC_018838. 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) Q Consensus 279 ~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~ 309 (315) ++.-.-..--|+.|-+.++++.+....-.+. T Consensus 307 rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 307 RIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cccchhhccceeeeeccccEEEEeceeecCC Confidence 3333333445778888888888876544444 No 211 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=91.59 E-value=0.015 Score=30.56 Aligned_cols=287 Identities=9% Similarity=-0.002 Sum_probs=130.7 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccch-hhh---cceeecCC-CceEEEEEeC-CceeEEe-ecccccCCCcccee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVL-AKL---SPEQPTIF-GPVKGAVFSG-VPRAKIV-GEGEVKPSASVDVS 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i-~~l---~~~~~~~~-~~~~ip~~~~-~~~a~wv-~Eg~~~~~s~~~~~ 73 (315) |.....+.=.-+-=.+.+.++.+.+-..+++ +.| +++.+.++ -++..|..-. ..+++|- +|..-...-.-.|. T Consensus 1 mp~~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~ 80 (321) T protein:vir:34 1 MPFPNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVIS 80 (321) T ss_pred CCCchHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhcc Confidence 4432211000000012223334444444443 333 34444443 3466666544 7788995 55433333445688 Q ss_pred eEEEeeEEEEEeehhhH-HHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cccccccccccccccccc-- Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSD-EFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA-TGKPAAAVKVSLDKTTKT-- 149 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~-ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~-~~~~~~~~~~~~~~~~~~-- 149 (315) +-++.++.+++.+.||- |+|..+......+|-+.=.+.+.+.+.+.+|..+ |.+|.+ ++....|+...+....++ T Consensus 81 ~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l-~sdGTa~g~~~i~GL~~lv~~~p~tGt 159 (321) T protein:vir:34 81 SAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAAL-YGDGTAFGGRAINGLDGAVPVDPTVGT 159 (321) T ss_pred ccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhh-hccccccccchhhhhhhhcccCCCCce Confidence 99999999999888875 4444444444333333333444556677776654 444432 344444444433211110 Q ss_pred ----------------ccccc-chhHHHHHHHHHhhh---cccccceEEEEeHHHHHHHHHHhhccCccccccccccccc Q lcl|NC_018838. 150 ----------------VDATD-SATTDLVKAVGLIAG---AGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG 209 (315) Q Consensus 150 ----------------~~~~~-~~~~di~~~~~~~~~---~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~ 209 (315) .+.+. .+...+..++..+.- .....|+-|++....+...+.-....-|+.... ... T Consensus 160 vGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~----~a~ 235 (321) T protein:vir:34 160 YGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAE----EAN 235 (321) T ss_pred eccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecccc----ccc Confidence 00111 111234434443321 233456667877776665554443333433221 122 Q ss_pred cCCCc-cccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEE--EE Q lcl|NC_018838. 210 FAGLD-NWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRA--EA 286 (315) Q Consensus 210 ~~~~~-~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~--~~ 286 (315) .|-.+ .++|..|+..+.+...... ...||-|=+++.+....+-.+....... +--.||-++.. .. T Consensus 236 ~Gf~~Lky~~~div~D~~~g~~~pa-----n~~yfiNT~yl~~r~h~~~~~~pi~p~r-------~~~~NqdA~~q~I~~ 303 (321) T protein:vir:34 236 LGFRSLKFLSTDVVLDGGIGGFAGA-----NTMYFLNTKYLHFRPHKDRNMVPLSPSR-------RAAFNQDAEAQILAW 303 (321) T ss_pred ccceeeeeeeEEEEEeCCCCCCccc-----cceeeeecceEEEEEcCCCceeecCccc-------ccccchhHHhhhhhh Confidence 22222 4667788877754332111 1366667677766655544444333221 00112222111 11 Q ss_pred EeccEeecccceEEEeec Q lcl|NC_018838. 287 VLYVAIESLDSFAVVKEK 304 (315) Q Consensus 287 r~~~~v~~~~af~~l~~~ 304 (315) +..-.+-++.+=.+|+.- T Consensus 304 ~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 304 AGNLTCSGAQFQGRLIAE 321 (321) T ss_pred hheeeeecccceeEEeeC Confidence 122223445544555544 No 212 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=89.31 E-value=0.027 Score=29.18 Aligned_cols=284 Identities=14% Similarity=0.066 Sum_probs=124.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccch-----hhhcceeecCCCceEEEEEeCCceeEEe-------------ecc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVL-----AKLSPEQPTIFGPVKGAVFSGVPRAKIV-------------GEG 62 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i-----~~l~~~~~~~~~~~~ip~~~~~~~a~wv-------------~Eg 62 (315) -++-..+.+-+.|+..-. +++...+ -.+-++..+....+++-|-.++..|.-+ .|| T Consensus 69 ta~~~a~~T~i~V~~~~~------f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEG 142 (418) T protein:vir:96 69 TAEALADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEG 142 (418) T ss_pred EEEEecCceEEEecCCcc------cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcccc Confidence 122222222355554432 3333332 1223445555556677665555333222 355 Q ss_pred cccCCCccceeeEEEeeEEEEEeehhhHHHhccChhhhH-------HHHHHHHHHHHHHHHHHHHHHhhhcccc---ccc Q lcl|NC_018838. 63 EVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRL-------GVLQDLISPALGASIGRAVDLIAFHGID---PAT 132 (315) Q Consensus 63 ~~~~~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~-------~~l~~~i~~~la~~i~~~~d~a~~~G~g---~~~ 132 (315) ++.|... ...+..+.-+..|-+|-+.-|..... .++....++.|... ...++.+.++|.- ..+ T Consensus 143 sd~~ta~------~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~n 215 (418) T protein:vir:96 143 SQRPTAR------SIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYN 215 (418) T ss_pred cccCCcc------eecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCC Confidence 5544432 23333333333333333332222111 11222222333332 3456777777752 111 Q ss_pred ccc-------ccccccccccccccccccc---chhHHHHHHHHHhhh---cccccce----EEEEeHHHHHHHHHHhhcc Q lcl|NC_018838. 133 GKP-------AAAVKVSLDKTTKTVDATD---SATTDLVKAVGLIAG---AGLQVPN----GVALDPAFSFALSTEVYPK 195 (315) Q Consensus 133 ~~~-------~~~~~~~~~~~~~~~~~~~---~~~~di~~~~~~~~~---~~~~~~~----~~~m~~~~~~~L~~l~d~~ 195 (315) +.+ ..++...+ .++.+++.. ..++.+.+++..... +.+.... .+..+.+...+|.++.. + T Consensus 216 g~p~~~t~R~m~gI~~f~--~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~ 292 (418) T protein:vir:96 216 GQPLHTTQGIVDAIRQYA--PDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-E 292 (418) T ss_pred CcccccccchhHHHHhhc--cccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-e Confidence 211 12222221 223333332 234555555554322 1122221 14568888998987752 2 Q ss_pred CccccccccccccccCCC----ccccce-eeEeecccCccccccccccceEEEecccceEEEee--ccceEEEeccCC-- Q lcl|NC_018838. 196 GSPLAGQPMYPAAGFAGL----DNWRGL-NVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQ--RNFPIELIEYGD-- 266 (315) Q Consensus 196 g~~~~~~~~~~~~~~~~~----~~l~G~-Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~--~~~~v~~~~~~~-- 266 (315) -+. ...+-..|.. -+-+|. +++.++++|..--. ...+++.|.+.+.+..- +....+.+.... T Consensus 293 I~~-----~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~----~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~ 363 (418) T protein:vir:96 293 VTV-----TQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGIS----PGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGE 363 (418) T ss_pred eEe-----ccccceeceEEEEEEeeccEEEEEecCCCCccccC----cceEEEEecCceEEEEecCCCccchhcccCCCc Confidence 221 1111112211 122344 88888888754211 12367779888877665 444444442211 Q ss_pred --------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec----cCCCCCCCCC Q lcl|NC_018838. 267 --------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK----AAPKPNPPAG 314 (315) Q Consensus 267 --------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~----~a~~~~~~~~ 314 (315) +++.+++ -+.+++. ..+...++++++-++|++. ..-.||.|+. T Consensus 364 ~~~~~~~~~~~~~~D-~~~G~l~----~Eltle~~N~~a~a~itgl~~~~~~~~~~~~~~ 418 (418) T protein:vir:96 364 NKSGATDYSYGHGVD-AQGGSLT----SEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ccccccccccccccc-cccCEEE----EEEEEEeecccccEEeecccccccccccCCCCC Confidence 1222222 2334333 4567778999999999853 2223444444 No 213 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=89.29 E-value=0.027 Score=29.17 Aligned_cols=278 Identities=10% Similarity=-0.053 Sum_probs=120.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhcc------eeecCCCceEEEEEeCCceeEEeec--ccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSP------EQPTIFGPVKGAVFSGVPRAKIVGE--GEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~------~~~~~~~~~~ip~~~~~~~a~wv~E--g~~~~~s~~~~ 72 (315) ||.- . ..+.++..+.+.++..+....|+. +...++..++||+.....-..+--- +......+.++ T Consensus 1 MA~~--n-----~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~ 73 (299) T protein:vir:79 1 MAAL--N-----YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAW 73 (299) T ss_pred Cccc--h-----hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcce Confidence 8831 1 237889999999998887766642 2334556799999875443333211 21111234455 Q ss_pred eeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccccccccc Q lcl|NC_018838. 73 SAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDA 152 (315) Q Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~ 152 (315) ...+|.-.|--.+. |-. +...+.+....+...+.+.....+.-.+|.-.+...- .+. ..+.+.... ... . T Consensus 74 ~t~~ldqdr~~~f~-vD~--~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~--~~a--~~~g~~~~~--~~~-T 143 (299) T protein:vir:79 74 EPKVLTNQRKWSTL-VHP--ADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIY--ADW--TALGNTADT--TVL-T 143 (299) T ss_pred eEEEeeccccceec-cch--hhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHH--Hhh--hhcCCcccc--ccc-C Confidence 55666655543321 110 0000001011111122222222333334442221110 000 001111000 111 1 Q ss_pred ccchhHHHHHHHHHhhhcccccceE-EEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe--ecccCc Q lcl|NC_018838. 153 TDSATTDLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA--SSTVSG 229 (315) Q Consensus 153 ~~~~~~di~~~~~~~~~~~~~~~~~-~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~--s~~v~~ 229 (315) ....|+.++++...+.+++....+. .+++|.....|.+...- .+...... -.....+..++|.|.||+. ++.|.. T Consensus 144 ~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f-~k~~~~~~-~~~~~~g~Vg~idG~~Ii~Vps~r~~t 221 (299) T protein:vir:79 144 TTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEI-QRTVNIKD-AGTSLNRQTTDIDTVKIIKVPSNLMKT 221 (299) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhh-hccccccc-ccceeeeeeeeecceEEEEechhhcCc Confidence 2345788899998887776554444 56788888877543211 11111100 0123455567899999985 333432 Q ss_pred c-------ccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeec--ccce-E Q lcl|NC_018838. 230 A-------PEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIES--LDSF-A 299 (315) Q Consensus 230 ~-------~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~--~~af-~ 299 (315) . .....+.+.-+++.. ....+...+.-.+++.+- ...+.+..+ +.-..++|.=|.+ .+++ + T Consensus 222 ~~~~~~G~~~~~~ak~in~ii~~-~~a~~~~~K~~~~~~~~P-~~~~~~~~~-------~~~r~y~d~~v~~nk~~~i~~ 292 (299) T protein:vir:79 222 AYDFTTGWKVGAGAKQIFMSLVH-PSAIITPVSYQFSKLDEP-TAVTEGKYF-------YFEESFEDVFILNKKADAIQF 292 (299) T ss_pred cceeccCccccCcccccceEEEc-CCeeeeeEeeeeEEeecC-CCCCcccee-------eeeeeeeeeeeeccccCeEEE Confidence 1 111112222334443 345555555545555432 222222111 2223344443333 2333 2 Q ss_pred EEeeccC Q lcl|NC_018838. 300 VVKEKAA 306 (315) Q Consensus 300 ~l~~~~a 306 (315) -++.+=+ T Consensus 293 ~~~~a~~ 299 (299) T protein:vir:79 293 VVEGAGA 299 (299) T ss_pred EeeecCC Confidence 2222222 No 214 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=86.98 E-value=0.041 Score=28.21 Aligned_cols=272 Identities=14% Similarity=0.149 Sum_probs=126.4 Q ss_pred CCCCc--cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEE-EeCCceeEEeecccccCCCccceeeEEE Q lcl|NC_018838. 1 MADDF--LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAV-FSGVPRAKIVGEGEVKPSASVDVSAFTA 77 (315) Q Consensus 1 m~~~~--~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~-~~~~~~a~wv~Eg~~~~~s~~~~~~v~l 77 (315) +++.. ..+..+-+|.-+...|-..+...+++.+...+..++. +-+.+ +..+.+++....|+.+.+...++.--+| T Consensus 35 laengvtitdttfqlprklvesintallntnpvfkvfhvtnvga--llvsrsfdssneaqvhkdgqtkteqaatltidtl 112 (318) T protein:vir:94 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA--LLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTL 112 (318) T ss_pred hhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhh--eeeeccccccchhhhhcccccccccceeeeeccc Confidence 44331 2344567888888888888888888888877765542 32333 4455678888899999988888877777 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHH-HHhhhccccccccccccccccccc------cccccc Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAV-DLIAFHGIDPATGKPAAAVKVSLD------KTTKTV 150 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~-d~a~~~G~g~~~~~~~~~~~~~~~------~~~~~~ 150 (315) .|--++....+-+..-+. ..+...|..+|..++.++|.+++ |.++.-|+|..+ ..++..... .+++.- T Consensus 113 epvmvyklqslaervkrl--qmsyselynlivaeltqaivnkivdlalvegdgtng---fksidkeadvkkikkittkak 187 (318) T protein:vir:94 113 EPVMVYKLQSLAERVKRL--QMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG---FKSIDKEADVKKIKKITTKAK 187 (318) T ss_pred chhHHHHHHHHHHHHHHH--hhhHHHHHHHHHHHHHHHHHhhhhheeeeecCCcch---hhhhchhhhHHHHHHhhhhhh Confidence 776666555555443222 12234477888899999988877 667777876322 122222211 122233 Q ss_pred ccccchhHH-HHHHHHHhhhcccccceEEEEeHH-HHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccC Q lcl|NC_018838. 151 DATDSATTD-LVKAVGLIAGAGLQVPNGVALDPA-FSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVS 228 (315) Q Consensus 151 ~~~~~~~~d-i~~~~~~~~~~~~~~~~~~~m~~~-~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~ 228 (315) .++..+++| +..++..+.+..+.+ -.++... -.+.|..|+.+.... +. .+-++... -..=.|..-+ +- T Consensus 188 sagktpfadaieeavdfvrptagrr--ylivktedrkalldelrqatana-nv-riknddte--iasevgvdei----iv 257 (318) T protein:vir:94 188 SAGKTPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKALLDELRQATANA-NV-RIKNDDTE--IASEVGVDEI----IV 257 (318) T ss_pred hcCCCchhHHHHHHHhhhccCCCce--EEEEeccchHHHHHHHHhhhccc-ce-EEeccchh--hhhhcCccee----EE Confidence 334444443 444555554332211 1333333 244455555443221 11 11111100 0011111110 00 Q ss_pred ccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEe Q lcl|NC_018838. 229 GAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVK 302 (315) Q Consensus 229 ~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~ 302 (315) - ++...-.+-++-| ..|+|.+..=..++..++. -.+||+.+.....--....+..|+..+. T Consensus 258 y---tgskavkptvlvd-qkyhidmqdltkvdafewk---------tnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 258 Y---TGSKAVKPTVLVD-QKYHIDMQDLTKVDAFEWK---------TNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred e---eccccccceeEec-cceecchhhhhhhhceeec---------cCCceEEEEecccCcceeecCceeEEeC Confidence 0 0000011122334 2344433221112222211 1134444433222222223344433332 No 215 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=86.62 E-value=0.018 Score=30.14 Aligned_cols=276 Identities=16% Similarity=0.116 Sum_probs=124.6 Q ss_pred CCCCcc--CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFL--SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~--s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) +++.-- ++-.+.+|+-+...|-..+.++.++.+...+...+.--++....+ ...|.-.-.|..+.+...+|..-++. T Consensus 35 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s-~AeAq~HkdGqTK~eqa~~~~~~Tl~ 113 (318) T protein:vir:86 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SAEAQVHKDGQTKTEQAATLTIDTLE 113 (318) T ss_pred hhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhhhhh-hhhhhhhccCCccccceeeeeeechh Confidence 333322 445677899888888888999999988877765553323333222 35667777888888887777766766 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHH-HHHHHhhhccccccccc---cccccccccccccccccccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIG-RAVDLIAFHGIDPATGK---PAAAVKVSLDKTTKTVDATD 154 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~-~~~d~a~~~G~g~~~~~---~~~~~~~~~~~~~~~~~~~~ 154 (315) +--++....+ .|+.++..-. -..|-.+|..++++++- +..|.|+.-|+|+-+-. ..+-+..-...+++.-.++. T Consensus 114 ~~~VY~~~S~-Ae~~K~~~~s-Ysel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagt 191 (318) T protein:vir:86 114 PVMVYKLQSL-AERVKRLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGT 191 (318) T ss_pred HHHHHHHHHH-HHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCC Confidence 6443333333 3444444333 34477889999999888 78899999998743211 00111111122333333444 Q ss_pred chhHH-HHHHHHHhhhcccccceEEEEeHHH-HHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc Q lcl|NC_018838. 155 SATTD-LVKAVGLIAGAGLQVPNGVALDPAF-SFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 155 ~~~~d-i~~~~~~~~~~~~~~~~~~~m~~~~-~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) .++++ +..++..+.+..+.+ -.++.... ...|..|+.+.... ...+-++... -..=.|..-+ +-- T Consensus 192 tpfanaieeavdfvrptagrr--ylivkaedrkalldelrqatana--hvriknddte--iasevgvdei----ivy--- 258 (318) T protein:vir:86 192 TPFANAIEEAVDFVRPTAGRR--YLIVKAEDRKALLDELRQATANA--HVRIKNDDTE--IASEVGVDEI----IVY--- 258 (318) T ss_pred CchhhHHHHHHhhhccCCCce--EEEEeecchHHHHHHHHhhcccc--eeEEeccchh--hhhhcCccee----eee--- Confidence 44443 444555554332211 12333222 34455555443221 0011111000 0011111110 000 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEe Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVK 302 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~ 302 (315) ++...-.+-++-| ..|+|.+..=..++..++. -.+||+.+.....--....+..|+..+. T Consensus 259 tgskalkptvlvd-qkyhidmqdltkvdafewk---------tnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 259 TGSKALKPTVLVD-QKYHIDMQDLTKVDAFEWK---------TNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred eccccccceeeec-cceecchhhhhhhhcceec---------cCCceEEEeecccCcceeecCceeEEeC Confidence 0000011122233 2344432221112112211 1134444433222222223344433332 No 216 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=86.59 E-value=0.012 Score=31.02 Aligned_cols=276 Identities=16% Similarity=0.122 Sum_probs=123.7 Q ss_pred CCCCcc--CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFL--SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~--s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) +++.-- ++-.+.+|.-+...|-..+.++.++.+...+...+.--++....+. ..|.-.-.|..+.+...+|..-++. T Consensus 110 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s~-~eAq~HkdGqTK~eqa~~~~~~Tl~ 188 (393) T protein:vir:16 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA-NEAQVHKDGQTKTEQAATLTIDTLE 188 (393) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhhh-hhhhhhccCCccccceeeeeeechh Confidence 333322 4456778998888888889999999888776655432222222222 3666667788888777777766666 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHH-HHHHHhhhccccccccc---cccccccccccccccccccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIG-RAVDLIAFHGIDPATGK---PAAAVKVSLDKTTKTVDATD 154 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~-~~~d~a~~~G~g~~~~~---~~~~~~~~~~~~~~~~~~~~ 154 (315) +--++....+ .|+.++.... -..|-.+|..++++++- +..|.|+.-|+|.-+-. ..+-+-.-...+++.-.++. T Consensus 189 ~~~VY~~~S~-Ae~~K~~~~s-Ysel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagk 266 (393) T protein:vir:16 189 PVMVYKLQSL-AERVKRLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGK 266 (393) T ss_pred HHHHHHHHHH-HHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCC Confidence 6433333333 3444443323 34477889999999888 78899999998742210 00111111122333334444 Q ss_pred chhHH-HHHHHHHhhhcccccceEEEEeHH-HHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc Q lcl|NC_018838. 155 SATTD-LVKAVGLIAGAGLQVPNGVALDPA-FSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 155 ~~~~d-i~~~~~~~~~~~~~~~~~~~m~~~-~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) .+++| +..++..+.+..+.+ -.++... -.+.|..|+.+.... +. .+-++... -.+=.|..-+ +--. T Consensus 267 tpfadaieeavdfvrptagrr--ylivktedrkalldelrqatana-nv-riknddte--iasevgvdei----ivyt-- 334 (393) T protein:vir:16 267 TPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKALLDELRQATANA-NV-RIKNDDTE--IASEVGVDEI----IVYT-- 334 (393) T ss_pred CchhHHHHHHHhhhccCCCce--EEEEeccchHHHHHHHHhhhccC-ce-eeeccchh--hhhhcCccee----eeee-- Confidence 44443 445555554332211 1233332 344455555443221 11 11111100 0011111110 0000 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEe Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVK 302 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~ 302 (315) +...-.+-++-| ..|+|.+..=..++..++- -.+||+.+.....--....+..|+..+. T Consensus 335 -gskalkptvlvd-qkyhidmqdltkvdafewk---------tnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 335 -GSKALKPTVLVD-QKYHIDMQDLTKVDAFEWK---------TNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred -ccccccceeeec-cccccchhhhhhhhhheec---------cCCceEEEeecccCcceeeccceeEeeC Confidence 000011122233 2333332211111111111 1134444433222222223344433332 No 217 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=86.42 E-value=0.046 Score=27.93 Aligned_cols=284 Identities=9% Similarity=-0.044 Sum_probs=136.7 Q ss_pred CCCCc-cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCC-ceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~-~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) =.+++ ..+..|.|-+...+.+.+.+++.|-+.+.-+++++..- +-.+-....++-++-+.-+ ....++.++.-.+. T Consensus 22 ~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt~--R~~r~~~l~~~~Y~ 99 (341) T protein:vir:27 22 KSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAGG--RFTKQVGVGGHKYK 99 (341) T ss_pred HHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCCC--ceecccccCCcceE Confidence 12222 23456888889999999999999999888888877632 1333333334444433321 12223455666666 Q ss_pred eEEEEEeehhhHHHhccChhh-hHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc----cccc------cccc----- Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADY-RLGVLQDLISPALGASIGRAVDLIAFHGIDPATGK----PAAA------VKVS----- 142 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d-~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~----~~~~------~~~~----- 142 (315) .++.-.-..|+-+.|....-. ....++..+++.+.++++.-.-.-.++|+--...+ +|.+ ...- T Consensus 100 c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a 179 (341) T protein:vir:27 100 LAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRK 179 (341) T ss_pred EEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHHhhc Confidence 666666677888887444310 13456777888888887766666678886311111 1111 0000 Q ss_pred ---ccccccccccccchhHHHHHHHHHhh-----hcccccce-EEEEeHHHHHH-HHHHhhccCccccccccccccccCC Q lcl|NC_018838. 143 ---LDKTTKTVDATDSATTDLVKAVGLIA-----GAGLQVPN-GVALDPAFSFA-LSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) Q Consensus 143 ---~~~~~~~~~~~~~~~~di~~~~~~~~-----~~~~~~~~-~~~m~~~~~~~-L~~l~d~~g~~~~~~~~~~~~~~~~ 212 (315) +-..........-.|.++.+++..+. +.....+. ++++-...... --.|......|.- ..+.+ T Consensus 180 ~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE-------~~Aa~ 252 (341) T protein:vir:27 180 ASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSE-------QIAAQ 252 (341) T ss_pred ccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHH-------HHHHH Confidence 00111112223445777777655432 22222222 45555444331 1111111111211 11111 Q ss_pred --CccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEE-EeccCCccccchhhhhcCcEEEEEEEEec Q lcl|NC_018838. 213 --LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIE-LIEYGDPDQTGRDLKGHNEVMVRAEAVLY 289 (315) Q Consensus 213 --~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~-~~~~~~~~~~~~~~f~~~~v~~r~~~r~~ 289 (315) ..+|-|+|.+..+++|.+. +++--|+++.|-...+-.=+ +-+....+.- -+ |++ + T Consensus 253 ~i~k~iGGlpa~~~PffP~~~---------~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri-e~-yes-----------~ 310 (341) T protein:vir:27 253 KLDKTIAGRPAYVPPFLPDNA---------MVVTIPENLQVLTQHGTAQRKAKHESDRKRS-KT-HTG-----------A 310 (341) T ss_pred HHHHhhCCCeEEEccccCCCc---------eEEeeccceEEEEecCcEEEEEEeccccccc-cc-hhh-----------h Confidence 2489999999999999764 34445555555443333222 2222111110 01 222 2 Q ss_pred cEeecccceEEEeeccCCCCCCCC-----CC Q lcl|NC_018838. 290 VAIESLDSFAVVKEKAAPKPNPPA-----GN 315 (315) Q Consensus 290 ~~v~~~~af~~l~~~~a~~~~~~~-----~~ 315 (315) +.|-+-.+|+.+.......|+--- -| T Consensus 311 YvVEdyg~~~~~~~~~vkl~~~~~~~~~~~~ 341 (341) T protein:vir:27 311 WKVTQWVCWKRSPLTTQKKSTSALNHRSERN 341 (341) T ss_pred heeehhhhhhhccccccccCccccccccccC Confidence 444444444433333222222111 11 No 218 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=83.68 E-value=0.066 Score=27.04 Aligned_cols=286 Identities=8% Similarity=-0.055 Sum_probs=136.6 Q ss_pred CC--CCc-----cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCC-CceEEEEEeCCceeEEeecccccCCCccce Q lcl|NC_018838. 1 MA--DDF-----LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSASVDV 72 (315) Q Consensus 1 m~--~~~-----~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~ 72 (315) +| +++ ..+..|.|.+...+.+.+.+++.|-+.+.-+++++.. ++..+-...++..++-........+. ... T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~r~~t~~~~~~~-~~~ 94 (343) T protein:vir:98 16 AAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYGAHDRRTPIQQR-WTR 94 (343) T ss_pred HHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccCccccCCCcccc-ccC Confidence 22 222 2344689999999999999999999988888887753 11222222222222211110000000 001 Q ss_pred eeEEEeeEEEEEeehhhHHHhccChhhhHHH-HHHHHHHHHHHHHHHHHHHhhhccccccc-cccccccc---------- Q lcl|NC_018838. 73 SAFTAQPIKVVTQQRVSDEFMWADADYRLGV-LQDLISPALGASIGRAVDLIAFHGIDPAT-GKPAAAVK---------- 140 (315) Q Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~-l~~~i~~~la~~i~~~~d~a~~~G~g~~~-~~~~~~~~---------- 140 (315) +.-.+..++.-.-..|+-+.|.... . +.. ++..+++.+.+.++.-...-.++|+--.. .++|.+-. T Consensus 95 ~~~~Y~c~qTn~dt~i~Y~~lD~WA-~-~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPllqDVN~GWLQ~~R 172 (343) T protein:vir:98 95 QVMSMNVSRQIQACLIPWAKLDQWG-H-LKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPNLADVNKGWIQFVR 172 (343) T ss_pred CCCccEEEEeeeeeeccHHHHHHhh-c-ChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCcchhhcchHHHHHHH Confidence 1113444444444667777775542 2 233 66777777777766555556778863211 11211110 Q ss_pred ---------ccccccccccccccchhHHHHHHHHHhh----hcccccce-EEEEeHHHHHHHH-HHhhccCccccccccc Q lcl|NC_018838. 141 ---------VSLDKTTKTVDATDSATTDLVKAVGLIA----GAGLQVPN-GVALDPAFSFALS-TEVYPKGSPLAGQPMY 205 (315) Q Consensus 141 ---------~~~~~~~~~~~~~~~~~~di~~~~~~~~----~~~~~~~~-~~~m~~~~~~~L~-~l~d~~g~~~~~~~~~ 205 (315) ...........+....|.++.+++..+. +.....+. ++++.......=. .+....+++-. T Consensus 173 e~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~pt----- 247 (343) T protein:vir:98 173 ENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIAT----- 247 (343) T ss_pred hcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChH----- Confidence 0000001111122334777666655432 22222222 4555555433211 11112221100 Q ss_pred cccccC----CCccccceeeEeecccCccccccccccceEEEecccceEEEeecc-ceEEEeccCCccccchhhhhcCcE Q lcl|NC_018838. 206 PAAGFA----GLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHNEV 280 (315) Q Consensus 206 ~~~~~~----~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~-~~v~~~~~~~~~~~~~~~f~~~~v 280 (315) +..+. ...++-|+|.+..+++|.+. +++--|+++.|-...+ .+-.+.+.. +++++ T Consensus 248 -Ek~Aa~~~~~~k~iGGl~a~~~PfFP~~~---------llVT~L~NLsIY~Q~gs~RR~~~d~p----------~r~ri 307 (343) T protein:vir:98 248 -EKAALNTHDLMKSFGGMPAMIVPNMPPRA---------AIVTSLSNLSIYTQEGSMRRGMKDDD----------DKKAV 307 (343) T ss_pred -HHHHHHHHHHHHhhCCCeeEEccccCCCc---------eEEeeccccEEEEecCcEEEEEEecc----------ccccc Confidence 11111 12478999999999999764 3344455555433332 222222222 22333 Q ss_pred EEEEEEEeccEeecccceEEEeeccCCCCCCCCCC Q lcl|NC_018838. 281 MVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAGN 315 (315) Q Consensus 281 ~~r~~~r~~~~v~~~~af~~l~~~~a~~~~~~~~~ 315 (315) .-.=..--|+.|-+.+.++.+....-..+. ..|. T Consensus 308 e~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~-~~g~ 341 (343) T protein:vir:98 308 RDSYYRNEAYAVEDCGKFMAVDFTKVKLSS-GKGT 341 (343) T ss_pred cchhhhcceeeeeccccEEEeeeeeeeecC-CCCC Confidence 333334467788888998888876544442 2222 No 219 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=77.01 E-value=0.13 Score=25.45 Aligned_cols=280 Identities=14% Similarity=0.062 Sum_probs=118.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchh-----hhcceeecCCCceEEEEEeCCceeEEe-------------ecc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLA-----KLSPEQPTIFGPVKGAVFSGVPRAKIV-------------GEG 62 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~-----~l~~~~~~~~~~~~ip~~~~~~~a~wv-------------~Eg 62 (315) -+.-+.+.+-+.|+..-. +++...+. .+-++..+....+++-|..++..|.-+ .|| T Consensus 69 ta~a~a~~T~l~ve~~~~------f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEG 142 (418) T protein:vir:10 69 TAEAAADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEG 142 (418) T ss_pred EEEEecCceEEEEcCcce------eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccc Confidence 222222223344444322 33333321 123444555566777776655443322 355 Q ss_pred cccCCCccceeeEEEeeEEEE-------EeehhhHHHhccChhhhHHH-HHHHHHHHHHHHHHHHHHHhhhcccc----c Q lcl|NC_018838. 63 EVKPSASVDVSAFTAQPIKVV-------TQQRVSDEFMWADADYRLGV-LQDLISPALGASIGRAVDLIAFHGID----P 130 (315) Q Consensus 63 ~~~~~s~~~~~~v~l~~~kl~-------~~~~iS~ell~~~~~d~~~~-l~~~i~~~la~~i~~~~d~a~~~G~g----~ 130 (315) ++.+... ...+..+. -.+.||.-........-.++ .+.. .+++... +..+++++++|.- . T Consensus 143 sd~~ta~------~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese-~drk~~~-av~iEkalI~G~~~~~~~ 214 (418) T protein:vir:10 143 SQRPTAR------SIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESR-RDCMDFH-ATEQETAIFFGQAFMGTY 214 (418) T ss_pred cccCCcc------eecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHH-HHHHHHH-HHHHHHHHhcccccCCCc Confidence 5544432 22222222 11223332211000000011 1222 2222222 2367889999951 1 Q ss_pred ccc--ccccccccccc--ccccccccc---cchhHHHHHHHHHhhh--cc-cccce----EEEEeHHHHHHHHHHhhccC Q lcl|NC_018838. 131 ATG--KPAAAVKVSLD--KTTKTVDAT---DSATTDLVKAVGLIAG--AG-LQVPN----GVALDPAFSFALSTEVYPKG 196 (315) Q Consensus 131 ~~~--~~~~~~~~~~~--~~~~~~~~~---~~~~~di~~~~~~~~~--~~-~~~~~----~~~m~~~~~~~L~~l~d~~g 196 (315) ..+ -...|+...+- ..++.+++. ...++.+.+++..... .+ +.... ...++++...++.++- + T Consensus 215 ~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~ 291 (418) T protein:vir:10 215 NGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---G 291 (418) T ss_pred CCcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---h Confidence 111 12334433221 112323333 2345666666665422 11 11111 2456788888887763 1 Q ss_pred ccccccccccccccCCC--cc--------ccceeeEeecccCccccccccccceEEEecccceEEEee--ccceEEEecc Q lcl|NC_018838. 197 SPLAGQPMYPAAGFAGL--DN--------WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQ--RNFPIELIEY 264 (315) Q Consensus 197 ~~~~~~~~~~~~~~~~~--~~--------l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~--~~~~v~~~~~ 264 (315) + +. ...++-..|.. .. |.=+|++..=+||.+ .+++-|..++.+..- +.+..+.+.. T Consensus 292 ~-I~--~~~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g---------~mlVvD~~~vkL~~L~~R~~~~E~l~k 359 (418) T protein:vir:10 292 E-VT--VTQRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPG---------FAVVVDVPAVKLAYMDGRNAKVENYGQ 359 (418) T ss_pred h-ee--ecccceeeeEEEEEEEcceEEEEeecccccccccCCCc---------eEEEEccccceEEEeccccccchhccc Confidence 1 11 01111111110 01 112233333345532 467779888887765 5555555422 Q ss_pred CC----------ccccchhhhhcCcEEEEEEEEeccEeecccceEEEee----ccCCCCCCCCC Q lcl|NC_018838. 265 GD----------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKE----KAAPKPNPPAG 314 (315) Q Consensus 265 ~~----------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~----~~a~~~~~~~~ 314 (315) .. +++.+++ -+.+++. -.+...++++.+-++|++ +..-.+|||+. T Consensus 360 ~G~~~~~~~~~~~~~~~~D-~~kG~iv----~E~tLe~~N~~a~avitgl~~~~~~~~~t~p~~ 418 (418) T protein:vir:10 360 GGGENKSGATDYSYGHGVD-AQGGSLT----SEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred CCCcccccccccccccccc-cccceEE----EEeeeeeecccceEEeeccceecccccCCCCCC Confidence 11 1222222 2333333 456777899999999975 23334566666 No 220 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=72.13 E-value=0.19 Score=24.58 Aligned_cols=276 Identities=16% Similarity=0.127 Sum_probs=123.9 Q ss_pred CCCCcc--CCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCceeEEeecccccCCCccceeeEEEe Q lcl|NC_018838. 1 MADDFL--SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) Q Consensus 1 m~~~~~--s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~~~~v~l~ 78 (315) +++.-- ++-.+.+|.-+...|-..+.++.++.+...+...+.--++....+. ..|.-.-.|..+.+...+|..-++. T Consensus 117 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s~-~~Aq~HkdGqTK~eqa~~~~~~Tl~ 195 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA-NEAQVHKDGQTKTEQAATLTIDTLE 195 (400) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhhh-hhhhhhccCCccccceeeeeeechh Confidence 333322 4456778998888888889999999888777655432222222222 3666667788888777777766666 Q ss_pred eEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHH-HHHHHhhhccccccccc---cccccccccccccccccccc Q lcl|NC_018838. 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIG-RAVDLIAFHGIDPATGK---PAAAVKVSLDKTTKTVDATD 154 (315) Q Consensus 79 ~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~-~~~d~a~~~G~g~~~~~---~~~~~~~~~~~~~~~~~~~~ 154 (315) +--++....+ .|+.++..-. -..|-.+|..++++++- +..|.|+.-|+|.-+-. ..+-+-.-...+++.-.++. T Consensus 196 ~~~VY~~~S~-Ae~~K~~~~s-Ysel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagk 273 (400) T protein:vir:93 196 PVMVYKLQSL-AERVKRLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGK 273 (400) T ss_pred HHHHHHHHHH-HHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCC Confidence 6433333333 3444443322 34477889999999888 78899999998742210 00011111122333333444 Q ss_pred chhHH-HHHHHHHhhhcccccceEEEEeH-HHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc Q lcl|NC_018838. 155 SATTD-LVKAVGLIAGAGLQVPNGVALDP-AFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) Q Consensus 155 ~~~~d-i~~~~~~~~~~~~~~~~~~~m~~-~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 232 (315) .+++| +..++..+.+..+.+ -.++.. .-.+.|..|+.+.... ...+-++.. .-..=.|..-+ +-- T Consensus 274 tpfadaieeavdfvrptagrr--ylivktedrkalldelrqatana--hvrikndda--eiasevgvdei----ivy--- 340 (400) T protein:vir:93 274 TPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKALLDELRQATANA--HVRIKNDDA--EIASEVGVDEI----IVY--- 340 (400) T ss_pred CchhHHHHHHHhhhccCCCce--EEEEeccchHHHHHHHHhhcccc--ceEeecchh--hhhhhcCccee----eee--- Confidence 44443 445555554332211 123333 2344455555443321 001111100 00011121111 000 Q ss_pred ccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEe Q lcl|NC_018838. 233 MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVK 302 (315) Q Consensus 233 ~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~ 302 (315) ++...-.+-++-| ..|+|.+..=..++..++- -.+||+.+.....--....+..|+..+. T Consensus 341 tgskalkptvlvd-qkyhidmqdltkvdafewk---------tnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 341 TGSKALKPTVLVD-QKYHIDMQDLTKVDAFEWK---------TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eccccccceeeec-cccccchhhhhhhhhheec---------cCCceEEEeecccCcceeeccceeEeeC Confidence 0000011122233 2333332211111111111 1134444433222222223344433332 No 221 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=69.40 E-value=0.22 Score=24.15 Aligned_cols=290 Identities=11% Similarity=-0.019 Sum_probs=108.4 Q ss_pred cCCCceEcchhHHHHHHHHHHhccchhhhcceeecCCCceEEEEEeCCcee----EEeecccccCCCccceeeEEEeeEE Q lcl|NC_018838. 6 LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRA----KIVGEGEVKPSASVDVSAFTAQPIK 81 (315) Q Consensus 6 ~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a----~wv~Eg~~~~~s~~~~~~v~l~~~k 81 (315) .+.+-+++-+.+.+--+.--.+..+-..+++++|++.-..+|+.+...... .-++-+.+...-+++..+.+...+. T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~~ 80 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTED 80 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeecc Confidence 344445555555442232222223335678899999888888887432111 1123333322223333333333333 Q ss_pred EEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccc-ccccccccccccccccccccccccccchhHHH Q lcl|NC_018838. 82 VVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-DPATGKPAAAVKVSLDKTTKTVDATDSATTDL 160 (315) Q Consensus 82 l~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~di 160 (315) -+-..+|..+-..+... .-++++...+.+...|....+..+-.-. ++.+ .+++..-..+.+.+-.+.......+| T Consensus 81 ~~L~~~i~~~~~~~a~~--~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~--y~~~~k~~Lsgt~~wsd~~SDPi~~i 156 (309) T protein:vir:99 81 HGLDAPVPQADIDNAPT--NYNPLGHATEQTTNLILLDREARTSKLVFSPNS--YAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) T ss_pred cceeecCCchhhhhccC--CCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhh--cCCCceEEecCccccCCCCCCcHHHH Confidence 33444555554433211 0113344444444444333332211110 1111 11111111111111112223344566 Q ss_pred HHHHHHhhhcccccceEEEEeHHHHHHHHH---Hhhc-cCccccccccccccccCCCccccce-eeEeecccCccccccc Q lcl|NC_018838. 161 VKAVGLIAGAGLQVPNGVALDPAFSFALST---EVYP-KGSPLAGQPMYPAAGFAGLDNWRGL-NVGASSTVSGAPEMSP 235 (315) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~---l~d~-~g~~~~~~~~~~~~~~~~~~~l~G~-Pv~~s~~v~~~~~~~~ 235 (315) .+...++ +..+|..+|..+++.+|+. +... +++......+-++ .-..|+|+ .|++-...-.....+. T Consensus 157 ~~~~~~~----g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~----~la~l~~ve~V~vg~a~~n~a~~g~ 228 (309) T protein:vir:99 157 TDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQ 228 (309) T ss_pred HHHHHhh----CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHH----HHHHHhCcceEEeecceeecccccc Confidence 6666554 4678999999999887764 2211 2221111111111 12346666 4655433311110000 Q ss_pred cccceEEEecccceEEEeeccceEEEeccCCc-------cccch--hhhhcCcEEEEEEEEeccEeecccceEEEeeccC Q lcl|NC_018838. 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP-------DQTGR--DLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) Q Consensus 236 ~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~-------~~~~~--~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a 306 (315) +..-.-+.|+..-+.+-....-.++--.+++. .+... ++=+.+.-.+|+..++.=.++-+++=..|+.+.+ T Consensus 229 ~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred ccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhccc Confidence 00000011221111111111101110000000 00000 0111222335555555544455555445555554 Q ss_pred C Q lcl|NC_018838. 307 P 307 (315) Q Consensus 307 ~ 307 (315) - T Consensus 309 ~ 309 (309) T protein:vir:99 309 A 309 (309) T ss_pred C Confidence 4 No 222 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=53.94 E-value=0.52 Score=22.16 Aligned_cols=303 Identities=12% Similarity=0.070 Sum_probs=125.9 Q ss_pred CCCC------ccCCCceEcchhHHHHHHHHHHhccch--hhhcceeecCCCceEEEEEe---CCceeEEeecccccCCCc Q lcl|NC_018838. 1 MADD------FLSAGKLELPGSMIGAVRDRAIDSGVL--AKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~~------~~s~Gg~~vP~~~~~~ii~~~~~~s~i--~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~ 69 (315) |..+ +-.++|.+=-+.+..+|..+......+ .+--.+.+..+.--++-... +...+.+++|+..++.++ T Consensus 26 ~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (462) T protein:vir:96 26 YQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSD 105 (462) T ss_pred HhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 3322 222233333344555554443332221 11112333333322333222 225678999999999999 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc------cccccccccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------KPAAAVKVSL 143 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~------~~~~~~~~~~ 143 (315) +.+.+.....|-++..-.+|...=+.+. .....+...++-...+++.++.++|+|+...+. ...-|+...+ T Consensus 106 ~~~~R~~~~~k~l~~t~~vsi~~tl~n~---~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI 182 (462) T protein:vir:96 106 PNIRQKTVEMKYVSDTKNLSIASTLVNN---IQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLI 182 (462) T ss_pred CceEEEEEEEEEEeeeeeechhhhhccc---hhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhc Confidence 9999999999999877666654433222 222346667777778899999999999854332 2222332222 Q ss_pred ccccccccccc-chhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhh--------ccCc-cccccccccccc---- Q lcl|NC_018838. 144 DKTTKTVDATD-SATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVY--------PKGS-PLAGQPMYPAAG---- 209 (315) Q Consensus 144 ~~~~~~~~~~~-~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d--------~~g~-~~~~~~~~~~~~---- 209 (315) ...+.+++-. ....+++...+-....++..++-.+|+..+...|..-.. .+++ ...|.++.--.. T Consensus 183 -~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~ 261 (462) T protein:vir:96 183 -DKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGF 261 (462) T ss_pred -CCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeee Confidence 1223333332 222344443333334556667778999999888874321 1111 111111000000 Q ss_pred -cCCCccccceeeEeecccCccccc-----cccc---cceEEEeccc---ceEEEeeccceEEEeccCCc---------- Q lcl|NC_018838. 210 -FAGLDNWRGLNVGASSTVSGAPEM-----SPAS---GVKAIVGDFS---RVHWGFQRNFPIELIEYGDP---------- 267 (315) Q Consensus 210 -~~~~~~l~G~Pv~~s~~v~~~~~~-----~~~~---~~~~~~gDf~---~~~i~~~~~~~v~~~~~~~~---------- 267 (315) .-.++++++.|-+....+...+.. ..++ .....++|-. .|.|.. ..++.++.+ T Consensus 262 I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V-----~avs~dgeS~PS~~VtaTv 336 (462) T protein:vir:96 262 IKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGLFTDEHDRAELTYKV-----VVNSDDAQSAPSEAVTATV 336 (462) T ss_pred eeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCCCCCccCceeEEEEE-----EEECCCCccccceeeEeee Confidence 111234444555443333211111 1111 0011223321 111111 001111100 Q ss_pred --cccchhh------hhcCcEEEEEEEEeccEeecccceEEEeeccCCCC--------------CCCCCC Q lcl|NC_018838. 268 --DQTGRDL------KGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP--------------NPPAGN 315 (315) Q Consensus 268 --~~~~~~~------f~~~~v~~r~~~r~~~~v~~~~af~~l~~~~a~~~--------------~~~~~~ 315 (315) ..-++.+ -+.....+....|-+ .....|-.++..+..+. .|.-.| T Consensus 337 a~~~~gv~ltIt~~a~~~~~~~~~~IYRk~---~~sg~y~li~rv~~~~~n~~gt~tf~D~n~~iPgt~~ 403 (462) T protein:vir:96 337 NNATDGVKLEISVNAMYQQQPQFVSIYRQG---RKTGDFYLIKRLGMKEVNDEGKLVFYDLNETIPETTD 403 (462) T ss_pred ecccccceEEEEEcCCccccceEEEEEeec---CCccccceeeeeeceeecCCcceeEeeccCCCCCccc Confidence 0000000 000001111112221 13334444433321111 111111 No 223 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=49.95 E-value=0.62 Score=21.71 Aligned_cols=282 Identities=11% Similarity=0.033 Sum_probs=117.3 Q ss_pred CCC------CccCCCceEcchhHHHHHHHHHHhccch--hhhcceeecCCCceEEEEEe---CCceeEEeecccccCCCc Q lcl|NC_018838. 1 MAD------DFLSAGKLELPGSMIGAVRDRAIDSGVL--AKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~------~~~s~Gg~~vP~~~~~~ii~~~~~~s~i--~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~ 69 (315) |.. .+-.+|+.+=-+.+..+|..+......+ .+--.+.+..+.--++-... +...+.+++|+..++.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 333 2222334443334445444433322221 11112333333322333222 235678999999999999 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc------cccccccccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------KPAAAVKVSL 143 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~------~~~~~~~~~~ 143 (315) +.+.......|-++..-.+|.-+=+.. ..........++-.-.+++.++.++|+|+...+. ...-|+.+.+ T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~n---~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lI 182 (463) T protein:vir:99 106 PNIRQKTVSMKYVSDTKNMSIASGLVN---NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLI 182 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhhc---ccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhc Confidence 999999999998887766665433322 2334456667777788999999999999853322 1223333333 Q ss_pred cccccccccc-cchhHHHHH-HHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceee Q lcl|NC_018838. 144 DKTTKTVDAT-DSATTDLVK-AVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNV 221 (315) Q Consensus 144 ~~~~~~~~~~-~~~~~di~~-~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv 221 (315) .. .++.++- .....+++. +...+ ..++..++-++|+..+...|..-....-| .+-..+.+ +...|+|| T Consensus 183 d~-enviDarG~~Ls~~~ln~Aa~~i-~~~fGt~TD~~lp~~vka~f~~~~l~~qr------v~~~~N~~--~~~~G~~v 252 (463) T protein:vir:99 183 DK-NNVINAKGNQLTEKHLNEAAVRI-GKGFGTATDAYMPIGVHADFVNSILGRQM------QLMQDNSG--NVNTGYSV 252 (463) T ss_pred CC-CCeeecCCCcccHHHHhhhhhhh-hcccCChhheecchHHHHHHHHHhcCceE------EEEcCCCC--ceeeeeec Confidence 22 2333331 222233333 33333 44566677799999999988744322111 11000000 11223333 Q ss_pred ----------------------EeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCc Q lcl|NC_018838. 222 ----------------------GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNE 279 (315) Q Consensus 222 ----------------------~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~ 279 (315) +........++ +| ....+...-.++.++..-+--.... T Consensus 253 ~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~------------ap--------~~~~~tatv~~~~~~~~~~~~~~a~ 312 (463) T protein:vir:99 253 NGFYSSRGFIKLHGSTVMENELILDESLQPLPN------------AP--------QPAKVTATVETKQKGAFENEEDRAG 312 (463) T ss_pred cceeeeeeeeeeCCceecCCcccccchhhcCCC------------Cc--------cCceeEEEEeeccCCCCCCcccccc Confidence 22111110000 00 0000000000000000000001111 Q ss_pred EEEEEEEEeccEeecccceEEEeecc-----CCCCCCCCC------------------C Q lcl|NC_018838. 280 VMVRAEAVLYVAIESLDSFAVVKEKA-----APKPNPPAG------------------N 315 (315) Q Consensus 280 v~~r~~~r~~~~v~~~~af~~l~~~~-----a~~~~~~~~------------------~ 315 (315) ..|++...-+.+=-.|..++-.+.+. .=.-++|+. + T Consensus 313 ~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~ 371 (463) T protein:vir:99 313 LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGM 371 (463) T ss_pred eEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCc Confidence 12222222222111122221111100 000111111 1 No 224 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=49.95 E-value=0.62 Score=21.71 Aligned_cols=282 Identities=11% Similarity=0.033 Sum_probs=117.3 Q ss_pred CCC------CccCCCceEcchhHHHHHHHHHHhccch--hhhcceeecCCCceEEEEEe---CCceeEEeecccccCCCc Q lcl|NC_018838. 1 MAD------DFLSAGKLELPGSMIGAVRDRAIDSGVL--AKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSAS 69 (315) Q Consensus 1 m~~------~~~s~Gg~~vP~~~~~~ii~~~~~~s~i--~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~ 69 (315) |.. .+-.+|+.+=-+.+..+|..+......+ .+--.+.+..+.--++-... +...+.+++|+..++.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 333 2222334443334445444433322221 11112333333322333222 235678999999999999 Q ss_pred cceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc------cccccccccc Q lcl|NC_018838. 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------KPAAAVKVSL 143 (315) Q Consensus 70 ~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~------~~~~~~~~~~ 143 (315) +.+.......|-++..-.+|.-+=+.. ..........++-.-.+++.++.++|+|+...+. ...-|+.+.+ T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~n---~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lI 182 (463) T protein:vir:95 106 PNIRQKTVSMKYVSDTKNMSIASGLVN---NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLI 182 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhhc---ccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhc Confidence 999999999998887766665433322 2334456667777788999999999999853322 1223333333 Q ss_pred cccccccccc-cchhHHHHH-HHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceee Q lcl|NC_018838. 144 DKTTKTVDAT-DSATTDLVK-AVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNV 221 (315) Q Consensus 144 ~~~~~~~~~~-~~~~~di~~-~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv 221 (315) .. .++.++- .....+++. +...+ ..++..++-++|+..+...|..-....-| .+-..+.+ +...|+|| T Consensus 183 d~-enviDarG~~Ls~~~ln~Aa~~i-~~~fGt~TD~~lp~~vka~f~~~~l~~qr------v~~~~N~~--~~~~G~~v 252 (463) T protein:vir:95 183 DK-NNVINAKGNQLTEKHLNEAAVRI-GKGFGTATDAYMPIGVHADFVNSILGRQM------QLMQDNSG--NVNTGYSV 252 (463) T ss_pred CC-CCeeecCCCcccHHHHhhhhhhh-hcccCChhheecchHHHHHHHHHhcCceE------EEEcCCCC--ceeeeeec Confidence 22 2333331 222233333 33333 44566677799999999988744322111 11000000 11223333 Q ss_pred ----------------------EeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCc Q lcl|NC_018838. 222 ----------------------GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNE 279 (315) Q Consensus 222 ----------------------~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~ 279 (315) +........++ +| ....+...-.++.++..-+--.... T Consensus 253 ~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~------------ap--------~~~~~tatv~~~~~~~~~~~~~~a~ 312 (463) T protein:vir:95 253 NGFYSSRGFIKLHGSTVMENELILDESLQPLPN------------AP--------QPAKVTATVETKQKGAFENEEDRAG 312 (463) T ss_pred cceeeeeeeeeeCCceecCCcccccchhhcCCC------------Cc--------cCceeEEEEeeccCCCCCCcccccc Confidence 22111110000 00 0000000000000000000001111 Q ss_pred EEEEEEEEeccEeecccceEEEeecc-----CCCCCCCCC------------------C Q lcl|NC_018838. 280 VMVRAEAVLYVAIESLDSFAVVKEKA-----APKPNPPAG------------------N 315 (315) Q Consensus 280 v~~r~~~r~~~~v~~~~af~~l~~~~-----a~~~~~~~~------------------~ 315 (315) ..|++...-+.+=-.|..++-.+.+. .=.-++|+. + T Consensus 313 ~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~ 371 (463) T protein:vir:95 313 LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGM 371 (463) T ss_pred eEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCc Confidence 12222222222111122221111100 000111111 1 No 225 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=48.08 E-value=0.68 Score=21.50 Aligned_cols=284 Identities=11% Similarity=0.017 Sum_probs=110.6 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhh---c-ceeecCCCceEEEEEeCCceeEEe-ecccccCCC--cccee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKL---S-PEQPTIFGPVKGAVFSGVPRAKIV-GEGEVKPSA--SVDVS 73 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l---~-~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s--~~~~~ 73 (315) ||++ +-..+.+++.+-+.+...+.-..| . .+.-.++.+++||+..-+.-..+- ..+...... +.++. T Consensus 1 Mant------l~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~e 74 (312) T protein:vir:10 1 MANT------LAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYE 74 (312) T ss_pred CCcc------hhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccce Confidence 8865 333466777666666554432222 1 233456778999997644322221 111111111 22333 Q ss_pred eEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccccccccccccccc Q lcl|NC_018838. 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPAAAVKVSLDKTTKTVDAT 153 (315) Q Consensus 74 ~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 153 (315) ..+|.-.+--.+. |- -+.-.+.+....+...+.+-......=.+|.=.+.-.- ......+....... ...-.. T Consensus 75 t~tl~qDR~~~F~-vD--~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla--~~a~~~~~~~~~~~--~~~~T~ 147 (312) T protein:vir:10 75 TKTMTQDRGRKFT-LD--AMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLA--TIAIGIKGDTNVEY--SYSVNS 147 (312) T ss_pred eEEeeecccceee-cc--ccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHH--hhhhcccccccccc--ccccCH Confidence 4444333221110 10 00000111111122222222222333334442221100 00000000000000 000012 Q ss_pred cchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeecccCcccc- Q lcl|NC_018838. 154 DSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE- 232 (315) Q Consensus 154 ~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~- 232 (315) ...++.+++++..+.+++...+....|+|.....|++. ..+.....-.-.....+..++|.|.||+. +|+.-. T Consensus 148 ~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~---~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~---VPs~r~~ 221 (312) T protein:vir:10 148 STIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEK---VLEKLTAVTFAQGGIQTQVPSIDGCALIK---TPQNRMY 221 (312) T ss_pred HHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhh---hhceecccccccceeeeeeeeecccEEEE---chhhhcc Confidence 34577888888888776554334467888877666642 11111111111223345557899999983 342211 Q ss_pred --------------------ccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEe Q lcl|NC_018838. 233 --------------------MSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI 292 (315) Q Consensus 233 --------------------~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v 292 (315) ...+.+.-+++.. ....+...+.-.+++.+-... . +.|...+.-..++|.=| T Consensus 222 t~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~-~~a~i~~~K~~~~~if~P~~~-~------~~d~~~~~~R~Y~D~fv 293 (312) T protein:vir:10 222 SSILLNDGTTSNQTAGGYLKGTKALDTNFIIAP-VDVPLAITKQDKMRIFDPETN-Q------TANAWSMDYRRYHDLWV 293 (312) T ss_pred ceeeeccCcccccccCceeecCcccccceEEeC-CceeeceeeeeeeeeeCCCCC-C------Ccceeeeeeeeeeeeee Confidence 1111122223332 334455555545555432211 1 11112233334455444 Q ss_pred ecccceEEEeeccCCCCCCCCC Q lcl|NC_018838. 293 ESLDSFAVVKEKAAPKPNPPAG 314 (315) Q Consensus 293 ~~~~af~~l~~~~a~~~~~~~~ 314 (315) .+-+.=.. -+.-+...|.| T Consensus 294 ~~nk~~~I---yv~~k~a~~~~ 312 (312) T protein:vir:10 294 TDNKANSV---YANFKDAKPVG 312 (312) T ss_pred eccccCeE---EEEeecccCCC Confidence 33221111 12223344555 No 226 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=37.99 E-value=1.1 Score=20.38 Aligned_cols=274 Identities=13% Similarity=0.014 Sum_probs=121.2 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhc--ceeecCCCceEEEEEeCCceeEEe-ecccccCCCccceeeEEE Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLS--PEQPTIFGPVKGAVFSGVPRAKIV-GEGEVKPSASVDVSAFTA 77 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~--~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~~~~s~~~~~~v~l 77 (315) |+.... +.++..+.+.++..+....+. ++.-.++.+++||+.....-..+- ..|-....-+.++...+| T Consensus 1 Main~a--------~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl 72 (290) T protein:vir:78 1 MAINYV--------DKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTI 72 (290) T ss_pred CchhHH--------HHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEe Confidence 776542 467888888887777655553 334456677999997654322221 222222222344455555 Q ss_pred eeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhh---ccccccccccccccccccccccccccccc Q lcl|NC_018838. 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAF---HGIDPATGKPAAAVKVSLDKTTKTVDATD 154 (315) Q Consensus 78 ~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~---~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (315) .-.+--.+. |- -+.-.+......+...+.+..+..+.-.+|.-.+ .+.. ++ .+... . ... ... T Consensus 73 ~qdR~~~F~-vD--~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a-~~----~~~~~----~-~t~-t~~ 138 (290) T protein:vir:78 73 DFDRDVEFF-VD--VMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAA-KT----NSNSV----A-EEI-TKD 138 (290) T ss_pred eccccceee-cc--ccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhh-hc----cCccc----c-ccc-CHH Confidence 554432221 10 0000011122223444444455555555664322 2211 00 00000 0 000 123 Q ss_pred chhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec---cc---- Q lcl|NC_018838. 155 SATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS---TV---- 227 (315) Q Consensus 155 ~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~---~v---- 227 (315) ..|+.+++++..+.+. .......+++|.....|.+...-. +.++....-.....+..++|.|.+|+... .| T Consensus 139 n~~~~i~~~~~~ldev-p~~~rvl~vtp~~~~lL~~~~~f~-r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~ 216 (290) T protein:vir:78 139 NVFTKLKAAIRKVKKY-GTQNLVMYVSPDVMAALELSDDFV-RAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTF 216 (290) T ss_pred HHHHHHHHHHHHHHhc-CCCCeEEEECHHHHHHHhhChhhh-ccccccccccccccceeeeecCcEEEEecccchhhhhh Confidence 4577788888777542 222223568898888775432111 11111111112235566789999998421 11 Q ss_pred ---CccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Q lcl|NC_018838. 228 ---SGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) Q Consensus 228 ---~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~~ 304 (315) .+......+.+.-+++... ...+...+.-.+++.+-.. .+ +-+...+.-..++|.=|.+-+.=...... T Consensus 217 ~f~~G~~~~~~ak~in~ii~~~-~a~i~~~K~~~~~~~~P~~-~~------~~d~~~~~~r~y~d~~v~~nk~~~i~~~~ 288 (290) T protein:vir:78 217 DFTDGYKPAAGAKKLNFLLVNK-GSVVGGAKHASIYLHAPGS-VG------QGDGWLYQYRVYHDIFVLDQQKDGVIAST 288 (290) T ss_pred hhcccccccCCccceeEEEEcC-CceeeeeeeeEEEeeCCCC-Cc------CcceeeeeeeeeeeeeeeccccCeeEEEe Confidence 0111122233333444443 4455555554555554221 11 11213333445566555544433333333 Q ss_pred cC Q lcl|NC_018838. 305 AA 306 (315) Q Consensus 305 ~a 306 (315) +. T Consensus 289 ~~ 290 (290) T protein:vir:78 289 EV 290 (290) T ss_pred eC Confidence 33 No 227 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=35.28 E-value=1.2 Score=20.07 Aligned_cols=289 Identities=11% Similarity=0.032 Sum_probs=125.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhh-hcc---------eeecC---CCceEEEEEeCCceeEEeecccc--c Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAK-LSP---------EQPTI---FGPVKGAVFSGVPRAKIVGEGEV--K 65 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~-l~~---------~~~~~---~~~~~ip~~~~~~~a~wv~Eg~~--~ 65 (315) |+.|....|.-.....++..+.......|.... +.- ..-.. +..+++..... ....+|-+++. - T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVH-LRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeee-cccCCcccCceeec Confidence 999988888777778888888877766665443 421 00011 11133332221 13344443333 2 Q ss_pred CCCccceeeEEEeeEEEEEeeh----hhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhh-ccccccccc------ Q lcl|NC_018838. 66 PSASVDVSAFTAQPIKVVTQQR----VSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAF-HGIDPATGK------ 134 (315) Q Consensus 66 ~~s~~~~~~v~l~~~kl~~~~~----iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~-~G~g~~~~~------ 134 (315) .+...+|.+-++....+..-+. +|++ -..-.|+..-++.|+.-+.+..|..++ +-.|..+-. T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~q-------Rt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~ 152 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSRK-------RTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIET 152 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhhh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 3556777766666655554444 4433 122346677777777777777887544 322210000 Q ss_pred -cccc--------------cccccccccccccccc-chhHHHHHHHHHhhhcccc-------------cce--EEEEeHH Q lcl|NC_018838. 135 -PAAA--------------VKVSLDKTTKTVDATD-SATTDLVKAVGLIAGAGLQ-------------VPN--GVALDPA 183 (315) Q Consensus 135 -~~~~--------------~~~~~~~~~~~~~~~~-~~~~di~~~~~~~~~~~~~-------------~~~--~~~m~~~ 183 (315) ..++ +..........++.++ ...+-|.++...+...... ... .++|||. T Consensus 153 ~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~ 232 (364) T protein:vir:93 153 PDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEY 232 (364) T ss_pred cCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcch Confidence 0000 0000001111122222 2233344444433211100 001 3678999 Q ss_pred HHHHHHHHhhcc---------CccccccccccccccCCCccccceeeEeecccCccccccccccce----EEEeccc-ce Q lcl|NC_018838. 184 FSFALSTEVYPK---------GSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK----AIVGDFS-RV 249 (315) Q Consensus 184 ~~~~L~~l~d~~---------g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~----~~~gDf~-~~ 249 (315) .+..|+.-++.+ ..-...+|+|. |..+++.|.+|+-...+......+.+..+. +++|--. -+ T Consensus 233 q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~----G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~ 308 (364) T protein:vir:93 233 QATDMRTAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVI 308 (364) T ss_pred hhhhhhhcCCHHHHHHHHHhhhcccccCCcee----cCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEE Confidence 999987544311 11122344553 556789999998766665443333333321 2344311 11 Q ss_pred EEEeeccceEEEeccC----CccccchhhhhcCcEEEEEEEEeccEeecccceEEEee Q lcl|NC_018838. 250 HWGFQRNFPIELIEYG----DPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKE 303 (315) Q Consensus 250 ~i~~~~~~~v~~~~~~----~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~af~~l~~ 303 (315) .+|-.++....-.++. ......+. +--++-..|... -|++|.--+..+++-. T Consensus 309 a~g~~~g~~~~w~Ee~~D~gn~~~i~~~-~i~G~kK~rF~~-~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 309 AYGTANGLRFDWEETVKDYGNEPAIAAG-FIAGMKKARFNN-KDFGVISIDTAAKKHS 364 (364) T ss_pred EeecCCCCCceeeecccCCCCchhhhhh-hHhhhhhcccCC-ccceEEEecccccccC Confidence 2233344444333322 11111100 001111111111 1333322222222221 No 228 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=33.41 E-value=1.4 Score=19.85 Aligned_cols=255 Identities=11% Similarity=0.049 Sum_probs=98.3 Q ss_pred CC------CCccCCCceEcchhHHHHHHHHHHhccc---hhhhcceeecCCCceEEEE---EeCCceeEEeecccccCCC Q lcl|NC_018838. 1 MA------DDFLSAGKLELPGSMIGAVRDRAIDSGV---LAKLSPEQPTIFGPVKGAV---FSGVPRAKIVGEGEVKPSA 68 (315) Q Consensus 1 m~------~~~~s~Gg~~vP~~~~~~ii~~~~~~s~---i~~l~~~~~~~~~~~~ip~---~~~~~~a~wv~Eg~~~~~s 68 (315) |. -++-.+|+.+=-+.+..++..+...... +.++ ...+..+.-..+-. ..+...+.+++|+.-.+.+ T Consensus 45 ~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i-~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~ 123 (514) T protein:vir:10 45 FTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDI-AKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVN 123 (514) T ss_pred hccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhc-CCchhhHHHhhhhhhcccCcccccccccccccCcCC Confidence 11 1111112222111222222221111111 1112 12222222122222 2223466789999999999 Q ss_pred ccceeeEEEeeEEEEEeehhhHHHhcc-ChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc------cccccccc Q lcl|NC_018838. 69 SVDVSAFTAQPIKVVTQQRVSDEFMWA-DADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------KPAAAVKV 141 (315) Q Consensus 69 ~~~~~~v~l~~~kl~~~~~iS~ell~~-~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~------~~~~~~~~ 141 (315) ++.+....+..|-++....+|.-+=.. +..| ......++-.-.+++.++.++++|+...+. ...-|+.+ T Consensus 124 d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d----~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~ 199 (514) T protein:vir:10 124 NPNERQRTINIKYIVDTHVTSIALQRANTIVD----SLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFK 199 (514) T ss_pred CcceEEEEEeeeeeeeeeeeeehhhhccchhh----HHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHH Confidence 999999988888887664444333222 2222 334455666778899999999999753221 22233333 Q ss_pred ccccccccccccc-chhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccc-cccccCCCccccce Q lcl|NC_018838. 142 SLDKTTKTVDATD-SATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGL 219 (315) Q Consensus 142 ~~~~~~~~~~~~~-~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~-~~~~~~~~~~l~G~ 219 (315) .+.. -+.+++-. ....+++...+-+...++..++-++|+..+...+..--... +.++ |.. ..+...|+ T Consensus 200 lI~~-~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~------qRV~~~~n---~~~~~~G~ 269 (514) T protein:vir:10 200 LIAP-ENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNG------QRVMLPGQ---TGGMTTGL 269 (514) T ss_pred hhcC-CCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCc------ceEEeecC---ccceeeee Confidence 3322 23333322 22234444333333444555666778777777653322111 1110 000 00112233 Q ss_pred eeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEee-cccce Q lcl|NC_018838. 220 NVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIE-SLDSF 298 (315) Q Consensus 220 Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~-~~~af 298 (315) |+- + ++..++.+.+.-+ +.++-+.++++... .|.| T Consensus 270 ~v~-------------------------~-f~s~~G~I~L~gs-----------------~im~~~n~L~~~~~~~~~A- 305 (514) T protein:vir:10 270 DID-------------------------K-FLSAHGSIRIQGS-----------------TIMDSDNKLDFDRPVSPTA- 305 (514) T ss_pred ecc-------------------------c-eeEeccceeecCC-----------------eeecccccCccCCccCCcC- Confidence 321 0 0011111111100 11111111221111 0000 Q ss_pred EEEeeccCCCCCCC----------------------CCC Q lcl|NC_018838. 299 AVVKEKAAPKPNPP----------------------AGN 315 (315) Q Consensus 299 ~~l~~~~a~~~~~~----------------------~~~ 315 (315) ..-...+..+||- +|. T Consensus 306 -p~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~ 343 (514) T protein:vir:10 306 -PTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGV 343 (514) T ss_pred -CCCCcceEEEecCcccccCcccccccccccccccccce Confidence 0000001111111 111 No 229 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=31.88 E-value=1.5 Score=19.67 Aligned_cols=267 Identities=11% Similarity=0.046 Sum_probs=103.3 Q ss_pred CCC---------------CccCCCceEcchhHHHHHHHHHHhccc---hhhhcceeecCCCceEEEEEe---CCceeEEe Q lcl|NC_018838. 1 MAD---------------DFLSAGKLELPGSMIGAVRDRAIDSGV---LAKLSPEQPTIFGPVKGAVFS---GVPRAKIV 59 (315) Q Consensus 1 m~~---------------~~~s~Gg~~vP~~~~~~ii~~~~~~s~---i~~l~~~~~~~~~~~~ip~~~---~~~~a~wv 59 (315) |.- ..+..|+.+=-+.+..++..+...... +.++ ...+..+.-.++-... +......+ T Consensus 1 ~~~~~~~~~~~a~~~al~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i-~k~~a~STV~ey~~~~~rhG~~g~s~~ 79 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNAAGQVAESLEREDLEPEVTQLNVLDTPLTDLLSK-NAVKAKAYEHEYNVVTARHDKIGYAAF 79 (470) T ss_pred CChhHhhhhhHHHHHHHHHhhhcchhhhhhhhccceeEeeecCccchhhhhc-CCchhhhHhhhhhhhccccccccceee Confidence 100 011111111001111111111000000 0111 1112221112222211 22233356 Q ss_pred ecccccCCCccceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc------ Q lcl|NC_018838. 60 GEGEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------ 133 (315) Q Consensus 60 ~Eg~~~~~s~~~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~------ 133 (315) .|+.-.+.+++.+.+.....|-++....+|.-.++-. .+.+..+...+.++---.+++.++.++|+|+...+. T Consensus 80 ~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~-~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~ 158 (470) T protein:vir:10 80 REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTT-QNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSP 158 (470) T ss_pred cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhh-hccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCccc Confidence 8999999999999999999999998888886532111 112223455555666677899999999999753321 Q ss_pred --cccccccccccc--ccccccccc--chhHHHHHHHHHhh-hcccccceEEEEeHHHHHHHHHHhhccCcccccccccc Q lcl|NC_018838. 134 --KPAAAVKVSLDK--TTKTVDATD--SATTDLVKAVGLIA-GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYP 206 (315) Q Consensus 134 --~~~~~~~~~~~~--~~~~~~~~~--~~~~di~~~~~~~~-~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~ 206 (315) ...-|+.+.+.. ..+..++-. .....|..+...+. ..++..++-++|+..+...|..-....-| ++- T Consensus 159 ~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qR------v~~ 232 (470) T protein:vir:10 159 NNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISR------VMT 232 (470) T ss_pred CceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceE------EEE Confidence 122222222221 123333321 11223333433443 24555667788999988888655443222 110 Q ss_pred ccccCCCccccceeeEeecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEE Q lcl|NC_018838. 207 AAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEA 286 (315) Q Consensus 207 ~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~ 286 (315) .. .++ ....|+||- -|.+- ++.+.+.-+.-.... +.+ . .. T Consensus 233 ~~-N~~-~~~~G~~v~------------------~f~sa--------~G~I~L~~s~~m~~~----~k~----~----p~ 272 (470) T protein:vir:10 233 TA-DRR-AGLLGADAQ------------------SYIGV--------RGEHSLYPSQFLGDF----HKF----N----PA 272 (470) T ss_pred ec-CCC-ceeeeeecc------------------ceeee--------eeeeeecccccccch----hhc----C----cc Confidence 00 011 123566652 11110 111111101000000 000 0 01 Q ss_pred EeccEe---ecccceEEEeec---cCCCCCCCCCC Q lcl|NC_018838. 287 VLYVAI---ESLDSFAVVKEK---AAPKPNPPAGN 315 (315) Q Consensus 287 r~~~~v---~~~~af~~l~~~---~a~~~~~~~~~ 315 (315) +++-.+ .-|..++-+... ++..+.--+|+ T Consensus 273 ~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~ 307 (470) T protein:vir:10 273 RFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGD 307 (470) T ss_pred cCCcccCCcccCceeEEeecCCCceeecccCCCCc Confidence 122111 112211111110 00000011111 No 230 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=29.16 E-value=1.7 Score=19.34 Aligned_cols=265 Identities=11% Similarity=0.041 Sum_probs=107.1 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccchhhhc------ceeecCCCceEEEEEeC--CceeEEeecccccCCCccce Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLS------PEQPTIFGPVKGAVFSG--VPRAKIVGEGEVKPSASVDV 72 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i~~l~------~~~~~~~~~~~ip~~~~--~~~a~wv~Eg~~~~~s~~~~ 72 (315) |+..- -+.+...+.+..+..+....+. .+...++++++||+... +...+=-.-|-....-+.++ T Consensus 1 Main~--------~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~ 72 (285) T protein:vir:79 1 MTVVL--------DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGK 72 (285) T ss_pred Ccchh--------hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceee Confidence 66542 2456777888887776665552 24456677899999742 22222222221111222233 Q ss_pred eeEEEeeEEEEEeehhhHHHhccC--h-hhhHHHHHHHHHHHHHH-HHHHHHHHhh---hcccccccccccccccccccc Q lcl|NC_018838. 73 SAFTAQPIKVVTQQRVSDEFMWAD--A-DYRLGVLQDLISPALGA-SIGRAVDLIA---FHGIDPATGKPAAAVKVSLDK 145 (315) Q Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~~--~-~d~~~~l~~~i~~~la~-~i~~~~d~a~---~~G~g~~~~~~~~~~~~~~~~ 145 (315) ...+|.-.+--. ++-|. . ++....+ ..|..++.+ ...-.+|.=. +++.. +. . T Consensus 73 et~tl~~DR~~~-------f~iD~mDvdEn~~~~~-~ni~~ef~~~~vvPEiDayrfskla~~a---~~----------~ 131 (285) T protein:vir:79 73 ETVKLTHEDWFG-------YDLDQFDMDENGAYTV-ENVVREHNKMITIPHRDKVAVQKLFDSA---AK----------K 131 (285) T ss_pred eEEEeeccccce-------ecccccchhhhhhhhH-HHHHHHHHhhhhcchhhHHHHHHHHhhc---cc----------c Confidence 333333322111 11110 0 0111111 112222222 2222334211 22110 00 0 Q ss_pred cccccccccchhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccc-eeeEe- Q lcl|NC_018838. 146 TTKTVDATDSATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRG-LNVGA- 223 (315) Q Consensus 146 ~~~~~~~~~~~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G-~Pv~~- 223 (315) .+... .....+..+++++..+.+++...+...+|+|.....|.+-+.-.......+........+..+.|.| .|++. T Consensus 132 ~~~~~-T~~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~V 210 (285) T protein:vir:79 132 ATDSI-TKDNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRV 210 (285) T ss_pred ccccc-CHHHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEc Confidence 00111 1234577888888888776554333456889888877654432211101111101112334568998 89984 Q ss_pred -ecccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeec--ccceEE Q lcl|NC_018838. 224 -SSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIES--LDSFAV 300 (315) Q Consensus 224 -s~~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~--~~af~~ 300 (315) ++.|.. ...+.+.-+++... ...+...+.-.+.+.+-... . .-|...+.-..++|.=|.+ .+++-. T Consensus 211 ps~r~kt---~~~~k~Infiiv~~-~a~i~~~K~~~~~~f~P~~~-~------~~d~~~~~~R~Y~d~fv~~nk~~~Iy~ 279 (285) T protein:vir:79 211 SSDRLKG---LGITNHVNFILTPL-SAIAPIVKYDSVSVIDPSTD-R------SGNRWTIKGLSYYDAIVLDNAKKGIYV 279 (285) T ss_pred chhhccC---cCcchhccEEEecC-ceeccceeeeeeEeECCCCC-C------Ccceeeeeeeeeeeeeehhhccceeee Confidence 233321 11112222333332 24454444444444432211 1 1111222233444443432 333333 Q ss_pred EeeccC Q lcl|NC_018838. 301 VKEKAA 306 (315) Q Consensus 301 l~~~~a 306 (315) ...++- T Consensus 280 ~~~a~~ 285 (285) T protein:vir:79 280 AATAGV 285 (285) T ss_pred eecccC Confidence 333332 No 231 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=28.57 E-value=1.7 Score=19.27 Aligned_cols=283 Identities=11% Similarity=-0.009 Sum_probs=111.4 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccch-hh------hcceeecCCCceEEEEEeC--CceeEEeeccccc-CCCcc Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVL-AK------LSPEQPTIFGPVKGAVFSG--VPRAKIVGEGEVK-PSASV 70 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i-~~------l~~~~~~~~~~~~ip~~~~--~~~a~wv~Eg~~~-~~s~~ 70 (315) |+-.-. +.+++.+.+.+...+.- .. ..++...++.+++||+.+. +...+--.-|-.. ..-+. T Consensus 1 Mainya--------~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~ 72 (346) T protein:vir:10 1 MTINYA--------EKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSN 72 (346) T ss_pred CcchhH--------HHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccc Confidence 766542 45667676666654322 11 2234456677899999862 2221111111111 11133 Q ss_pred ceeeEEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhh---hcccccccccccccccccccccc Q lcl|NC_018838. 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIA---FHGIDPATGKPAAAVKVSLDKTT 147 (315) Q Consensus 71 ~~~~v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~---~~G~g~~~~~~~~~~~~~~~~~~ 147 (315) ++...+|.-.+--.+. |- -+.-.+.+....+...+.+-......-.+|.=. ++... ++.. + ... .. T Consensus 73 ~~et~tl~qDR~~~F~-vD--~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a--~~~~--~-~~~--~~- 141 (346) T protein:vir:10 73 DWDSYELKNERYWSTL-VD--PSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGK--EAAH--D-GGI--TT- 141 (346) T ss_pred ceeEEEeeccccceec-cc--ccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhh--hhhc--c-ccc--cc- Confidence 3444444443321110 11 000000011111222222222222222334321 11100 0000 0 000 00 Q ss_pred cccccccchhHHHHHHHHHhhhcccccce-EEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEe--e Q lcl|NC_018838. 148 KTVDATDSATTDLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGA--S 224 (315) Q Consensus 148 ~~~~~~~~~~~di~~~~~~~~~~~~~~~~-~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~--s 224 (315) ... .....|+.+++++..+.++.....+ ..+|+|.....|.+-..-. +.+... -.....+..++|.|.||+. + T Consensus 142 ~a~-T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~-k~~~v~--~~~~i~~~V~siDGv~Ii~VPs 217 (346) T protein:vir:10 142 NTL-DEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMN-RALTLK--DPNNIQRTVYSLDDVTIRVVPS 217 (346) T ss_pred ccc-CHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhhe-eccccc--cccccceeeeeecCeEEEEcch Confidence 001 1234577888888888766654333 3568888888765433211 111111 0122355667899999984 3 Q ss_pred cccC-------ccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEeeccc- Q lcl|NC_018838. 225 STVS-------GAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLD- 296 (315) Q Consensus 225 ~~v~-------~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~- 296 (315) +.|. +......+.+..+++.. ....+...+.-.+++..-. ... .|...+.-..++|.=|.+-+ T Consensus 218 ~r~~t~~~f~~G~~~~t~ak~INfiiv~-~~A~ia~~K~~~~~if~P~-~~~-------~g~~l~~~R~Y~D~fv~~nk~ 288 (346) T protein:vir:10 218 DLMQTAYDFSDGSKIIDTAKQIEMFLIY-NGVQIAPEKYSFVGFDQPS-AAT-------SGNYLYYEQSYDDVLLLNTKT 288 (346) T ss_pred hhcccchhhccCccccCCccceeEEEEC-CceeeeeeeeeeeEeeCCC-CCc-------ccceeeeeeeeeeeeeecccc Confidence 3332 11112222233344443 3455555555556555442 111 12222333344554444322 Q ss_pred -ce-EEEeecc-CCCCCCC------CCC Q lcl|NC_018838. 297 -SF-AVVKEKA-APKPNPP------AGN 315 (315) Q Consensus 297 -af-~~l~~~~-a~~~~~~------~~~ 315 (315) ++ +-++.+. ...-++. +.+ T Consensus 289 ~~Iyv~~~~a~~~~~~~~~~~~kpt~~~ 316 (346) T protein:vir:10 289 KGIQFVVSDKPKKDQEQSGQDAKPTAES 316 (346) T ss_pred ceEEEeeecccccCccCcccccCccccc Confidence 22 1122221 1111111 111 No 232 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=278 Identities=12% Similarity=0.085 Sum_probs=115.5 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccch---hhhcceeecCCCceEEEEEe---CCceeEEeecccccCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVL---AKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i---~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~~~~~~ 74 (315) ..-.+-..|+.+=-+.+..+|..+......+ .++ .+.+..+.--++-... +...+.+++|+..++.+++.+.. T Consensus 31 ~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di-~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r 109 (467) T protein:vir:80 31 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDI-AKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQ 109 (467) T ss_pred cCCccccCcchhhhhhhhhhhheeeccccchhhhhhc-ccchhhhhhhhheeeeccCccccccccccccccccCCCceEE Confidence 2222222334443344555554443333222 111 1222222222232222 23567899999999999999999 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc-------ccccccccccc Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPA-------AAVKVSLDKTT 147 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~-------~~~~~~~~~~~ 147 (315) .....|-++....+|...=+.+. +....+...++-.-.+++.++.++|+|+......+. -|+...... - T Consensus 110 ~~~~~k~l~~~~~vs~~~~l~n~---i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~-e 185 (467) T protein:vir:80 110 KTVNMKFASDTKNISIAAGLVNN---IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-D 185 (467) T ss_pred EEEEeeeeeeeeeehhhhhhhcc---hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC-C Confidence 99999999886666654433322 223346666777778899999999999853321111 122222211 2 Q ss_pred cccccccc--hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 148 KTVDATDS--ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 148 ~~~~~~~~--~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) +..+.-.. .-.++..+...+. .++..++-++|+..+...|.... +..++.++.. .......|+||- . T Consensus 186 nviDa~G~~ls~~~lneaa~~i~-~gfG~~td~~~p~~v~a~~~~~~------L~~q~~v~~~--n~~~~~~G~~v~--g 254 (467) T protein:vir:80 186 NVHDARGASLTESLLNQAAVMIS-KGYGTPTDAYMPVGVQADFVNQQ------LSKQTQLVRD--NGNNVSVGFNIQ--G 254 (467) T ss_pred ceeccCCCccCHHHHHHHhhhcc-ccccChhhhhcchhHHhhhhhhh------cCceEEEEcC--CCCceeeeeccc--c Confidence 33333222 1223333433333 34555666888888877763221 1111222111 112335566662 2 Q ss_pred ccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEe--------ecccc Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI--------ESLDS 297 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v--------~~~~a 297 (315) .+..-+..... ...|++|+... +-++..-.. ...-..+. +..-.++.= ...=+ T Consensus 255 ~~sa~G~I~l~--gs~il~~~~~l--------~~~~~~~~~-------Apsp~~vs--aT~~~~~~g~~~~~~~a~y~Y~ 315 (467) T protein:vir:80 255 FHSARGFIKLH--GSTVMENEQIL--------DERILALPT-------APQPAKVT--ATQEAGKKGQFRAEDLAAHEYK 315 (467) T ss_pred eecceeeeeec--CceeeccccCC--------Ccccccccc-------cccCCccc--eeeecccCCcccCCCcceEEEE Confidence 23222111110 01223332211 110100000 00000010 011001100 01111 Q ss_pred eEEEeeccCCCCCC-CCCC Q lcl|NC_018838. 298 FAVVKEKAAPKPNP-PAGN 315 (315) Q Consensus 298 f~~l~~~~a~~~~~-~~~~ 315 (315) |+.... -. +..| +.-+ T Consensus 316 v~~vs~-~G-ES~pS~~vt 332 (467) T protein:vir:80 316 VVVSSD-DA-ESIASEVAT 332 (467) T ss_pred EEEECC-CC-ccccccceE Confidence 111111 11 1111 1122 No 233 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=25.58 E-value=2 Score=18.89 Aligned_cols=278 Identities=12% Similarity=0.085 Sum_probs=115.3 Q ss_pred CCCCccCCCceEcchhHHHHHHHHHHhccch---hhhcceeecCCCceEEEEEe---CCceeEEeecccccCCCccceee Q lcl|NC_018838. 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVL---AKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSASVDVSA 74 (315) Q Consensus 1 m~~~~~s~Gg~~vP~~~~~~ii~~~~~~s~i---~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~~~~~~ 74 (315) ..-.+-..|+.+=-+.+..+|..+......+ .++ .+.+..+.--++-... +...+.+++|+..++.+++.+.. T Consensus 32 ~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di-~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r 110 (468) T protein:vir:63 32 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDI-AKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQ 110 (468) T ss_pred cCCccccCcchhhhhhhhhhhheeeecccchhhhhhc-ccchhhhhhhhheeeeccCccccccccccccccccCCCceEE Confidence 2222222334443344555554443332222 112 1222232222232222 23567899999999999999999 Q ss_pred EEEeeEEEEEeehhhHHHhccChhhhHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc-------ccccccccccc Q lcl|NC_018838. 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKPA-------AAVKVSLDKTT 147 (315) Q Consensus 75 v~l~~~kl~~~~~iS~ell~~~~~d~~~~l~~~i~~~la~~i~~~~d~a~~~G~g~~~~~~~-------~~~~~~~~~~~ 147 (315) .....|-++....+|...=+.+. +....+...++-.-.+++.++.++|+|+......+. -|+...... - T Consensus 111 ~~~~~k~l~~~~~vs~~~~l~n~---i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~-e 186 (468) T protein:vir:63 111 KTVNMKFASDTKNISIAAGLVNN---IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-D 186 (468) T ss_pred EEEEeeeeeeeeeehhhhhhhcc---hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC-C Confidence 99999999886666654433322 223346666777778899999999999853321111 122222211 2 Q ss_pred cccccccc--hhHHHHHHHHHhhhcccccceEEEEeHHHHHHHHHHhhccCccccccccccccccCCCccccceeeEeec Q lcl|NC_018838. 148 KTVDATDS--ATTDLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASS 225 (315) Q Consensus 148 ~~~~~~~~--~~~di~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~l~d~~g~~~~~~~~~~~~~~~~~~~l~G~Pv~~s~ 225 (315) +..+.-.. .-.++..+...+. .++..++-++|+..+...|.... +..++.++.. .......|+||- . T Consensus 187 nviDa~G~~ls~~~lneaa~~i~-~gfG~~td~~~~~~v~a~~~~~~------L~~q~~v~~~--n~~~~~~G~~v~--g 255 (468) T protein:vir:63 187 NVHDARGASLTESLLNQAAVMIS-KGYGTPTDAYMPVGVQADFVNQQ------LSKQTQLVRD--NGNNVSVGFNIQ--G 255 (468) T ss_pred ceeccCCCccCHHHHHHHhhhcc-ccccChhhhhcchhHHhhhhhhh------cCceEEEEcC--CCCceeeeeccc--c Confidence 33333222 1223333433333 34555666888888877763221 1111222111 112335566662 2 Q ss_pred ccCccccccccccceEEEecccceEEEeeccceEEEeccCCccccchhhhhcCcEEEEEEEEeccEe--------ecccc Q lcl|NC_018838. 226 TVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI--------ESLDS 297 (315) Q Consensus 226 ~v~~~~~~~~~~~~~~~~gDf~~~~i~~~~~~~v~~~~~~~~~~~~~~~f~~~~v~~r~~~r~~~~v--------~~~~a 297 (315) .+..-+..... ...|++|+... +-++..-.. ...-..+. +..-.++.= ...=+ T Consensus 256 ~~sa~G~I~l~--gs~il~~~~~l--------~~~~~~~~~-------Apsp~~vs--aT~~~~~~g~~~~~~~a~y~Y~ 316 (468) T protein:vir:63 256 FHSARGFIKLH--GSTVMENEQIL--------DERILALPT-------APQPAKVT--ATQEAGKKGQFRAEDLAAHEYK 316 (468) T ss_pred eecceeeeeec--CceeeccccCC--------Ccccccccc-------cccCCccc--eeeecccCCcccCCCcceEEEE Confidence 23222111110 01223332211 110100000 00000010 011001100 01111 Q ss_pred eEEEeeccCCCCCC-CCCC Q lcl|NC_018838. 298 FAVVKEKAAPKPNP-PAGN 315 (315) Q Consensus 298 f~~l~~~~a~~~~~-~~~~ 315 (315) |+.... -. +..| |.-+ T Consensus 317 v~~vs~-~G-ES~pS~~vt 333 (468) T protein:vir:63 317 VVVSSD-DA-ESIASEVAT 333 (468) T ss_pred EEEECC-CC-ccccccceE Confidence 111111 11 1111 1122 Done!