Query lcl|NC_010147.1_cdsid_YP_001604135.1 [gene=orf44] [protein=putative major capsid protein] [protein_id=YP_001604135.1] [location=21888..22712] Match_columns 274 No_of_seqs 129 out of 269 Neff 8.9 Searched_HMMs 1612 Date Thu Nov 7 13:23:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_45 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_45_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1239 Length: 274 # 100.0 2E-77 1.2E-80 441.1 27.7 274 1-274 1-274 (274) 2 protein:vir:97433 Length: 274 100.0 5.5E-77 3.4E-80 438.7 28.6 274 1-274 1-274 (274) 3 protein:vir:94494 Length: 274 100.0 5.5E-77 3.4E-80 438.7 28.6 274 1-274 1-274 (274) 4 protein:vir:95898 Length: 274 100.0 7.6E-77 4.7E-80 437.9 27.5 274 1-274 1-274 (274) 5 protein:vir:96262 Length: 274 100.0 7.6E-77 4.7E-80 437.9 27.5 274 1-274 1-274 (274) 6 protein:vir:93742 Length: 274 100.0 2.9E-76 1.8E-79 434.7 28.6 274 1-274 1-274 (274) 7 protein:vir:96833 Length: 275 100.0 5.4E-74 3.4E-77 422.3 27.1 274 1-274 1-275 (275) 8 protein:vir:96123 Length: 274 100.0 1.3E-73 7.8E-77 420.3 27.9 274 1-274 1-274 (274) 9 protein:vir:105334 Length: 276 100.0 1.1E-73 6.9E-77 420.5 27.4 274 1-274 1-274 (276) 10 protein:vir:80930 Length: 278 100.0 9E-70 5.6E-73 399.1 26.8 271 1-271 1-278 (278) 11 protein:vir:3613 Length: 272 # 100.0 4E-69 2.5E-72 395.6 26.3 268 1-270 1-272 (272) 12 protein:vir:95107 Length: 270 100.0 1.2E-68 7.5E-72 392.9 26.1 268 3-274 1-270 (270) 13 protein:vir:3033 Length: 272 # 100.0 3E-67 1.8E-70 385.3 29.1 272 1-273 1-272 (272) 14 protein:vir:9820 Length: 272 # 100.0 3E-67 1.8E-70 385.3 29.1 272 1-273 1-272 (272) 15 protein:vir:102944 Length: 330 100.0 5.9E-53 3.6E-56 306.9 23.5 264 1-274 1-301 (330) 16 protein:vir:739 Length: 231 # 100.0 7.4E-53 4.6E-56 306.4 22.0 227 40-270 1-231 (231) 17 protein:vir:5974 Length: 324 # 100.0 6.5E-52 4.1E-55 301.2 22.5 263 1-274 1-295 (324) 18 protein:vir:1583 Length: 351 # 100.0 2.1E-50 1.3E-53 293.0 23.0 261 1-274 1-299 (351) 19 protein:vir:7990 Length: 273 # 100.0 5.2E-47 3.2E-50 274.3 23.9 261 1-270 1-273 (273) 20 protein:vir:80446 Length: 367 100.0 2.6E-47 1.6E-50 275.9 20.4 265 1-274 1-340 (367) 21 protein:vir:102605 Length: 273 100.0 1.7E-46 1.1E-49 271.5 24.6 261 1-270 1-273 (273) 22 protein:vir:105822 Length: 273 100.0 1.7E-46 1.1E-49 271.5 24.6 261 1-270 1-273 (273) 23 protein:vir:94989 Length: 349 100.0 9E-43 5.6E-46 251.1 23.7 263 1-274 1-323 (349) 24 protein:vir:94622 Length: 341 100.0 1.8E-43 1.1E-46 254.9 19.0 268 1-272 1-341 (341) 25 protein:vir:78387 Length: 349 100.0 1.1E-42 6.8E-46 250.6 23.2 263 1-274 1-323 (349) 26 protein:vir:3364 Length: 347 # 100.0 7.7E-38 4.8E-41 224.1 19.2 265 1-272 1-347 (347) 27 protein:vir:94711 Length: 347 100.0 2.9E-37 1.8E-40 220.9 16.5 264 1-271 1-347 (347) 28 protein:vir:1541 Length: 347 # 100.0 1.3E-36 8E-40 217.4 19.6 266 1-272 1-347 (347) 29 protein:vir:10450 Length: 344 100.0 1.1E-36 6.6E-40 217.8 17.2 263 1-270 1-344 (344) 30 protein:vir:78739 Length: 332 100.0 1.6E-36 9.8E-40 216.9 18.0 263 1-268 7-332 (332) 31 protein:vir:3136 Length: 322 # 100.0 2.7E-36 1.7E-39 215.6 17.6 264 1-274 1-322 (322) 32 protein:vir:8885 Length: 347 # 100.0 2.3E-36 1.4E-39 216.0 17.1 264 1-271 1-347 (347) 33 protein:vir:2201 Length: 345 # 100.0 4.6E-36 2.9E-39 214.3 17.4 263 1-270 1-345 (345) 34 protein:vir:80180 Length: 381 100.0 2.2E-35 1.3E-38 210.6 20.0 270 1-274 1-309 (381) 35 protein:vir:108303 Length: 418 100.0 4.3E-34 2.7E-37 203.5 25.0 262 1-271 1-418 (418) 36 protein:vir:94576 Length: 347 100.0 2.2E-35 1.3E-38 210.6 16.7 263 1-270 1-347 (347) 37 protein:vir:99075 Length: 392 100.0 2E-34 1.2E-37 205.3 21.3 265 1-274 1-320 (392) 38 protein:vir:9309 Length: 324 # 100.0 1.1E-32 7.1E-36 195.7 25.0 263 1-274 25-319 (324) 39 protein:vir:80684 Length: 315 100.0 1.3E-32 8.2E-36 195.4 24.6 268 1-274 1-310 (315) 40 protein:vir:41 Length: 299 # N 100.0 1.3E-32 8.3E-36 195.3 24.6 262 1-271 6-299 (299) 41 protein:vir:80213 Length: 334 100.0 1.5E-33 9.5E-37 200.5 18.2 267 1-272 1-334 (334) 42 protein:vir:97148 Length: 324 100.0 2.9E-32 1.8E-35 193.5 25.1 263 1-274 27-319 (324) 43 protein:vir:96392 Length: 324 100.0 2.9E-32 1.8E-35 193.5 24.9 263 1-274 27-319 (324) 44 protein:vir:78830 Length: 324 100.0 2.9E-32 1.8E-35 193.5 24.9 263 1-274 27-319 (324) 45 protein:vir:96223 Length: 324 100.0 5.5E-32 3.4E-35 191.9 24.8 263 1-274 25-319 (324) 46 protein:vir:99749 Length: 324 100.0 7.3E-32 4.5E-35 191.3 25.0 263 1-274 25-319 (324) 47 protein:vir:103955 Length: 324 100.0 1.5E-31 9.2E-35 189.6 25.0 263 1-274 25-319 (324) 48 protein:vir:9574 Length: 300 # 100.0 1.8E-31 1.1E-34 189.1 24.2 262 1-270 1-300 (300) 49 protein:vir:9759 Length: 303 # 100.0 3E-31 1.8E-34 187.9 23.9 262 1-270 1-303 (303) 50 protein:vir:1328 Length: 392 # 100.0 3E-31 1.9E-34 187.9 23.8 265 1-271 110-392 (392) 51 protein:vir:100057 Length: 375 100.0 8.7E-32 5.4E-35 190.9 20.7 268 1-274 1-373 (375) 52 protein:vir:105905 Length: 304 100.0 5.4E-31 3.4E-34 186.5 23.9 257 1-269 1-304 (304) 53 protein:vir:94142 Length: 304 100.0 5.4E-31 3.4E-34 186.5 23.9 257 1-269 1-304 (304) 54 protein:vir:6242 Length: 390 # 100.0 4.2E-31 2.6E-34 187.1 22.7 263 1-271 110-390 (390) 55 protein:vir:2344 Length: 397 # 100.0 7.3E-31 4.5E-34 185.8 23.3 269 1-274 10-310 (397) 56 protein:vir:95763 Length: 297 100.0 2.4E-30 1.5E-33 183.0 24.7 260 1-271 9-297 (297) 57 protein:vir:104085 Length: 320 100.0 1.8E-30 1.1E-33 183.7 23.9 269 1-273 14-320 (320) 58 protein:vir:78223 Length: 333 100.0 2.3E-30 1.5E-33 183.0 24.1 267 1-271 10-333 (333) 59 protein:vir:2430 Length: 318 # 100.0 2.7E-30 1.7E-33 182.7 24.0 270 1-274 14-318 (318) 60 protein:vir:7771 Length: 330 # 100.0 4.1E-30 2.5E-33 181.7 24.5 270 1-274 1-327 (330) 61 protein:vir:8187 Length: 311 # 100.0 3.3E-30 2E-33 182.2 23.8 262 1-271 1-311 (311) 62 protein:vir:191 Length: 385 # 100.0 3.8E-30 2.3E-33 181.9 23.9 261 1-271 105-385 (385) 63 protein:vir:1886 Length: 385 # 100.0 3.8E-30 2.3E-33 181.9 23.9 261 1-271 105-385 (385) 64 protein:vir:99675 Length: 324 100.0 2.7E-31 1.7E-34 188.2 16.6 233 37-274 1-305 (324) 65 protein:vir:103323 Length: 364 100.0 3.8E-30 2.4E-33 181.9 22.9 270 1-274 1-343 (364) 66 protein:vir:78523 Length: 338 100.0 1.4E-29 8.8E-33 178.7 24.5 269 1-273 10-338 (338) 67 protein:vir:94771 Length: 298 100.0 1.3E-29 8.4E-33 178.9 24.1 258 1-269 1-298 (298) 68 protein:vir:97053 Length: 390 100.0 1.2E-29 7.6E-33 179.1 23.7 259 1-268 113-390 (390) 69 protein:vir:105374 Length: 423 99.9 2.6E-29 1.6E-32 177.3 25.0 262 1-269 1-423 (423) 70 protein:vir:485 Length: 407 # 99.9 1.7E-29 1E-32 178.3 23.5 267 1-274 106-404 (407) 71 protein:vir:100135 Length: 418 99.9 3.5E-29 2.2E-32 176.6 25.0 264 1-273 135-418 (418) 72 protein:vir:81070 Length: 390 99.9 2.8E-29 1.7E-32 177.1 23.8 259 1-268 113-390 (390) 73 protein:vir:9927 Length: 295 # 99.9 4.2E-30 2.6E-33 181.6 19.2 264 1-274 1-293 (295) 74 protein:vir:4856 Length: 293 # 99.9 4.7E-29 2.9E-32 175.9 25.0 267 1-274 5-285 (293) 75 protein:vir:101607 Length: 379 99.9 5.4E-29 3.3E-32 175.6 24.7 262 1-270 106-379 (379) 76 protein:vir:4339 Length: 395 # 99.9 5.3E-29 3.3E-32 175.6 24.6 261 1-270 113-395 (395) 77 protein:vir:1638 Length: 298 # 99.9 6E-29 3.7E-32 175.3 24.2 258 1-269 1-298 (298) 78 protein:vir:3525 Length: 423 # 99.9 1.2E-29 7.3E-33 179.2 20.2 263 1-274 1-309 (423) 79 protein:vir:94800 Length: 319 99.9 6.7E-29 4.2E-32 175.0 24.3 269 1-274 19-319 (319) 80 protein:vir:97331 Length: 319 99.9 6.7E-29 4.2E-32 175.0 24.3 269 1-274 19-319 (319) 81 protein:vir:100247 Length: 425 99.9 3.9E-29 2.4E-32 176.3 22.8 264 1-271 130-425 (425) 82 protein:vir:105522 Length: 423 99.9 8.5E-29 5.3E-32 174.5 24.4 262 1-269 1-423 (423) 83 protein:vir:10364 Length: 390 99.9 8.8E-29 5.5E-32 174.4 24.1 259 1-268 113-390 (390) 84 protein:vir:4226 Length: 326 # 99.9 6.9E-29 4.3E-32 175.0 23.5 268 1-273 20-326 (326) 85 protein:vir:4456 Length: 401 # 99.9 7.6E-29 4.7E-32 174.7 22.4 263 1-270 107-401 (401) 86 protein:vir:104256 Length: 458 99.9 1.8E-28 1.1E-31 172.7 24.3 265 1-270 161-458 (458) 87 protein:vir:4953 Length: 397 # 99.9 2E-28 1.3E-31 172.4 24.1 267 1-274 109-389 (397) 88 protein:vir:94673 Length: 419 99.9 1.9E-28 1.2E-31 172.5 23.5 265 1-272 121-419 (419) 89 protein:vir:174 Length: 423 # 99.9 7.7E-29 4.8E-32 174.7 20.9 264 1-274 1-309 (423) 90 protein:vir:4830 Length: 397 # 99.9 4.2E-28 2.6E-31 170.7 24.3 267 1-274 109-389 (397) 91 protein:vir:81160 Length: 371 99.9 4.2E-28 2.6E-31 170.7 24.2 262 1-270 91-371 (371) 92 protein:vir:8102 Length: 543 # 99.9 3.9E-28 2.4E-31 170.9 23.4 261 1-271 249-543 (543) 93 protein:vir:9410 Length: 415 # 99.9 6.5E-28 4E-31 169.6 24.4 268 1-274 120-411 (415) 94 protein:vir:81100 Length: 415 99.9 8.6E-28 5.4E-31 169.0 25.0 268 1-274 120-411 (415) 95 protein:vir:98339 Length: 415 99.9 8.6E-28 5.4E-31 169.0 25.0 268 1-274 120-411 (415) 96 protein:vir:79987 Length: 415 99.9 8.6E-28 5.4E-31 169.0 25.0 268 1-274 120-411 (415) 97 protein:vir:4600 Length: 415 # 99.9 1E-27 6.3E-31 168.6 25.0 268 1-274 120-411 (415) 98 protein:vir:4700 Length: 415 # 99.9 1E-27 6.3E-31 168.6 25.0 268 1-274 120-411 (415) 99 protein:vir:4997 Length: 397 # 99.9 1E-27 6.4E-31 168.5 24.2 267 1-274 109-389 (397) 100 protein:vir:107120 Length: 329 99.9 1.2E-27 7.5E-31 168.2 24.5 268 1-274 30-310 (329) 101 protein:vir:2504 Length: 305 # 99.9 5.6E-28 3.5E-31 170.0 22.5 260 1-274 1-303 (305) 102 protein:vir:102655 Length: 322 99.9 1.5E-28 9.3E-32 173.1 19.3 266 1-271 1-322 (322) 103 protein:vir:95131 Length: 325 99.9 4E-28 2.5E-31 170.8 21.4 261 1-274 1-301 (325) 104 protein:vir:96762 Length: 632 99.9 4.8E-28 3E-31 170.4 21.0 258 1-269 355-632 (632) 105 protein:vir:78935 Length: 335 99.9 2.7E-28 1.7E-31 171.8 19.3 267 1-274 1-333 (335) 106 protein:vir:93616 Length: 645 99.9 2E-27 1.3E-30 166.9 23.9 264 1-273 338-645 (645) 107 protein:vir:6324 Length: 335 # 99.9 1.7E-28 1E-31 172.9 17.9 267 1-274 1-333 (335) 108 protein:vir:5739 Length: 366 # 99.9 2.1E-27 1.3E-30 166.9 22.3 259 1-270 64-366 (366) 109 protein:vir:1268 Length: 397 # 99.9 3.4E-27 2.1E-30 165.7 23.3 262 1-270 123-397 (397) 110 protein:vir:81227 Length: 413 99.9 7.7E-27 4.8E-30 163.7 24.8 268 1-273 118-413 (413) 111 protein:vir:95376 Length: 425 99.9 3.8E-27 2.4E-30 165.4 23.1 264 1-273 138-425 (425) 112 protein:vir:99920 Length: 311 99.9 3.5E-27 2.2E-30 165.6 22.6 261 1-270 1-311 (311) 113 protein:vir:1025 Length: 408 # 99.9 7.8E-27 4.9E-30 163.7 24.1 267 1-274 116-397 (408) 114 protein:vir:100172 Length: 394 99.9 1.3E-26 8E-30 162.5 24.4 267 1-274 111-388 (394) 115 protein:vir:7409 Length: 408 # 99.9 2.1E-26 1.3E-29 161.4 24.5 267 1-274 116-397 (408) 116 protein:vir:105038 Length: 428 99.9 9.9E-27 6.2E-30 163.1 22.7 260 1-270 125-428 (428) 117 protein:vir:80376 Length: 435 99.9 1.6E-26 9.8E-30 162.0 23.3 261 1-272 130-435 (435) 118 protein:vir:1433 Length: 435 # 99.9 1.6E-26 9.7E-30 162.1 23.3 262 1-272 130-435 (435) 119 protein:vir:9704 Length: 394 # 99.9 2.3E-26 1.4E-29 161.1 23.6 262 1-274 127-394 (394) 120 protein:vir:7855 Length: 497 # 99.9 2.1E-26 1.3E-29 161.3 23.4 269 1-274 151-497 (497) 121 protein:vir:101650 Length: 497 99.9 2.1E-26 1.3E-29 161.3 23.4 269 1-274 151-497 (497) 122 protein:vir:3991 Length: 404 # 99.9 3.6E-26 2.2E-29 160.1 24.6 267 1-274 116-398 (404) 123 protein:vir:100884 Length: 389 99.9 4.1E-26 2.6E-29 159.8 24.8 267 1-274 109-386 (389) 124 protein:vir:78640 Length: 352 99.9 3.8E-27 2.3E-30 165.5 18.5 257 1-274 83-350 (352) 125 protein:vir:3870 Length: 400 # 99.9 1.9E-26 1.2E-29 161.6 22.0 259 1-271 133-400 (400) 126 protein:vir:102119 Length: 404 99.9 5E-26 3.1E-29 159.3 24.0 269 1-274 110-404 (404) 127 protein:vir:4511 Length: 409 # 99.9 3.3E-26 2.1E-29 160.3 22.8 265 1-273 115-409 (409) 128 protein:vir:3845 Length: 395 # 99.9 5.8E-26 3.6E-29 158.9 24.1 267 1-274 105-387 (395) 129 protein:vir:97031 Length: 402 99.9 3.6E-27 2.2E-30 165.6 17.2 270 1-274 1-342 (402) 130 protein:vir:94424 Length: 387 99.9 5.4E-27 3.3E-30 164.6 17.4 257 1-274 118-385 (387) 131 protein:vir:96978 Length: 387 99.9 5.4E-27 3.3E-30 164.6 17.4 257 1-274 118-385 (387) 132 protein:vir:2685 Length: 387 # 99.9 5.4E-27 3.3E-30 164.6 17.4 257 1-274 118-385 (387) 133 protein:vir:1383 Length: 421 # 99.9 4.9E-26 3E-29 159.3 22.6 261 1-274 114-387 (421) 134 protein:vir:93881 Length: 387 99.9 1.3E-26 8.2E-30 162.5 18.8 257 1-274 118-385 (387) 135 protein:vir:4092 Length: 390 # 99.9 9.4E-26 5.8E-29 157.8 23.1 263 1-274 84-374 (390) 136 protein:vir:102873 Length: 392 99.9 2.1E-25 1.3E-28 155.9 24.2 266 1-274 106-388 (392) 137 protein:vir:105004 Length: 392 99.9 2.1E-25 1.3E-28 155.9 24.2 266 1-274 106-388 (392) 138 protein:vir:102082 Length: 392 99.9 2.1E-25 1.3E-28 155.9 24.2 266 1-274 106-388 (392) 139 protein:vir:107593 Length: 392 99.9 2.1E-25 1.3E-28 155.9 24.2 266 1-274 106-388 (392) 140 protein:vir:9361 Length: 402 # 99.9 1.5E-26 9.4E-30 162.1 17.6 257 1-274 133-400 (402) 141 protein:vir:1084 Length: 437 # 99.9 1.2E-24 7.7E-28 151.7 22.1 265 1-274 156-431 (437) 142 protein:vir:6212 Length: 434 # 99.9 1E-24 6.3E-28 152.1 21.5 265 1-273 141-434 (434) 143 protein:vir:108211 Length: 318 99.9 2.4E-25 1.5E-28 155.5 17.6 265 1-271 1-318 (318) 144 protein:vir:96792 Length: 315 99.9 4E-24 2.5E-27 148.8 22.4 263 3-274 1-285 (315) 145 protein:vir:962 Length: 397 # 99.9 4.9E-24 3E-27 148.4 21.3 257 1-270 132-397 (397) 146 protein:vir:80128 Length: 466 99.9 3.3E-24 2.1E-27 149.3 19.8 265 1-274 148-452 (466) 147 protein:vir:79928 Length: 393 99.9 2.8E-24 1.8E-27 149.7 19.1 269 1-274 72-387 (393) 148 protein:vir:106647 Length: 303 99.9 2.9E-24 1.8E-27 149.6 17.9 265 1-274 1-301 (303) 149 protein:vir:9875 Length: 296 # 99.9 1.1E-23 6.8E-27 146.5 20.4 255 1-271 1-296 (296) 150 protein:vir:9643 Length: 377 # 99.9 2.8E-23 1.8E-26 144.2 20.7 254 1-270 79-377 (377) 151 protein:vir:7019 Length: 401 # 99.9 5.5E-24 3.4E-27 148.1 16.2 270 1-274 1-344 (401) 152 protein:vir:95963 Length: 395 99.9 4.5E-23 2.8E-26 143.1 20.8 258 1-274 86-380 (395) 153 protein:vir:105645 Length: 400 99.9 1.2E-23 7.3E-27 146.3 17.5 270 1-274 1-338 (400) 154 protein:vir:98635 Length: 377 99.9 1E-23 6.3E-27 146.7 16.8 265 1-270 79-377 (377) 155 protein:vir:9509 Length: 381 # 99.9 4E-23 2.5E-26 143.4 19.8 258 1-274 76-374 (381) 156 protein:vir:101291 Length: 381 99.9 4E-23 2.5E-26 143.4 19.8 258 1-274 76-374 (381) 157 protein:vir:8420 Length: 477 # 99.8 9.7E-23 6E-26 141.3 18.4 271 1-274 155-476 (477) 158 protein:vir:100632 Length: 381 99.8 2E-22 1.2E-25 139.5 19.7 260 1-274 76-372 (381) 159 protein:vir:78350 Length: 383 99.8 1.7E-22 1.1E-25 139.9 17.7 258 1-274 83-379 (383) 160 protein:vir:79008 Length: 299 99.8 5.8E-21 3.6E-24 131.5 23.1 261 1-272 1-299 (299) 161 protein:vir:4197 Length: 314 # 99.8 9.3E-21 5.7E-24 130.4 23.6 264 1-272 14-314 (314) 162 protein:vir:4159 Length: 315 # 99.8 8.9E-20 5.5E-23 125.0 23.0 262 1-269 17-315 (315) 163 protein:vir:78920 Length: 290 99.8 1.8E-19 1.1E-22 123.3 21.3 254 1-269 1-290 (290) 164 protein:vir:3158 Length: 321 # 99.7 1.1E-18 6.6E-22 119.1 22.1 267 1-274 15-315 (321) 165 protein:vir:105464 Length: 346 99.7 1.8E-17 1.1E-20 112.4 22.4 264 1-274 1-310 (346) 166 protein:vir:79712 Length: 285 99.6 2.1E-16 1.3E-19 106.5 21.1 261 1-271 1-285 (285) 167 protein:vir:102335 Length: 312 99.6 1.6E-15 9.8E-19 101.7 21.2 263 1-272 1-312 (312) 168 protein:vir:8324 Length: 410 # 99.6 1.8E-16 1.1E-19 106.9 16.0 257 1-268 127-410 (410) 169 protein:vir:2106 Length: 430 # 99.6 1E-15 6.5E-19 102.7 20.1 259 1-271 1-430 (430) 170 protein:vir:100939 Length: 430 99.5 4.8E-15 3E-18 99.1 21.8 259 1-271 1-430 (430) 171 protein:vir:9265 Length: 430 # 99.5 4.8E-15 3E-18 99.1 21.8 259 1-271 1-430 (430) 172 protein:vir:99523 Length: 311 99.5 2.2E-14 1.4E-17 95.5 21.9 265 1-270 1-311 (311) 173 protein:vir:1781 Length: 221 # 99.5 1.2E-15 7.2E-19 102.5 13.5 179 83-274 1-208 (221) 174 protein:vir:78090 Length: 302 99.4 3.3E-14 2.1E-17 94.5 20.1 260 1-270 1-302 (302) 175 protein:vir:97397 Length: 517 99.4 2E-14 1.2E-17 95.7 18.6 259 1-272 237-517 (517) 176 protein:vir:94933 Length: 330 99.2 2.2E-11 1.4E-14 79.0 20.1 262 1-270 25-330 (330) 177 protein:vir:95875 Length: 401 99.0 3.7E-11 2.3E-14 77.8 16.8 268 1-272 1-401 (401) 178 protein:vir:95451 Length: 313 99.0 1.8E-11 1.1E-14 79.5 14.4 266 3-272 1-313 (313) 179 protein:vir:97255 Length: 310 99.0 6.2E-10 3.9E-13 71.0 21.3 264 1-270 1-310 (310) 180 protein:vir:4074 Length: 480 # 99.0 9.2E-12 5.7E-15 81.1 11.1 251 1-273 171-480 (480) 181 protein:vir:93696 Length: 364 98.9 2E-10 1.2E-13 73.8 16.7 271 1-273 1-364 (364) 182 protein:vir:79548 Length: 652 98.9 6.2E-10 3.8E-13 71.1 19.1 255 1-267 359-652 (652) 183 protein:vir:95512 Length: 693 98.8 2E-09 1.2E-12 68.3 18.9 256 1-268 394-693 (693) 184 protein:vir:8843 Length: 317 # 98.5 2.8E-07 1.8E-10 56.5 21.8 263 1-272 1-317 (317) 185 protein:vir:105610 Length: 430 98.4 9.3E-08 5.7E-11 59.1 17.3 271 1-274 1-426 (430) 186 protein:vir:103285 Length: 296 98.4 2.6E-07 1.6E-10 56.7 18.9 261 1-268 1-296 (296) 187 protein:vir:2770 Length: 318 # 98.3 4.3E-07 2.7E-10 55.5 17.3 223 1-231 1-318 (318) 188 protein:vir:80068 Length: 301 98.2 1.8E-06 1.1E-09 52.1 20.4 259 1-268 1-301 (301) 189 protein:vir:107687 Length: 319 98.2 7.9E-07 4.9E-10 54.0 17.4 261 1-268 19-319 (319) 190 protein:vir:99424 Length: 360 98.1 3.6E-06 2.3E-09 50.4 21.5 263 1-273 23-360 (360) 191 protein:vir:103886 Length: 302 98.1 2.4E-06 1.5E-09 51.4 18.9 253 3-271 1-302 (302) 192 protein:vir:3298 Length: 404 # 98.0 4E-06 2.5E-09 50.2 17.2 271 1-273 1-404 (404) 193 protein:vir:819 Length: 404 # 98.0 4E-06 2.5E-09 50.2 17.2 271 1-273 1-404 (404) 194 protein:vir:10123 Length: 404 98.0 4E-06 2.5E-09 50.2 17.2 271 1-273 1-404 (404) 195 protein:vir:104439 Length: 404 98.0 4E-06 2.5E-09 50.2 17.2 271 1-273 1-404 (404) 196 protein:vir:104342 Length: 314 97.9 6.3E-06 3.9E-09 49.1 17.8 262 1-270 1-314 (314) 197 protein:vir:79642 Length: 329 97.9 9.4E-06 5.8E-09 48.1 18.8 262 1-271 26-329 (329) 198 protein:vir:2736 Length: 348 # 97.1 0.00016 1E-07 41.3 16.7 257 1-271 1-348 (348) 199 protein:vir:94070 Length: 339 97.0 0.00022 1.4E-07 40.6 15.8 255 1-268 35-339 (339) 200 protein:vir:5942 Length: 523 # 96.6 7.8E-05 4.8E-08 43.1 10.5 268 1-272 162-523 (523) 201 protein:vir:95318 Length: 328 96.6 0.00034 2.1E-07 39.6 13.7 211 1-218 1-328 (328) 202 protein:vir:78148 Length: 123 96.6 2.8E-05 1.7E-08 45.6 7.7 107 163-270 1-123 (123) 203 protein:vir:101557 Length: 336 96.6 0.00018 1.1E-07 41.1 12.0 255 1-268 31-336 (336) 204 protein:vir:78558 Length: 336 96.4 0.00024 1.5E-07 40.5 12.0 252 1-268 31-336 (336) 205 protein:vir:4902 Length: 348 # 96.3 0.0008 5E-07 37.6 15.6 258 1-271 1-348 (348) 206 protein:vir:3643 Length: 336 # 96.2 0.00033 2.1E-07 39.7 11.5 255 1-268 31-336 (336) 207 protein:vir:94528 Length: 286 96.1 0.0011 6.6E-07 36.9 17.1 259 1-271 1-286 (286) 208 protein:vir:106734 Length: 336 96.0 0.00043 2.6E-07 39.1 11.3 252 1-268 31-336 (336) 209 protein:vir:96490 Length: 348 96.0 0.0012 7.2E-07 36.7 16.4 258 1-271 1-348 (348) 210 protein:vir:3969 Length: 287 # 95.9 0.0013 7.9E-07 36.5 16.4 257 7-271 1-287 (287) 211 protein:vir:93858 Length: 400 95.9 0.001 6.4E-07 36.9 13.1 257 1-268 101-400 (400) 212 protein:vir:107882 Length: 307 95.0 0.003 1.9E-06 34.4 16.4 255 1-270 2-307 (307) 213 protein:vir:103759 Length: 330 94.0 0.0056 3.5E-06 32.9 13.1 211 1-218 1-330 (330) 214 protein:vir:3424 Length: 341 # 93.6 0.007 4.3E-06 32.4 19.5 256 9-268 1-341 (341) 215 protein:vir:96079 Length: 382 93.3 0.0079 4.9E-06 32.1 14.8 257 1-268 61-382 (382) 216 protein:vir:107732 Length: 379 93.3 0.0081 5E-06 32.1 14.7 256 1-268 56-379 (379) 217 protein:vir:107826 Length: 331 93.2 0.0085 5.3E-06 31.9 14.5 210 1-218 1-331 (331) 218 protein:vir:107388 Length: 331 93.2 0.0085 5.3E-06 31.9 14.5 210 1-218 1-331 (331) 219 protein:vir:98525 Length: 331 93.2 0.0085 5.3E-06 31.9 14.5 210 1-218 1-331 (331) 220 protein:vir:98871 Length: 314 93.0 0.0093 5.8E-06 31.7 18.9 268 1-273 17-314 (314) 221 protein:vir:106590 Length: 349 92.9 0.0097 6E-06 31.6 16.7 257 1-268 1-349 (349) 222 protein:vir:7214 Length: 521 # 92.8 0.0099 6.1E-06 31.6 15.3 273 1-274 147-510 (521) 223 protein:vir:79078 Length: 307 92.7 0.01 6.4E-06 31.5 15.6 254 1-270 2-307 (307) 224 protein:vir:106286 Length: 534 92.5 0.011 7E-06 31.3 15.2 269 1-274 174-516 (534) 225 protein:vir:348 Length: 321 # 92.1 0.013 8.1E-06 30.9 12.4 258 1-268 1-321 (321) 226 protein:vir:99888 Length: 309 92.0 0.013 8.2E-06 30.9 14.0 253 1-271 1-309 (309) 227 protein:vir:99576 Length: 388 91.5 0.016 9.7E-06 30.5 12.5 257 1-268 65-388 (388) 228 protein:vir:96442 Length: 418 91.0 0.018 1.1E-05 30.2 13.2 267 1-274 61-410 (418) 229 protein:vir:103181 Length: 457 90.4 0.021 1.3E-05 29.8 12.4 267 1-274 114-444 (457) 230 protein:vir:5670 Length: 514 # 89.9 0.024 1.5E-05 29.5 15.7 271 1-274 142-494 (514) 231 protein:vir:393 Length: 341 # 89.3 0.027 1.7E-05 29.2 19.3 256 9-268 1-341 (341) 232 protein:vir:7324 Length: 335 # 87.9 0.036 2.2E-05 28.5 13.3 212 1-219 1-335 (335) 233 protein:vir:103463 Length: 521 87.1 0.041 2.5E-05 28.2 15.3 270 1-274 166-510 (521) 234 protein:vir:1991 Length: 305 # 87.0 0.042 2.6E-05 28.1 12.1 187 1-210 1-305 (305) 235 protein:vir:5255 Length: 304 # 86.3 0.046 2.9E-05 27.9 14.5 254 7-267 1-304 (304) 236 protein:vir:98480 Length: 348 79.7 0.1 6.3E-05 26.0 20.3 258 1-269 1-348 (348) 237 protein:vir:100603 Length: 529 79.2 0.11 6.6E-05 25.9 14.3 273 1-274 160-520 (529) 238 protein:vir:98143 Length: 524 74.3 0.16 9.9E-05 24.9 15.3 264 1-274 167-509 (524) 239 protein:vir:104549 Length: 462 73.8 0.17 0.0001 24.9 13.6 270 1-274 97-451 (462) 240 protein:vir:103370 Length: 418 70.1 0.21 0.00013 24.3 15.0 262 1-274 61-411 (418) 241 protein:vir:101039 Length: 529 65.9 0.28 0.00017 23.6 17.7 272 1-274 165-520 (529) 242 protein:vir:4786 Length: 295 # 64.8 0.29 0.00018 23.5 13.6 242 1-250 1-295 (295) 243 protein:vir:6378 Length: 346 # 63.4 0.32 0.0002 23.3 19.0 255 9-268 1-346 (346) 244 protein:vir:6901 Length: 522 # 63.3 0.32 0.0002 23.3 17.3 266 1-274 167-512 (522) 245 protein:vir:101811 Length: 529 62.7 0.33 0.00021 23.2 17.8 272 1-274 165-520 (529) 246 protein:vir:10324 Length: 320 60.8 0.37 0.00023 23.0 14.6 250 10-273 1-320 (320) 247 protein:vir:80835 Length: 464 60.6 0.37 0.00023 23.0 13.2 271 1-274 1-338 (464) 248 protein:vir:6601 Length: 528 # 58.7 0.41 0.00025 22.7 19.5 271 1-274 79-513 (528) 249 protein:vir:80986 Length: 528 58.1 0.42 0.00026 22.7 18.9 271 1-274 79-513 (528) 250 protein:vir:106998 Length: 468 43.0 0.86 0.00054 20.9 15.2 272 1-274 117-451 (468) 251 protein:vir:1153 Length: 338 # 39.2 1 0.00064 20.5 14.7 255 1-273 16-338 (338) 252 protein:vir:107947 Length: 519 36.6 1.2 0.00072 20.2 16.0 273 1-274 134-507 (519) 253 protein:vir:104915 Length: 470 22.9 2.4 0.0015 18.5 20.7 269 1-274 69-459 (470) No 1 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=2e-77 Score=441.11 Aligned_cols=274 Identities=99% Similarity=1.338 Sum_probs=269.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|++++.++++++|++|++|+||+|+.+|++++|.+|++++++++++++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++++||++++.++++++|++++|+++++.+.+++..+...++++|.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~lgd~~~~~~ 160 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+.++++|++++++|.||+++|++||+|+++|+|++|+++++|++++.++++++|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:12 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCCcceEEEEeccceeeeecCCceeccccchh Confidence 99999999999999998999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||++++||+++|++|++.||+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 2 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=5.5e-77 Score=438.66 Aligned_cols=274 Identities=99% Similarity=1.345 Sum_probs=269.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|+++++++++++|++|++|+||+|+.++++++|.||++++++++++++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++++||++++.++++++|++++|+.+++.+.+++..+.++.++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..++.|++++++|.||+++|++|++|+++|+|++|+++++|++++.++++++|++||++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:97 161 VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||+++++|+++|++|+++||+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 9999999999999999999999999999999999 No 3 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=5.5e-77 Score=438.66 Aligned_cols=274 Identities=99% Similarity=1.345 Sum_probs=269.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|+++++++++++|++|++|+||+|+.++++++|.||++++++++++++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++++||++++.++++++|++++|+.+++.+.+++..+.++.++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..++.|++++++|.||+++|++|++|+++|+|++|+++++|++++.++++++|++||++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:94 161 VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||+++++|+++|++|+++||+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 9999999999999999999999999999999999 No 4 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=7.6e-77 Score=437.90 Aligned_cols=274 Identities=90% Similarity=1.272 Sum_probs=269.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|++++.++++++|++|++|+||+|+.+|+++++.+|++++++++++++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|.++|++..++..||++++.++++++||+++|+++++.+++++..+.+++++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~ 160 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..+++|++++++|.||+++|++||+|+++|+|++|+++++|++++.++++++|++||++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:95 161 VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||++++||+++|+++++++|+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 5 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=7.6e-77 Score=437.90 Aligned_cols=274 Identities=90% Similarity=1.272 Sum_probs=269.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|++++.++++++|++|++|+||+|+.+|+++++.+|++++++++++++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|.++|++..++..||++++.++++++||+++|+++++.+++++..+.+++++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~ 160 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..+++|++++++|.||+++|++||+|+++|+|++|+++++|++++.++++++|++||++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 161 VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||++++||+++|+++++++|+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 6 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=2.9e-76 Score=434.73 Aligned_cols=274 Identities=100% Similarity=1.349 Sum_probs=269.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+++|+++|+|+||+|++|+.+++.++++|.+++++++++++++|++|+||+|+.++++++|.||++++++++++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++++||++++.++++++|++++|+.+++.+.+++..+.++.++++.|++|..+|+++++.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhhccCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..++.|++++++|.||+++|++|++|+++|+|++|+++++|++++.++++++|++|++. T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:93 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCCeEEEEecCCcccccccchh Confidence 99999999999999998999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||+++++|++++++|+++||+|| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 9999999999999999999999999999999999 No 7 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=5.4e-74 Score=422.25 Aligned_cols=274 Identities=76% Similarity=1.116 Sum_probs=267.7 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |||.+ |+++|+|+||+|++|+.+++.+.++|++++.++++++|++|++|+||+|+.+|+++++.||++|+++++++++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 88854 99999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEP 159 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~ 159 (274) ++++++++++|+++|++..++..||+.++.++++++||+++|+++++.+++++.++.++.+++|.|++|+.+|+++++++ T Consensus 81 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~lgd~~~~~ 160 (275) T protein:vir:96 81 QATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLAGLQTAIDKFNDEDLEP 160 (275) T ss_pred eEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecch Q lcl|NC_010147. 160 MVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDA 239 (274) Q Consensus 160 ~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~ 239 (274) ++++|||++++.|+|++.++|+..+..|++++++|.||+++|++||+|+++|++++|+++++|++++.++++++|++|++ T Consensus 161 ~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~ 240 (275) T protein:vir:96 161 MVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVKLITKRDFFLETERHA 240 (275) T ss_pred cEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcceEEEEeccceeeeecCCcccccccch Confidence 99999999999999999889999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++++|.|++++|||+++++|+++|+++++.|.+-. T Consensus 241 ~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 241 SHKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred hhcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 99999999999999999999999999999999999 No 8 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=1.3e-73 Score=420.25 Aligned_cols=274 Identities=80% Similarity=1.158 Sum_probs=269.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+++|+++|+|+||+|++|+.+++.+.++|.+++++++++++++|++++||+|+.++++++|.||++++++++++++++ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|.++|++..++..||++++.++++++|++++|+.+++.+++++..+.++.+++|.|++|..+|+++++.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccccHHHHHHHHHHhcccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..++.|++++++|.||+++|++|++|+++|++++|+++++|++++.++++++|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) T ss_pred EEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCcceeeeecCCcccccccchh Confidence 99999999999999998999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.|++++|||+++++|+++|++|+++|-+-| T Consensus 241 ~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 9999999999999999999999999999999999 No 9 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=1.1e-73 Score=420.53 Aligned_cols=274 Identities=79% Similarity=1.157 Sum_probs=269.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++|+|+||+|++|+.+++.++++|.+++.++++++|++|++|+||+|+.+++++++.||++++++++++++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|.++|++.+++..||+.++.++++++||+++|+++++.+++++....++.++++.|++|..+|++++++++ T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~lgd~~~~~~ 160 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAGLEAAIDTFDDEDLEPM 160 (276) T ss_pred EEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+|++.++|+..++.|++++++|.||+++|++||+|+++|+|++|+++++|++++.++++++|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~ 240 (276) T protein:vir:10 161 VLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKLITKRDFFLETDRDPS 240 (276) T ss_pred EEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcceEEEEeccceeeeecCCceeecccchh Confidence 99999999999999988999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++|.+++++|||+++++|+++|++|+++.|.+- T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (276) T protein:vir:10 241 TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS 274 (276) T ss_pred hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcC Confidence 9999999999999999999999999999999888 No 10 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=9e-70 Score=399.12 Aligned_cols=271 Identities=46% Similarity=0.689 Sum_probs=257.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++++|+||+|++|+.+++.+++++.+++.+++.++|++|++|+||+|+.++++++|.||++++++++++++++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc------cCHHHHHHHHHHHhh Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADI------TKLNGLQSAIDKFND 154 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~------~~~d~i~~A~~~l~~ 154 (274) +++++++++|+++|++..++..|++++++++++++|++++|+.+++.+.+++.....+. ..++.|++|..+|++ T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l~~ 160 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAIED 160 (278) T ss_pred EeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999999999998876654433 347889999999998 Q ss_pred cCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCcee Q lcl|NC_010147. 155 EDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFL 233 (274) Q Consensus 155 ~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~v 233 (274) ++. ..++++|||++|+.|+|++.++|+..+..|++++++|.||+++|++|++|+++|++++|+++++|++++.++++++ T Consensus 161 ~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~v 240 (278) T protein:vir:80 161 ESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKAGALKTFLKRNLLA 240 (278) T ss_pred cCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcceEEEEeccceeeeecCCccc Confidence 875 4678999999999999999889999999999999999999999999999999999999999999999999999999 Q ss_pred eeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 234 EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 234 e~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) |++|++++++|.|++++|||++++||+++|+++++++- T Consensus 241 E~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 241 ESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ccccchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 99999999999999999999999999999999999888 No 11 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=4e-69 Score=395.59 Aligned_cols=268 Identities=49% Similarity=0.752 Sum_probs=254.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+..|+++|+|+||+|++|+.+++.++++|++++.+++.++|++|++|+||+|+.+++++++.||++++++++++++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++.+++++||++++.++++++|++++|+++++.+.+++..+. ...++|.|++|+.+|+++++.++ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~-~~~~~d~i~~A~~~lgd~~~~~~ 159 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAY 159 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-ccccHHHHHHHHHHhhhcCCCce Confidence 99999999999999999999999999999999999999999999999999887764 46789999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceE----EEEeCCeEEEEeecCceeeee Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA----ILAKKGAVKLILKRDFFLEVA 236 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~----~~~~~~a~~~~~~~~~~ve~~ 236 (274) +++|||++++.|+|++.+.+... ..+++++++|.||+++|++|++|+++|+++. |+++++|++++.++++++|++ T Consensus 160 ~ivv~p~~~~~L~k~~~~~~~~~-~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~ 238 (272) T protein:vir:36 160 VLIVNPKDAAKIRKDANAKNIGS-EVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETD 238 (272) T ss_pred EEEEcHHHHHHHhcccccccccc-cccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccc Confidence 99999999999999987665543 4467889999999999999999999998765 789999999999999999999 Q ss_pred cchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 237 RDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 237 rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) |++++++|.+++++|||+++++|+++|++|++.- T Consensus 239 R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 239 RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred cchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999999999998887 No 12 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=1.2e-68 Score=392.92 Aligned_cols=268 Identities=33% Similarity=0.463 Sum_probs=253.3 Q ss_pred CccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeEEE Q lcl|NC_010147. 3 QGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAK 82 (274) Q Consensus 3 ~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~~~ 82 (274) |..|+++|+++||+|++|+.+++.++++|.+++.++++++|++|++|+||+|+.+|+++++.||++++++++++++.+++ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 44799999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCceEE Q lcl|NC_010147. 83 IRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVL 162 (274) Q Consensus 83 ~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~~~ 162 (274) +++++++|+++|++.+.+++||+.++.+|++.+|+|++|+++++.+++++.+.+ ..++++.|++|+.+||++...++++ T Consensus 81 i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~-~~~t~~~~~dA~~~lgd~~~~~~~i 159 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTAT-VSADATGILDAIEVFNSENDEDYVL 159 (270) T ss_pred eehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-cccCHHHHHHHHHHhccccCCCcEE Confidence 999999999999999999999999999999999999999999999999988764 4578999999999999999999999 Q ss_pred EEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC-CCcceEEEEeCCeEEEEeecCceeeeecchhh Q lcl|NC_010147. 163 FINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK-LEAGTAILAKKGAVKLILKRDFFLEVARDAST 241 (274) Q Consensus 163 vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~-v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~ 241 (274) +|||++++.|+|++.+ ..++.+++++++|.||+++|++||++++ .++|++|+|++||++++.++++.+|++||+++ T Consensus 160 ~vhs~~~~~Lrk~~~~---~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~~~~~~vEtdRd~~~ 236 (270) T protein:vir:95 160 YVNPKDYNKLVKSLFK---VGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVNKKKPEAYTDFDILK 236 (270) T ss_pred EEcHHHHHHHHhhhcc---cccccccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeeecCCceeeeccchhh Confidence 9999999999998744 4566678899999999999999988665 56899999999999999999999999999999 Q ss_pred cceEEEEEEEEEEEEEcCccEEEEE-ecCCCCCC Q lcl|NC_010147. 242 KTTALYSDKHYVAYLYDESKAVKIT-KGSGSLEM 274 (274) Q Consensus 242 ~~~~v~~~~~yg~~~~~~~~~v~~~-~~~a~~~~ 274 (274) ++|.+++++||++++++|+++|++| +.+.+.|| T Consensus 237 ~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 237 RTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred cccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 9999999999999999999999999 47788999 No 13 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=3e-67 Score=385.32 Aligned_cols=272 Identities=47% Similarity=0.709 Sum_probs=263.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++++++||+|++++.+++.+++++++++.+++.+++++|++++||+|+.+++++|++||++++++++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++.+|+++++.++++++|++++|+.+++.+.+++.... ...+++.|++|..+|++++..++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~-~~~t~d~i~da~~~l~~~~~~~~ 159 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE-ATATVDGVSKALDIFNDEDDAET 159 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-cccCHHHHHHHHHHHhccCCCcc Confidence 99999999999999999999999999999999999999999999999998877664 46789999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+++..++|...++.+++.+++|.+|+++|+||++|+++|++++|++++++++++.++++++|++|++. T Consensus 160 ~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~ 239 (272) T protein:vir:30 160 VIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT 239 (272) T ss_pred EEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc Confidence 99999999999999988899999998889999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) +++|.+++++|||+++++|+++|++|+++|++. T Consensus 240 ~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 240 KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 999999999999999999999999999999999 No 14 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=3e-67 Score=385.32 Aligned_cols=272 Identities=47% Similarity=0.709 Sum_probs=263.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+.+|+++++++||+|++++.+++.+++++++++.+++.+++++|++++||+|+.+++++|++||++++++++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCce Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~ 160 (274) +++++++++|+++|++..++.+|+++++.++++++|++++|+.+++.+.+++.... ...+++.|++|..+|++++..++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~-~~~t~d~i~da~~~l~~~~~~~~ 159 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE-ATATVDGVSKALDIFNDEDDAET 159 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-cccCHHHHHHHHHHHhccCCCcc Confidence 99999999999999999999999999999999999999999999999998877664 46789999999999999999999 Q ss_pred EEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh Q lcl|NC_010147. 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) Q Consensus 161 ~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~ 240 (274) +++|||++++.|+++..++|...++.+++.+++|.+|+++|+||++|+++|++++|++++++++++.++++++|++|++. T Consensus 160 ~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~ 239 (272) T protein:vir:98 160 VIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT 239 (272) T ss_pred EEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc Confidence 99999999999999988899999998889999999999999999999999999999999999999999999999999999 Q ss_pred hcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 241 ~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) +++|.+++++|||+++++|+++|++|+++|++. T Consensus 240 ~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 240 KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 999999999999999999999999999999999 No 15 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=100.00 E-value=5.9e-53 Score=306.94 Aligned_cols=264 Identities=16% Similarity=0.159 Sum_probs=225.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhccccc------ccccccCCCceEEEEeeccC-CccccccCCC-cCCcc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV------DSTLQGQPGDTLTFPAFVYS-GDAQVVAEGE-KIPTD 72 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~------~~~~~~~~g~tv~ip~~~~~-~~~~~~~eg~-~i~~~ 72 (274) ||+++|+++|+|+||+|++|+.+++.+++.|.++... +..+++ +|++++||+|+++ |+++.+.||+ +++++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~-~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITS-GGLLVNMPFWNDLTGDSEVLGNGDKALETG 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhc-CCCEEEecccccCCCcccccCCCccccchh Confidence 9999999999999999999999999999988665322 333445 8999999999988 8899999996 79999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------------c Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------------T 134 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------------~ 134 (274) +++++++.+++++++++|.++|++...++.|||.++++|++++|+++.++.+++.+++... + T Consensus 80 ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~ 159 (330) T protein:vir:10 80 KITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQS 159 (330) T ss_pred hcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheeccc Confidence 9999999999999999999999999999999999999999999999999999998874211 1 Q ss_pred ccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC--- Q lcl|NC_010147. 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--- 211 (274) Q Consensus 135 ~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~--- 211 (274) ...+.++++.|++|.++|||+.....+++|||.+|..|+++++++|...++. ++.|++|+|++||+|+++| T Consensus 160 ~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~~~~G~~VivdD~~p~~~ 233 (330) T protein:vir:10 160 KASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTA------TINIPTYLGYRVIIDDGIAPTG 233 (330) T ss_pred ccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhccccc------CcccccccceEEEEeCCCCCCC Confidence 1233467899999999999999999999999999999999999999887664 3568999999999999997 Q ss_pred -cceEEEEeCCeEEEEeec---CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEe----cCCCCCC Q lcl|NC_010147. 212 -AGTAILAKKGAVKLILKR---DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK----GSGSLEM 274 (274) Q Consensus 212 -~~~~~~~~~~a~~~~~~~---~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~----~~a~~~~ 274 (274) +|++|++++||+++..+. .+.+|++|++.+++|.+..|+||..+ |.++-.-.. +.-||-. T Consensus 234 ~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~h---p~G~s~~~~~~~~~~~sPt~ 301 (330) T protein:vir:10 234 DIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMH---PYGVKWTGAEVDAGNITPSN 301 (330) T ss_pred CceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEee---eeeeeecccccccCcCCcCh Confidence 788999999999998654 37899999999999999999998877 555544322 2223333 No 16 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=7.4e-53 Score=306.39 Aligned_cols=227 Identities=45% Similarity=0.661 Sum_probs=210.7 Q ss_pred ccccCCCceEEEEeeccCCccccccCCCcCCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHH Q lcl|NC_010147. 40 TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) Q Consensus 40 ~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~ 119 (274) +=....|+||+||+| +|++++++||++++++++++++.++++++++++|+++|++.+.+.+||+.++.+|++.++|++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~k 78 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHHh Confidence 111235999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 120 VDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 120 ~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) +|+++++.+.++++++.+ .+++|.|++|..+|++++..+++++|||++++.|||+....+ ..+..|++++++|.||++ T Consensus 79 vD~di~~~~~~a~l~~~~-~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~-~~~~~g~~i~~~G~iG~i 156 (231) T protein:vir:73 79 VDDDLLKAAKTTSQTVST-KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADV 156 (231) T ss_pred hhHHHHHhhccccccccc-cccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhh-hhhhhccceeeecccceE Confidence 999999999999988765 589999999999999999999999999999999999875433 346778899999999999 Q ss_pred ccceEEEcCCCCcceE----EEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 200 LGAIIVRTNKLEAGTA----ILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 200 ~G~~Vv~s~~v~~~~~----~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +|++|++|+++|++++ |++.++|++++.++++++|++||+++++|.++++.||+++++||+++|++|++.- T Consensus 157 ~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 9999999999999887 5677999999999999999999999999999999999999999999999999888 No 17 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=100.00 E-value=6.5e-52 Score=301.19 Aligned_cols=263 Identities=17% Similarity=0.165 Sum_probs=223.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccc------cccccc-cCCCceEEEEeeccC-CccccccCCCcCCcc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAE------VDSTLQ-GQPGDTLTFPAFVYS-GDAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~------~~~~~~-~~~g~tv~ip~~~~~-~~~~~~~eg~~i~~~ 72 (274) || +|+++|+|+||||++|+.+++.+++.|.+++. .+..+. +.+|+++++|+|+++ |+++++.++++++++ T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 77 89999999999999999999999999966543 233443 458999999999998 899999999999999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------ccc---cccc Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------TVN---ADIT 140 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------~~~---~~~~ 140 (274) +++++++.+++++++++|.++|+....++.||+.++.+|++.+|+++.|+.+++.|++... .+. ...+ T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~ 158 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIY 158 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecccccee Confidence 9999999999999999999999999999999999999999999999999999999875311 111 2236 Q ss_pred CHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC--------- Q lcl|NC_010147. 141 KLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--------- 211 (274) Q Consensus 141 ~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~--------- 211 (274) +++.|++|.++|||+.....+++|||.+|..|++++.++|+..++. .+.|++|+|++||+|+.|| T Consensus 159 s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~~~~G~~VivdD~~p~~~~~~~~~ 232 (324) T protein:vir:59 159 SAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQS------GIRFPTYMNKRVIVDDSMPVETLEDGTK 232 (324) T ss_pred cHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhcccccc------CceeeeecccEEEEeCCCCccccCCCCc Confidence 8899999999999999999999999999999999999999887664 2468999999999999987 Q ss_pred cceEEEEeCCeEEEEee-cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEe--cCCCCCC Q lcl|NC_010147. 212 AGTAILAKKGAVKLILK-RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK--GSGSLEM 274 (274) Q Consensus 212 ~~~~~~~~~~a~~~~~~-~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~--~~a~~~~ 274 (274) +|++|++++||+++..+ .++.+|++|++.++.|.++.|+||..++ .|+-.... +..||.. T Consensus 233 ~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p---~G~s~~~~~~~~~sPt~ 295 (324) T protein:vir:59 233 VFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHP---RGVKFTENAMAGTTPTD 295 (324) T ss_pred eEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEe---eeEEecccccCCCCCCh Confidence 57899999999999985 4588999999999999999999987664 34322111 1223332 No 18 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=100.00 E-value=2.1e-50 Score=292.96 Aligned_cols=261 Identities=17% Similarity=0.130 Sum_probs=216.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccc--cc----cccccCCCceEEEEeeccC-CccccccCCCcCCccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAE--VD----STLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~--~~----~~~~~~~g~tv~ip~~~~~-~~~~~~~eg~~i~~~~ 73 (274) || +|+++|+|+||+|++|+.++..+++.|.++.. .+ ..+.+ +|++++||+|+++ |+++++.++++|++++ T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~-~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLE-AGTRITVPFLNDLTGDPDNWTDSDDIDVNN 77 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhc-CCCEEEecccccCCCcccccCCCcccchhe Confidence 76 79999999999999999999999998866432 22 23344 8999999999988 8999999999999999 Q ss_pred cccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------ccccc Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------------TVNAD 138 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------------~~~~~ 138 (274) ++++++.+++++++++|.++|++.+.++.|||.++++|++.+|+++.|+.+|+.+++... ..... T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~ 157 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEP 157 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccccc Confidence 999999999999999999999999999999999999999999999999999999875311 11233 Q ss_pred ccCHHHHHHHHHHHhhcCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC------ Q lcl|NC_010147. 139 ITKLNGLQSAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE------ 211 (274) Q Consensus 139 ~~~~d~i~~A~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~------ 211 (274) .++++.|++|.++|||... ....++|||.++..|++++.++|+..++. ++.|++|+|++||+|+.|| T Consensus 158 ~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~t~~G~~VivdD~~p~~~~~~ 231 (351) T protein:vir:15 158 MFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNG------ATPFEAYNGLRIVLDDDIEIDLTDK 231 (351) T ss_pred ccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhcccccc------CcccceecceEEEEcCCCccccCCC Confidence 4688999999999999765 58999999999999999999999988764 2458999999999999997 Q ss_pred ---cceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEe----cCCCCCC Q lcl|NC_010147. 212 ---AGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITK----GSGSLEM 274 (274) Q Consensus 212 ---~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~----~~a~~~~ 274 (274) +|++|++++||+++..+.+ .+|++|++.+ +.|.++.|+||..+ |.++-.-.. +..|+.. T Consensus 232 ~~~~ytsyl~~~GAi~~~~~~~-~ve~~rd~~~~~g~d~l~~r~~~~~h---p~G~s~~~~~~~~~~~sPt~ 299 (351) T protein:vir:15 232 TKPVSTSYIFAPGAVRYSTNMR-STETKYDPLINGGQDVIVQKRVGTIH---VAGTSIKASFSPSKASFPTI 299 (351) T ss_pred CCceeEEEEEecceeeeecCCc-CcceeecccCCCCceEEEEeeeeeee---eeeeeecccccccCcCCcCh Confidence 4689999999999987654 5788888765 78999999987644 455443211 1223333 No 19 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=5.2e-47 Score=274.33 Aligned_cols=261 Identities=18% Similarity=0.196 Sum_probs=222.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+ +.|+||+|++++.+++.+++++.+++++++++.+..|+||+||+|...+......+|..+++++++.++++ T Consensus 1 MA~------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEE Confidence 665 45899999999999999999999999999988888999999999987654444568888999999999999 Q ss_pred EEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-c----cccCHHHHHHHHHHHhh Q lcl|NC_010147. 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-A----DITKLNGLQSAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~-~----~~~~~d~i~~A~~~l~~ 154 (274) +++++ .+..+.++|++..++..|+. .+.+++++++|+++|+++++.+.++..... + ....++.|++|..+|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhh Confidence 99987 58999999999999988875 589999999999999999999877543322 1 12347889999999999 Q ss_pred cCC--CceEEEEcHHHHHHHHhhccccccccccc-cccceeccccceeccceEEEcCCCCcce---EEEEeCCeEEEEee Q lcl|NC_010147. 155 EDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEAGT---AILAKKGAVKLILK 228 (274) Q Consensus 155 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~-~~~~~~~g~ig~~~G~~Vv~s~~v~~~~---~~~~~~~a~~~~~~ 228 (274) +++ ++|+++|+|++++.|+++..+ +...... .+..+++|.+|+++|++|++|+++|.++ ++.++++|++++.+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~Ll~~~~~-~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~ 232 (273) T protein:vir:79 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred ccCCccCcEEEECHHHHHHHhhchhh-hhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeee Confidence 986 689999999999999987532 2333333 3467899999999999999999999654 56889999998765 Q ss_pred cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 229 ~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) . ..+|..|++.+++|.++++++||+++++|++++.++++++ T Consensus 233 ~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred h-hhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 4 5899999999999999999999999999999999999888 No 20 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=100.00 E-value=2.6e-47 Score=275.94 Aligned_cols=265 Identities=11% Similarity=0.132 Sum_probs=219.8 Q ss_pred CCC--ccceeeeeechHHHHHHHHHHHHHHhhhhcccc--cccccc---cCCCceEEEEeeccC-CccccccCCC---cC Q lcl|NC_010147. 1 MPQ--GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAE--VDSTLQ---GQPGDTLTFPAFVYS-GDAQVVAEGE---KI 69 (274) Q Consensus 1 Ma~--~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~--~~~~~~---~~~g~tv~ip~~~~~-~~~~~~~eg~---~i 69 (274) ||. ..|+++|+|+||+|.+|+.++..+++.|.++.. .+..+. ..+|++++||+|+++ |+.+.|.+++ ++ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 885 449999999999999999999999988876643 344443 357999999999988 5566675544 58 Q ss_pred CccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------- Q lcl|NC_010147. 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------------- 134 (274) Q Consensus 70 ~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------------- 134 (274) ++.+++++++.+++..++++|..+|+....++.|||+++++|++.||.|..++.||+.|++.... T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~ 160 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999988752210 Q ss_pred --------------c------cccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceecc Q lcl|NC_010147. 135 --------------V------NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) Q Consensus 135 --------------~------~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g 194 (274) . ....++++.|++|.++|||+..++..++|||.+++.|+++++++|+..++. .. T Consensus 161 ~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~------~~ 234 (367) T protein:vir:80 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG------QL 234 (367) T ss_pred ccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCC------cc Confidence 0 112367899999999999999999999999999999999999999998874 24 Q ss_pred ccceeccceEEEcCCCC--------cceEEEEeCCeEEEEeecC-ceeeeecchhh----cceEEEEEEEEEEEEEcCcc Q lcl|NC_010147. 195 AFGEALGAIIVRTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDAST----KTTALYSDKHYVAYLYDESK 261 (274) Q Consensus 195 ~ig~~~G~~Vv~s~~v~--------~~~~~~~~~~a~~~~~~~~-~~ve~~rd~~~----~~~~v~~~~~yg~~~~~~~~ 261 (274) .|++|+|++||+||.|| +|++|+|++|||+|....+ +.+|++||+++ +.|.++.|+| .+++|.+ T Consensus 235 ~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP~G 311 (367) T protein:vir:80 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGG 311 (367) T ss_pred ccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee---EEeecce Confidence 59999999999999998 6889999999999997655 44799999986 5699999999 4555777 Q ss_pred EEEEEecCC----------------CCCC Q lcl|NC_010147. 262 AVKITKGSG----------------SLEM 274 (274) Q Consensus 262 ~v~~~~~~a----------------~~~~ 274 (274) +-.....-+ |+.. T Consensus 312 ~s~~~~~v~~~~~~~~~~~~~~~~~sPt~ 340 (367) T protein:vir:80 312 FNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) T ss_pred eeecccccccccccccccccccccCCCCh Confidence 765443322 1222 No 21 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1.7e-46 Score=271.49 Aligned_cols=261 Identities=18% Similarity=0.194 Sum_probs=221.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+ +.|+||+|++.+.+++.+.+++.++++++++.++..|++++||++...+......+|..+++++++.++++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 554 56789999999999999999999999999988888999999999987644333456778899999999999 Q ss_pred EEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cc----ccCHHHHHHHHHHHhh Q lcl|NC_010147. 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-AD----ITKLNGLQSAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~-~~----~~~~d~i~~A~~~l~~ 154 (274) +++++ .+..+.++|++..++..++. .+.+++++++|+++|+++++.+.++..... +. ...++.|++|..+|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99977 58999999999999988874 589999999999999999999887644332 21 2347899999999999 Q ss_pred cCC--CceEEEEcHHHHHHHHhhccccccccccc-cccceeccccceeccceEEEcCCCCcc---eEEEEeCCeEEEEee Q lcl|NC_010147. 155 EDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEAG---TAILAKKGAVKLILK 228 (274) Q Consensus 155 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~-~~~~~~~g~ig~~~G~~Vv~s~~v~~~---~~~~~~~~a~~~~~~ 228 (274) +++ ++|+++|+|++++.|+++..+ +...... +++.+++|.+|+++|++|++|+++|.+ +++.++++|++++.+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhh-hhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 886 689999999999999987532 2222322 346789999999999999999999964 467899999999865 Q ss_pred cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 229 ~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) ..++|..|++.+++|.|+++++||+++++|++++.++++++ T Consensus 233 -~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 233 -IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -eehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 45899999999999999999999999999999999999888 No 22 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1.7e-46 Score=271.49 Aligned_cols=261 Identities=18% Similarity=0.194 Sum_probs=221.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||+ +.|+||+|++.+.+++.+.+++.++++++++.++..|++++||++...+......+|..+++++++.++++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 554 56789999999999999999999999999988888999999999987644333456778899999999999 Q ss_pred EEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cc----ccCHHHHHHHHHHHhh Q lcl|NC_010147. 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-AD----ITKLNGLQSAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~-~~----~~~~d~i~~A~~~l~~ 154 (274) +++++ .+..+.++|++..++..++. .+.+++++++|+++|+++++.+.++..... +. ...++.|++|..+|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99977 58999999999999988874 589999999999999999999887644332 21 2347899999999999 Q ss_pred cCC--CceEEEEcHHHHHHHHhhccccccccccc-cccceeccccceeccceEEEcCCCCcc---eEEEEeCCeEEEEee Q lcl|NC_010147. 155 EDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEAG---TAILAKKGAVKLILK 228 (274) Q Consensus 155 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~-~~~~~~~g~ig~~~G~~Vv~s~~v~~~---~~~~~~~~a~~~~~~ 228 (274) +++ ++|+++|+|++++.|+++..+ +...... +++.+++|.+|+++|++|++|+++|.+ +++.++++|++++.+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhh-hhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 886 689999999999999987532 2222322 346789999999999999999999964 467899999999865 Q ss_pred cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 229 ~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) ..++|..|++.+++|.|+++++||+++++|++++.++++++ T Consensus 233 -~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 233 -IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -eehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 45899999999999999999999999999999999999888 No 23 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=100.00 E-value=9e-43 Score=251.08 Aligned_cols=263 Identities=14% Similarity=0.131 Sum_probs=212.8 Q ss_pred CCCccceeeeeechH--HHHHHHHHHHHHHhhhhccc--ccccccc---cCCCceEEEEeeccC-Ccccc-ccCC---Cc Q lcl|NC_010147. 1 MPQGITKTSNQIIPE--VLAPMMQAQLEKKLRFASFA--EVDSTLQ---GQPGDTLTFPAFVYS-GDAQV-VAEG---EK 68 (274) Q Consensus 1 Ma~~~T~~~~~~~Pe--v~~~~v~~~~~~~~v~~~~~--~~~~~~~---~~~g~tv~ip~~~~~-~~~~~-~~eg---~~ 68 (274) || +|+++|+++|| +|.+|+.++..+++.|.++. ..+..+. ..+|+.++||+|+.+ |+.+. +... +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 55 89999999998 89999999999988887654 3344443 357999999999986 56442 3322 35 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-------------- 134 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~-------------- 134 (274) +++.+++++++.+.+..++++|..+|+..+.++.|||++|+++++.+|.|..++.+++.|++.... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~ 158 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDM 158 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCce Confidence 889999999999999999999999999999999999999999999999999999999998763211 Q ss_pred ----ccccccCHHHHHHHHHHHhhc-----CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEE Q lcl|NC_010147. 135 ----VNADITKLNGLQSAIDKFNDE-----DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) Q Consensus 135 ----~~~~~~~~d~i~~A~~~l~~~-----~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv 205 (274) ...+.++.+.|++|..+||++ ...+..++|||.+++.|+++++++|+..++. ...|++|+|++|| T Consensus 159 ~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~------~~~i~ty~G~~Vi 232 (349) T protein:vir:94 159 VVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAEN------NTMFATYQGYRVI 232 (349) T ss_pred eEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCccc------CcccceecCcEEE Confidence 012336788999999999886 4678999999999999999999999987763 3358999999999 Q ss_pred EcCCCC--------cceEEEEeCCeEEEEeecC-ceeeeecchhh----cceEEEEEEEEEEEEEcCccEEEEEecC--- Q lcl|NC_010147. 206 RTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDAST----KTTALYSDKHYVAYLYDESKAVKITKGS--- 269 (274) Q Consensus 206 ~s~~v~--------~~~~~~~~~~a~~~~~~~~-~~ve~~rd~~~----~~~~v~~~~~yg~~~~~~~~~v~~~~~~--- 269 (274) +||.|| +|++|+|++|||++....+ +.+|++|++.+ +.|.+..|+||..| |.++-...... T Consensus 233 vDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~h---p~G~s~~~a~v~~~ 309 (349) T protein:vir:94 233 VDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTWLLH---PFGYSFTSAVITGN 309 (349) T ss_pred EeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEEEee---eeeeeecccccCCC Confidence 999998 6789999999999998754 56899999976 56999999996544 56654433211 Q ss_pred ------CCCC---C Q lcl|NC_010147. 270 ------GSLE---M 274 (274) Q Consensus 270 ------a~~~---~ 274 (274) .|+. | T Consensus 310 ~~~~~~~sPt~aeL 323 (349) T protein:vir:94 310 GTETIARSASWQDL 323 (349) T ss_pred ccccccCCCChHHh Confidence 1222 3 No 24 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.8e-43 Score=254.90 Aligned_cols=268 Identities=18% Similarity=0.190 Sum_probs=223.3 Q ss_pred CCCccceee--------eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc Q lcl|NC_010147. 1 MPQGITKTS--------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~~~T~~~--------~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~ 72 (274) |++.+|-.. +.|+||+|+.++.+++.++++|.+++ ++++.....|++|+||.++. +.+.++++|..++++ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~~~Gdtv~ip~~g~-~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQVKKGDTFHVPRISE-LGVEDKATDVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccccCCceEEEeccCc-ceeeeecCCCccccc Confidence 888776444 45899999999999999999998876 56677777799999999874 578899999999999 Q ss_pred ccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------ccc Q lcl|NC_010147. 73 ILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------------VNA 137 (274) Q Consensus 73 ~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------------~~~ 137 (274) +++.++.++++++ .+..+.++|++..++..|+++++.+++++++|+++|+.+++.+..++.. ... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 9999999999966 5899999999999999999999999999999999999999887543221 112 Q ss_pred cccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceE Q lcl|NC_010147. 138 DITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA 215 (274) Q Consensus 138 ~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~ 215 (274) ...+++.|++|...|+++++ ++|+++++|++++.|+++. .|......++..+++|.+|+++|++|++|+++|.++. T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~--~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~ 236 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIP--QFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSA 236 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhch--hhhhhhccccchhheeeeeeEeceEEEEecccccccc Confidence 23468999999999999876 6899999999999999875 4566666777789999999999999999999996543 Q ss_pred -------------------------------------EEEeCCeEEEEe-----------ecCceeeeecchhhcceEEE Q lcl|NC_010147. 216 -------------------------------------ILAKKGAVKLIL-----------KRDFFLEVARDASTKTTALY 247 (274) Q Consensus 216 -------------------------------------~~~~~~a~~~~~-----------~~~~~ve~~rd~~~~~~~v~ 247 (274) +++++.+++.+. .+...+|.+|++.++.|.|. T Consensus 237 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 316 (341) T protein:vir:94 237 TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMV 316 (341) T ss_pred ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhh Confidence 233444443331 23356778899999999999 Q ss_pred EEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 248 SDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 248 ~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) +++.||++++||++++.|+.+++-- T Consensus 317 ~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 317 GRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhhhcccccCcceeEEEecCcCCC Confidence 9999999999999999998877776 No 25 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=100.00 E-value=1.1e-42 Score=250.63 Aligned_cols=263 Identities=14% Similarity=0.128 Sum_probs=212.1 Q ss_pred CCCccceeeeeechH--HHHHHHHHHHHHHhhhhccc--ccccccc---cCCCceEEEEeeccC-Cccc-cc-cCC--Cc Q lcl|NC_010147. 1 MPQGITKTSNQIIPE--VLAPMMQAQLEKKLRFASFA--EVDSTLQ---GQPGDTLTFPAFVYS-GDAQ-VV-AEG--EK 68 (274) Q Consensus 1 Ma~~~T~~~~~~~Pe--v~~~~v~~~~~~~~v~~~~~--~~~~~~~---~~~g~tv~ip~~~~~-~~~~-~~-~eg--~~ 68 (274) || +|+++|+++|| +|.+|+.++..+++.|.++. ..+..+. ..+|++++||+|+.+ |+.+ .+ +++ +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 55 89999999998 89999999999988887654 3344443 357999999999986 4444 22 333 46 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------- 133 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--------------- 133 (274) +++.+++++++.+.+..++++|..+|+..+.++.|||++|+++++.+|.|..++.+++.|++... T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~ 158 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDM 158 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccc Confidence 79999999999999999999999999999999999999999999999999999999999875321 Q ss_pred c--c-cccccCHHHHHHHHHHHhhc-----CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEE Q lcl|NC_010147. 134 T--V-NADITKLNGLQSAIDKFNDE-----DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) Q Consensus 134 ~--~-~~~~~~~d~i~~A~~~l~~~-----~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv 205 (274) + + ..+.++++.|++|.++||++ ...+..++|||.++..|+++++++|+..++. ...|++|+|++|| T Consensus 159 t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~------~~~i~ty~G~~Vi 232 (349) T protein:vir:78 159 VVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAEN------NTMFATYQGYRVI 232 (349) T ss_pred eeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCccc------CcccceecCeEEE Confidence 0 1 12236888999999999986 4678999999999999999999999987763 3358999999999 Q ss_pred EcCCCC--------cceEEEEeCCeEEEEeecC-ceeeeecchhh----cceEEEEEEEEEEEEEcCccEEEEEecC--- Q lcl|NC_010147. 206 RTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDAST----KTTALYSDKHYVAYLYDESKAVKITKGS--- 269 (274) Q Consensus 206 ~s~~v~--------~~~~~~~~~~a~~~~~~~~-~~ve~~rd~~~----~~~~v~~~~~yg~~~~~~~~~v~~~~~~--- 269 (274) +||.|| +|++|+|++|||++....+ +.+|++||+.+ +.|.+..|+||..| |.++-...... T Consensus 233 vDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~h---p~G~s~~~a~v~~~ 309 (349) T protein:vir:78 233 VDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTWLLH---PFGYRFTSAVITGN 309 (349) T ss_pred EeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEEEee---eeeeeeccccccCC Confidence 999998 6789999999999988654 56899999976 56999999996544 55554433211 Q ss_pred ------CCC---CC Q lcl|NC_010147. 270 ------GSL---EM 274 (274) Q Consensus 270 ------a~~---~~ 274 (274) .|+ || T Consensus 310 ~~~~~~~sPt~aeL 323 (349) T protein:vir:78 310 GTETIARSASWQDL 323 (349) T ss_pred ccccccCCCChHHh Confidence 122 23 No 26 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=7.7e-38 Score=224.05 Aligned_cols=265 Identities=16% Similarity=0.129 Sum_probs=216.3 Q ss_pred CCCccce------ee--------e-eechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccC Q lcl|NC_010147. 1 MPQGITK------TS--------N-QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE 65 (274) Q Consensus 1 Ma~~~T~------~~--------~-~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~e 65 (274) |||..|- .. + +|+ |+|+..|.+.+.+++++.++... .++ +.|++++||+.+.. .+..+.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~-r~~--~~G~sv~i~~iG~~-t~~~~~~ 75 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHML-RSI--ASGKSAQFPVIGRT-KAAYLKP 75 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhcc-ccc--cccceeEeeeccce-eeeeecC Confidence 8865431 11 1 477 99999999999999999999875 233 35999999998764 7788999 Q ss_pred CCcCC--ccccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------c-- Q lcl|NC_010147. 66 GEKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--------K-- 132 (274) Q Consensus 66 g~~i~--~~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--------~-- 132 (274) |++++ +.+++..+.++++++. +..+.|+|++..++..|+++++.+++++++|++.|+.++..+... . T Consensus 76 g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:33 76 GENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENI 155 (347) T ss_pred CCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 99885 4668899999999875 788999999999999999999999999999999999987554210 0 Q ss_pred -------cc----cccc---------ccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccc Q lcl|NC_010147. 133 -------LT----VNAD---------ITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDI 190 (274) Q Consensus 133 -------~~----~~~~---------~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~ 190 (274) .. ..+. ..-++.|++|..+|+++++ ++|+++|+|++|..|+++.. +......+++. T Consensus 156 ~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~--~~~~d~~~~~~ 233 (347) T protein:vir:33 156 EGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM--PNAANYQALLD 233 (347) T ss_pred ccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccc--ccccccccccc Confidence 00 0010 0126788999999999886 68999999999999999874 33444456677 Q ss_pred eeccccceeccceEEEcCCCCcce--------------------------------EEEEeCCeEEEEeecCceeeeecc Q lcl|NC_010147. 191 IVKGAFGEALGAIIVRTNKLEAGT--------------------------------AILAKKGAVKLILKRDFFLEVARD 238 (274) Q Consensus 191 ~~~g~ig~~~G~~Vv~s~~v~~~~--------------------------------~~~~~~~a~~~~~~~~~~ve~~rd 238 (274) +.+|.+++++|++|++|+++|.+. ++++|++|++.+...++++|..|+ T Consensus 234 ~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~ 313 (347) T protein:vir:33 234 PERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred cccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccc Confidence 899999999999999999998531 258899999999989999999999 Q ss_pred hhhcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 239 ~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) +.++.|.|++++.||+++++|++++.|++..-+. T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 9999999999999999999999999998765555 No 27 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=2.9e-37 Score=220.87 Aligned_cols=264 Identities=18% Similarity=0.148 Sum_probs=208.4 Q ss_pred CCCccce-e-------------eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC Q lcl|NC_010147. 1 MPQGITK-T-------------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) Q Consensus 1 Ma~~~T~-~-------------~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg 66 (274) |||.+-. + .++|+ |+|...+...+.+.++|.++..+. + .+.|++++||+.+.. .+..+.+| T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r-~--i~~G~sv~i~~iG~~-tv~~~t~G 75 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR-T--IQNGKSAQFPVMGRT-SGVYLAPG 75 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc-c--ccccceEEEecccce-eeeeecCC Confidence 8875421 1 12333 456666666677888888876543 2 345999999998654 77889999 Q ss_pred CcC--CccccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc---c---cc---- Q lcl|NC_010147. 67 EKI--PTDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG---A---KL---- 133 (274) Q Consensus 67 ~~i--~~~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~---a---~~---- 133 (274) +++ .++.++.++.++++++. +..+.|+|.+..++..|+++++.+++++++++.+|+.++..+.. . +. T Consensus 76 ~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 76 ERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred CCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 998 55678999999999885 78899999999999999999999999999999999999865532 0 00 Q ss_pred ------cc----ccc----c----cCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceec Q lcl|NC_010147. 134 ------TV----NAD----I----TKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) Q Consensus 134 ------~~----~~~----~----~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~ 193 (274) .. .+. . .-++.|.+|..+|.++++ ++|+++|+|++|..|+++.. +......+++.+.+ T Consensus 156 g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~--~~~~~~~~~~~~~~ 233 (347) T protein:vir:94 156 GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALM--PNAANYAALIDPET 233 (347) T ss_pred CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccch--hhhhhccccccccc Confidence 00 000 0 115678888899998875 68999999999999987753 34444455667889 Q ss_pred cccceeccceEEEcCCCCcc------------------------------------eEEEEeCCeEEEEeecCceeeeec Q lcl|NC_010147. 194 GAFGEALGAIIVRTNKLEAG------------------------------------TAILAKKGAVKLILKRDFFLEVAR 237 (274) Q Consensus 194 g~ig~~~G~~Vv~s~~v~~~------------------------------------~~~~~~~~a~~~~~~~~~~ve~~r 237 (274) |.+++++|++|++|+++|.+ .+.+||+.|++.+...++++|.+| T Consensus 234 G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r 313 (347) T protein:vir:94 234 GNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDR 313 (347) T ss_pred cceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchh Confidence 99999999999999999831 235778999999988999999999 Q ss_pred chhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 238 DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 238 d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.++.|.|++++.||+++++|++++.|+.+.|- T Consensus 314 ~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 314 DVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred chhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 9999999999999999999999999999998666 No 28 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=1.3e-36 Score=217.35 Aligned_cols=266 Identities=17% Similarity=0.116 Sum_probs=213.9 Q ss_pred CCCcccee------ee-eech-------HHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC Q lcl|NC_010147. 1 MPQGITKT------SN-QIIP-------EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) Q Consensus 1 Ma~~~T~~------~~-~~~P-------ev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg 66 (274) ||+.++-- .. ...+ |+|+..+.+.+.+.+++.++..+. + ...|++++||+.+.. .++.+.+| T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~-~--~~~G~sv~i~~ig~~-t~~~~~~g 76 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR-S--IASGKSAQFPVIGRT-KAAYLKPG 76 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc-c--ccccceeEeeeccce-eeeeeccC Confidence 88765421 11 1233 567888889999999999987653 2 335999999998763 77889999 Q ss_pred CcCC--ccccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Q lcl|NC_010147. 67 EKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------- 133 (274) Q Consensus 67 ~~i~--~~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------- 133 (274) ++++ +..++.++.++++++. +..+.|+|++..++..|+++++.+++++++|+++|+.++..+.++.. T Consensus 77 ~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:15 77 ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIE 156 (347) T ss_pred CCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 9885 4668999999999875 78899999999999999999999999999999999999876542100 Q ss_pred -----------cccccc-c----C----HHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccce Q lcl|NC_010147. 134 -----------TVNADI-T----K----LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDII 191 (274) Q Consensus 134 -----------~~~~~~-~----~----~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~ 191 (274) ...+.. . . ++.+.+|..+|+++++ ++||++|+|++|..|+++.. +......+.+.+ T Consensus 157 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~--~~~~d~~~~~~~ 234 (347) T protein:vir:15 157 GLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM--PNAANYQALIDH 234 (347) T ss_pred ccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccc--cccccccccccc Confidence 000111 1 1 5667778889999886 68999999999999999864 445555566778 Q ss_pred eccccceeccceEEEcCCCCcc--------------------------------eEEEEeCCeEEEEeecCceeeeecch Q lcl|NC_010147. 192 VKGAFGEALGAIIVRTNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARDA 239 (274) Q Consensus 192 ~~g~ig~~~G~~Vv~s~~v~~~--------------------------------~~~~~~~~a~~~~~~~~~~ve~~rd~ 239 (274) ++|.+++++|++|++|+++|.+ ...++|+.|++.+..+++++|.+|++ T Consensus 235 ~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~ 314 (347) T protein:vir:15 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred cceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccc Confidence 9999999999999999999832 13578999999999999999999999 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) .++.|.|++++.||+++++|++++.|.+..-+. T Consensus 315 ~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 315 NYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 999999999999999999999999998765555 No 29 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=1.1e-36 Score=217.82 Aligned_cols=263 Identities=18% Similarity=0.166 Sum_probs=213.5 Q ss_pred CCCcccee----------------eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccccc Q lcl|NC_010147. 1 MPQGITKT----------------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVA 64 (274) Q Consensus 1 Ma~~~T~~----------------~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~ 64 (274) |||.+|-. -++|+ |+|+..|.+.+.+.++|.++..+ +++. .|++++||+.+.. .+..+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~-r~i~--~g~s~~~~~iG~~-~~~~~~ 75 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMV-RSIS--SGKSAQFPVLGRT-QAAYLA 75 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhccccee-eeec--ccceEEEEeecee-EEEeee Confidence 88765431 01356 89999999999999999998866 3444 4999999998654 667789 Q ss_pred CCCcCCc--cccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Q lcl|NC_010147. 65 EGEKIPT--DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------- 133 (274) Q Consensus 65 eg~~i~~--~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------- 133 (274) +|++++. +++..+++++++++ .+..+.|+|++..++..|+++++.+++++++|+.+|+.++..+..+.. T Consensus 76 ~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~ 155 (344) T protein:vir:10 76 PGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN 155 (344) T ss_pred cCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 9999864 57899999999988 588999999999999999999999999999999999999866532110 Q ss_pred ------------ccc----ccc-c----CHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccc Q lcl|NC_010147. 134 ------------TVN----ADI-T----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDI 190 (274) Q Consensus 134 ------------~~~----~~~-~----~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~ 190 (274) ... +++ . -++.|.+|..+|.++++ ++||++|+|++|..|+++..+ ......+++. T Consensus 156 ~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~--~~~~~~~~~~ 233 (344) T protein:vir:10 156 ITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMP--NAANYAALID 233 (344) T ss_pred cccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccc--cccccccccc Confidence 000 011 1 25678889999999886 679999999999999987643 3444456677 Q ss_pred eeccccceeccceEEEcCCCCcc-------------------------------eEEEEeCCeEEEEeecCceeeeecch Q lcl|NC_010147. 191 IVKGAFGEALGAIIVRTNKLEAG-------------------------------TAILAKKGAVKLILKRDFFLEVARDA 239 (274) Q Consensus 191 ~~~g~ig~~~G~~Vv~s~~v~~~-------------------------------~~~~~~~~a~~~~~~~~~~ve~~rd~ 239 (274) .++|.+++++|++|++|+++|.+ .+.+||+.|++.+...++++|..|++ T Consensus 234 ~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~ 313 (344) T protein:vir:10 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA 313 (344) T ss_pred eeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch Confidence 89999999999999999999842 12578899999998899999999999 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++.|.|++++.||+++++|++++++.++.- T Consensus 314 ~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 314 NFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hHHHHHHHHHhhcccceecccceEEEEeecC Confidence 9999999999999999999998877666554 No 30 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=1.6e-36 Score=216.87 Aligned_cols=263 Identities=13% Similarity=0.099 Sum_probs=210.2 Q ss_pred CCCccce-------eee----eechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcC Q lcl|NC_010147. 1 MPQGITK-------TSN----QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI 69 (274) Q Consensus 1 Ma~~~T~-------~~~----~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i 69 (274) |.++..- .+| +|+ |+|+.+|.+++.+.++|.++.... ++ ..|++++||+.+.. .+..+.+|+++ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r-~i--~~G~tv~i~~ig~~-~~~~~~~g~~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGKL-SAGYHTPGTPI 81 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc-cc--cccceEEEEeccce-eEeeecCCCCC Confidence 4443322 222 567 899999999999999999987653 33 35999999998653 77789999999 Q ss_pred Ccc-ccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Q lcl|NC_010147. 70 PTD-ILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------- 134 (274) Q Consensus 70 ~~~-~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~------------- 134 (274) .++ +++++++++++++ .+..+.|+|++..++..|+++++.+++++++|+.+|+.++..+..+... T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 876 5999999999998 6899999999999999999999999999999999999998877543211 Q ss_pred -cc-cccc----CHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccc-cccccceeccc-cceeccceE Q lcl|NC_010147. 135 -VN-ADIT----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRAT-ELGDDIIVKGA-FGEALGAII 204 (274) Q Consensus 135 -~~-~~~~----~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s-~~~~~~~~~g~-ig~~~G~~V 204 (274) .. +..+ -|+.|++|..+|.++++ ++||++|+|++|..|++.....+.... ...++.+++|. +++++|++| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V 241 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRI 241 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEE Confidence 01 1112 36889999999999986 679999999999999984222333321 22335677774 899999999 Q ss_pred EEcCCCCcc------------------------eEEEEeCCeEEEEeecCcee---eeecchhhcceEEEEEEEEEEEEE Q lcl|NC_010147. 205 VRTNKLEAG------------------------TAILAKKGAVKLILKRDFFL---EVARDASTKTTALYSDKHYVAYLY 257 (274) Q Consensus 205 v~s~~v~~~------------------------~~~~~~~~a~~~~~~~~~~v---e~~rd~~~~~~~v~~~~~yg~~~~ 257 (274) ++|+++|.. .+++||+.+++++...+.++ |.+|++.++.|.|++++.||++++ T Consensus 242 ~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~ 321 (332) T protein:vir:78 242 LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSL 321 (332) T ss_pred EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCcee Confidence 999999832 24788999999988777655 468899999999999999999999 Q ss_pred cCccEEEEEec Q lcl|NC_010147. 258 DESKAVKITKG 268 (274) Q Consensus 258 ~~~~~v~~~~~ 268 (274) +|++++.|+.+ T Consensus 322 rPe~~v~l~~a 332 (332) T protein:vir:78 322 RTSVAGSFQAA 332 (332) T ss_pred cccceEEEeeC Confidence 99999999988 No 31 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=2.7e-36 Score=215.56 Aligned_cols=264 Identities=14% Similarity=0.097 Sum_probs=204.7 Q ss_pred CCCcc-ceeeee-echHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGI-TKTSNQ-IIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~-T~~~~~-~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) |+.++ |.-... |+||+|+++++..+.+++++..++.+... + .|+||+||.+.. +.+.+|.++++++++.+++.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~--g-~GDtV~InsIg~-~tV~dY~~~~~i~~d~ltt~~ 76 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDF--P-DGDKLTIPSVGT-PVVRSRPEQGDFTFDNLDTGE 76 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhccccc--C-CCCeEEeccccc-cccccccCCCCcccccCCCce Confidence 88866 333334 55999999999999999998887664322 3 499999999865 478899999999999999999 Q ss_pred eEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccc-----------cc Q lcl|NC_010147. 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------LTV-----------NA 137 (274) Q Consensus 79 ~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------~~~-----------~~ 137 (274) .++.+.+ .+++|.++| +..|...|+++...+++++++++.+|+.+...++++. .+. +. T Consensus 77 ~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 77 ISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 9999987 589999999 8899999999999999999999999998877665321 111 12 Q ss_pred cccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHH---------Hhhccccccccccccccceecc--ccceeccceE Q lcl|NC_010147. 138 DITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKL---------RGDASTNFTRATELGDDIIVKG--AFGEALGAII 204 (274) Q Consensus 138 ~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L---------~k~~~~~~~~~s~~~~~~~~~g--~ig~~~G~~V 204 (274) ....|+.|+++..+|+++++ .+||+||+|.++..| .++..+..+..+.. ..| .+|+++|++| T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~-----a~g~~~Vg~~~GF~V 230 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGI-----APDMQFVRSVYGIDL 230 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccc-----hhhHHHHHHHhceee Confidence 23479999999999999987 589999999998755 55554333322221 223 3899999999 Q ss_pred EEcCCCCc--ceEEEEeCCeEEEE--------------------eecCceeeeecchhhcceEEEEEEEEEEEEEcCccE Q lcl|NC_010147. 205 VRTNKLEA--GTAILAKKGAVKLI--------------------LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKA 262 (274) Q Consensus 205 v~s~~v~~--~~~~~~~~~a~~~~--------------------~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~ 262 (274) ++|++++. ++++.-+.++...+ .|+-.+.|..|+++++.|.++++++||.++.+|+.+ T Consensus 231 ~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l 310 (322) T protein:vir:31 231 FVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENL 310 (322) T ss_pred eeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccce Confidence 99999974 33333333222221 112234689999999999999999999999999999 Q ss_pred EEEEecCCCCCC Q lcl|NC_010147. 263 VKITKGSGSLEM 274 (274) Q Consensus 263 v~~~~~~a~~~~ 274 (274) +.+...++..-. T Consensus 311 ~~~~a~~~~~~~ 322 (322) T protein:vir:31 311 VCVLANADKVTF 322 (322) T ss_pred EEEEeccccccC Confidence 999988887777 No 32 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=2.3e-36 Score=216.01 Aligned_cols=264 Identities=16% Similarity=0.126 Sum_probs=214.1 Q ss_pred CCCcccee------------e---eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccC Q lcl|NC_010147. 1 MPQGITKT------------S---NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE 65 (274) Q Consensus 1 Ma~~~T~~------------~---~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~e 65 (274) |||.++-- + ++|+ |+|+..+...+.+.++|.++..+- + .+.|++++||+.+.. .+..+.+ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r-~--i~~G~sv~~~~iG~~-~~~~~~~ 75 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR-T--IQNGKSASFPVMGRT-KGYYLAP 75 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccc-c--ccCcceEEEeeecce-eeeeecc Confidence 88755411 2 3466 899999999999999999987653 3 346999999998764 5667788 Q ss_pred CCcCCc--cccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|NC_010147. 66 GEKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------- 133 (274) Q Consensus 66 g~~i~~--~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--------- 133 (274) |++++. .++.++++++++++. +..+.|+|.+..++..|+++++.+++++++|+..|+.++..+..+.. T Consensus 76 g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:88 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) T ss_pred ccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 888753 578999999999985 88999999999999999999999999999999999998866532110 Q ss_pred --------cc-c-ccc---------cCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhcccccccccccccccee Q lcl|NC_010147. 134 --------TV-N-ADI---------TKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIV 192 (274) Q Consensus 134 --------~~-~-~~~---------~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~ 192 (274) .. . +.. ..++.|++|..+|+++++ ++|+++|+|++|..|++... +......++..++ T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~--~~~~~~~~~~~~~ 233 (347) T protein:vir:88 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALM--PNAANYAALIDPE 233 (347) T ss_pred CCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchh--hhhhhhccccchh Confidence 00 0 000 126889999999999885 68999999999999998764 3344444555678 Q ss_pred ccccceeccceEEEcCCCCcce-----------------------------------EEEEeCCeEEEEeecCceeeeec Q lcl|NC_010147. 193 KGAFGEALGAIIVRTNKLEAGT-----------------------------------AILAKKGAVKLILKRDFFLEVAR 237 (274) Q Consensus 193 ~g~ig~~~G~~Vv~s~~v~~~~-----------------------------------~~~~~~~a~~~~~~~~~~ve~~r 237 (274) +|.+++++|++|++|+++|.+. ..++++.+++.+...++++|.+| T Consensus 234 ~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r 313 (347) T protein:vir:88 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) T ss_pred cceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeee Confidence 9999999999999999998310 14567888888888888999999 Q ss_pred chhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 238 DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 238 d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.++.|.|++++.||+++++|+.++.|+.+.|+ T Consensus 314 ~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred chhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 9999999999999999999999999999998888 No 33 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=4.6e-36 Score=214.30 Aligned_cols=263 Identities=18% Similarity=0.159 Sum_probs=214.5 Q ss_pred CCCccce--------e-----e---eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccccc Q lcl|NC_010147. 1 MPQGITK--------T-----S---NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVA 64 (274) Q Consensus 1 Ma~~~T~--------~-----~---~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~ 64 (274) ||+.++. - . ++|+ |+|+..+.+.+.+.++|+++..+ +++. .|++++||+.+.. .+..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~-r~i~--~gks~~~~~iG~~-~~~~~~ 75 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMV-RSIS--SGKSAQFPVLGRT-QAAYLA 75 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhccccee-eecc--ccceEEEeeecce-EEEeee Confidence 7764441 1 1 2455 88999999999999999998865 3444 4899999998654 677889 Q ss_pred CCCcCCcc--ccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Q lcl|NC_010147. 65 EGEKIPTD--ILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------- 133 (274) Q Consensus 65 eg~~i~~~--~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------- 133 (274) +|++++.+ ++..++.++++++ .+..+.|+|++..++..|+++++.+++++++|+.+|+.++..+..+.. T Consensus 76 ~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 76 PGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred cCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 99998654 5788999999987 588899999999999999999999999999999999998866532100 Q ss_pred ------------ccccc---------ccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccc Q lcl|NC_010147. 134 ------------TVNAD---------ITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDI 190 (274) Q Consensus 134 ------------~~~~~---------~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~ 190 (274) +..+. ..-++.|.+|..+|+++++ ..||++|+|++|..|+++..+ ......+++. T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~--~~~~~~~~~~ 233 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMP--NAANYAALID 233 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccc--cccccccccc Confidence 00011 1127889999999999886 679999999999999987643 4444556666 Q ss_pred eeccccceeccceEEEcCCCCcc--------------------------------eEEEEeCCeEEEEeecCceeeeecc Q lcl|NC_010147. 191 IVKGAFGEALGAIIVRTNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARD 238 (274) Q Consensus 191 ~~~g~ig~~~G~~Vv~s~~v~~~--------------------------------~~~~~~~~a~~~~~~~~~~ve~~rd 238 (274) ..+|.+++++|++|++|+++|.+ .+.+||+.|++.+...++++|..|+ T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 313 (345) T ss_pred cccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeec Confidence 78999999999999999998731 2267899999999999999999999 Q ss_pred hhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 239 ~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +.++.|.|++++.||+++++|++++.|++..- T Consensus 314 ~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 314 ANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 99999999999999999999999999987655 No 34 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=2.2e-35 Score=210.64 Aligned_cols=270 Identities=13% Similarity=0.089 Sum_probs=201.9 Q ss_pred CCC----------cc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcC Q lcl|NC_010147. 1 MPQ----------GI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI 69 (274) Q Consensus 1 Ma~----------~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i 69 (274) ||+ .+ |...+.|+||+|+..+.+.+.++++|.+++. ...++++.|++++||+++. +.+.++.+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~-~~~~~~~~GdTV~ip~~g~-~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATK-KIPFEGKKGDLIHIPNISR-AAVYDKQPQTPV 78 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccc-cccceeecCceEEeeccCc-ceeeeecCCCcc Confidence 333 22 3444678999999999999999999988765 4567888899999999975 478899999999 Q ss_pred CccccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_010147. 70 PTDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------- 133 (274) Q Consensus 70 ~~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--------------- 133 (274) +.++++.++.++++++ .+..+.++|++..++..|+.+++.++++.++|+++|+.++..+..... T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~ 158 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGD 158 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccc Confidence 9999999999999966 588899999999999999999999999999999999999877643111 Q ss_pred -------cccccccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceE Q lcl|NC_010147. 134 -------TVNADITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAII 204 (274) Q Consensus 134 -------~~~~~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~V 204 (274) +......+++.|++|..+|+++++ ++|+++++|+++..|+++. +|......++..+++|.+|+++|++| T Consensus 159 ~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~--~~~~ad~~~~~~l~~G~Ig~i~G~~V 236 (381) T protein:vir:80 159 GTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSIN--QFISVDFSQVKPVTSGVVGTILGMEV 236 (381) T ss_pred cccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhch--hhhhhhhccchhhhceeeeEEcceEE Confidence 011223468899999999999886 6899999999999999876 34555455667899999999999999 Q ss_pred EEcCCCCcceEEEEeCCeEEEEeecCce--eeeecchhhcceEEEEEEEEEEEEEcCccEEE-EEecCCCCCC Q lcl|NC_010147. 205 VRTNKLEAGTAILAKKGAVKLILKRDFF--LEVARDASTKTTALYSDKHYVAYLYDESKAVK-ITKGSGSLEM 274 (274) Q Consensus 205 v~s~~v~~~~~~~~~~~a~~~~~~~~~~--ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~-~~~~~a~~~~ 274 (274) ++|+++|.+.+..+...+..-....+.. .....+.....+.++..+.|+.++...--.+. ++++..-.+- T Consensus 237 v~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~ 309 (381) T protein:vir:80 237 IVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAAD 309 (381) T ss_pred EeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecC Confidence 9999999865543333332222112211 11233345567899999999999854333333 2322111111 No 35 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=4.3e-34 Score=203.49 Aligned_cols=262 Identities=15% Similarity=0.117 Sum_probs=210.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc-cCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~-~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) || |+-++++.||+|++.+++.|+++++|.++++++++-+ ...|++|+||.+... . ..+|..+++++++.++. T Consensus 1 m~---~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~-~---v~dg~~~~~~~~te~~v 73 (418) T protein:vir:10 1 MA---VQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRV-K---SASGRTLVKQPMVDQTI 73 (418) T ss_pred CC---ccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCce-e---ecccCCccccccccceE Confidence 76 5556677899999999999999999999998887543 345999999996532 3 34467899999999999 Q ss_pred EEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---ccccCHHHHHHHHHHHhhc Q lcl|NC_010147. 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---ADITKLNGLQSAIDKFNDE 155 (274) Q Consensus 80 ~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~---~~~~~~d~i~~A~~~l~~~ 155 (274) ++++.+ .+..|.++|++..++..|+++++.+++++++|+++|++++..+.+++.... +....|+.|++|..+|+++ T Consensus 74 ~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~~~Ld~~ 153 (418) T protein:vir:10 74 PFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANAGAKQTTY 153 (418) T ss_pred EEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHHHHHHHhc Confidence 999966 689999999999999999999999999999999999999998887655433 3334699999999999999 Q ss_pred CC--C-ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCc-------------------- Q lcl|NC_010147. 156 DL--E-PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA-------------------- 212 (274) Q Consensus 156 ~~--~-~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~-------------------- 212 (274) ++ + .|++|++|+.++.|+++..+. ......+..+++|.||+++|++|++|+++|. T Consensus 154 ~VP~~G~R~lVv~P~~~~~L~~~~~~~--~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~ 231 (418) T protein:vir:10 154 AVPQDGMRHAVLDPFTCASLSDEVTKL--FKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGD 231 (418) T ss_pred CCCCCCceEEEeCHHHHHHHhhhcccc--ccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecccccce Confidence 86 3 499999999999999876543 3334455679999999999999999999971 Q ss_pred -----------------ceE------------------------------------------------------------ Q lcl|NC_010147. 213 -----------------GTA------------------------------------------------------------ 215 (274) Q Consensus 213 -----------------~~~------------------------------------------------------------ 215 (274) |+. T Consensus 232 ~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~ 311 (418) T protein:vir:10 232 TVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPV 311 (418) T ss_pred eEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccccccccccc Confidence 110 Q ss_pred -------------------------------EEEeCCeEEEEeecC--------------------ceeeeecchhhcce Q lcl|NC_010147. 216 -------------------------------ILAKKGAVKLILKRD--------------------FFLEVARDASTKTT 244 (274) Q Consensus 216 -------------------------------~~~~~~a~~~~~~~~--------------------~~ve~~rd~~~~~~ 244 (274) +.||+++|.++.+.. +++-.++|...+.+ T Consensus 312 ~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~ 391 (418) T protein:vir:10 312 SLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSE 391 (418) T ss_pred cccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccce Confidence 233444554443221 22334567777888 Q ss_pred EEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 245 ALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 245 ~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) .+|.+..||++.++|+-.+++-..+|| T Consensus 392 ~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 392 IHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred EEEEEeecCceeecccceEEEEeecCC Confidence 888899999999999999999999999 No 36 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=2.2e-35 Score=210.63 Aligned_cols=263 Identities=16% Similarity=0.137 Sum_probs=213.6 Q ss_pred CCCcccee------e--------e-eechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccC Q lcl|NC_010147. 1 MPQGITKT------S--------N-QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE 65 (274) Q Consensus 1 Ma~~~T~~------~--------~-~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~e 65 (274) |||.+|-- . + +|+ |+|+..|.+.+.+.++|.++..+- ++ ++|++++||+.+.. .++.+.+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r-ti--~~G~sv~~~~iG~~-~~~~~~~ 75 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVR-SI--QSGKSAQFPVLGRT-KAAYLQP 75 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhhe-ec--cccceEEeeeccce-eEeeeec Confidence 88755421 1 1 466 999999999999999999998652 33 45999999998764 6677899 Q ss_pred CCcCC--ccccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|NC_010147. 66 GEKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------- 133 (274) Q Consensus 66 g~~i~--~~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--------- 133 (274) |+++. ..++.+++.++++++. +..+.|+|++..++..|+.+++.+++++++|+..|+.++..+..+.. T Consensus 76 G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:94 76 GENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENI 155 (347) T ss_pred CcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 99884 3679999999999985 88899999999999999999999999999999999998865532100 Q ss_pred ---------cc------cc-----cccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccce Q lcl|NC_010147. 134 ---------TV------NA-----DITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDII 191 (274) Q Consensus 134 ---------~~------~~-----~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~ 191 (274) .+ .. +...++.|.+|..+|.++++ .+|+++++|++|..|++.... ...+..+...+ T Consensus 156 ~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~--~~~~~~~~~~~ 233 (347) T protein:vir:94 156 AGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMP--NAANYQALIDP 233 (347) T ss_pred ccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcc--ccccccccccc Confidence 00 00 01126789999999999886 589999999999999986432 23333344557 Q ss_pred eccccceeccceEEEcCCCCcc-----------------------------------eEEEEeCCeEEEEeecCceeeee Q lcl|NC_010147. 192 VKGAFGEALGAIIVRTNKLEAG-----------------------------------TAILAKKGAVKLILKRDFFLEVA 236 (274) Q Consensus 192 ~~g~ig~~~G~~Vv~s~~v~~~-----------------------------------~~~~~~~~a~~~~~~~~~~ve~~ 236 (274) .+|.+++++|++|++|+++|.+ .+.+|++.|++.+...++.+|.+ T Consensus 234 ~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~ 313 (347) T protein:vir:94 234 STGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERA 313 (347) T ss_pred ccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeee Confidence 8999999999999999999832 13678889999888888999999 Q ss_pred cchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 237 RDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 237 rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) |++.++.|.|.++..||+.++||+.++.|+.+.| T Consensus 314 ~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 314 RRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred echhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 9999999999999999999999999999999888 No 37 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=2e-34 Score=205.32 Aligned_cols=265 Identities=14% Similarity=0.150 Sum_probs=194.0 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhccccccc--ccccCCCceEEEEeeccCCccccc-----cCCCcCCccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVV-----AEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~--~~~~~~g~tv~ip~~~~~~~~~~~-----~eg~~i~~~~ 73 (274) ||| ++|+||+|++.+.+.|++.++|.+++++++ ++.++.|++|+||+++. ..+.++ .++.++++++ T Consensus 1 Ma~------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Ccc------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccc Confidence 664 559999999999999999999999998886 56777899999999865 344444 3567899999 Q ss_pred cccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ccccCHHHHH Q lcl|NC_010147. 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN------ADITKLNGLQ 146 (274) Q Consensus 74 ~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~------~~~~~~d~i~ 146 (274) ++.+++++++.+ .++.|.++|++..++..|+++++.+++++++|+++|.+++..+.++..... .....|+.|+ T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~ 153 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHH Confidence 999999999965 689999999999999999999999999999999999999998876543222 2334689999 Q ss_pred HHHHHHhhcCC-CceEEEEcHHHHHHHHhhcccccccccccc---ccceeccccceeccceEEEcCCCCcceEEEEeCCe Q lcl|NC_010147. 147 SAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELG---DDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA 222 (274) Q Consensus 147 ~A~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~---~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a 222 (274) +|..+|++++. ++|+++++|++++.|++++. |......| ...+++|.+|+++|++|++|+++|+++.+.+++.+ T Consensus 154 ~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~--~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a 231 (392) T protein:vir:99 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) T ss_pred HHHHHHhhcCCCCCCEEEEcHHHHHHHhcccc--eeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccc Confidence 99999999885 67999999999999998864 44443333 34688999999999999999999999999998888 Q ss_pred EEEEeecCceee-------------------eecchhhcceEEEEEEEEEEEEEcCcc---EE---EEEecC-------- Q lcl|NC_010147. 223 VKLILKRDFFLE-------------------VARDASTKTTALYSDKHYVAYLYDESK---AV---KITKGS-------- 269 (274) Q Consensus 223 ~~~~~~~~~~ve-------------------~~rd~~~~~~~v~~~~~yg~~~~~~~~---~v---~~~~~~-------- 269 (274) +.++.+.++..+ .+.+....++.......+|.+.+.... +. .++... T Consensus 232 ~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v 311 (392) T protein:vir:99 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) T ss_pred cccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeee Confidence 776655432211 111111122222222334444432111 10 011100 Q ss_pred ----CCCCC Q lcl|NC_010147. 270 ----GSLEM 274 (274) Q Consensus 270 ----a~~~~ 274 (274) .+..+ T Consensus 312 ~~~~~~~~~ 320 (392) T protein:vir:99 312 AGANATITA 320 (392) T ss_pred ecccceeEe Confidence 01111 No 38 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1.1e-32 Score=195.71 Aligned_cols=263 Identities=14% Similarity=0.104 Sum_probs=215.3 Q ss_pred CCCcc--ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGI--TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~--T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) =|+.+ +..+..++|+.+.+.+.+.+.+.+++.+++.+.. .++..++||++...+.+.|++||+.++..++++++ T Consensus 25 ~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~ 100 (324) T protein:vir:93 25 NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVN 100 (324) T ss_pred ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeeecCCccccccccceeE Confidence 12222 3334568999999999999999999999886542 23556899999888889999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~ 146 (274) +++.++|++..+.+|+|...++..++.+.+.+++++++++++|+.++....+. .........++++|+ T Consensus 101 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) T protein:vir:93 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) T ss_pred EEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHH Confidence 99999999999999999999999999999999999999999999998543221 112233456899999 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEE Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVK 224 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~ 224 (274) ++...+..++.....|+|||..+..|++.. +. .|..++..+..++++|+||++++. .+++..++.+.+.+. T Consensus 181 ~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~------d~-~G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~ 253 (324) T protein:vir:93 181 DLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI 253 (324) T ss_pred HHHHhhhhccCCCCEEEEcHHHHHHHHHhh------CC-CCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEE Confidence 999999998888889999999999998642 11 244556667788999999999876 456778888888888 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.+.+++++.+++.. +....+|...||++++.+|+++++|+.+.+..+- T Consensus 254 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~ 319 (324) T protein:vir:93 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC Confidence 8888899998888742 3557888889999999999999999987777666 No 39 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.3e-32 Score=195.36 Aligned_cols=268 Identities=13% Similarity=0.100 Sum_probs=208.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||...+.....++|+.++..|.+.+.+.+++++++.+-.. ++..++||++...+.+.|++||+.++.+++++++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~----~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPT----IFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeCCccccccccceeeeE Confidence 9999888899999999999999999999999998865432 344689999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCcc----HHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------ccccccccC Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LTVNADITK 141 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------------~~~~~~~~~ 141 (274) +.++|++..+.+|+|...++..+ +.+.+.+++++++++++|..++......+ ......... T Consensus 77 l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) T protein:vir:80 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSA 156 (315) T ss_pred eeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccc Confidence 99999999999999998887776 55888899999999999999986532111 111122335 Q ss_pred HHHHHHHHHHHhhcCC-CceEEEEcHHHHHHHHhhcccccccccccccc---ceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 142 LNGLQSAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDD---IIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 142 ~d~i~~A~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~---~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) +++|+++..++..++. ....|+|||..+..|++....+... . .+.. .+..|..++++|+||+++++||.+ T Consensus 157 ~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~-~-~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~ 234 (315) T protein:vir:80 157 TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSP-L-AGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMS 234 (315) T ss_pred hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCc-c-cccccccccccCCCceecceeeEecCcCCcccccc Confidence 7889999888865543 5567999999999998753211111 0 1111 133455679999999999999854 Q ss_pred -----eEEEEeCCeEEEEeecCceeeeecchh----------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 214 -----TAILAKKGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 214 -----~~~~~~~~a~~~~~~~~~~ve~~rd~~----------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .+|+.+.+.+.++.+++.+++..++.. ++...+++..|+|+++.+|+++++++.++|++.= T Consensus 235 ~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~ 310 (315) T protein:vir:80 235 PASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) T ss_pred cccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCC Confidence 234556666777777888887776532 3446778889999999999999999999998887 No 40 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.3e-32 Score=195.33 Aligned_cols=262 Identities=16% Similarity=0.114 Sum_probs=210.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |+..+|..+..++|+.++..+.+.+.+.+++.+++.+-. .++.+.++|.+.. +.+.|++||++++.++++++++. T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~f~~v~ 80 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVP----MTKPEEEFTFMSG-VGAFWVDEAERIQTSKPTFTKAK 80 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeee----cCCCcEEEEEEcC-CceeeeecCccccccccceeEEE Confidence 555444455578999999999999999999999886532 2356688999864 67889999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc------------ccccccccccCHHHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG------------AKLTVNADITKLNGLQSA 148 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~------------a~~~~~~~~~~~d~i~~A 148 (274) +.+++++..+++|+|...++..++.+.+.+++++++++++|+.++..-.+ +...+.....++++|++| T Consensus 81 l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l~~~ 160 (299) T protein:vir:41 81 MRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDLNEA 160 (299) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHHHHH Confidence 99999999999999999999999999999999999999999999864321 122333455689999999 Q ss_pred HHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEeCCeEE Q lcl|NC_010147. 149 IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAKKGAVK 224 (274) Q Consensus 149 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~~~a~~ 224 (274) ..++..++..+..++|||..+..|++... ........+.. .+..++++|+||++++++|.++ .++.+...+. T Consensus 161 ~~~l~~~~~~~~~~v~n~~~~~~L~~lkd---~~G~~l~~~~~-~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~ 236 (299) T protein:vir:41 161 IGLIEAEDLEPNGIATIRKQRVKYRSTKD---GNGMPIFNTAT-SNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAY 236 (299) T ss_pred HHhhhcccCCcCEEEEcHHHHHHHHHhhc---cCCceeecCCc-CCCCceecceeeEEecccCCCCCceEEEEEecccEE Confidence 99999888888999999999999986421 11111111112 2334689999999999999776 5666667677 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.+++++++..|+.. ++...+|...|+++++.+|+++++++..+|. T Consensus 237 i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 237 YGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 8888899999888764 2345678889999999999999999999999 No 41 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1.5e-33 Score=200.48 Aligned_cols=267 Identities=14% Similarity=0.069 Sum_probs=211.8 Q ss_pred CCCc--cc--e------ee--eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQG--IT--K------TS--NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~~--~T--~------~~--~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~ 68 (274) |++. .+ . -+ ++|+ |+|+..|...+.++++|.++..+ .++ ..|++++||+.+.. .++.+..|++ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~~~~~~~-r~i--~~G~s~~~~~iG~~-~~~~~~~g~~ 75 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKFASWMNV-RSL--RGTNQLRVDRVGAS-TIAGRKAGEE 75 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhhhcccee-eec--cccceEEEeeecce-eeeeecCCCC Confidence 8775 22 1 11 2344 99999999999999999998866 333 45999999998653 6778899999 Q ss_pred CCccccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Q lcl|NC_010147. 69 IPTDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------- 134 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~------------- 134 (274) ++.+.+.+++.+++++. .+..+.|+|++..++..|+.+++.+++++++|+..|+.++..+..+... T Consensus 76 l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 76 LVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 99999999999999998 5888999999999999999999999999999999999887665321100 Q ss_pred ----------ccccccCH----HHHHHHHHHHhhcCCC-----ceEEEEcHHHHHHHHhhccccccccc-cccccceecc Q lcl|NC_010147. 135 ----------VNADITKL----NGLQSAIDKFNDEDLE-----PMVLFINPLDAGKLRGDASTNFTRAT-ELGDDIIVKG 194 (274) Q Consensus 135 ----------~~~~~~~~----d~i~~A~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~s-~~~~~~~~~g 194 (274) ......+. +.+.+|.+.|.++++. +|+++|+|++|..|+++..+...... ..+.....+| T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g 235 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGG 235 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccce Confidence 00011223 4566788889888754 59999999999999998643221111 1123456789 Q ss_pred ccceeccceEEEcCCCCcce---------------------EEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 195 AFGEALGAIIVRTNKLEAGT---------------------AILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 195 ~ig~~~G~~Vv~s~~v~~~~---------------------~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) .+++++|++|++|+++|... +.++++.|++.+...++..|.+|++.++.|.|.+.+.|| T Consensus 236 ~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G 315 (334) T protein:vir:80 236 RIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYN 315 (334) T ss_pred eEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcC Confidence 99999999999999999431 256789999999988999999999999999999999999 Q ss_pred EEEEcCccEEEEEecCCCC Q lcl|NC_010147. 254 AYLYDESKAVKITKGSGSL 272 (274) Q Consensus 254 ~~~~~~~~~v~~~~~~a~~ 272 (274) +++++|++++.+.++.--+ T Consensus 316 ~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 316 IGQRRPDAVAVHDITVTNP 334 (334) T ss_pred CceeccceEEEEEEeeecC Confidence 9999999998888766666 No 42 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.9e-32 Score=193.50 Aligned_cols=263 Identities=13% Similarity=0.077 Sum_probs=215.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) +....+..+..++|+.|...+.+.+.+.+++.+++.+.+ .++.++++|++...+.+.|++||+.++.+++++++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 102 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceee----ccCCceEEEEEecCcceeEeccCccccccccceeEEE Confidence 221223445668999999999999999999999876543 2355799999988888999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQSA 148 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~~A 148 (274) +.++|++..+.+|++...++..++.+.+.+++++++++++|+.++....+. ......+..++++|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:97 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHH Confidence 999999999999999999999999999999999999999999998654321 12223455689999999 Q ss_pred HHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCC--CcceEEEEeCCeEEEE Q lcl|NC_010147. 149 IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v--~~~~~~~~~~~a~~~~ 226 (274) ...+..++.....|+|||..+..|++.. ++ .|...+..+..++++|+||++++.. ++++.++.+...+.++ T Consensus 183 ~~~l~~~~~~~~~~v~n~~~~~~L~~lk------d~-~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~ 255 (324) T protein:vir:97 183 EALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhh------cC-CCceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEE Confidence 9999988888889999999999988642 11 2344455667789999999998875 4667788888888888 Q ss_pred eecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 227 LKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 227 ~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .+.+++++.+++.. +....++...||++++.+|+++++|+.+.+..+- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) T protein:vir:97 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCC Confidence 88899998887642 3557788889999999999999999998887776 No 43 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=2.9e-32 Score=193.46 Aligned_cols=263 Identities=14% Similarity=0.085 Sum_probs=216.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) +....+..+..++|+.+...|.+.+.+.+.+.+++.+.. .+|.++++|+....+.+.|++||+.++.+++++++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeEecCCccccccccceeEEE Confidence 333334445678999999999999999999999876542 2355799999988888999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQSA 148 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~~A 148 (274) +.+++++..+.+|++...++..|+.+.+.+++++++++++|+.++....+. ......+..+++.|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:96 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 999999999999999999999999999999999999999999998554221 11223355689999999 Q ss_pred HHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEE Q lcl|NC_010147. 149 IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~ 226 (274) ...+..++.....|+|||..+..|++.. .. .|...+..+..++++|+||++++. ++++..++.+.+.+.++ T Consensus 183 ~~~l~~~~~~~~~~vmn~~~~~~L~~l~------d~-~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g 255 (324) T protein:vir:96 183 EALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhh------cc-CCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEE Confidence 9999988888889999999999997642 11 244455667778999999999876 45677888888888888 Q ss_pred eecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 227 LKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 227 ~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .+.+++++.+++.. +....++...||++++.+|+++++|+.+.+..|- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:96 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 88899998887643 3456788889999999999999999998777766 No 44 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=2.9e-32 Score=193.46 Aligned_cols=263 Identities=14% Similarity=0.085 Sum_probs=216.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) +....+..+..++|+.+...|.+.+.+.+.+.+++.+.. .+|.++++|+....+.+.|++||+.++.+++++++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeEecCCccccccccceeEEE Confidence 333334445678999999999999999999999876542 2355799999988888999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQSA 148 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~~A 148 (274) +.+++++..+.+|++...++..|+.+.+.+++++++++++|+.++....+. ......+..+++.|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:78 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 999999999999999999999999999999999999999999998554221 11223355689999999 Q ss_pred HHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEE Q lcl|NC_010147. 149 IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~ 226 (274) ...+..++.....|+|||..+..|++.. .. .|...+..+..++++|+||++++. ++++..++.+.+.+.++ T Consensus 183 ~~~l~~~~~~~~~~vmn~~~~~~L~~l~------d~-~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g 255 (324) T protein:vir:78 183 EALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhh------cc-CCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEE Confidence 9999988888889999999999997642 11 244455667778999999999876 45677888888888888 Q ss_pred eecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 227 LKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 227 ~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .+.+++++.+++.. +....++...||++++.+|+++++|+.+.+..|- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:78 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 88899998887643 3456788889999999999999999998777766 No 45 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=5.5e-32 Score=191.95 Aligned_cols=263 Identities=14% Similarity=0.096 Sum_probs=213.4 Q ss_pred CCCcc--ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGI--TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~--T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) =|+.+ +..+..++|+.+++.|.+.+.+.+.+.+++.+.+ .++.++++|++...+.+.|++||+.++.+++++++ T Consensus 25 ~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~ 100 (324) T protein:vir:96 25 NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVN 100 (324) T ss_pred ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeeecCCccccccccceeE Confidence 11212 2233458899999999999999999999886543 23557999999877889999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~ 146 (274) +++.+++++..+.+|++...++..++.+.+.+++++++++++|+.+|....+. ......+..++++|+ T Consensus 101 v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) T protein:vir:96 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNII 180 (324) T ss_pred EEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHH Confidence 99999999999999999999999999999999999999999999988553221 112233456899999 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCC--CcceEEEEeCCeEE Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKGAVK 224 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v--~~~~~~~~~~~a~~ 224 (274) ++..++..++.....|+|||..+..|++.. +. .|...+..+..++++|+||++++.. +++..++.+.+.+. T Consensus 181 ~~~~~i~~~~~~~~~~i~n~~~~~~L~~lk------d~-~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~ 253 (324) T protein:vir:96 181 DLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI 253 (324) T ss_pred HHHHhhhhccCCCCEEEEcHHHHHHHHHhh------CC-CCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEE Confidence 999999988888889999999999998642 11 2444455677789999999997764 56678888888888 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.+.+++++.+++.. +....+|...||++++.+|+++++|+.+.+..+- T Consensus 254 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~ 319 (324) T protein:vir:96 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC Confidence 8888899998887743 3456788889999999999999999988777777 No 46 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=7.3e-32 Score=191.28 Aligned_cols=263 Identities=14% Similarity=0.095 Sum_probs=213.9 Q ss_pred CCCcc--ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGI--TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~--T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) =|+.+ +..+..++|+.|++.+.+.+.+.+.+.+++.+.+ .++.+++||++...+.+.|++||+.++..++++++ T Consensus 25 ~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~ 100 (324) T protein:vir:99 25 NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVN 100 (324) T ss_pred cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeEeccCccccccccceeE Confidence 12222 2333458899999999999999999999876543 23457999999888889999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~ 146 (274) +++.++|++..+.+|+|...++..++.+.+.+++++++++++|+.++....+. ......+..+++.|+ T Consensus 101 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) T protein:vir:99 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) T ss_pred EEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHH Confidence 99999999999999999999999999999999999999999999998553322 112233557899999 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC--cceEEEEeCCeEE Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--AGTAILAKKGAVK 224 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~--~~~~~~~~~~a~~ 224 (274) ++...|..++.....|+|||..+..|++.. +. .|...+..+.-++++|+||++++.++ ++..++.+...+. T Consensus 181 ~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~ 253 (324) T protein:vir:99 181 DLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLI 253 (324) T ss_pred HHHHhhhhccCCCCEEEEcHHHHHHHHHhh------cC-CCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEE Confidence 999999988888889999999999998642 11 23444555666789999999998865 5567777888888 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.+.+++++..++.. ++...++...||++++.+|+++++++.+.+..|- T Consensus 254 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~ 319 (324) T protein:vir:99 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC Confidence 8888899998887742 3456788889999999999999999999988887 No 47 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=1.5e-31 Score=189.60 Aligned_cols=263 Identities=14% Similarity=0.095 Sum_probs=213.5 Q ss_pred CCCc--cceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQG--ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~--~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) =|+. .+..+..++|+.+.+.|.+.+.+.+.+.+++.+.. .++.++++|++...+.+.|++||++++.+++++++ T Consensus 25 ~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~ 100 (324) T protein:vir:10 25 NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVN 100 (324) T ss_pred cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEeCCcceeEeccCccccccccceeE Confidence 1221 22333458999999999999999999999886543 23456999999888899999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccccccCHHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a------------~~~~~~~~~~~d~i~ 146 (274) +++.+++++..+.+|+|...++..++.+.+.+++++++++++|..++....+. ......+..++++|+ T Consensus 101 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~ 180 (324) T protein:vir:10 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) T ss_pred EEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHH Confidence 99999999999999999999999999999999999999999999998654321 112233456899999 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC--cceEEEEeCCeEE Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--AGTAILAKKGAVK 224 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~--~~~~~~~~~~a~~ 224 (274) ++...+..++.....|+|||..+..|++.. .. .|...+..+..++++|+||++++.++ ++..++.+.+.+. T Consensus 181 ~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~ 253 (324) T protein:vir:10 181 DLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLI 253 (324) T ss_pred HHHHhhhhccCCCCEEEEcHHHHHHHHHhh------cc-CCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEE Confidence 999999988888889999999999998642 11 23444556667889999999988754 6677887888888 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.+.+++++..++.. ++...++...||++++.+|+++++|+.+.+..|- T Consensus 254 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:10 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC Confidence 8888888888877642 3456788889999999999999999999888886 No 48 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.96 E-value=1.8e-31 Score=189.15 Aligned_cols=262 Identities=14% Similarity=0.120 Sum_probs=199.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||..++..+. ++|+.++..+.+.+.+.+++++++.+-. .++..+++|++...+.+.|++||++++.+++++++++ T Consensus 1 ma~~t~~~G~-lip~~~~~~ii~~l~~~s~i~~l~~~~~----~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQLSKGN-LFNPELVTKVINKVKGHSSIAKLSPQKP----IPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVT 75 (300) T ss_pred CcccccCCcc-eechhhHHHHHHHHHhhhhhhhhcceee----ccCCceEEEEEecCcceEEeeCCcccccccccceeeE Confidence 9977666555 5677789999999999999988875432 2344689999987788999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHh---hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc---cc--------------cc-ccccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEALMG---AK--------------LT-VNADI 139 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~---~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~---a~--------------~~-~~~~~ 139 (274) ++++|++..+.+|+|... .+..++.+.+.+++++++++++|..++..... .. .. ..... T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 999999999999999874 45678999999999999999999999976421 10 01 11234 Q ss_pred cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----- Q lcl|NC_010147. 140 TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----- 214 (274) Q Consensus 140 ~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----- 214 (274) ..++.|.++..++...+..+..|+|||..+..|++.... .....-.+....|..++++|+||++|+.+|.+. T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~---~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 232 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNA---EGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKN 232 (300) T ss_pred chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhcc---CCCeeccCccccCCCceecceeeEEecCCCCCCCCCcc Confidence 568999999999988888888999999999999764211 111111122345667899999999999998543 Q ss_pred -EEEEeCC-eEEEEeecCceeeeecch----------hhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 215 -AILAKKG-AVKLILKRDFFLEVARDA----------STKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 215 -~~~~~~~-a~~~~~~~~~~ve~~rd~----------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++.+.. .+.+..+++++++..+.. .+....+|...|+++++.+|+++++|++.+. T Consensus 233 ~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 233 TAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred EEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 3444433 455667777777655432 2334677888999999999999999998888 No 49 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.96 E-value=3e-31 Score=187.94 Aligned_cols=262 Identities=13% Similarity=0.076 Sum_probs=199.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |+ ++..+..++|+.++..|.+.+.+.+++++++.+.. .++.++++|++...+.+.|++||++++.+++++++++ T Consensus 1 m~--t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~ 74 (303) T protein:vir:97 1 MG--TETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKP----IPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVT 74 (303) T ss_pred Cc--ccCCCCeEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEecCcceEEeecCccccccccceeeEE Confidence 66 44556778999999999999999999999986643 2345689999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHh---hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------------cccccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------------LTVNAD 138 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~---~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-------------------~~~~~~ 138 (274) +.++|++..+++|+|... .+..++.+.+.+++++++++++|..++....+.+ ....+. T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (303) T protein:vir:97 75 IVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTES 154 (303) T ss_pred eeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccc Confidence 999999999999999874 4556789999999999999999999997642110 011133 Q ss_pred ccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce---- Q lcl|NC_010147. 139 ITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT---- 214 (274) Q Consensus 139 ~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~---- 214 (274) ...++.|+++..++..++..+..++|||..+..|++.. +.....-...+.-..+..++++|+||++|++||... T Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk--d~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 232 (303) T protein:vir:97 155 EDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVT--NGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAE 232 (303) T ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh--ccCCCeEEecCccCCCCCceecceeeEEecccCCccccCC Confidence 45689999999999888888889999999999997532 110000000011123445689999999999998421 Q ss_pred ---EEEEe--CCeEEEEeecCceeeeecch----------hhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 215 ---AILAK--KGAVKLILKRDFFLEVARDA----------STKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 215 ---~~~~~--~~a~~~~~~~~~~ve~~rd~----------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++++ ..++.++.++++++|..+.. .+....+|...||++++.+|+++++|+++-= T Consensus 233 ~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 233 SKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 23443 35677788888887755421 1234577778999999999999999998776 No 50 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.96 E-value=3e-31 Score=187.93 Aligned_cols=265 Identities=15% Similarity=0.123 Sum_probs=206.4 Q ss_pred CCCcccee-eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKT-SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~-~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +...++.. ...+.|+++...+.+.+.+..++.+++.+. ....+..+.+|.....+.+.|++||+.++.++++++++ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v 186 (392) T protein:vir:13 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTF---TTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQR 186 (392) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceee---ecCCCceeEEEEEcCCcceeeecccccccccccceeeE Confidence 22222222 235677788888888777777888777543 22346678999998888899999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc---------c------cccccccccCHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG---------A------KLTVNADITKLNG 144 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~---------a------~~~~~~~~~~~d~ 144 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++..-.+ . .....+..++++. T Consensus 187 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~ 266 (392) T protein:vir:13 187 SMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDA 266 (392) T ss_pred EeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHH Confidence 999999999999999999999999999999999999999999999864211 1 1112344567999 Q ss_pred HHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEE Q lcl|NC_010147. 145 LQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVK 224 (274) Q Consensus 145 i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~ 224 (274) ++++...|......+..|+|||..+..|++.. + ..+...-.+-+..|..++++|+||++++++|.++.++.+.+.+. T Consensus 267 l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk--d-~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~ 343 (392) T protein:vir:13 267 LIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLK--D-ANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYR 343 (392) T ss_pred HHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhh--c-cCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeecccee Confidence 99999888776666778999999999887531 1 01101111123345567899999999999999998888888888 Q ss_pred EEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.+.++.++.+++..+ +...++...|+++++.+|++++.++.++|+ T Consensus 344 i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 344 VRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 88888888887777765 456788889999999999999999998888 No 51 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.96 E-value=8.7e-32 Score=190.86 Aligned_cols=268 Identities=14% Similarity=0.125 Sum_probs=210.0 Q ss_pred CCC----------ccceee--------eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccc Q lcl|NC_010147. 1 MPQ----------GITKTS--------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV 62 (274) Q Consensus 1 Ma~----------~~T~~~--------~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~ 62 (274) |++ ..|.-. ++|+ |+|+..|...+.+.+++.++... +++. .|++++||+.+.. .+.. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~-rti~--~Gksv~f~~iG~~-t~~~ 75 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTK-RTLK--NGKSLQFIYTGRM-TSSF 75 (375) T ss_pred CccccccccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccc-cccc--cCceEEEEeeeee-EEee Confidence 333 222221 3556 88999999999999999998765 3444 4999999998653 6778 Q ss_pred ccCCCcCCc---cccccceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----- Q lcl|NC_010147. 63 VAEGEKIPT---DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL----- 133 (274) Q Consensus 63 ~~eg~~i~~---~~~t~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~----- 133 (274) +..|+++.- .++.++++++++++. +..|.|+|++..++..|+++++.+++++++|+.+|+.++..+..+.. T Consensus 76 ~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~ 155 (375) T protein:vir:10 76 HTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPV 155 (375) T ss_pred ecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 999998753 367788889999885 89999999999999999999999999999999999999877642110 Q ss_pred ------------------cccc----cccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhcccc-ccccccccc Q lcl|NC_010147. 134 ------------------TVNA----DITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTN-FTRATELGD 188 (274) Q Consensus 134 ------------------~~~~----~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~-~~~~s~~~~ 188 (274) .... +...++.|.+|..+|.++++ ..||++|+|++|..|+++...+ +......++ T Consensus 156 ~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~ 235 (375) T protein:vir:10 156 SATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGS 235 (375) T ss_pred ccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccc Confidence 0001 11236889999999999886 5899999999999998763222 233233456 Q ss_pred cceeccccceeccceEEEcCCCCcce--------------------------------------------------EEEE Q lcl|NC_010147. 189 DIIVKGAFGEALGAIIVRTNKLEAGT--------------------------------------------------AILA 218 (274) Q Consensus 189 ~~~~~g~ig~~~G~~Vv~s~~v~~~~--------------------------------------------------~~~~ 218 (274) ++..+|.+++++|++|++|+++|..+ +.+| T Consensus 236 ~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~ 315 (375) T protein:vir:10 236 ALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIF 315 (375) T ss_pred ceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEE Confidence 67788999999999999999998311 3678 Q ss_pred eCCeEEEEeecCceeeee---cchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 219 KKGAVKLILKRDFFLEVA---RDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 219 ~~~a~~~~~~~~~~ve~~---rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.|.+.+.-.++++|.. |+...+.|.|.+++-||+..+||+.++.|+.++ ..+. T Consensus 316 ~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~-~~~~ 373 (375) T protein:vir:10 316 QKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA-TAPS 373 (375) T ss_pred chhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc-Cccc Confidence 888998887777788765 689999999999999999999999999998774 5555 No 52 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.96 E-value=5.4e-31 Score=186.50 Aligned_cols=257 Identities=16% Similarity=0.120 Sum_probs=206.6 Q ss_pred CCC--------ccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc Q lcl|NC_010147. 1 MPQ--------GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~--------~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~ 72 (274) ||- .+|..+..++|+.+...+.+.+.+.+++.+++.+.+ .++..++||++...+.+.|++|+++++.+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEP----MTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceee----ccCCceEEEEEeCCcceEEeecCcccccc Confidence 543 224444568999999999999999999988876543 23456899999877889999999999999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------------cccc Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA-----------------KLTV 135 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a-----------------~~~~ 135 (274) +++++++++.++|++..+.+|++...++..|+.+.+.+++++++++++|+.++..-.+. .... T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:10 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998643211 1112 Q ss_pred cccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC---- Q lcl|NC_010147. 136 NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE---- 211 (274) Q Consensus 136 ~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~---- 211 (274) .....++++|+++..++..++.....|+|||..+..|++.. ... |. .+..+..++++|+||++++++| T Consensus 157 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lk------d~~-G~-~l~~~~~~~l~G~PV~~~~~~~~~~~ 228 (304) T protein:vir:10 157 TDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNAL------DAN-DR-PLFDANGNEIMGLPLSYTGADVYDKK 228 (304) T ss_pred ccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhh------ccC-Cc-EeecCCCccccceeeEEecccccCCC Confidence 23345699999999999988888889999999999998632 111 22 2344456899999999999997 Q ss_pred cceEEEEeCCeEEEEeecCceeeeecch------------------hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 212 AGTAILAKKGAVKLILKRDFFLEVARDA------------------STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 212 ~~~~~~~~~~a~~~~~~~~~~ve~~rd~------------------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ++..++.+...+.++.+.+++++..++. .+....+|...||++++.+|++++++|.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 4456777777777888888888776653 224467788899999999999999999998 No 53 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.96 E-value=5.4e-31 Score=186.50 Aligned_cols=257 Identities=16% Similarity=0.120 Sum_probs=206.6 Q ss_pred CCC--------ccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc Q lcl|NC_010147. 1 MPQ--------GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~--------~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~ 72 (274) ||- .+|..+..++|+.+...+.+.+.+.+++.+++.+.+ .++..++||++...+.+.|++|+++++.+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEP----MTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceee----ccCCceEEEEEeCCcceEEeecCcccccc Confidence 543 224444568999999999999999999988876543 23456899999877889999999999999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------------cccc Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA-----------------KLTV 135 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a-----------------~~~~ 135 (274) +++++++++.++|++..+.+|++...++..|+.+.+.+++++++++++|+.++..-.+. .... T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:94 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998643211 1112 Q ss_pred cccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC---- Q lcl|NC_010147. 136 NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE---- 211 (274) Q Consensus 136 ~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~---- 211 (274) .....++++|+++..++..++.....|+|||..+..|++.. ... |. .+..+..++++|+||++++++| T Consensus 157 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lk------d~~-G~-~l~~~~~~~l~G~PV~~~~~~~~~~~ 228 (304) T protein:vir:94 157 TDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNAL------DAN-DR-PLFDANGNEIMGLPLSYTGADVYDKK 228 (304) T ss_pred ccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhh------ccC-Cc-EeecCCCccccceeeEEecccccCCC Confidence 23345699999999999988888889999999999998632 111 22 2344456899999999999997 Q ss_pred cceEEEEeCCeEEEEeecCceeeeecch------------------hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 212 AGTAILAKKGAVKLILKRDFFLEVARDA------------------STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 212 ~~~~~~~~~~a~~~~~~~~~~ve~~rd~------------------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ++..++.+...+.++.+.+++++..++. .+....+|...||++++.+|++++++|.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 4456777777777888888888776653 224467788899999999999999999998 No 54 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.96 E-value=4.2e-31 Score=187.13 Aligned_cols=263 Identities=16% Similarity=0.141 Sum_probs=206.6 Q ss_pred CCCccce-eeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITK-TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~-~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) ....++. ....+.|+++...|.+.+.+..++++++++.. ...+..++||+....+.+.|++||+.++.++++++++ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~---~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i 186 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFT---TSDANPLDFTVITGRSSASIVGETAEIPESYPATAQR 186 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeee---cCCCceeEEEEEcCCcceeeecccccccccccceeee Confidence 2222222 23457788888888888888888888876633 2235568999998878899999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHh-------hcc------cccccccccCHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEAL-------MGA------KLTVNADITKLNGLQ 146 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~-------~~a------~~~~~~~~~~~d~i~ 146 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++..- ... .....+...+++.++ T Consensus 187 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~ 266 (390) T protein:vir:62 187 SMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALI 266 (390) T ss_pred EeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHH Confidence 9999999999999999999999999999999999999999999988642 111 111223456899999 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhh--ccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEE Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVK 224 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~--~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~ 224 (274) ++...|..+......|+|||..+..|++. ..-.++ -.+-+..|..++++|+||++++++|.+..++.+.+.+. T Consensus 267 ~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l-----~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~ 341 (390) T protein:vir:62 267 DLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYL-----WQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYR 341 (390) T ss_pred HHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCee-----ecCCcCCCccceecccceEEecCCCCccEEEeecccee Confidence 99998877666677899999999988653 211111 11123456667899999999999999998877777777 Q ss_pred EEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) +..+.++.++.+.+..+ +...++...|+++++++|+++++++.++|+ T Consensus 342 i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 342 VRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEeecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 77788888887777765 556778889999999999999999999988 No 55 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.96 E-value=7.3e-31 Score=185.81 Aligned_cols=269 Identities=15% Similarity=0.077 Sum_probs=204.0 Q ss_pred CCCccceee-eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTS-NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~-~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |+...|... ..+.|+ +...+.+.+.+.+.+.+++.+.. .++..++||++...+.+.|++||+.++.++++++++ T Consensus 10 ~~~~~t~~~~g~l~~~-~~~~ii~~l~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPV-QAKDYFAEAEKTSIVQRVAQKIP----MGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchh-HHHHHHHHHHhccchhhhcceee----ccCCceEEEEEcCCcceEEecCCccccccccceeEE Confidence 665444333 345555 56677788888888888875532 235568999998888899999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------cccccccCHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------TVNADITKLNGLQSA 148 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~~~~~~~~~d~i~~A 148 (274) ++.++|++..+.+|+|...++..++.+.+.+++++++++++|+.++....+... ...+....++.++++ T Consensus 85 ~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (397) T protein:vir:23 85 DVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVSG 164 (397) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHHHHH Confidence 999999999999999999999999999999999999999999999865433211 112334567889999 Q ss_pred HHHHhhcCCCceEEEEcHHHHHHHHhhc--cccccccccccccceeccccceeccceEEEcCCCCcceEE--EEeCCeEE Q lcl|NC_010147. 149 IDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAI--LAKKGAVK 224 (274) Q Consensus 149 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~--~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~--~~~~~a~~ 224 (274) ...|..++.....++|||..+..|++.. .-.++-......+....+..++++|+||++++++|.++.. +.+...+. T Consensus 165 ~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~ 244 (397) T protein:vir:23 165 LTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQII 244 (397) T ss_pred HHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEE Confidence 9999988888899999999999998632 1111111111222222334568999999999999988763 34556666 Q ss_pred EEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.++++.++.+|+.. +....+|...|+++++.+|+++++++++..+-.- T Consensus 245 i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 245 WGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred EEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 7778888888877643 2445778889999999999999999986654443 No 56 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.96 E-value=2.4e-30 Score=182.95 Aligned_cols=260 Identities=14% Similarity=0.082 Sum_probs=211.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |...+|..+..++|+.+++.+.+.+.+.+.+.+++.+... ..+..+.+|.....+.+.|++||++++..++++++++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 85 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEM---EGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVT 85 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeec---CCCccEEEEEEcCCceeEEeecCccccccccceeEEE Confidence 4444455556689999999999999999999998866432 1233467888777778899999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------cccccccccCHHHHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA-----------KLTVNADITKLNGLQSAI 149 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a-----------~~~~~~~~~~~d~i~~A~ 149 (274) +.+++++..+.+|+|...++..++.+.+.+++++++++++|+.++....+. .....++.+++++|+++. T Consensus 86 l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~ 165 (297) T protein:vir:95 86 LKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNILKLQ 165 (297) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCHHHHHHHH Confidence 999999999999999999999999999999999999999999998543221 112234567899999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEEe Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLIL 227 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~~ 227 (274) .++.+++.....++|||..+..|++.. +.. |. .+.++..++++|+||+.++. +++++.++.+...+.++. T Consensus 166 ~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~~-G~-~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~ 237 (297) T protein:vir:95 166 DALYDADVEPNAFVSKIQNRSALREAR------DGN-KV-SIYDKAANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGV 237 (297) T ss_pred HHhhhccCCcCEEEEcHHHHHHHHHhh------ccC-Cc-eeecCCCCcccceeeEeecCCCCCCceEEEEecccEEEEE Confidence 999998888889999999999998632 111 22 34566778999999998664 578888888888888888 Q ss_pred ecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 228 KRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 228 ~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) +.+++++..++.. ++...+|...|+++++.+|+++++|+.+.-. T Consensus 238 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 238 PYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 8898888777642 3456778889999999999999999987766 No 57 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.96 E-value=1.8e-30 Score=183.66 Aligned_cols=269 Identities=14% Similarity=0.056 Sum_probs=201.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |+...|..+.-++|+.++..+.+.+.+.+.+.+++.+.. .++..++||++...+.+.|++||++++.+++++++++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 89 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVP----MGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQN 89 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceee----ccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 555444444446788899999999999999988876532 2355789999988888999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------ccc-----cccc-CHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------TVN-----ADIT-KLN 143 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~~~-----~~~~-~~d 143 (274) +.++|++..+.+|+|...++..++.+.+.+++++++++++|+.++..-.+... ... .+.. .++ T Consensus 90 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (320) T protein:vir:10 90 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYD 169 (320) T ss_pred EeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccccHH Confidence 99999999999999999999999999999999999999999999865332110 001 1111 222 Q ss_pred -HHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcc--ccccccccccccceeccccceeccceEEEcCCCCcceEE--EE Q lcl|NC_010147. 144 -GLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAI--LA 218 (274) Q Consensus 144 -~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~--~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~--~~ 218 (274) .++++...+...+.....++|||..+..|++... -.++-......+......-++++|+||++++++|.++.. +. T Consensus 170 ~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~g 249 (320) T protein:vir:10 170 AVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMG 249 (320) T ss_pred HHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEe Confidence 4667777777777788999999999999976321 111111111111111222357999999999999998753 34 Q ss_pred eCCeEEEEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 219 KKGAVKLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 219 ~~~a~~~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) +...+.++.+.+++++.+|+.. +....++...|+++++.+|+++++|+...|++- T Consensus 250 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 250 DFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred ecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 5666778888888888877643 244567788999999999999999998888777 No 58 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.96 E-value=2.3e-30 Score=183.03 Aligned_cols=267 Identities=13% Similarity=0.068 Sum_probs=200.7 Q ss_pred CCCcccee------eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC-------- Q lcl|NC_010147. 1 MPQGITKT------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-------- 66 (274) Q Consensus 1 Ma~~~T~~------~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg-------- 66 (274) |+++...- ...++|+.+.+.|.+.+.+.+++.+++.+.. ..+..+++|++...+.+.|++|| T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~ 85 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIP----ISYGETIIPTTVKRPEVGQVGVGTSNEQREG 85 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEeCCceeEeecCccccccccc Confidence 22221111 1127899999999999999999999886643 23456899999877766666554 Q ss_pred CcCCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------- Q lcl|NC_010147. 67 EKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------- 132 (274) Q Consensus 67 ~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-------------- 132 (274) +.++.++++++++++..+|++..+.+|++...++..++.+.+.+++++++++.+|..++..-.... T Consensus 86 ~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~ 165 (333) T protein:vir:78 86 GLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIA 165 (333) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccc Confidence 567888999999999999999999999999999999999999999999999999999985443211 Q ss_pred ------ccccccccCHHHHHHHHHHHhhcC-CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEE Q lcl|NC_010147. 133 ------LTVNADITKLNGLQSAIDKFNDED-LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) Q Consensus 133 ------~~~~~~~~~~d~i~~A~~~l~~~~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv 205 (274) ........+++.|+++...+..+. .....++|||..+..|++...................|..++++|+||+ T Consensus 166 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~ 245 (333) T protein:vir:78 166 NTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQ 245 (333) T ss_pred ccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeE Confidence 011233457899999998887654 4567899999999988764321111111111223445667899999999 Q ss_pred EcCCCCcc---------eEEEEeCCeEEEEeecCceeeeecch-------------hhcceEEEEEEEEEEEEEcCccEE Q lcl|NC_010147. 206 RTNKLEAG---------TAILAKKGAVKLILKRDFFLEVARDA-------------STKTTALYSDKHYVAYLYDESKAV 263 (274) Q Consensus 206 ~s~~v~~~---------~~~~~~~~a~~~~~~~~~~ve~~rd~-------------~~~~~~v~~~~~yg~~~~~~~~~v 263 (274) ++++||.+ .+++.+...+.++.+++++++.+++. .++...+|...|+++++.+|++++ T Consensus 246 ~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~ 325 (333) T protein:vir:78 246 FGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFV 325 (333) T ss_pred EccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceE Confidence 99999854 45677777787888888888887763 223456788899999999999999 Q ss_pred EEEecCCC Q lcl|NC_010147. 264 KITKGSGS 271 (274) Q Consensus 264 ~~~~~~a~ 271 (274) +|+++.|+ T Consensus 326 ~l~~~~a~ 333 (333) T protein:vir:78 326 KFVDDEQP 333 (333) T ss_pred EEeccCCC Confidence 99999999 No 59 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.96 E-value=2.7e-30 Score=182.67 Aligned_cols=270 Identities=14% Similarity=0.061 Sum_probs=204.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |+...|..+..++|+.+...+.+.+.+.+++.+++.+.. .++.+++||++...+.+.|++||++++.+++++++++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 89 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVP----MGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQT 89 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 665555555668899999999999999999999886532 2355799999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------c-cccccc-CHHHHH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------T-VNADIT-KLNGLQ 146 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------~-~~~~~~-~~d~i~ 146 (274) +.+++++..+.+|+|...++..++.+.+.+++++.+++++|+.++....+... . ..+... ..+.++ T Consensus 90 ~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (318) T protein:vir:24 90 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVAV 169 (318) T ss_pred EeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHHHHH Confidence 99999999999999999999999999999999999999999999865432110 0 111112 234567 Q ss_pred HHHHHHhhcCCCceEEEEcHHHHHHHHhhc--cccccccccccccceeccccceeccceEEEcCCCCcceE--EEEeCCe Q lcl|NC_010147. 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA--ILAKKGA 222 (274) Q Consensus 147 ~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~--~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~--~~~~~~a 222 (274) ++...+...+.....++|||..+..|++.. .-.++-......+......-+++.|+||++++++|.++. ++.+.+. T Consensus 170 ~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~ 249 (318) T protein:vir:24 170 NGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQ 249 (318) T ss_pred HHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecce Confidence 788888777778889999999999987532 111111111111111122235799999999999998875 4446666 Q ss_pred EEEEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEe-cCCCCCC Q lcl|NC_010147. 223 VKLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITK-GSGSLEM 274 (274) Q Consensus 223 ~~~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~-~~a~~~~ 274 (274) +.++.+.++.++..|+.. +....++...||++++.+|+++++|+. ++++.|= T Consensus 250 ~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 250 LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 778888898898877643 345678889999999999999999996 4444444 No 60 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.96 E-value=4.1e-30 Score=181.72 Aligned_cols=270 Identities=13% Similarity=0.056 Sum_probs=200.3 Q ss_pred CCCcc--------ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc Q lcl|NC_010147. 1 MPQGI--------TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~~~--------T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~ 72 (274) |+... |..+..++|+.+.+.+.+.+.+.+++.+++.... ..+..+++|++...+.+.|++||+.++.+ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVP----MGPTGISIPHWTGAVSASWTGEAERKPIT 76 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceee----ccCCceEEEEEcCCcceeEecCCCccccc Confidence 55431 1112234555567778899999999999876532 23556899999887889999999999999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------cc Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------------LT 134 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~------------------~~ 134 (274) +++++++++.+++++..+.+|++...++..++.+.+.+++++++++++|+.++..-.... .. T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 999999999999999999999999999999999999999999999999999985432110 00 Q ss_pred cc---ccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhc--cccccccccccccceeccccceeccceEEEcCC Q lcl|NC_010147. 135 VN---ADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNK 209 (274) Q Consensus 135 ~~---~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~--~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~ 209 (274) .. .....+++++++...+..++.....++|||..+..|++.. .-.++-......+......-++++|+||+++++ T Consensus 157 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~ 236 (330) T protein:vir:77 157 TTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADN 236 (330) T ss_pred cccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecc Confidence 11 1123478899999999888888889999999999987632 111111111111111223446899999999999 Q ss_pred CCcce------EEEEeCCeEEEEeecCceeeeecchh--------------------hcceEEEEEEEEEEEEEcCccEE Q lcl|NC_010147. 210 LEAGT------AILAKKGAVKLILKRDFFLEVARDAS--------------------TKTTALYSDKHYVAYLYDESKAV 263 (274) Q Consensus 210 v~~~~------~~~~~~~a~~~~~~~~~~ve~~rd~~--------------------~~~~~v~~~~~yg~~~~~~~~~v 263 (274) +|.++ .++.+.+.+.++.+.+++++..++.. +....+|...|+++++.+|++++ T Consensus 237 ~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 316 (330) T protein:vir:77 237 VVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFV 316 (330) T ss_pred ccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceE Confidence 99764 45566677777888888887766642 34567888899999999999999 Q ss_pred EEEecCCCCCC Q lcl|NC_010147. 264 KITKGSGSLEM 274 (274) Q Consensus 264 ~~~~~~a~~~~ 274 (274) +++.+.|+.+= T Consensus 317 ~i~~~~~~~~~ 327 (330) T protein:vir:77 317 KLTDQVAGTDP 327 (330) T ss_pred EEEeccCCcCC Confidence 99976666544 No 61 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.95 E-value=3.3e-30 Score=182.23 Aligned_cols=262 Identities=13% Similarity=0.092 Sum_probs=198.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) || ++..+..++|+.+++.|.+.+.+.+++++++.+... ++..+++|++...+.+.|++||++++.+++++++++ T Consensus 1 ma--t~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~----~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~ 74 (311) T protein:vir:81 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) T ss_pred Cc--eecCCceEcchhHHHHHHHHHHhcchhhhhcceeec----CCCceEEEEEeCCceeEEeecCcccccccceeeEEE Confidence 55 555567899999999999999999999999866432 334689999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhc---CccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---c------------c---ccccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---L------------T---VNADI 139 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---~------------~---~~~~~ 139 (274) +.++|++..+.+|+|...++ ..++.+.+.+++++++++++|..++......+ . . ..... T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:81 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) T ss_pred EeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeeccccc Confidence 99999999999999987544 45588999999999999999999997642110 0 0 01111 Q ss_pred cC-HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc----- Q lcl|NC_010147. 140 TK-LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG----- 213 (274) Q Consensus 140 ~~-~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~----- 213 (274) .. +..+.++..++...+.....|+|||..+..|++... ......-.+....+..++++|+||++++.||.+ T Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~ 231 (311) T protein:vir:81 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRD---SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) T ss_pred chHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhc---cCCCeeecCccccCCCceecceeEEecccccccccccc Confidence 22 345666777777777788889999999999986321 111111112234556789999999999999732 Q ss_pred -------------eEEEEeCCeEEEEeecCceeeeecchh---------hcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 214 -------------TAILAKKGAVKLILKRDFFLEVARDAS---------TKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 214 -------------~~~~~~~~a~~~~~~~~~~ve~~rd~~---------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ..++.+.+.+.+..+.+++++..++.. ++...+|...|+++++.+|+++++++.+.-+ T Consensus 232 ~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 235666677777778888888776642 3445777789999999999999999976666 No 62 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.95 E-value=3.8e-30 Score=181.89 Aligned_cols=261 Identities=16% Similarity=0.125 Sum_probs=202.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |... +..+..++|+.+...+.+.+.+.+.+.+++.+.. ..+..+++|++.. .+.+.|++||+.++..++++.++ T Consensus 105 ~~~~-~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:19 105 LGSD-ADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGR----TSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQ 179 (385) T ss_pred hccc-cccCCceecchhhhHHHHHhhhccchhhhcceec----ccCcceEEEEEecCCcceeeeccCccccccccceeEE Confidence 3322 2333345677788889999999999988876532 2355789999864 45778999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) .+.+++++..+.+|++...++ +++.+.+.+++++++++++|..++....+.. ....++...++.| T Consensus 180 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:19 180 TANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 999999999999999976654 7899999999999999999999986532211 1112344678999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCC-eEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~-a~~ 224 (274) +++...+..++.....|+|||..+..|++... ..+.....+ ..+|..++++|+||++++.+|.+++++.+.. ++. T Consensus 259 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd---~~G~~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~ 334 (385) T protein:vir:19 259 AHAIYQVTESEFSASGIVLNPRDWHNIALLKD---NEGRYIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQ 334 (385) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhc---CCCceeccC-cccCCCceecceeeEEcCcCCCCcEEEeecccEEE Confidence 99999998888888999999999999876421 111111111 2356678999999999999999998887765 577 Q ss_pred EEeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.+.++.++..+... +....++...||++++.+|++++++++++|+ T Consensus 335 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 335 VWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 7778888887765442 4556788889999999999999999999999 No 63 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.95 E-value=3.8e-30 Score=181.89 Aligned_cols=261 Identities=16% Similarity=0.125 Sum_probs=202.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |... +..+..++|+.+...+.+.+.+.+.+.+++.+.. ..+..+++|++.. .+.+.|++||+.++..++++.++ T Consensus 105 ~~~~-~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:18 105 LGSD-ADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGR----TSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQ 179 (385) T ss_pred hccc-cccCCceecchhhhHHHHHhhhccchhhhcceec----ccCcceEEEEEecCCcceeeeccCccccccccceeEE Confidence 3322 2333345677788889999999999988876532 2355789999864 45778999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) .+.+++++..+.+|++...++ +++.+.+.+++++++++++|..++....+.. ....++...++.| T Consensus 180 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:18 180 TANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 999999999999999976654 7899999999999999999999986532211 1112344678999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCC-eEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~-a~~ 224 (274) +++...+..++.....|+|||..+..|++... ..+.....+ ..+|..++++|+||++++.+|.+++++.+.. ++. T Consensus 259 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd---~~G~~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~ 334 (385) T protein:vir:18 259 AHAIYQVTESEFSASGIVLNPRDWHNIALLKD---NEGRYIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQ 334 (385) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhc---CCCceeccC-cccCCCceecceeeEEcCcCCCCcEEEeecccEEE Confidence 99999998888888999999999999876421 111111111 2356678999999999999999998887765 577 Q ss_pred EEeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ++.+.++.++..+... +....++...||++++.+|++++++++++|+ T Consensus 335 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 335 VWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 7778888887765442 4556788889999999999999999999999 No 64 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.95 E-value=2.7e-31 Score=188.20 Aligned_cols=233 Identities=15% Similarity=0.153 Sum_probs=184.5 Q ss_pred cccccccCCCceEEEEeeccCCccccccCCCcC--CccccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHH Q lcl|NC_010147. 37 VDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHG 113 (274) Q Consensus 37 ~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i--~~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a 113 (274) .-.++. .|++++||+.+.. .+..+..|+++ ++.++...+.++++++ .+..+.|+|++..++..|+++++.++++ T Consensus 1 ~vr~i~--~g~s~~~~~iG~~-~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTIT--SGKSAQFPVMGRT-KARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred Ceeeee--cCceEEEeeeeee-EeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHH Confidence 222333 4999999998654 67788999998 4688999999999988 4889999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhc----ccc---------------ccc----ccccC----HHHHHHHHHHHhhcCC--CceEEEE Q lcl|NC_010147. 114 LAHANKVDNDVLEALMG----AKL---------------TVN----ADITK----LNGLQSAIDKFNDEDL--EPMVLFI 164 (274) Q Consensus 114 ~~~a~~~d~~~~~~~~~----a~~---------------~~~----~~~~~----~d~i~~A~~~l~~~~~--~~~~~vv 164 (274) +++|+.+|+.++..+.. .+. ... ....+ ++.|.+|..+|.++++ .+||++| T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv 157 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYT 157 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEe Confidence 99999999998766421 000 000 11111 6778889999999886 6799999 Q ss_pred cHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc------------------------------- Q lcl|NC_010147. 165 NPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG------------------------------- 213 (274) Q Consensus 165 ~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~------------------------------- 213 (274) +|++|..|+++... ......+++.+++|.|++++|++|++|+++|.. T Consensus 158 ~P~~y~~Ll~~~~~--~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d 235 (324) T protein:vir:99 158 DPDTYSAILAALMP--NAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVG 235 (324) T ss_pred ChHHHHHHhhcccc--cccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccc Confidence 99999988866432 233334567789999999999999999999842 Q ss_pred ----eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC-----CCC Q lcl|NC_010147. 214 ----TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS-----LEM 274 (274) Q Consensus 214 ----~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~-----~~~ 274 (274) .+.+|++.+++.+...++++|..|++.++.|.|++++.||++++||++++.+++...+ +|. T Consensus 236 ~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~ 305 (324) T protein:vir:99 236 ADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDV 305 (324) T ss_pred cCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchh Confidence 1157888888888888899999999999999999999999999999988766653333 344 No 65 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.95 E-value=3.8e-30 Score=181.86 Aligned_cols=270 Identities=12% Similarity=0.043 Sum_probs=211.9 Q ss_pred CCCcccee--------eeeech-HHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKT--------SNQIIP-EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~--------~~~~~P-ev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) |+..++-. ++.-+. |+|+..|.+.+.+.+++.++..+ +++ ..|++++||+.+.. .++.+..|+++.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-rti--~~gkS~q~~~iG~~-~~~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-QEV--VGTNSVSNKYIGET-ELQVLSPGKSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-eee--cccceEEeeeeeee-EEeeeccCcccCC Confidence 77765421 112223 78999999999999999988755 343 35899999998654 6677889999999 Q ss_pred cccccceeEEEeee-ecceeeeeHHHHhhcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhhccc--c--------c----- Q lcl|NC_010147. 72 DILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAK--L--------T----- 134 (274) Q Consensus 72 ~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--~--------~----- 134 (274) +.+..++.+++++. ++..+.|.|.+..++..| +-+++.+++++++|+..|+.++..+..+. . . T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 99999999999987 477788999999999999 78899999999999999999876553210 0 0 Q ss_pred ------c--cccccC----HHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 135 ------V--NADITK----LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 135 ------~--~~~~~~----~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) . ....++ ++.|.+|.+.|++.++ +.|+++|+|++|..|++...+-.......+.+...+|.++++. T Consensus 157 ~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~ 236 (364) T protein:vir:10 157 FSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSW 236 (364) T ss_pred ceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEe Confidence 0 001112 4567788999999886 7799999999999999875422112112233456799999999 Q ss_pred cceEEEcCCCCc---------------------------------ceEEEEeCCeEEEEeecCceeeeecchhhcceEEE Q lcl|NC_010147. 201 GAIIVRTNKLEA---------------------------------GTAILAKKGAVKLILKRDFFLEVARDASTKTTALY 247 (274) Q Consensus 201 G~~Vv~s~~v~~---------------------------------~~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~ 247 (274) |++|++|+++|. ..+.+|++.|++.+...++.+|.+|+..++.+.+. T Consensus 237 Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id 316 (364) T protein:vir:10 237 NTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID 316 (364) T ss_pred ceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee Confidence 999999999982 11468899999999999999999999999999999 Q ss_pred EEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 248 SDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 248 ~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ..+.||++++||++++.++.+.+..-- T Consensus 317 a~~a~G~g~lRPeaa~~i~~~~~~~~~ 343 (364) T protein:vir:10 317 TFLAEGAIPDRWEAVAVVTAADTAELA 343 (364) T ss_pred eehcccCcccCccceEEEEecCCCCCc Confidence 999999999999999999876665443 No 66 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.95 E-value=1.4e-29 Score=178.74 Aligned_cols=269 Identities=12% Similarity=0.074 Sum_probs=200.6 Q ss_pred CCCcc------ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC--------CccccccCC Q lcl|NC_010147. 1 MPQGI------TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS--------GDAQVVAEG 66 (274) Q Consensus 1 Ma~~~------T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~--------~~~~~~~eg 66 (274) |++.. |.....++|+.+++.+.+.+.+.+.+.+++.+. ..++..+++|++... +.+.|++|| T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg 85 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI----PISYGETIIPTTVKRPEVGQVGVGTSNEQREG 85 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCccceeeccccccccccc Confidence 33322 111223799999999999999999999988653 234667999997543 345677899 Q ss_pred CcCCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------- Q lcl|NC_010147. 67 EKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------- 133 (274) Q Consensus 67 ~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------- 133 (274) +.++.++++++++++.++|++..+.+|+|...++..++.+.+.+++++.+++++|..++..-.+.+. T Consensus 86 ~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~ 165 (338) T protein:vir:78 86 GTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIV 165 (338) T ss_pred ccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999965432110 Q ss_pred ---cc----cccccCHHHHHHHHHHHhhcC-CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEE Q lcl|NC_010147. 134 ---TV----NADITKLNGLQSAIDKFNDED-LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) Q Consensus 134 ---~~----~~~~~~~d~i~~A~~~l~~~~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv 205 (274) .. ......++.|.++..++..+. .....|+|||..+..|++............-.+....|.-++++|+||+ T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~ 245 (338) T protein:vir:78 166 NTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQ 245 (338) T ss_pred cccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEE Confidence 00 011234788888888876543 4667899999999988653211111111111122345667899999999 Q ss_pred EcCCCCc---------ceEEEEeCCeEEEEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCc Q lcl|NC_010147. 206 RTNKLEA---------GTAILAKKGAVKLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDES 260 (274) Q Consensus 206 ~s~~v~~---------~~~~~~~~~a~~~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~ 260 (274) ++++||. ..+|+.+.+.+.++.+.++.++..|+.. +....+|...|+++++++|+ T Consensus 246 ~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~ 325 (338) T protein:vir:78 246 FGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQ 325 (338) T ss_pred EccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeeccc Confidence 9999984 3356667777888888888888877642 24456777899999999999 Q ss_pred cEEEEEecCCCCC Q lcl|NC_010147. 261 KAVKITKGSGSLE 273 (274) Q Consensus 261 ~~v~~~~~~a~~~ 273 (274) ++++|+++.++.- T Consensus 326 a~~~l~~~~~~~~ 338 (338) T protein:vir:78 326 AFVKFVDDEDPDA 338 (338) T ss_pred ceEEEecccCCCC Confidence 9999999888877 No 67 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.95 E-value=1.3e-29 Score=178.86 Aligned_cols=258 Identities=14% Similarity=0.090 Sum_probs=195.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||. .+-.++|+.+.+.+.+.+.+.+++.+++..... ++..++||++...+.+.|++||++++.+++++++++ T Consensus 1 ma~----~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 72 (298) T protein:vir:94 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred Cee----ccccccChhHHHHHHHHHHhhchhhhhcceeec----cCCceEEEEEecCcceEEeeCCccccccccceeEEE Confidence 543 334578888999999999999999888765322 334589999987788999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhc---CccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----------------cc---ccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LT---VNA 137 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-----------------~~---~~~ 137 (274) +.++|++..+.+|+|...++ ..++.+.+.+++++++++++|..++....... .. ... T Consensus 73 l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:94 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccc Confidence 99999999999999987544 45688999999999999999999997632100 00 011 Q ss_pred cccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 138 DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 138 ~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) ....++.|+++..++..++.....|+|||..+..|++.... .....-.+....|..++++|+||++++++|.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~---~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:94 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDL---QGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred cccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhcc---CCCeeecCcccCCCCceecceeeEEecccccccCCC Confidence 11236789999999998888888999999999999764211 11111122334566789999999999999853 Q ss_pred --eEEEEeCC-eEEEEeecCceeeeecch----------hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 214 --TAILAKKG-AVKLILKRDFFLEVARDA----------STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 214 --~~~~~~~~-a~~~~~~~~~~ve~~rd~----------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ..++.+.+ .+.++.+++++++..++. .++...++...|+++++.+|+++++++.+. T Consensus 230 ~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 33444443 356777888888776642 134456777899999999999999999988 No 68 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.95 E-value=1.2e-29 Score=179.10 Aligned_cols=259 Identities=14% Similarity=0.116 Sum_probs=202.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +...+|..+..++|+.+...+.+.+.+.+.+.+++.... .++.++++|.+.. .+.+.|++||++++.++++++++ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGR----TDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceee----ccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 444445555567788888889999999999988876533 2355788999865 35788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+.+|++...++ .++.+.+.+++++.+++++|..++..-.+.. ....++...++.+ T Consensus 189 ~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~ 267 (390) T protein:vir:97 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHH Confidence 999999999999999987665 6899999999999999999999986532111 1122345678899 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCC-eEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~-a~~ 224 (274) +++...+...+.....|+|||..+..|++... ..+.....+ ...+..++++|+||++++.+|+++.++.+.+ ++. T Consensus 268 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd---~~G~~l~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~ 343 (390) T protein:vir:97 268 RLAMLQASLAEYPASGIVINPIDWAAIELAKD---ANNQYLIGN-ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhc---CCCceeecC-ccCCCCceecceeeEEcCCCCCCcEEEEeccceEE Confidence 99999999888889999999999999975321 111111111 1244567899999999999999998887765 566 Q ss_pred EEeecCceeeeecch-hh--cceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 225 LILKRDFFLEVARDA-ST--KTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~rd~-~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) ++.+.++.++..++. .+ +...++...||++++.+|+++++++++ T Consensus 344 ~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 344 IFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 778888899887753 43 445677889999999999999999999 No 69 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=99.95 E-value=2.6e-29 Score=177.31 Aligned_cols=262 Identities=11% Similarity=0.073 Sum_probs=199.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCcccccc--CCCcCCccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVA--EGEKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~--eg~~i~~~~~t 75 (274) |||.. +. ++|++|++.+++.|++++++.++++++++-+ ++.|+||+||++.. ..+.++. ++..+.+++++ T Consensus 1 MaN~l--lT--~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~-~~~~d~~~~~~~~~~~~dl~ 75 (423) T protein:vir:10 1 MPNNL--DS--NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQ-FSSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Cccch--hh--hhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCc-eeeeccCCccccccccCccc Confidence 77542 11 2699999999999999999999999887533 45799999998864 3555554 44568899999 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----ccccCHHHHHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN----ADITKLNGLQSAID 150 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~----~~~~~~d~i~~A~~ 150 (274) .+++.+++.+ .+.+|+++|++..+...++ +++.++.++.+|+.+|+++++.......... +....++.|+++.. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~ 154 (423) T protein:vir:10 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTAS 154 (423) T ss_pred cceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHH Confidence 9999999976 5889999999998887776 7899999999999999999987665432221 22346899999999 Q ss_pred HHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceecccc-ceeccceEEEcCCCCcceE------------ Q lcl|NC_010147. 151 KFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAF-GEALGAIIVRTNKLEAGTA------------ 215 (274) Q Consensus 151 ~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~i-g~~~G~~Vv~s~~v~~~~~------------ 215 (274) +|++++. ..|++|++|+.++.|+++... +......+...+++|.+ |+++||+|++|+++|..+. T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~-~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~ 233 (423) T protein:vir:10 155 FLKDLGVNEGENYAVMDPWSAQRLADAQTG-LHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQ 233 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhccccc-eecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeec Confidence 9999886 679999999999999876542 33445556677899987 8999999999999983111 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010147. 216 -------------------------------------------------------------------------------- 215 (274) Q Consensus 216 -------------------------------------------------------------------------------- 215 (274) T Consensus 234 ~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i 313 (423) T protein:vir:10 234 PTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTL 313 (423) T ss_pred ceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeec Confidence Q ss_pred ---------------------------------------EEEeCCeEEEEeec-----------------Cceeeeecch Q lcl|NC_010147. 216 ---------------------------------------ILAKKGAVKLILKR-----------------DFFLEVARDA 239 (274) Q Consensus 216 ---------------------------------------~~~~~~a~~~~~~~-----------------~~~ve~~rd~ 239 (274) ++||+.||.++.+. .+++..++|. T Consensus 314 ~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~ 393 (423) T protein:vir:10 314 SGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADG 393 (423) T ss_pred cCccccccCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeeccccCceEEEEEeeec Confidence 12222233332211 1334456666 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ....+..|-+..||++.++|+-.+++-... T Consensus 394 ~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 394 DANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred cccceEEEEEeecceeeeccceEEEEEecC Confidence 677778888888999999999999998776 No 70 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.95 E-value=1.7e-29 Score=178.34 Aligned_cols=267 Identities=13% Similarity=0.053 Sum_probs=204.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccc-ccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI-LETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~-~t~~~~ 79 (274) |...++.-+-.++|+.|...|.+.+.+.+++.+++.+... .+..+.+|.....+.+.|++||+.++..+ .+++.+ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i 181 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITL----GGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLI 181 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeec----CCCceEEEEecCCcceeeecccccccccccccceeE Confidence 4433333344689999999999999999999888765332 24468888877767888999999998764 689999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------c----------------cc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------K----------------LT 134 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---------~----------------~~ 134 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++..-.+. . .. T Consensus 182 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 261 (407) T protein:vir:48 182 EPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIAS 261 (407) T ss_pred EeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999988542110 0 11 Q ss_pred ccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCc-- Q lcl|NC_010147. 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA-- 212 (274) Q Consensus 135 ~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~-- 212 (274) ..++.+++|+|+++...|..+......|+|||..+..|++....+ ....-..-+..|..++++|.||+++++||. T Consensus 262 ~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~---Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~ 338 (407) T protein:vir:48 262 GAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDND---GNYLWRPGIELGQPSSLAGYGIVENEQMPDIA 338 (407) T ss_pred ccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccC---CceeeccCcCCCCCceecceeeEEecCcCCcc Confidence 223446799999999999887777778999999999987532111 000011113456678999999999999985 Q ss_pred --ceEEEEeC--CeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 213 --GTAILAKK--GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 213 --~~~~~~~~--~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ...++|+. .++.++.+.+++++.++...++...++...||++++++|++++++++++|+.+= T Consensus 339 ~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~ 404 (407) T protein:vir:48 339 ADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQK 404 (407) T ss_pred CCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeeccCCCC Confidence 23355543 346666677777766655556778899999999999999999999999999998 No 71 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.95 E-value=3.5e-29 Score=176.60 Aligned_cols=264 Identities=12% Similarity=0.104 Sum_probs=203.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) .....+..+..++|+.++..+.+.+.+.+.+.+++.+.. .++.++++|.... .+.+.|++||+.++.++++++++ T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v 210 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQ----TSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLK 210 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceee----ccCCceeEEEEecCCCceeeeccCccccccccceeeE Confidence 222233444568999999999999999999999876532 2355788999765 46778999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) .+.+++++..+.+|++...++ .++.+.+.+++++.+++++|..++....+.. ....++..+++.+ T Consensus 211 ~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i 289 (418) T protein:vir:10 211 NQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKI 289 (418) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHH Confidence 999999999999999977655 6899999999999999999999986532211 0112234578999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCe-EE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA-VK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a-~~ 224 (274) +++...+...+.....|+|||..+..|++... ........+ ..+|..++++|+||+++++||.++.++.+.+. +. T Consensus 290 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd---~~G~~i~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~ 365 (418) T protein:vir:10 290 RLALLQAVLAEFPATGIVLNPIDWASIELTKD---SQGRYIVGN-PVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQ 365 (418) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhc---CCCceeccc-cccCCCceecceeeEEcCCCCCCcEEEeeccceEE Confidence 99999998888888899999999999876421 011111111 23566789999999999999999988877664 66 Q ss_pred EEeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 225 LILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 225 ~~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) ++.+.+++++.+++.. ++...++...||++++.+|+++++++.+.+..= T Consensus 366 ~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 366 IFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred EEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 6777888888776543 455678888999999999999999997544444 No 72 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.95 E-value=2.8e-29 Score=177.15 Aligned_cols=259 Identities=14% Similarity=0.130 Sum_probs=199.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +...++..+..++|+.+...+.+.+.+.+.+.+++.+.. ..+.++++|++.. .+.+.|++||+.++..+++++++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGR----TDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceee----ccCCceEEEEEecCCcceeeecCCcccccccceeeEE Confidence 332334444445666677788889999999988876532 2355789999865 35788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) .+.+++++..+.+|++...++ +++.+.+.+++++.+++++|..++..-.+.. ....++...++.| T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:81 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHH Confidence 999999999999999987776 6899999999999999999999986532211 1122344678999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCC-eEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~-a~~ 224 (274) +++...+...+.....|+|||..+..|++... ........+ ...+..++++|+||++++.+|+++.++.+.+ ++. T Consensus 268 ~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd---~~G~~l~~~-~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 343 (390) T protein:vir:81 268 RLAMLQASLAEYNPSGIVINPIDWAAIELAKD---ANNQYLIGN-ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhc---CCCceeecC-cccccCceecceeeEEcCCCCCCcEEEEehhceEE Confidence 99999999888888899999999999876321 011111111 1244557899999999999999998887765 466 Q ss_pred EEeecCceeeeecch-h--hcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 225 LILKRDFFLEVARDA-S--TKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~rd~-~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) ++.+.++.++.++.. . .+...++...||++++.+|+++|+++++ T Consensus 344 ~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 344 IFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 777888889887753 3 3556788889999999999999999999 No 73 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.95 E-value=4.2e-30 Score=181.65 Aligned_cols=264 Identities=16% Similarity=0.143 Sum_probs=196.4 Q ss_pred CCCc-cceeeeeechHHHHH--HHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccc Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAP--MMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETK 77 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~--~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~ 77 (274) ||.+ .|++.|+..|+.+.- .....+.+-+-+.+ +.+...-..|+++++|+|..++++++++||+.||.++++.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lg---i~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~ 77 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLG---VTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRT 77 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhc---cccccccccCCeEEeeeeeeecccccccCCcccchhhheee Confidence 9987 477778776765432 22222222222111 22233444599999999999999999999999999999976 Q ss_pred ---eeEEEeeeecceeeeeHHHH-hhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc--CHHHHHHHHHH Q lcl|NC_010147. 78 ---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT--KLNGLQSAIDK 151 (274) Q Consensus 78 ---~~~~~~~~~~~~~~vtd~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~--~~d~i~~A~~~ 151 (274) ..+++++|+.+++ |||+. ..+.+||+.+..+|+.+++++++|+++++.++++++++..+.+ .++.+.+++.. T Consensus 78 ~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~~ 155 (295) T protein:vir:99 78 KDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLAT 155 (295) T ss_pred eeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhhh Confidence 4788888888864 99996 6778999999999999999999999999999999998876644 67888888888 Q ss_pred HhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccce-EEEcCCCCcceEEEEeCCeEEEEeecC Q lcl|NC_010147. 152 FNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAI-IVRTNKLEAGTAILAKKGAVKLILKRD 230 (274) Q Consensus 152 l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~-Vv~s~~v~~~~~~~~~~~a~~~~~~~~ 230 (274) +.+.+....+++|||..++.||++..+++..++..|...+.+ ++|+. ||+|+++|+|++|.-...++.+++-.+ T Consensus 156 f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n-----fLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~ 230 (295) T protein:vir:99 156 FNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN-----FLGMQNVIVMPSVPEGKIYSTAVENLVFASLNV 230 (295) T ss_pred cccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh-----hhccceEEEcccCCCceEEEeeccceEEEEecC Confidence 888887889999999999999999999988888888877654 99997 999999999999999999988876432 Q ss_pred c--eeeeecchhhcceEEEEEEE-------------E-EEEEE--cCccEEEEEe-cCCCCCC Q lcl|NC_010147. 231 F--FLEVARDASTKTTALYSDKH-------------Y-VAYLY--DESKAVKITK-GSGSLEM 274 (274) Q Consensus 231 ~--~ve~~rd~~~~~~~v~~~~~-------------y-g~~~~--~~~~~v~~~~-~~a~~~~ 274 (274) - .+..--....+.+-+.+..| + |..+. .+++|++.+. +.+++-. T Consensus 231 ~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~~ 293 (295) T protein:vir:99 231 KGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPGI 293 (295) T ss_pred CchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCCC Confidence 1 12221122223333333333 1 11111 5678888886 3334444 No 74 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.95 E-value=4.7e-29 Score=175.87 Aligned_cols=267 Identities=12% Similarity=0.054 Sum_probs=205.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |+..++.-+..++|+.|+..|.+.+.+.+.+.+++.+... .+ ...++.+|.+.. .+.+.|++||++++. +++++++ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~-~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~ 82 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENV-TT-LTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSL 82 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeec-cC-CcceEEEEeecCCCcceeeecCCcccccccccceeE Confidence 7766666566789999999999999999999888765322 11 223577888753 467889999999986 6799999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+++|+|...++..|+.+.+.+++++.+++++|+.+++.+.+.+. .....++++|+++..++..+... T Consensus 83 i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 160 (293) T protein:vir:48 83 IKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--KPTLTKWDDIIDLEAKVDPAIKQ 160 (293) T ss_pred EEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--cccccCHHHHHHHHHhhhhhhcC Confidence 9999999999999999999999999999999999999999999999988765443 34567899999999999888778 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcce----EEEEe-C-CeEEEEeecC Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGT----AILAK-K-GAVKLILKRD 230 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~----~~~~~-~-~a~~~~~~~~ 230 (274) ...|+|||..+..|++... ......-..-+.+|..++++|+||+++++ +|... .++|+ . .++.++.+.+ T Consensus 161 ~a~~vmn~~~~~~L~~lkd---~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 237 (293) T protein:vir:48 161 TSFFLTNTSGFTALKKVKN---ALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQ 237 (293) T ss_pred CCEEEEcHHHHHHHHHhhc---cCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecc Confidence 8899999999999876321 01111111223456678999999987543 44322 23444 3 3567777888 Q ss_pred ceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 231 FFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 231 ~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++++.++.. .++...++...||++++.+|+++++++++.+...= T Consensus 238 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 238 MSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred eEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 888877643 35667889999999999999999999965544443 No 75 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.95 E-value=5.4e-29 Score=175.56 Aligned_cols=262 Identities=11% Similarity=0.010 Sum_probs=203.0 Q ss_pred CCC-ccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC--CccccccCCCcCCccccccc Q lcl|NC_010147. 1 MPQ-GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS--GDAQVVAEGEKIPTDILETK 77 (274) Q Consensus 1 Ma~-~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~--~~~~~~~eg~~i~~~~~t~~ 77 (274) .+. .++......+|+.|...|.+.+...+.+.+++.+... .+.++++|+.... +...|++||+.++..+++++ T Consensus 106 ~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~ 181 (379) T protein:vir:10 106 VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSI----SGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDIS 181 (379) T ss_pred hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeec----cCCceEEEEeecCCCcccccccCCcccccccccee Confidence 111 2222223368999999999999999888888765332 3557899987643 34568999999999999999 Q ss_pred eeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---ccccccccCHHHHHHHHHHHhh Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---LTVNADITKLNGLQSAIDKFND 154 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---~~~~~~~~~~d~i~~A~~~l~~ 154 (274) ++++.+++++..+.+|++...++ +++.+.+.+++++.+++++|..++..+.+.. ....+...+++.+.++...+.. T Consensus 182 ~i~~~~~k~~~~~~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~ 260 (379) T protein:vir:10 182 MIDVNTDFIAGFTRYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIITNKNKVEMLINEIAKQEN 260 (379) T ss_pred eeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccCcccHHHHHHHHHhhhh Confidence 99999999999999999987765 6799999999999999999999988776432 2233455678999999999988 Q ss_pred cCCCceEEEEcHHHHHHHHhhccccccccccccc--cceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCce Q lcl|NC_010147. 155 EDLEPMVLFINPLDAGKLRGDASTNFTRATELGD--DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFF 232 (274) Q Consensus 155 ~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~--~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ 232 (274) ++.....|+|||..+..|++.... .+..... ....+|...+++|+||++|+.||.|+.++.+.+.+.+..++++. T Consensus 261 ~~~~~~~~vmn~~~~~~l~~lkd~---~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~ 337 (379) T protein:vir:10 261 LDFPVTAIVLRPTDYYDILVTQKS---VGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLS 337 (379) T ss_pred ccCCCCEEEEcHHHHHHHHHhhcc---CCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceE Confidence 888888999999999988754211 1111111 11124455689999999999999999988888887777778888 Q ss_pred eeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 233 LEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 233 ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) ++..++.. +....++...|+|+++.+|+++|++++++= T Consensus 338 i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 338 LEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 88777653 455677888999999999999999999887 No 76 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.95 E-value=5.3e-29 Score=175.62 Aligned_cols=261 Identities=12% Similarity=0.100 Sum_probs=199.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +...++..+..++|+.++..|.+.+.+.+.+.+++.+... ++..+++|+... .+.+.|++||+.++.++++++++ T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~----~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i 188 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTT----ESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELE 188 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceec----CCCceEEEEEecCCCceeeecCCccccccccceeEE Confidence 2222233333467777889999999999999998766432 355689999854 45788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------------cccccccCHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL----------------TVNADITKLN 143 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~----------------~~~~~~~~~d 143 (274) .+.+++++..+.+|++...++ +++.+.+.+++++++++.+|..++....+... ...+....++ T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (395) T protein:vir:43 189 NAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRID 267 (395) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHH Confidence 999999999999999976654 67889999999999999999999865321110 1112234588 Q ss_pred HHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCe- Q lcl|NC_010147. 144 GLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA- 222 (274) Q Consensus 144 ~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a- 222 (274) .+.++...+..++.....|+|||..+..|++... ..+.....+ ..+|..++++|+||++++.||.++.++.+.+. T Consensus 268 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd---~~G~~i~~~-~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~ 343 (395) T protein:vir:43 268 RIRLAILQAQLAEFPASGIVLNPIDWALIELNKD---AENRYIIGS-PQNGTTPTLWRLPVVETQAITQDEFLTGAFSLG 343 (395) T ss_pred HHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhc---cCCceeccc-cccCCCceecceeeEEcCCCCCCcEEEEeccce Confidence 9999999998888888899999999998875421 011111111 23566788999999999999999988877554 Q ss_pred EEEEeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 223 VKLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 223 ~~~~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +.++.+.++.++.++... ++...++...||++++.+|++++++++++| T Consensus 344 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 344 AQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 556667788888776542 455678888999999999999999999999 No 77 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.95 E-value=6e-29 Score=175.32 Aligned_cols=258 Identities=15% Similarity=0.115 Sum_probs=193.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||.. .+. ++|+.+...+.+.+.+.+.+.+++.+... ++..++||++...+.+.|++||++++.+++++++++ T Consensus 1 ma~~---gG~-lvp~~~~~~ii~~~~~~s~i~~l~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MVLN---KGT-LFDPTLVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred Cccc---Ccc-eechhHHHHHHHHHHhhhhhhhhcceeec----cCCceEEEEEecCcceEEecCCccccccccceeEEE Confidence 7632 233 56666777888999999999998865422 334589999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhh---cccc--------------cc---cc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALM---GAKL--------------TV---NA 137 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~---~a~~--------------~~---~~ 137 (274) +.++|++..+.+|+|...+ +..++.+.+.+++++++++++|..++.... +... .. .. T Consensus 73 l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:16 73 MVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccc Confidence 9999999999999999854 446788999999999999999999997632 1110 00 01 Q ss_pred cccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 138 DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 138 ~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) ....++.|.++..++..++.+...++|||..+..|++... ......-.+....|..++++|+||++++++|.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd---~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:16 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKD---LQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred cccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhc---cCCCeeecCcccCCCCceecceeeEEecccccccCCC Confidence 1123678999999999888888899999999999986421 111111122234566789999999999999853 Q ss_pred --eEEEEeC-CeEEEEeecCceeeeecch----------hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 214 --TAILAKK-GAVKLILKRDFFLEVARDA----------STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 214 --~~~~~~~-~a~~~~~~~~~~ve~~rd~----------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ..++.+. .++.+..+.+++++..++. .++...++...|+++++.+|+++++++.+. T Consensus 230 ~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 2333343 4466777778787766542 124467778899999999999999999988 No 78 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=99.95 E-value=1.2e-29 Score=179.17 Aligned_cols=263 Identities=11% Similarity=0.072 Sum_probs=183.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCccccccC--CCcCCccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVAE--GEKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~e--g~~i~~~~~t 75 (274) |||... -++|++|++.+++.+++++||.++++++++-+ ++.|+||+||.+.. ..+.++.- +..+.+++++ T Consensus 1 MAN~ll----T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~-~~v~d~~~~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MANNLE----SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQ-FKSERTETGDITGKDKNGLF 75 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCc-ceeecccCcCCCCccccccc Confidence 775432 13699999999999999999999999987543 35699999999864 35666643 5678899999 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccccc---ccccCHHHHHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG-AKLTVN---ADITKLNGLQSAID 150 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-a~~~~~---~~~~~~d~i~~A~~ 150 (274) ..++.+++.+ .+.+|+++|++..++..++ +++.++.++++++++|.+++..+.. ++..+. +....|+.|++|.. T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~~ 154 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTAS 154 (423) T ss_pred cceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHHH Confidence 9999999977 5889999999999998888 5678888999999999999986654 333322 22346899999999 Q ss_pred HHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceecccc-ceeccceEEEcCCCCcceEEEEeCCeEEEEe Q lcl|NC_010147. 151 KFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAF-GEALGAIIVRTNKLEAGTAILAKKGAVKLIL 227 (274) Q Consensus 151 ~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~i-g~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~ 227 (274) +|++.+. .+|++|++|+.++.|++... .+......+...+++|.+ |+++||+|++|+++|..+...++...... . T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~-~ 232 (423) T protein:vir:35 155 FIKDIGIKTGENYAIMDPWSAQRLADAQS-GLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVK-T 232 (423) T ss_pred HHHHhcCCcCCCEEEeCHHHHHHHhcccc-ceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeec-c Confidence 9999886 57999999999999887543 344445556667888876 99999999999999987765543322110 0 Q ss_pred ecCceee-----------------eecchhhcceEEEEEEEEEEEEEcCccEEEE-----------Eec-----CCCCCC Q lcl|NC_010147. 228 KRDFFLE-----------------VARDASTKTTALYSDKHYVAYLYDESKAVKI-----------TKG-----SGSLEM 274 (274) Q Consensus 228 ~~~~~ve-----------------~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~-----------~~~-----~a~~~~ 274 (274) +.-+... ...+.....|.+ ..-|++.++|..-.++ ..+ .|+.+- T Consensus 233 a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~ 309 (423) T protein:vir:35 233 APNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQL---KFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDV 309 (423) T ss_pred ccccccccccccccceeeeeeeeeccCCcEEecceE---EeeeeeeccccccceeecccCCceeEEEEeccccccccCce Confidence 0000000 111222233322 3346666655444322 221 122222 No 79 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.95 E-value=6.7e-29 Score=175.04 Aligned_cols=269 Identities=14% Similarity=0.071 Sum_probs=199.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) .|+..--..++..-|.|+..+.+.+ .+.++.+....++.+++.+|++|+||++.. ....+|..++.+..++++.+..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~-~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t 96 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVT-AVNAYSTPALISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETT 96 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHH-HHhhhhhhcccCcceEeccCcEEEEeeecc-cccccccCCCCcccCCcccceeE Confidence 2332222223334456777766544 444454433334556777899999999986 57889999889999999999999 Q ss_pred EEeee-ecceeeeeHHHHhhcCccH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cccccCHHHHHHHHHHHh Q lcl|NC_010147. 81 AKIRK-IAKGTSITDEALLSGYGDP--QGEQVRQHGLAHANKVDNDVLEALMGAKLTV----NADITKLNGLQSAIDKFN 153 (274) Q Consensus 81 ~~~~~-~~~~~~vtd~~~~~~~~d~--~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~----~~~~~~~d~i~~A~~~l~ 153 (274) +++.+ ++..|.+++.+..++..++ .....+++...++..+|+..++.+.+..... .+....|+.|.++..+|. T Consensus 97 ~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Ld 176 (319) T protein:vir:94 97 YFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELD 176 (319) T ss_pred EEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHH Confidence 99976 7999999999999987654 4556677788889999998887775432222 233345899999999999 Q ss_pred hcCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEEeecC Q lcl|NC_010147. 154 DEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLILKRD 230 (274) Q Consensus 154 ~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~~~~~ 230 (274) ++++ ++|+++|+|+++..|+++. .|......++..+++|.+|++.|++|+..++ +.....++++++|+....+-. T Consensus 177 e~~VP~~Rvl~Vtp~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:94 177 EIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred hcCCCCCcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeee Confidence 9875 6899999999999999876 4566666677778999999999999998643 334456777888888877653 Q ss_pred ceeeeec-chhhcceEEEEEEEEEEEEEcCccEEEEE---------------------ecCCCCCC Q lcl|NC_010147. 231 FFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVKIT---------------------KGSGSLEM 274 (274) Q Consensus 231 ~~ve~~r-d~~~~~~~v~~~~~yg~~~~~~~~~v~~~---------------------~~~a~~~~ 274 (274) .+|..+ .+.++++.+++|++||++|++|++...+. +..+|+|| T Consensus 255 -~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 255 -LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred -eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCccccccccccCCcccccC Confidence 577655 57788999999999999999998554443 23455555 No 80 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.95 E-value=6.7e-29 Score=175.04 Aligned_cols=269 Identities=14% Similarity=0.071 Sum_probs=199.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) .|+..--..++..-|.|+..+.+.+ .+.++.+....++.+++.+|++|+||++.. ....+|..++.+..++++.+..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~-~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t 96 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVT-AVNAYSTPALISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETT 96 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHH-HHhhhhhhcccCcceEeccCcEEEEeeecc-cccccccCCCCcccCCcccceeE Confidence 2332222223334456777766544 444454433334556777899999999986 57889999889999999999999 Q ss_pred EEeee-ecceeeeeHHHHhhcCccH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cccccCHHHHHHHHHHHh Q lcl|NC_010147. 81 AKIRK-IAKGTSITDEALLSGYGDP--QGEQVRQHGLAHANKVDNDVLEALMGAKLTV----NADITKLNGLQSAIDKFN 153 (274) Q Consensus 81 ~~~~~-~~~~~~vtd~~~~~~~~d~--~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~----~~~~~~~d~i~~A~~~l~ 153 (274) +++.+ ++..|.+++.+..++..++ .....+++...++..+|+..++.+.+..... .+....|+.|.++..+|. T Consensus 97 ~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Ld 176 (319) T protein:vir:97 97 YFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELD 176 (319) T ss_pred EEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHH Confidence 99976 7999999999999987654 4556677788889999998887775432222 233345899999999999 Q ss_pred hcCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEEeecC Q lcl|NC_010147. 154 DEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLILKRD 230 (274) Q Consensus 154 ~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~~~~~ 230 (274) ++++ ++|+++|+|+++..|+++. .|......++..+++|.+|++.|++|+..++ +.....++++++|+....+-. T Consensus 177 e~~VP~~Rvl~Vtp~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:97 177 EIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred hcCCCCCcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeee Confidence 9875 6899999999999999876 4566666677778999999999999998643 334456777888888877653 Q ss_pred ceeeeec-chhhcceEEEEEEEEEEEEEcCccEEEEE---------------------ecCCCCCC Q lcl|NC_010147. 231 FFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVKIT---------------------KGSGSLEM 274 (274) Q Consensus 231 ~~ve~~r-d~~~~~~~v~~~~~yg~~~~~~~~~v~~~---------------------~~~a~~~~ 274 (274) .+|..+ .+.++++.+++|++||++|++|++...+. +..+|+|| T Consensus 255 -~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 255 -LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred -eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCccccccccccCCcccccC Confidence 577655 57788999999999999999998554443 23455555 No 81 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.95 E-value=3.9e-29 Score=176.32 Aligned_cols=264 Identities=11% Similarity=0.016 Sum_probs=201.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccc-ccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI-LETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~-~t~~~~ 79 (274) |...++.-+-.++|+.|...|.+.+.+.+++.+++.+-.. .+..+++|.....+.+.|++||+.++..+ .+++++ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v 205 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPV----SKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPL 205 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeec----cCCceEEEEEcCCcceeeecccccccccccccccee Confidence 5444444445689999999999999999999998865332 23457888877777889999999998775 689999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc---------cc----------------cc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG---------AK----------------LT 134 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~---------a~----------------~~ 134 (274) .+.+++++..+.+|++...++..++.+.+.+++++.+++++|..++..-.+ .+ .+ T Consensus 206 ~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 206 SFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999864211 00 01 Q ss_pred ccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCc-- Q lcl|NC_010147. 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA-- 212 (274) Q Consensus 135 ~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~-- 212 (274) ..+..+++++|+++...|......+..|+|||..+..|++.... .+...-..-+.+|.-++++|.||+++++||. T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~---~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~ 362 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDG---QGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVA 362 (425) T ss_pred cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcC---CCceeeccCccCCCCceecceeeEEecCcCCcc Confidence 12344689999999999988777888999999999998753210 1000111123456667899999999999984 Q ss_pred --ceEEEEe--CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 213 --GTAILAK--KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 213 --~~~~~~~--~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) .+.++|+ +.++.++.+.++++..++...++...++...||++++.+|+++++++.+++= T Consensus 363 ~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 363 ANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred CCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 2335554 2345666677766665555556777889999999999999999999887666 No 82 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=99.95 E-value=8.5e-29 Score=174.46 Aligned_cols=262 Identities=11% Similarity=0.093 Sum_probs=193.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCcccccc--CCCcCCccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVA--EGEKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~--eg~~i~~~~~t 75 (274) |||..|. |+|++|++.+++.++++++|.++++++++-+ ++.|+||+||.+... .+.+.. .....++++++ T Consensus 1 MANsl~~----l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~-~~~d~~~~~~t~~~~~~l~ 75 (423) T protein:vir:10 1 MANNLDA----NVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQF-KSERTMDGDITGKSKNSLI 75 (423) T ss_pred Ccccccc----ccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCce-eeecccCcccCcccccccc Confidence 7754332 6899999999999999999999999987543 456999999998643 343322 11233567888 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccccc---ccccCHHHHHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG-AKLTVN---ADITKLNGLQSAID 150 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-a~~~~~---~~~~~~d~i~~A~~ 150 (274) ..++.+++.+ .+..|+++|++..++..++ +++.++.++.+|+.+|+++...+.. ++.... .....++.+++|.. T Consensus 76 e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~~ 154 (423) T protein:vir:10 76 SAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTAS 154 (423) T ss_pred cceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHHH Confidence 8889999976 5889999999998888877 7899999999999999999765544 222221 22235899999999 Q ss_pred HHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceecccc-ceeccceEEEcCCCCc---ceE--------- Q lcl|NC_010147. 151 KFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAF-GEALGAIIVRTNKLEA---GTA--------- 215 (274) Q Consensus 151 ~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~i-g~~~G~~Vv~s~~v~~---~~~--------- 215 (274) +|++++. ..|++|++|+.++.|+++... +......+...+++|.+ |+++|++|++|+++|. ++. T Consensus 155 ~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~-~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~ 233 (423) T protein:vir:10 155 FLKDLGINSGENYAVMDPWAAQRLADAQSG-LHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGT 233 (423) T ss_pred HHhhccCCcCCCEEEeCHHHHHHHhhhhhh-hccccccchHHHHhcccceeecceEEEEecCCcccccccccceeeeeee Confidence 9999886 679999999999999875433 23334445667888876 9999999999999972 110 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010147. 216 -------------------------------------------------------------------------------- 215 (274) Q Consensus 216 -------------------------------------------------------------------------------- 215 (274) T Consensus 234 ~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i 313 (423) T protein:vir:10 234 PEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVKI 313 (423) T ss_pred eEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCceEEEe Confidence Q ss_pred ---------------------------------------EEEeCCeEEEEeec-----------------Cceeeeecch Q lcl|NC_010147. 216 ---------------------------------------ILAKKGAVKLILKR-----------------DFFLEVARDA 239 (274) Q Consensus 216 ---------------------------------------~~~~~~a~~~~~~~-----------------~~~ve~~rd~ 239 (274) ++||+.||.++.+. .+++..++|. T Consensus 314 ~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~ 393 (423) T protein:vir:10 314 SGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADG 393 (423) T ss_pred ccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCccceeecccccceEEEEEeeec Confidence 01222233222211 1334456677 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ....+..|-+..||++.++|+-.+++-... T Consensus 394 ~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 394 DANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred cccceEEEEEeecceeeeccceEEEEEecC Confidence 777788888888999999999999998776 No 83 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.94 E-value=8.8e-29 Score=174.39 Aligned_cols=259 Identities=14% Similarity=0.118 Sum_probs=195.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC-CccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~-~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +....|..+..++|+.+...+.+.+.+.+.+.+++.+.+ .++..+++|++... +.+.|++||+.++..+++++++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGR----TDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 222233333344555566677888888888888876533 23557899998753 5788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccccCHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LTVNADITKLNGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------~~~~~~~~~~d~i 145 (274) .+.+++++..+.+|++...++ +++.+.+.+++++.+++++|+.++..-.+.. ....+....++.+ T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:10 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeEEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHH Confidence 999999999999999977665 6899999999999999999999986532111 1112334568899 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCC-eEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~-a~~ 224 (274) +++...+..++.....|+|||..+..|++.... .......+. ..+..++++|+||++++.||.++.++.+.. ++. T Consensus 268 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~---~g~~l~~~~-~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~ 343 (390) T protein:vir:10 268 RLAMLQASLAEYPASGIVINPIDWAAIELAKDA---NNQYLIGNA-RGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC---CCceeecCC-cCcCCceecceeeEEcCCCCCCcEEEEeccceEE Confidence 999999998888889999999999998753210 111111111 233456899999999999999999887765 456 Q ss_pred EEeecCceeeeecch-hh--cceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 225 LILKRDFFLEVARDA-ST--KTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~rd~-~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) ++.+.++.++..+.. .+ +...+++..||++++.+|+++++++++ T Consensus 344 ~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 344 IFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 677888888877653 33 555777889999999999999999999 No 84 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.94 E-value=6.9e-29 Score=174.96 Aligned_cols=268 Identities=15% Similarity=0.082 Sum_probs=195.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) |... |.....++|+.+++.+.+.+.+.+.+.+++.+.. .++...++|+....+.+.|++||++++.+++++++++ T Consensus 20 ~~~~-~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 94 (326) T protein:vir:42 20 AQTG-DSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIP----MGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQT 94 (326) T ss_pred eecc-ccCCcceechhhHHHHHHHHHhcchhhhhcceee----ccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 3222 2222336788889999999999999888876532 2355789999988888999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------c-------ccccccCH Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------T-------VNADITKL 142 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~-------~~~~~~~~ 142 (274) +.+++++..+.+|++...++..++.+.+.+++++++++++|+.++..-.+... . ..+..... T Consensus 95 ~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~ 174 (326) T protein:vir:42 95 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVY 174 (326) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhH Confidence 99999999999999999999999999999999999999999999864322100 0 00111222 Q ss_pred HH-HHHHHHHHhhcCCCceEEEEcHHHHHHHHhhc--cccccccccccccceeccccceeccceEEEcCCCCcceEEEE- Q lcl|NC_010147. 143 NG-LQSAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILA- 218 (274) Q Consensus 143 d~-i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~--~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~- 218 (274) +. +.++...+.........|+|||..+..|++.. .-.++-......+.......++++|+||++++++|.++.+++ T Consensus 175 ~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 254 (326) T protein:vir:42 175 DAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQ 254 (326) T ss_pred HHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEE Confidence 22 34555556556667788999999999997532 111111111111111223456899999999999999987543 Q ss_pred -eCCeEEEEeecCceeeeecchh----------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 219 -KKGAVKLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 219 -~~~a~~~~~~~~~~ve~~rd~~----------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) +...+.++.+.++.++..++.. +....++...||++++.+|+++++|+...|++- T Consensus 255 Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 255 GDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred eecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 4455557777788887766643 345678899999999999999999998777777 No 85 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.94 E-value=7.6e-29 Score=174.73 Aligned_cols=263 Identities=12% Similarity=0.031 Sum_probs=201.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) |...+..-+-.++|+.|.+.|.+.+.+.+++.+++.+-. ..+..+++|.......+.|++||+.++. +..+++++ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVIT----VGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeee----cCCCceEEEEecCCccceeeccccccCccccccceee Confidence 444333333468999999999999999999988876532 2355678888776667889999999886 45799999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------------------cc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------------------LT 134 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-------------------------~~ 134 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++..-.+.. .+ T Consensus 183 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t 262 (401) T protein:vir:44 183 EPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVS 262 (401) T ss_pred eeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999986422110 01 Q ss_pred ccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCc-- Q lcl|NC_010147. 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA-- 212 (274) Q Consensus 135 ~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~-- 212 (274) ..+..++|+.++++...|..+...+..|+|||..+..|++.... .....-..-+..|..++++|+||+++++||. T Consensus 263 ~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~---~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~ 339 (401) T protein:vir:44 263 GEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDT---EGNYLWRPGLELGQPSSLAGYGIAENEQMPDIA 339 (401) T ss_pred ccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhcc---CCceeecCCcCCCCCceecceeeEEecCcCCcc Confidence 12344679999999999987777788899999999998753210 1001111113456678899999999999984 Q ss_pred --ceEEEEe-C-CeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 213 --GTAILAK-K-GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 213 --~~~~~~~-~-~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) ...++|+ . .++.++.+.+++++.++...++...++...|+++++++|+++++++.++| T Consensus 340 ~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 340 ADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 2335544 3 35667777777777666655677778888999999999999999999999 No 86 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.94 E-value=1.8e-28 Score=172.72 Aligned_cols=265 Identities=15% Similarity=0.073 Sum_probs=199.0 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc------c Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD------I 73 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~------~ 73 (274) +...+ +.....++|+.+++.|.+.+.+.+++.+++.+.. .++....+|.....+.+.|+.|+...+.+ + T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~ 236 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELP----MSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVK 236 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceee----cCCcceEEEEecCCcceeeccccccccccccccccc Confidence 11111 2234568999999999999999999988876532 23556788888777889999998876643 5 Q ss_pred cccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------------cc Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------------LT 134 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-------------------~~ 134 (274) ++++++++.+++++..+.+|++...++..++.+.+.+++++++++++|..++..-.+.. .. T Consensus 237 ~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) T protein:vir:10 237 GALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKA 316 (458) T ss_pred ccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccc Confidence 68999999999999999999999999999999999999999999999999986421100 01 Q ss_pred ccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccc--cccccccccccceeccccceeccceEEEcCCCCc Q lcl|NC_010147. 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAST--NFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA 212 (274) Q Consensus 135 ~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~ 212 (274) ......+++.|+++...+..++..+..|+|||..+..|++.... .++.... .......|..++++|+||++++.||. T Consensus 317 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~-~~~~~~~~~~~~l~G~pv~~~~~~p~ 395 (458) T protein:vir:10 317 DGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG-NDSVKLQGQVGRIYGLPVVVSEYFPA 395 (458) T ss_pred cccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccc-cccccccCcCceecceeeEEcccccc Confidence 11234579999999999988888888999999999988753211 1111111 11223456667899999999999996 Q ss_pred ce----EEEEeC-CeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 213 GT----AILAKK-GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 213 ~~----~~~~~~-~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +. .++.+. .++.++.+.+++++.++...++...++...|+|..+++|+++|+.+++++ T Consensus 396 ~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 396 KANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 43 233333 45667777777777666555677788889999999999999999999988 No 87 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.94 E-value=2e-28 Score=172.39 Aligned_cols=267 Identities=14% Similarity=0.093 Sum_probs=204.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |+..++.-+..++|+.+...|.+.+.+.+.+.+++.+... .+..| .+.+|.+.. .+.+.|++||+.++. +++++++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV-TTLTG-SRVYEKWTDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec-ccCcc-ceEEEeeccCCcceeeecCccccccccccceee Confidence 6655555566789999999999999999999888765432 12122 356776653 356889999999986 6899999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..++....+... ....+++|.|+++...+..+... T Consensus 187 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:49 187 IKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPT--KPTLTKWDDIIDLEAKVDPAIKQ 264 (397) T ss_pred EEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccccccHHHHHHHHHhhhhhhcC Confidence 9999999999999999999999999999999999999999999999988765443 23457899999999999988888 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcce----EEEEe--CCeEEEEeecC Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGT----AILAK--KGAVKLILKRD 230 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~----~~~~~--~~a~~~~~~~~ 230 (274) ...|+|||..+..|++... ......-..-+..|.-++++|+||++.++ +|.++ .++++ +.++.++.+.+ T Consensus 265 ~a~~vmn~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:49 265 TSFFLTNTSGFTALKKVKN---ALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQH 341 (397) T ss_pred CCEEEEcHHHHHHHHHhhc---CCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecc Confidence 8999999999999976421 01111111113456678999999987543 45432 24554 33567777888 Q ss_pred ceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 231 FFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 231 ~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++++.++.. .++...++...|+++++.+|++++++++++++.+= T Consensus 342 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 342 MSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 389 (397) T ss_pred eEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecccCCC Confidence 888877643 34667889999999999999999999977665555 No 88 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.94 E-value=1.9e-28 Score=172.52 Aligned_cols=265 Identities=15% Similarity=0.130 Sum_probs=199.4 Q ss_pred CCC--cc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEee--------ccCCccccccCCCcC Q lcl|NC_010147. 1 MPQ--GI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAF--------VYSGDAQVVAEGEKI 69 (274) Q Consensus 1 Ma~--~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~--------~~~~~~~~~~eg~~i 69 (274) +.+ .. +.-...++|+.+...+.......+.+.+++.+... .+..+++|+. ...+.+.|++||+.+ T Consensus 121 ~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 196 (419) T protein:vir:94 121 RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAK 196 (419) T ss_pred cccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeec----cCCceeeeeeccccccccccCcccceecCCccc Confidence 222 22 33334678999999998888888788887765332 2334555543 334567899999999 Q ss_pred CccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------- Q lcl|NC_010147. 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------------- 132 (274) Q Consensus 70 ~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------------- 132 (274) +.++++++++++.+++++..+.+|++...++ .++.+.+.+++++.+++++|..++..-.+.. T Consensus 197 ~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~ 275 (419) T protein:vir:94 197 PQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK 275 (419) T ss_pred cccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccc Confidence 9999999999999999999999999987765 6899999999999999999999986422110 Q ss_pred -ccccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC Q lcl|NC_010147. 133 -LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE 211 (274) Q Consensus 133 -~~~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~ 211 (274) ....+....++.|+++...+..++..+..|+|||..+..|++..... ...-...+-+..|..++++|+||++++++| T Consensus 276 ~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~--~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~ 353 (419) T protein:vir:94 276 PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG--SGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) T ss_pred cccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcC--CCceeecCCcccCCCccccceeeEEcCCCC Confidence 00112234589999999999988888889999999999987542110 000001112345667799999999999999 Q ss_pred cceEEEEeCC-eEEEEeecCceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 212 AGTAILAKKG-AVKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 212 ~~~~~~~~~~-a~~~~~~~~~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) +++.++.+.. ++.++.+.++.++.++.. .++...++...||++++.+|++++++++++|.. T Consensus 354 ~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 354 QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred CccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 9998887665 456677788888776654 356778899999999999999999999999999 No 89 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=99.94 E-value=7.7e-29 Score=174.71 Aligned_cols=264 Identities=11% Similarity=0.085 Sum_probs=184.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCcccccc--CCCcCCccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVA--EGEKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~--eg~~i~~~~~t 75 (274) |||... -++|++|++.+++.++++++|.++++++++-+ ++.|+||+||.+... .+.++. .+..+++++++ T Consensus 1 MaN~ll----T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~-~~~~~~~~~~~~~~~~~l~ 75 (423) T protein:vir:17 1 MPNNLD----SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQF-SSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcc-eeecccCcccCCcccCccc Confidence 776432 13799999999999999999999999887543 457999999987543 444443 44567899999 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-c---cccccCHHHHHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-V---NADITKLNGLQSAID 150 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~-~---~~~~~~~d~i~~A~~ 150 (274) ..++.+++.+ .+.+|+++|++..+...++ +++.++.++.+|+.+|+++++.+...... . .+....|+.|+++.. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~~ 154 (423) T protein:vir:17 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTAS 154 (423) T ss_pred cceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHHH Confidence 9999999976 5889999999998888876 78999999999999999999886543222 1 122346999999999 Q ss_pred HHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceecccc-ceeccceEEEcCCCCcceEEEEeCCeEEEE- Q lcl|NC_010147. 151 KFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAF-GEALGAIIVRTNKLEAGTAILAKKGAVKLI- 226 (274) Q Consensus 151 ~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~i-g~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~- 226 (274) +|++++. .+|++|++|+.++.|+++... +......+...+++|.+ |+++||+|++|+++|..+.+.++....... T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~-~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~ 233 (423) T protein:vir:17 155 FLKDLGVNEGENYAVMDPWSAQRLADAQTG-LHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQ 233 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhccccc-eecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeeccc Confidence 9999886 679999999999998876543 33434455667899987 899999999999999887776654322111 Q ss_pred ------eecC-----c----eeeeecchhhcceEEEEEEEEEEEEEcCccE-----------EEEEecCCCCC-----C Q lcl|NC_010147. 227 ------LKRD-----F----FLEVARDASTKTTALYSDKHYVAYLYDESKA-----------VKITKGSGSLE-----M 274 (274) Q Consensus 227 ------~~~~-----~----~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~-----------v~~~~~~a~~~-----~ 274 (274) ...+ . ....+.+.....|.+ ..-|++.++|-.= -.++.++++.. - T Consensus 234 ~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~ 309 (423) T protein:vir:17 234 PTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQV---KFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDV 309 (423) T ss_pred ccccccccccccceeeeeeeeeeeccCceeecceE---EecceeeecccccccccccccccceEEEEEecccccccCce Confidence 0000 0 001112223334433 3335555544332 22333332222 1 No 90 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.94 E-value=4.2e-28 Score=170.69 Aligned_cols=267 Identities=12% Similarity=0.066 Sum_probs=202.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCCCcCCcc-ccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIPTD-ILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg~~i~~~-~~t~~~ 78 (274) |+..++.-+..++|+.|...|.+.+.+.+++.+++.+.+. .+..| ++.++.+. ..+.+.|++||+.++.. ++++++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV-TTLTG-SRVYEKWADITGLAKLDDEAGSIGTNDDPKLYP 186 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec-cCCcc-eEEEEeecCCCcceeeeccccccccccccceee Confidence 7666666667789999999999999999999998766432 22222 23344443 33567899999999865 689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+++|++...++..++.+.+.+++++.+++++|..++....+... .+..++++.|+++...|..+... T Consensus 187 v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:48 187 IRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPT--KPTLTKWDDIIDLQAKVDPAIKQ 264 (397) T ss_pred EEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--ccccccHHHHHHHHHHhhhhhcC Confidence 9999999999999999999999999999999999999999999999987655433 34567899999999999988888 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCc-----ceEEEEeCC-eEEEEeecC Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEA-----GTAILAKKG-AVKLILKRD 230 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~-----~~~~~~~~~-a~~~~~~~~ 230 (274) ...|+|||..+..|++.... ........-+.+|..++++|+||+++++ +|. ...++.+.+ ++.++.+.+ T Consensus 265 ~a~~v~n~~~~~~L~~lkd~---~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:48 265 TSFFLTNTSGFTALKKVKNA---FGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQ 341 (397) T ss_pred CCEEEECHHHHHHHHHhhcC---CCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecc Confidence 89999999999999764210 1111111123456778999999987653 442 223333434 567778888 Q ss_pred ceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 231 FFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 231 ~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.++.++.. ..+...++...||++++.+|+++++++.++++.+= T Consensus 342 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:48 342 MSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQK 389 (397) T ss_pred eEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEecccccCC Confidence 888887754 44667888889999999999999999976664444 No 91 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.94 E-value=4.2e-28 Score=170.70 Aligned_cols=262 Identities=16% Similarity=0.096 Sum_probs=201.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) |...++..+..++|+.+...+.+.+.+.+++.+++..... .+ ...++.+|.....+.+.|++||++++. ++++++++ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i 168 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPV-TT-LSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLL 168 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCcceeeeccccccccccccceeeE Confidence 6666666667799999999999999999999998765432 12 223456666666678889999999875 68999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~~~~ 158 (274) ++.+++++..+++|++...++..++.+.+.+++++.+++.+|..++....+.. .....+++++.++.. .+...... T Consensus 169 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~---~~~~~~~~~i~~~~~~~l~~~~~~ 245 (371) T protein:vir:81 169 QYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKA---KTAIADLDGLKQIINVQLDPVFRS 245 (371) T ss_pred EeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccccHHHHHHHHHhhcchhhhc Confidence 99999999999999999999989999999999999999999999988765443 234567888888764 45555556 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce-----------EEEEe--CCeEEE Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT-----------AILAK--KGAVKL 225 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~-----------~~~~~--~~a~~~ 225 (274) ...|+|||..+..|++... .........-+..|..++++|.||++++++|.+. .++|+ +..+.+ T Consensus 246 ~a~~vmn~~~~~~L~~lkd---~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~ 322 (371) T protein:vir:81 246 TSSVIVNQDAFNWLDTLKD---QNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVM 322 (371) T ss_pred CCEEEEcHHHHHHHHHhhc---cCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEE Confidence 7789999999999876421 1111111122345667899999999999998443 23444 234566 Q ss_pred EeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 226 ILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 226 ~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +.+.+++++.++... ++...++...||++++.+|+++++++++.| T Consensus 323 ~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 323 FDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 677788887776542 466788999999999999999999999999 No 92 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.94 E-value=3.9e-28 Score=170.86 Aligned_cols=261 Identities=13% Similarity=0.056 Sum_probs=194.5 Q ss_pred CCCccc-eeeeeechHHHHHHH-HHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGIT-KTSNQIIPEVLAPMM-QAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~T-~~~~~~~Pev~~~~v-~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) ++...| ..+-.++|+.+..-+ ...+.+.+.+.+++.+.. + ...+.+|+....+.+.|++||+.++.+++++++ T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~---~--~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 323 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVV---A--TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQ 323 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhccccc---C--CcceEEEEecCCcceeecccCccccccccccce Confidence 333333 333457887776654 455667777777765421 1 235778888777789999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------------cccccccccCH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA----------------KLTVNADITKL 142 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a----------------~~~~~~~~~~~ 142 (274) +.+.+++++..+++|++...++ .|+.+.+.+++++++++++|..++..-.+. ..++....+++ T Consensus 324 i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 402 (543) T protein:vir:81 324 PEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFAL 402 (543) T ss_pred eeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccccccccccccccH Confidence 9999999999999999988766 799999999999999999999998643211 01223445689 Q ss_pred HHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce-------- Q lcl|NC_010147. 143 NGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT-------- 214 (274) Q Consensus 143 d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~-------- 214 (274) +.++++...+..++.....|+|||..+..|++... ..+..+..+ +..|..++++|+||+++++||.+. T Consensus 403 ~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd---~~G~~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 478 (543) T protein:vir:81 403 ADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDT---QGGAGLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADN 478 (543) T ss_pred HHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhc---CCCceeccC-cCCCCCccccceeeEEeccccccccccccCCc Confidence 99999999998777777889999999999976321 111111111 234556789999999999998653 Q ss_pred --EEEEeCCeEEEEeecCceeeeecch------hhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 215 --AILAKKGAVKLILKRDFFLEVARDA------STKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 215 --~~~~~~~a~~~~~~~~~~ve~~rd~------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) .++.+...+.++.+.++.++.+... .++...++...|||+++.+|++++++++++++ T Consensus 479 ~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 479 FVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred ceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 3444556677777778777765432 23556788899999999999999999998888 No 93 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.94 E-value=6.5e-28 Score=169.65 Aligned_cols=268 Identities=14% Similarity=0.066 Sum_probs=201.7 Q ss_pred CCCc-cceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) .+.. .|..+...+|+.+...+.+.+.+.+.+.+++.+...- .+..++.+|.+...+.+.|++||.+++. +.+++++ T Consensus 120 ~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~ 197 (415) T protein:vir:94 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVT--NGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeecc--CCceeEEEEeecCCccceecccccccccccccccee Confidence 2222 2334456899999999999999999999987654321 1233566777666678889999999986 5689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +.+.+++++..+.+|++...++..++.+.+.+++++.+++.+|+.++....+... ...+...++++| T Consensus 198 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:94 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred eEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHH Confidence 9999999999999999999999999999999999999999999999987644221 112344679999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-- Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-- 219 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-- 219 (274) +++...+...+.....|+|||..+..|++... ........+-+.+|..++++|+||++++++|.++ .++++ T Consensus 278 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~ 354 (415) T protein:vir:94 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999998888888899999999999976321 0111111112345667899999999999998654 24444 Q ss_pred CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC---CCCCC Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS---GSLEM 274 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~---a~~~~ 274 (274) ..++.++.+.+++++..+.. ...+.+++..|+++++.+|+++++++++. .+.++ T Consensus 355 ~~~~~~~~~~~~~v~~~~~~-~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:94 355 KDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 33466677788888776543 45577899999999999999999999633 23334 No 94 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.94 E-value=8.6e-28 Score=168.96 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=200.5 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) ++..+ |.-+..++|+.|.+.+.+.+.+.+.+.+++.+...-. +..++.+|++.....+.|++||.+++. +.+++++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccC--CceeEEEEeecCCccceeeccccccCcccccceee Confidence 33322 3334568999999999999999999988876543211 123456666666667889999999986 4689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... .......++++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:81 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 9999999999999999999999999999999999999999999999987643211 122345689999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-- Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-- 219 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-- 219 (274) +++...+.+.+.....++|||..+..|++... .........-+.+|..++++|+||++++++|.+. .++|+ T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:81 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888888899999999999976311 0111111111345667799999999999998543 24554 Q ss_pred CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC-CCCC--C Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS-GSLE--M 274 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~-a~~~--~ 274 (274) +.++.++.+.+++++..+... ..+.+++..|+++++.+|+++++++++. ++++ + T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:81 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 334667777888888766543 4567889999999999999999999633 3333 3 No 95 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.94 E-value=8.6e-28 Score=168.96 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=200.5 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) ++..+ |.-+..++|+.|.+.+.+.+.+.+.+.+++.+...-. +..++.+|++.....+.|++||.+++. +.+++++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccC--CceeEEEEeecCCccceeeccccccCcccccceee Confidence 33322 3334568999999999999999999988876543211 123456666666667889999999986 4689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... .......++++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:98 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 9999999999999999999999999999999999999999999999987643211 122345689999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-- Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-- 219 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-- 219 (274) +++...+.+.+.....++|||..+..|++... .........-+.+|..++++|+||++++++|.+. .++|+ T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:98 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888888899999999999976311 0111111111345667799999999999998543 24554 Q ss_pred CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC-CCCC--C Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS-GSLE--M 274 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~-a~~~--~ 274 (274) +.++.++.+.+++++..+... ..+.+++..|+++++.+|+++++++++. ++++ + T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:98 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 334667777888888766543 4567889999999999999999999633 3333 3 No 96 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.94 E-value=8.6e-28 Score=168.96 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=200.5 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) ++..+ |.-+..++|+.|.+.+.+.+.+.+.+.+++.+...-. +..++.+|++.....+.|++||.+++. +.+++++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccC--CceeEEEEeecCCccceeeccccccCcccccceee Confidence 33322 3334568999999999999999999988876543211 123456666666667889999999986 4689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... .......++++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:79 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 9999999999999999999999999999999999999999999999987643211 122345689999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-- Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-- 219 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-- 219 (274) +++...+.+.+.....++|||..+..|++... .........-+.+|..++++|+||++++++|.+. .++|+ T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:79 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888888899999999999976311 0111111111345667799999999999998543 24554 Q ss_pred CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC-CCCC--C Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS-GSLE--M 274 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~-a~~~--~ 274 (274) +.++.++.+.+++++..+... ..+.+++..|+++++.+|+++++++++. ++++ + T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:79 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 334667777888888766543 4567889999999999999999999633 3333 3 No 97 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.94 E-value=1e-27 Score=168.57 Aligned_cols=268 Identities=14% Similarity=0.060 Sum_probs=198.5 Q ss_pred CCCc-cceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) ++.. .|.-+..++|+.|...|.+.+.+.+++.+++.+...-.+ ..++.++.....+.+.|++||..++. +.+++++ T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~ 197 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG--SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCC--ceeEEEEEecCCcceeecccccccccccccceee Confidence 2222 234455689999999999999999999998765332111 12344444455567789999999986 5789999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +.+.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... ...+...++++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:46 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 9999999999999999999999999999999999999999999999987643211 112344689999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-C Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-K 220 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-~ 220 (274) +++...+.+.+..+..|+|||..+..|++... .........-+.+|..++++|+||++++++|.+. .++|+ . T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd---~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~ 354 (415) T protein:vir:46 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEeh Confidence 99999999888888899999999999875321 0111111112345667899999999999998543 24554 3 Q ss_pred C-eEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec-CCCC--CC Q lcl|NC_010147. 221 G-AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG-SGSL--EM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~-~a~~--~~ 274 (274) . ++.++.+.++.++..+. ......+++..|+++++.+|+++++++++ ++++ ++ T Consensus 355 ~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:46 355 KDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEeecc-ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 3 46667778888876654 33456789999999999999999999963 3333 33 No 98 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.94 E-value=1e-27 Score=168.57 Aligned_cols=268 Identities=14% Similarity=0.060 Sum_probs=198.5 Q ss_pred CCCc-cceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) ++.. .|.-+..++|+.|...|.+.+.+.+++.+++.+...-.+ ..++.++.....+.+.|++||..++. +.+++++ T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~ 197 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG--SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCC--ceeEEEEEecCCcceeecccccccccccccceee Confidence 2222 234455689999999999999999999998765332111 12344444455567789999999986 5789999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccCHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +.+.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... ...+...++++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:47 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 9999999999999999999999999999999999999999999999987643211 112344689999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEEe-C Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAK-K 220 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~~-~ 220 (274) +++...+.+.+..+..|+|||..+..|++... .........-+.+|..++++|+||++++++|.+. .++|+ . T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd---~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~ 354 (415) T protein:vir:47 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD---KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhc---cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEeh Confidence 99999999888888899999999999875321 0111111112345667899999999999998543 24554 3 Q ss_pred C-eEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec-CCCC--CC Q lcl|NC_010147. 221 G-AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG-SGSL--EM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~-~a~~--~~ 274 (274) . ++.++.+.++.++..+. ......+++..|+++++.+|+++++++++ ++++ ++ T Consensus 355 ~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:47 355 KDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred hccEEEEeecceEEEeecc-ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 3 46667778888876654 33456789999999999999999999963 3333 33 No 99 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.93 E-value=1e-27 Score=168.52 Aligned_cols=267 Identities=13% Similarity=0.077 Sum_probs=205.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCccc-cccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDI-LETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~-~t~~~ 78 (274) |...++..+..++|+.+...|.+.+.+.+.+.+++.+...- + +..++.+|.+.. .+.+.|++||+.++..+ ++++. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~-~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVT-T-LTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSL 186 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeecc-C-CcceEEEEeeccCCcceeeeccccccccccccceee Confidence 66655555667899999999999999999998887654321 1 123466777753 45788999999998765 68999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..++....+... ....+++|.|+++...+..+... T Consensus 187 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:49 187 IRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPN--KPTLAKWDDIIDLQAKVDPAIKQ 264 (397) T ss_pred eEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--cccccCHHHHHHHHHhhhhhhcC Confidence 9999999999999999999999999999999999999999999999987655433 34557899999999999988888 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCCcce----EEEEe--CCeEEEEeecC Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEAGT----AILAK--KGAVKLILKRD 230 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~~~~----~~~~~--~~a~~~~~~~~ 230 (274) ...|+|||..+..|++.... ........-+..|.-++++|+||++++ .+|.++ .++|+ +.++.++.+.+ T Consensus 265 ~a~~v~n~~~~~~l~~lkd~---~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:49 265 TSLFLTNTSGFTALKKVKNA---MGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQH 341 (397) T ss_pred CCEEEEcHHHHHHHHHhhcc---CCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecc Confidence 89999999999998764210 111111111345666789999998854 455332 24444 33577888888 Q ss_pred ceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 231 FFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 231 ~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++++.++.. .++...++...|+++++.+|+++++++.++.+.+= T Consensus 342 ~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 342 LSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQK 389 (397) T ss_pred cEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccccccc Confidence 888887754 35667889999999999999999999987766655 No 100 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=99.93 E-value=1.2e-27 Score=168.16 Aligned_cols=268 Identities=10% Similarity=0.013 Sum_probs=198.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhc-ccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFAS-FAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~-~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) .||-.--.+++..-+.|...+.+.+...+.-.. +++ +.++..+|++|+||++.. ....+|..+..+..++++.+.. T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~~~g~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~ 106 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVIS--NDAIFMQGRSFTVIKGDV-TELKDYKRNATNEFDHPQIQET 106 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecc--cceeeccCcEEEEeeecc-cccccccCCCCcccccccccee Confidence 333333333334456777788777766544332 333 445666799999999975 5778999989999999999999 Q ss_pred EEEeee-ecceeeeeHHHHhhcCccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccccCHHHHHHHHHHH Q lcl|NC_010147. 80 EAKIRK-IAKGTSITDEALLSGYGDP--QGEQVRQHGLAHANKVDNDVLEALMGAKLT----VNADITKLNGLQSAIDKF 152 (274) Q Consensus 80 ~~~~~~-~~~~~~vtd~~~~~~~~d~--~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~----~~~~~~~~d~i~~A~~~l 152 (274) ++++.+ ++..|.+++.+..++...+ .....+++...++..+|+..++.+.+.... ..+....|+.|.+|..+| T Consensus 107 t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~~L 186 (329) T protein:vir:10 107 TYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSVEL 186 (329) T ss_pred EEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 999976 7999999999999986654 455566788888999999988877543222 223334589999999999 Q ss_pred hhcCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 153 NDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 153 ~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~a~~~~~~~ 229 (274) +++++ ++|+++|+|+++..|+++.. |.......+..+++|.+|++.|++|+.+++ +.....++++++|+....+. T Consensus 187 de~~vp~~Rvl~VtP~~~~~Lk~~~~--f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~ 264 (329) T protein:vir:10 187 DEIGAGASRILFVTPKFYKGIKKFVI--ELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQA 264 (329) T ss_pred HhcCCCCCcEEEeCHHHHHHHHhhhh--hhccccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeeeee Confidence 99875 67999999999999998764 555555566678899999999999998644 34445677888999888776 Q ss_pred Cceeeeec-chhhcceEEEEEEEEEEEEEcCccEEEEE-ecCCCCCC Q lcl|NC_010147. 230 DFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVKIT-KGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~r-d~~~~~~~v~~~~~yg~~~~~~~~~v~~~-~~~a~~~~ 274 (274) . .+|..+ .+.++++.+++|++||+++++|++...+. ...|...- T Consensus 265 ~-~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~ 310 (329) T protein:vir:10 265 N-EAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETN 310 (329) T ss_pred e-eeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccCcccC Confidence 5 677666 47788999999999999999999655444 22222222 No 101 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.93 E-value=5.6e-28 Score=169.99 Aligned_cols=260 Identities=15% Similarity=0.099 Sum_probs=196.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCc-----CCccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK-----IPTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~-----i~~~~~t 75 (274) ||..+|.-+..++|+.+++.|.+.+.+.+.+.+++.+.+ ..+.++++|++...+.+.|++||+. ++.++++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceee----ccCCcEEEEEEeCCcceEEeecccccccccccccccc Confidence 998888888889999999999999999999999886532 2355799999988889999999986 4556889 Q ss_pred cceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------cc--cccc Q lcl|NC_010147. 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LT--VNAD 138 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------------~~--~~~~ 138 (274) ++++++.++|++..+.+|+|...++..++.+.+.+++++.+++++|..++..-.+.. .. .... T Consensus 77 f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) T protein:vir:25 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) T ss_pred eeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999986532110 00 0111 Q ss_pred ccCHHHHHHHHHHH----hhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC--- Q lcl|NC_010147. 139 ITKLNGLQSAIDKF----NDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--- 211 (274) Q Consensus 139 ~~~~d~i~~A~~~l----~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~--- 211 (274) ...++++.++...+ ...+.....++|||..+..|++.. ++ .|..+... ++++|+||++++++| T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk------d~-~G~~i~~~---~~l~G~Pv~~~~~~~~~~ 226 (305) T protein:vir:25 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR------DA-NGNPVFRD---DSFAGFRTFFNRNGAWDA 226 (305) T ss_pred chhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhh------cc-CCceeecC---CcccccceEEcCccCCCC Confidence 12334444444333 333445566999999999997632 11 13333322 479999999999986 Q ss_pred -cceEEEEeCCeEEEEeecCceeeeecchh------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCC-CC Q lcl|NC_010147. 212 -AGTAILAKKGAVKLILKRDFFLEVARDAS------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL-EM 274 (274) Q Consensus 212 -~~~~~~~~~~a~~~~~~~~~~ve~~rd~~------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~-~~ 274 (274) ++..++.+...+.++.+.+++++.+++.. +....+|...|||+.+.||+++++++.+.++. +- T Consensus 227 ~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~p 303 (305) T protein:vir:25 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) T ss_pred CccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCC Confidence 34567777788888888888888777642 23457788899999999999999999854332 22 No 102 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.93 E-value=1.5e-28 Score=173.13 Aligned_cols=266 Identities=15% Similarity=0.176 Sum_probs=192.1 Q ss_pred CCCccc---------eeeeeechHHHHHHHHHHHHHH-hhhhcccccccccccCCCceEEEEeeccCCccc------ccc Q lcl|NC_010147. 1 MPQGIT---------KTSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ------VVA 64 (274) Q Consensus 1 Ma~~~T---------~~~~~~~Pev~~~~v~~~~~~~-~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~------~~~ 64 (274) |+.+.+ ++.+.|+ +.|++.+..-++++ ++|.+-+..- -.+..+++++.|.....+..+ -.. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHK--NESSESHNWETLASMDPDAVKRKRSRQQSA 77 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccc--cccccccceeeccccccccccccccccccc Confidence 443322 3344566 55777666555543 5555544322 123335556655543222111 111 Q ss_pred CCC-cCCccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|NC_010147. 65 EGE-KIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) Q Consensus 65 eg~-~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------- 134 (274) +++ +.|+.....+...+....++.++.|+|++..+...||.+.+.++++.+|+|+.|..+++.+.+.... T Consensus 78 d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~ 157 (322) T protein:vir:10 78 DGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEF 157 (322) T ss_pred CcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccccccccccc Confidence 222 3555666778778888778889999999999999999999999999999999999998766442211 Q ss_pred -------ccccccCHHHHHHHHHHHhhcCCC---ceEEEEcHHHHHHHHhhccccccccccccccc-eeccccceeccce Q lcl|NC_010147. 135 -------VNADITKLNGLQSAIDKFNDEDLE---PMVLFINPLDAGKLRGDASTNFTRATELGDDI-IVKGAFGEALGAI 203 (274) Q Consensus 135 -------~~~~~~~~d~i~~A~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~-~~~g~ig~~~G~~ 203 (274) ..+..++++.|++|.++|++++++ +||++++|+++.+|++++.+ ......+... .++|.+++++|++ T Consensus 158 ~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~--ts~D~~~~~~l~~~G~ig~~lGf~ 235 (322) T protein:vir:10 158 LATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEA--TSADYTSAMDLQSKGIITNWMGYT 235 (322) T ss_pred CCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhh--hhhhcccchhhhhcCeeeeeeeEE Confidence 112345789999999999998753 58999999999999998754 3333334344 4789999999999 Q ss_pred EEEcCCCCc------------------ceEEEEeCCeEEEEeecCceeeeecchh-hcceEEEEEEEEEEEEEcCccEEE Q lcl|NC_010147. 204 IVRTNKLEA------------------GTAILAKKGAVKLILKRDFFLEVARDAS-TKTTALYSDKHYVAYLYDESKAVK 264 (274) Q Consensus 204 Vv~s~~v~~------------------~~~~~~~~~a~~~~~~~~~~ve~~rd~~-~~~~~v~~~~~yg~~~~~~~~~v~ 264 (274) |++|+++|. ..+++++++|++++.+.+++++-+.++. ...+.|++.+.||+++++|++||. T Consensus 236 ~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~ 315 (322) T protein:vir:10 236 WIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFK 315 (322) T ss_pred EEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEE Confidence 999999983 3578999999999999888888665554 568999999999999999999999 Q ss_pred EEecCCC Q lcl|NC_010147. 265 ITKGSGS 271 (274) Q Consensus 265 ~~~~~a~ 271 (274) |....+- T Consensus 316 i~~~e~~ 322 (322) T protein:vir:10 316 LRLKNSL 322 (322) T ss_pred EEEeccC Confidence 9985544 No 103 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=99.93 E-value=4e-28 Score=170.81 Aligned_cols=261 Identities=14% Similarity=0.106 Sum_probs=175.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhccc----ccccccccCCCceEEEEeeccC-C---ccccccCCCcCCcc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFA----EVDSTLQGQPGDTLTFPAFVYS-G---DAQVVAEGEKIPTD 72 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~----~~~~~~~~~~g~tv~ip~~~~~-~---~~~~~~eg~~i~~~ 72 (274) |+-.--. +|.|.++..++.. +.+++..++.+ .+... ....|+.+++|+|+.+ | +.+.+.+.+.+++. T Consensus 1 m~lsD~~---vfN~~~~~a~~e~-~~q~~~~fn~as~gai~l~~-~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~ 75 (325) T protein:vir:95 1 MALSDLA---VYSEYAYSAFSET-LRQQVDLFNTATGGAIMLQS-AAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEK 75 (325) T ss_pred Cchhhhh---hhhhhhhhhhhhh-hhhhHhhhhhcccceeEecc-ccccCceeeccccccccccccccccCCCCceeccc Confidence 6532222 2788888777654 44443333332 11111 1123999999999864 4 34567778889999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHH----HHHHhhcc----cc-------cc-- Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDND----VLEALMGA----KL-------TV-- 135 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~----~~~~~~~a----~~-------~~-- 135 (274) ++++.+....+..++++|...|+.....+.++++.+.++++.+|++....+ +++.+.++ +. .. T Consensus 76 kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~ 155 (325) T protein:vir:95 76 VLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDA 155 (325) T ss_pred eeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCc Confidence 999999999999999999999999999999999887777777776666554 44444322 10 01 Q ss_pred cccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC---- Q lcl|NC_010147. 136 NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE---- 211 (274) Q Consensus 136 ~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~---- 211 (274) ....++++.|++|.++|||+...+..++|||.+|..|++++++++......+. . ..+++++|.+||+||.+| T Consensus 156 ~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g-~---~~i~t~~G~~VIVdD~~p~~~~ 231 (325) T protein:vir:95 156 ADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGT-V---NVVRDPFGKLLVMTDSPNLFAA 231 (325) T ss_pred ccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCC-c---ccccccCCcEEEEeCCCCCCCc Confidence 11235789999999999999999999999999999999998877655333221 1 146789999999999998 Q ss_pred ----cceEEEEeCCeEEEEeecCceee---eecchhhcceEEEEEEEEEEEEEcCccEEEEE-ecCCCC---CC Q lcl|NC_010147. 212 ----AGTAILAKKGAVKLILKRDFFLE---VARDASTKTTALYSDKHYVAYLYDESKAVKIT-KGSGSL---EM 274 (274) Q Consensus 212 ----~~~~~~~~~~a~~~~~~~~~~ve---~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~-~~~a~~---~~ 274 (274) +|++|++++||+++....+...+ ..|+..... +.+.+| ..+++|.++-.-+ ....|| || T Consensus 232 g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-tf~lhp~G~sw~~s~~g~sPt~aeL 301 (325) T protein:vir:95 232 GTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIR---TYQAEW-SYNIGVKGFAWDKANGGKSPTDAAL 301 (325) T ss_pred cCceeEEEEEEecCeEEecCCCCccccccccCcccceee---eeeeee-eEEeecceeeeecccccCCcChHhh Confidence 57789999999999987775443 344432221 112222 2344555554411 111233 33 No 104 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.93 E-value=4.8e-28 Score=170.36 Aligned_cols=258 Identities=14% Similarity=0.073 Sum_probs=200.5 Q ss_pred CCCccce--eeeeech-HHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccc Q lcl|NC_010147. 1 MPQGITK--TSNQIIP-EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETK 77 (274) Q Consensus 1 Ma~~~T~--~~~~~~P-ev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~ 77 (274) .|..++. .+..++| +++++.+++.+.+.+++.++... .+.+..| .++||+....+.+.|++||+.++.++++++ T Consensus 355 ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~--~~~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~ 431 (632) T protein:vir:96 355 RQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--MLPGLVG-DVDIPKKTSGANFYWIGEDEDVQDSDFDFT 431 (632) T ss_pred hhhhcccccccccccccccchHHHHHHHhhcchhhhhcce--EeecCCc-ceEEEEEeCCceeEeecCCcccccccccee Confidence 3333322 2234555 56688899999998888876322 1233223 589999987788999999999999999999 Q ss_pred eeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--c-----------ccccccccCHHH Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--K-----------LTVNADITKLNG 144 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--~-----------~~~~~~~~~~d~ 144 (274) ++++.+++++..+.+|++...++..++.+.+.+.++.++++++|+.+|..-.+. + ....++.++++. T Consensus 432 ~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 511 (632) T protein:vir:96 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWAS 511 (632) T ss_pred eEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHH Confidence 999999999999999999999999999999999999999999999998653321 1 112234567999 Q ss_pred HHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCe Q lcl|NC_010147. 145 LQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA 222 (274) Q Consensus 145 i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a 222 (274) ++++..++...+. ....++|||..+..|++.... +. .|..+... +++.|+||++|+.+|.++.++.+.+. T Consensus 512 i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~----d~-~G~~i~~~---~~l~G~pv~~s~~ip~~~~~~gd~s~ 583 (632) T protein:vir:96 512 VVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVF----DN-TGERIWQN---NEVNGYRAEASNQIPADTWIFGDWSQ 583 (632) T ss_pred HHHHHHHHhhcccccCccEEEEchhHHHHHHHHhcc----CC-CCceeecC---CeecccceEeccccccCcEEEeecce Confidence 9999988887664 455799999999888764321 11 23333322 57899999999999999998888888 Q ss_pred EEEEeecCceeeeecch--hhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 223 VKLILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 223 ~~~~~~~~~~ve~~rd~--~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) +.++...++.++.++.. .++...++..+++++++.+|++++.+++++ T Consensus 584 ~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 584 IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 87888888888777655 457778899999999999999999999988 No 105 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.93 E-value=2.7e-28 Score=171.75 Aligned_cols=267 Identities=12% Similarity=0.078 Sum_probs=208.2 Q ss_pred CCCcccee----------eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGITKT----------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T~~----------~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |.++.+.. .++|+ |+|+..|.+++..+++|.++..+- ++ +.|++++||+.+.. .++....|+++. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r-ti--~~g~s~~~~~iG~~-~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR-DL--RGSNVVRLDRLGNV-EAKGRRAGEELE 75 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhcccccee-ee--ccceeEEEeeeeee-eecccccCcccC Confidence 77654211 13566 899999999999999999987653 33 55999999998654 566788999999 Q ss_pred ccccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------- Q lcl|NC_010147. 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------------- 134 (274) Q Consensus 71 ~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------------- 134 (274) .+.+..++..++++. +...+.|.|++..++..|+..++.+++++++|+..|+.++..+..+... T Consensus 76 ~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcc Confidence 999999999999987 4677889999999999999999999999999999999887554322110 Q ss_pred ------ccccccCHH----HHHHHHHHHhhcCC-----CceEEEEcHHHHHHHHhhccccccccc---cccccceecccc Q lcl|NC_010147. 135 ------VNADITKLN----GLQSAIDKFNDEDL-----EPMVLFINPLDAGKLRGDASTNFTRAT---ELGDDIIVKGAF 196 (274) Q Consensus 135 ------~~~~~~~~d----~i~~A~~~l~~~~~-----~~~~~vv~p~~~~~L~k~~~~~~~~~s---~~~~~~~~~g~i 196 (274) ..+...+++ .+.+|...|.+.+. ..|+++|+|++|..|+++.. +.... -.+.+...+|.+ T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~--l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:78 156 EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDK--LMSVEYQATGATNDYVKSRV 233 (335) T ss_pred eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccc--ccccccccccccccccccee Confidence 011112344 44556666877665 25899999999999998753 33321 113345688999 Q ss_pred ceeccceEEEcCCCCcc---------------------eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEE Q lcl|NC_010147. 197 GEALGAIIVRTNKLEAG---------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAY 255 (274) Q Consensus 197 g~~~G~~Vv~s~~v~~~---------------------~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~ 255 (274) ++++|++|+.|+++|.+ .+.++++.|++.+...++..|.+|+...++|.|.+.+.||++ T Consensus 234 ~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g 313 (335) T protein:vir:78 234 AILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIG 313 (335) T ss_pred EEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCc Confidence 99999999999999932 346789999999998999999999999999999999999999 Q ss_pred EEcCccEEEEEecC-CCCCC Q lcl|NC_010147. 256 LYDESKAVKITKGS-GSLEM 274 (274) Q Consensus 256 ~~~~~~~v~~~~~~-a~~~~ 274 (274) ++||+.++.++.+. .+-+. T Consensus 314 ~lRPe~a~~i~~tg~~~~~~ 333 (335) T protein:vir:78 314 ARRPDTAGAIELKGIEAFDI 333 (335) T ss_pred ccCcceEEEEEecCCCcccc Confidence 99999999998543 33344 No 106 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.93 E-value=2e-27 Score=166.92 Aligned_cols=264 Identities=16% Similarity=0.106 Sum_probs=189.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccc-cccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVD-STLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~-~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |....+.....++|+.++..+.+.+.+.+++.+++... ..+.+.++ .++||.....+.+.|++||+.++.++++++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~-~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v 416 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPF-NIRVHAQVSGGAAGWVGEGKTKPLTKFDFESI 416 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccC-ceeeeeeecCcceEEeccCccccccccceeEE Confidence 22222223567899999999999999999998886442 22222222 47899988778899999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----c-------cccccccCHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----L-------TVNADITKLNGLQS 147 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-----~-------~~~~~~~~~d~i~~ 147 (274) ++.++|++..+.+|++...++.+++.+.+.+++++.+++++|..+|....+.. . ...+....+.++.. T Consensus 417 ~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~ 496 (645) T protein:vir:93 417 TFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEA 496 (645) T ss_pred EEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHH Confidence 99999999999999999999999999999999999999999999996533211 1 11122234566777 Q ss_pred HHHHHhhcCCC--ceEEEEcHHHHHHHHhhccccccccccccccce--eccccceeccceEEEcCCCCcceEEEEeCCeE Q lcl|NC_010147. 148 AIDKFNDEDLE--PMVLFINPLDAGKLRGDASTNFTRATELGDDII--VKGAFGEALGAIIVRTNKLEAGTAILAKKGAV 223 (274) Q Consensus 148 A~~~l~~~~~~--~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~--~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~ 223 (274) +...+..++.. ..+|+|||..+..|++...- . |+.+. ....-++++|+||++|++||.+.. +.+.+.+ T Consensus 497 ~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~------~-G~~~~~~~~~~~~tL~G~PV~~s~~vp~~~~-~gd~s~~ 568 (645) T protein:vir:93 497 AFGQFVAANLQPTGAVWLMSSTNALALSMRKNA------L-GQKEYPDMTLLGGSFQGLPVIVSQYVGDQLV-LVNAPDI 568 (645) T ss_pred HHHHHHhcCCCccccEEEEcHHHHHHHHhcccc------C-CceeecCCCCCCceeeceeeEEeccCCccee-EeccccE Confidence 77777766654 45799999999999764211 1 11111 111236899999999999997543 3444555 Q ss_pred EEEeecCceeeeecchh------------------------hcceEEEEEEEEEEEEEcCccEEEEEe---cCCCCC Q lcl|NC_010147. 224 KLILKRDFFLEVARDAS------------------------TKTTALYSDKHYVAYLYDESKAVKITK---GSGSLE 273 (274) Q Consensus 224 ~~~~~~~~~ve~~rd~~------------------------~~~~~v~~~~~yg~~~~~~~~~v~~~~---~~a~~~ 273 (274) .++...++.+...++.. +....++...|+++++.+|+++++||- ++|+.- T Consensus 569 ~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 569 YLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred EEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 55554455444333221 234577888999999999999999994 566666 No 107 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.93 E-value=1.7e-28 Score=172.88 Aligned_cols=267 Identities=13% Similarity=0.102 Sum_probs=209.5 Q ss_pred CCCccce----------eeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGITK----------TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T~----------~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |.++.+. -.++|+ |+|+..|...+...++|.++..+- ++ ..|++++||+.+.. .++...+|+++. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r-ti--~~g~s~~~~~iG~~-~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR-DL--RGSNVVRLDRLGNV-EAKGRRAGEELE 75 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhcccccee-ee--ccceeEEEeeeeee-eeecccCCcCcC Confidence 7665421 113566 899999999999999999987653 33 45999999998654 677889999999 Q ss_pred ccccccceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------- Q lcl|NC_010147. 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------------- 133 (274) Q Consensus 71 ~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------------- 133 (274) -+.+..++..++++. ++....|.|++..++..|+.+++.+++++++|+..|+.++..+..+.. T Consensus 76 ~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcc Confidence 998999999999987 466778999999999999999999999999999999988755432110 Q ss_pred -----cccccccCHHH----HHHHHHHHhhcCCC-----ceEEEEcHHHHHHHHhhccccccccc---cccccceecccc Q lcl|NC_010147. 134 -----TVNADITKLNG----LQSAIDKFNDEDLE-----PMVLFINPLDAGKLRGDASTNFTRAT---ELGDDIIVKGAF 196 (274) Q Consensus 134 -----~~~~~~~~~d~----i~~A~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~s---~~~~~~~~~g~i 196 (274) +..+...+++. +.+|.++|.++++. .|+++|+|++|..|+++.. +.... -.+.+...+|.+ T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~--l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:63 156 EKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDK--LMNVEYQATGATNDYVKSRV 233 (335) T ss_pred eeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccc--ccccccccccccccccCcee Confidence 01111123444 55788888888753 4999999999999998753 33321 112344678999 Q ss_pred ceeccceEEEcCCCCcc---------------------eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEE Q lcl|NC_010147. 197 GEALGAIIVRTNKLEAG---------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAY 255 (274) Q Consensus 197 g~~~G~~Vv~s~~v~~~---------------------~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~ 255 (274) ++++|++|+.|+++|.+ .+.++++.|++.+...++..|.+|+...++|.|.+.+.||++ T Consensus 234 ~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g 313 (335) T protein:vir:63 234 AILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIG 313 (335) T ss_pred EEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCc Confidence 99999999999999821 346889999999999999999999999999999999999999 Q ss_pred EEcCccEEEEEe-cCCCCCC Q lcl|NC_010147. 256 LYDESKAVKITK-GSGSLEM 274 (274) Q Consensus 256 ~~~~~~~v~~~~-~~a~~~~ 274 (274) +.||+.++.++. +..+-+. T Consensus 314 ~lRPe~a~~i~~tg~~~~~~ 333 (335) T protein:vir:63 314 ARRPDTAGAIELKGIGAFDI 333 (335) T ss_pred ccccceEEEEEEcCCCceee Confidence 999999998885 3344444 No 108 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.93 E-value=2.1e-27 Score=166.88 Aligned_cols=259 Identities=12% Similarity=0.041 Sum_probs=192.3 Q ss_pred CCCccce-eeeeechHHHHHHHHHHHHHHhhhhcc-cccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGITK-TSNQIIPEVLAPMMQAQLEKKLRFASF-AEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~T~-~~~~~~Pev~~~~v~~~~~~~~v~~~~-~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) |+..+|. ..-.++|+.+...+.+.+.+.++++++ +.+ +....| .+++|++...+.+.|++||++++.+++++++ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~---v~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~ 139 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARS---IPLPNG-NLSMPRLSGGATAGYVGEGKDVVATGATFDD 139 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceee---eecCCC-ceEEEEEeCCcceeeeccCccccccccceeE Confidence 5554433 234578999999999999999888877 332 222223 5899999877889999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc-----------cc--cccccC-- Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL-----------TV--NADITK-- 141 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--~~-----------~~--~~~~~~-- 141 (274) +++.+++++..+.+|++...++..++.+.+.+++++.+++++|+.++..-.+. +. .. .....+ T Consensus 140 i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~ 219 (366) T protein:vir:57 140 VKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT 219 (366) T ss_pred EEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh Confidence 99999999999999999999999999999999999999999999998653221 00 00 011122 Q ss_pred -HHHHHHHHHHHh---hcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 142 -LNGLQSAIDKFN---DEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 142 -~d~i~~A~~~l~---~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) .+..++.+.... ........|+|||..+..|++.. +++ |..+.....-|+++|+||++|++||.+ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk------d~~-G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~ 292 (366) T protein:vir:57 220 TIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR------DGN-GNKVYPEMSQGILKGYPIQRTSAIPANLGDD 292 (366) T ss_pred hHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh------ccC-CceeccCCCCCeecceeeEEccccccccccC Confidence 333444332222 22335677999999999997642 111 222222334468999999999999853 Q ss_pred ----eEEEEeCCeEEEEeecCceeeeecchh-------------hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 214 ----TAILAKKGAVKLILKRDFFLEVARDAS-------------TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 214 ----~~~~~~~~a~~~~~~~~~~ve~~rd~~-------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) ..++.+.+.+.++.+.+++++..|++. +....++...+|++++.+|+++++++...+ T Consensus 293 ~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 293 GNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 345567777778888888888887753 234588888999999999999999999999 No 109 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.93 E-value=3.4e-27 Score=165.66 Aligned_cols=262 Identities=13% Similarity=0.039 Sum_probs=198.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) |+..++.....++|+.|...|.+.+.+.+++.+++.+.+.-.+ ...+.+|+....+.+.|++||++++. +.++++.+ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v 200 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTR--SGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKV 200 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCC--ceeEEEEEecCCcceeeecccccccccccccceeE Confidence 5444445556789999999999999999999888765432111 23567777666678899999999986 56899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~~~~ 158 (274) ++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+.. ....++++.+.++.. .+..+... T Consensus 201 ~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~---~~g~~~~~~i~~~~~~~l~~~~~~ 277 (397) T protein:vir:12 201 SYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLK---KVDIDGLDGIKKALNVTLDPMVAP 277 (397) T ss_pred EeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccccccHHHHHHHHhhccchhhhC Confidence 99999999999999999999999999999999999999999999998765543 234568999999874 56555567 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc----e-EEEEe-CC-eEEEEeecCc Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG----T-AILAK-KG-AVKLILKRDF 231 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~----~-~~~~~-~~-a~~~~~~~~~ 231 (274) ...|+|||..+..|++... .........-+.+|..++++|+||+++++...+ . .++++ .. ++.++.+.++ T Consensus 278 ~a~~~~n~~~~~~L~~lkd---~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 354 (397) T protein:vir:12 278 GSIVLTNQDGYDWLDTLKD---GTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQ 354 (397) T ss_pred CCEEEEcHHHHHHHHHhhc---cCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecce Confidence 7889999999999876311 011111111234666789999999987764321 1 24444 33 4667778888 Q ss_pred eeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 232 FLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 232 ~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++.++... .+...++...|+++++.+|+++++++.++= T Consensus 355 ~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 355 SIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 888776543 466789999999999999999999998877 No 110 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.93 E-value=7.7e-27 Score=163.75 Aligned_cols=268 Identities=15% Similarity=0.056 Sum_probs=195.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC----CccccccCCCcCCcccc-c Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDIL-E 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~----~~~~~~~eg~~i~~~~~-t 75 (274) ++..++..+...+|+.|++.+.+.+.+.+.+.+++.+.. ..+..+++|+.... +.+.|++||+.++.++. + T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLT----MTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhcceee----ccCCceeEEEeccccccccccceecCcccccccCccc Confidence 444455566678899999999999999999988875432 23456778876533 34679999999998875 7 Q ss_pred cceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------ccccccccCH Q lcl|NC_010147. 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKL 142 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-------------~~~~~~~~~~ 142 (274) ++.+++.+++++..+.+|++...++ +++.+.+.+.+++.+++.+|+.++..-.+.. ....+....+ T Consensus 194 f~~i~~~~~k~~~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~ 272 (413) T protein:vir:81 194 FDIVTESLSKIAGLTKITDEMIEDY-DFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELA 272 (413) T ss_pred ceeeEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhH Confidence 9999999999999999999987766 5688999999999999999999986532211 1111233457 Q ss_pred HHHHHHHHHHhhc-CCCceEEEEcHHHHHHHHhhc--cccccc--cccccccceeccccceeccceEEEcCCCCcceEEE Q lcl|NC_010147. 143 NGLQSAIDKFNDE-DLEPMVLFINPLDAGKLRGDA--STNFTR--ATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAIL 217 (274) Q Consensus 143 d~i~~A~~~l~~~-~~~~~~~vv~p~~~~~L~k~~--~~~~~~--~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~ 217 (274) +.+.++...+..+ +.....|+|||..+..|++.. .-.++- ....+.+....+..++++|+||++|+++|.++.++ T Consensus 273 ~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~ 352 (413) T protein:vir:81 273 DSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVV 352 (413) T ss_pred HHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEE Confidence 7777887766544 345567999999999887532 111111 11111111112234689999999999999999888 Q ss_pred EeCC-eEEEEeecCceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 218 AKKG-AVKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 218 ~~~~-a~~~~~~~~~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) .+.. ++.++.+.++.++.++.. .++...++...||++++.+|+++++++.+.++.- T Consensus 353 gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 353 GAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 7765 566677778888877754 3466688888999999999999999997555544 No 111 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.93 E-value=3.8e-27 Score=165.41 Aligned_cols=264 Identities=15% Similarity=0.123 Sum_probs=200.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccc-cccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDIL-ETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~-t~~~~ 79 (274) .+..+|.....++|+.+.+.|.+.+.+.+.+.+++.+.+ . .| ..+||+....+.+.|++||++++.++. +++++ T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~-~---~g-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i 212 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIR-V---KG-TTRILVDTDTSPATWIEQSGALPTGDVGTIASI 212 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceee-c---Cc-eeEEEEecCCcccccccccccccccccccccee Confidence 343344445568999999999999999999999876533 2 23 368999888889999999999998874 79999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------ccccccccCHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LTVNADITKLNG 144 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------------~~~~~~~~~~d~ 144 (274) ++.+++++..+.+|++...++..++.+.+.+++++.+++++|..++..-.+.. ....++..+++. T Consensus 213 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 292 (425) T protein:vir:95 213 DFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKN 292 (425) T ss_pred eeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHH Confidence 99999999999999999999999999999999999999999999997542210 012234567899 Q ss_pred HHHHHHHHhhcC--CCceEEEEcHHHH-HHHHhhccccccccccccccc--eeccccceeccceEEEcCCCCcceEEEEe Q lcl|NC_010147. 145 LQSAIDKFNDED--LEPMVLFINPLDA-GKLRGDASTNFTRATELGDDI--IVKGAFGEALGAIIVRTNKLEAGTAILAK 219 (274) Q Consensus 145 i~~A~~~l~~~~--~~~~~~vv~p~~~-~~L~k~~~~~~~~~s~~~~~~--~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~ 219 (274) ++++...+..+. ....+++|||..+ ..|.+.. ...+++ |..+ .-.+..++++|.||++|+++|+++.++.+ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~---~~kd~~-g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd 368 (425) T protein:vir:95 293 LVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFS---IQVDSN-GNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGE 368 (425) T ss_pred HHHHHHhhhhhccccCceEEEEeChHHHHHHHHHH---hhcCCC-CceeeccCCCCCccccceeeEEcCcCCCccEEEEe Confidence 999988876544 3456788998764 3332211 111111 1111 12445678999999999999999988877 Q ss_pred CCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecC-CCCC Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGS-GSLE 273 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~-a~~~ 273 (274) .+.+.++.+.++.++.+++..+ +...+++..|+++++.+|+++++++.+. .++- T Consensus 369 ~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 369 FEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred cccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 7777777888888888877665 5567888899999999999999999754 2222 No 112 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.93 E-value=3.5e-27 Score=165.62 Aligned_cols=261 Identities=14% Similarity=0.057 Sum_probs=184.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||..+| ..-.++|+.+++.|.+.+.+.+++.+++.+... ++..++||++...+.+.|++||++++.+++++++++ T Consensus 1 Mat~tt-~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~----~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~ 75 (311) T protein:vir:99 1 MATFGT-GNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQ----RFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVT 75 (311) T ss_pred CceecC-CCceeccHHHHHHHHHHHHhhchhhhhcceeec----cCCceEEEEEeCCceeEEeecCcccccccceeeEEE Confidence 996544 444578999999999999999999998865322 234579999988889999999999999999999999 Q ss_pred EEeeeecceeeeeHHHHh---hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------ccc---cccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LTV---NADI 139 (274) Q Consensus 81 ~~~~~~~~~~~vtd~~~~---~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------------~~~---~~~~ 139 (274) +.++|++..+.+|+|... ++..++.+.+.+++++++++++|+.++......+ ..+ .... T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:99 76 STPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTI 155 (311) T ss_pred EeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccccc Confidence 999999999999999874 4567899999999999999999999996643211 000 0111 Q ss_pred c-CHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceE- Q lcl|NC_010147. 140 T-KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA- 215 (274) Q Consensus 140 ~-~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~- 215 (274) . .++.+.++..++..++. ....|+|||..+..|++... ........+....+..++++|+||++++++|.+.. T Consensus 156 ~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd---~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 156 ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARY---TDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhc---cCCCeeecCcccCCCCceecceeeEeeccccccccc Confidence 2 23455566666655443 44569999999999976421 01111111222344567899999999999873221 Q ss_pred --------------EEE-eC-CeEEEEeecCceeeeecch---------hhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 216 --------------ILA-KK-GAVKLILKRDFFLEVARDA---------STKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 216 --------------~~~-~~-~a~~~~~~~~~~ve~~rd~---------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +++ +. ..+.+..+++..++..+.. .+....+|...||++++.+| ++++++.++| T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 122 22 2345656666666655432 23445678889999999997 5667777777 No 113 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.93 E-value=7.8e-27 Score=163.71 Aligned_cols=267 Identities=14% Similarity=0.085 Sum_probs=198.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |...++.-+..++|+.++..|.+.+.+.+.+.+++.....-.. ...+.+|+... .+.+.|++||++++. +.+++++ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTS--NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCC--cceEEEeeccccccceeeecCccccccccCcceee Confidence 3322233334689999999999999999999998765432111 12345555533 356789999999986 5689999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhcCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~~~ 157 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++....+.+.. ....++++++++.. .+..... T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~--~~~~~~~~l~~~~~~~~~~~~~ 271 (408) T protein:vir:10 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK--PTIAKFDDVITMINTAVDPAII 271 (408) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccccHHHHHHHHHHhhhhhhc Confidence 99999999999999999999999999999999999999999999999887765443 33467999999874 5655555 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCCcce-----EEEEeCC-eEEEEeec Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEAGT-----AILAKKG-AVKLILKR 229 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~~~~-----~~~~~~~-a~~~~~~~ 229 (274) ....|+|||..+..|++.... .+......-+.+|..++++|+||++++ .+|... .++.+.+ ++.++.+. T Consensus 272 ~~a~~v~n~~~~~~l~~lkd~---~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:10 272 ATSSLLTNQSGLNKLALVKTA---EGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred cCCEEEEcHHHHHHHHHhhcc---CCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEec Confidence 667899999999999864211 111111112345666799999999965 355422 3444434 46777788 Q ss_pred Cceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.++.++... ++...++...||++++.+|+++++++++.++++- T Consensus 349 ~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:10 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) T ss_pred ceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccCC Confidence 88888777643 4667899999999999999999999998877776 No 114 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.92 E-value=1.3e-26 Score=162.52 Aligned_cols=267 Identities=12% Similarity=0.071 Sum_probs=198.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) +...++.-+..++|+.|...|.+.+.+.+++.+++.+.+ .++.+.++|.... .+.+.|++|+++.+. +++++++ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTP----VTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeee----ccCCceEEEEEecCCCcccccccccccccccccccee Confidence 332333334468999999999999999999999876532 2345677887753 367789999999884 7899999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +.+.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+.+....+...++|.+.++....-.... T Consensus 187 v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~- 265 (394) T protein:vir:10 187 VDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDTLVDSLKHILNVDLDPAY- 265 (394) T ss_pred EEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccHHHHHHHHHhhhhhhc- Confidence 9999999999999999999999999999999999999999999999999887776666777889999998764433333 Q ss_pred ceEEEEcHHHHHHHHhhccc--cccccccccccceeccccceeccceEEEcCCC--Cc--ce-EEEE-eCC-eEEEEeec Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDAST--NFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EA--GT-AILA-KKG-AVKLILKR 229 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~--~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v--~~--~~-~~~~-~~~-a~~~~~~~ 229 (274) ...|+|||..+..|++.... .++-.... .+....|.-++++|+||+++++. |. +. .+++ +.+ ++.++.+. T Consensus 266 ~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~ 344 (394) T protein:vir:10 266 SRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQ 344 (394) T ss_pred cCEEEecHHHHHHHHHhhccCCCeeeeccc-cccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeec Confidence 46899999999998864211 11110010 01112344468999999987654 32 12 2444 434 46667778 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++++..++..+ ...+++..|+++++.+|++++.++.+.++..= T Consensus 345 ~~~v~~~~~~~~-~~~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~ 388 (394) T protein:vir:10 345 QVTLAWEDSKIY-GRYLGAAFRFGVKQADSNAGYFVTNTDAASGS 388 (394) T ss_pred ceEEEEeccccc-ceeEEEEEEeccEEeccccEEEEEeecccCCC Confidence 888887776554 45689999999999999999999987666655 No 115 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.92 E-value=2.1e-26 Score=161.36 Aligned_cols=267 Identities=13% Similarity=0.062 Sum_probs=199.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC-CccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~-~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |...++..+..++|+.|...|.+.+.+.+.+.+++.....- + ....+.+|++... +.+.|++||++++. ++++++. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS-T-SSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeecc-C-CcceEEEEeecCCcccccccccccccccccccceee Confidence 43334444456899999999999999999998887653321 1 1234667777544 34568999999986 6799999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhcCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~~~ 157 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++++|..++....+... .....++++++++.. .+..+.. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~--~~~~~~~~~i~~~~~~~l~~~~~ 271 (408) T protein:vir:74 194 IKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK--KPTIANFDDVITMINTSVDPAII 271 (408) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--ccccccHHHHHHHHHHhhhhhhc Confidence 9999999999999999999999999999999999999999999999887654433 234568999999874 6666666 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCc-----ceEEEEeCC-eEEEEeec Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEA-----GTAILAKKG-AVKLILKR 229 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~-----~~~~~~~~~-a~~~~~~~ 229 (274) ....|+|||..+..|++... ..+...-..-+..|.-++++|+||+++++ +|. +..++.+.+ ++.++.+. T Consensus 272 ~~a~~v~n~~~~~~l~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:74 272 ATSSLLTNQSGLNKLALVKT---AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred CCCEEEEcHHHHHHHHHhhc---CCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEec Confidence 77789999999999976421 01111111112345567999999998754 553 223443433 57778888 Q ss_pred Cceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.++.++.. .++...++...||++++++|+++++++++..+.+- T Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:74 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQV 397 (408) T ss_pred ceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCC Confidence 8888887754 34667888999999999999999999987777776 No 116 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.92 E-value=9.9e-27 Score=163.14 Aligned_cols=260 Identities=11% Similarity=0.042 Sum_probs=191.1 Q ss_pred CCCccc-eeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGIT-KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T-~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +++.++ .....++|+.+.+.|.+.+.+.+++.++... .+.+. ...++||++...+.+.|++||+.++.++++++++ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~--~~~~~-~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i 201 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR--SIPLP-NGNMSLPRLAGGATASYTGENQDAKVSEARFDDV 201 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce--eeecC-CcceEEEEEeCCcceeeeccCccccccccceeeE Confidence 333322 2334689999999999999999998887322 12222 2248999998778899999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc--------------cccccccCHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL--------------TVNADITKLN 143 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--~~--------------~~~~~~~~~d 143 (274) ++.+++++..+++|++...++..++.+.+.+++++.+++++|+.++..-.+. +. .......+++ T Consensus 202 ~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 281 (428) T protein:vir:10 202 KLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLD 281 (428) T ss_pred EeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHH Confidence 9999999999999999999999999999999999999999999998653221 00 0011223444 Q ss_pred HH---HHHHHHH---hhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce--- Q lcl|NC_010147. 144 GL---QSAIDKF---NDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT--- 214 (274) Q Consensus 144 ~i---~~A~~~l---~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~--- 214 (274) .+ +++...+ .........|+|||..+..|++.. + ..|..+.....-|+++|+||+++++||.+. T Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk------d-~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~ 354 (428) T protein:vir:10 282 TIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR------D-GNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEG 354 (428) T ss_pred HHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh------c-cCCceeccCCCCCeeeceeeEEeccccccccCC Confidence 43 3333222 222335568999999999887532 1 113333333344689999999999998642 Q ss_pred -----EEEEeCCeEEEEeecCceeeeecchh-------------hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 215 -----AILAKKGAVKLILKRDFFLEVARDAS-------------TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 215 -----~~~~~~~a~~~~~~~~~~ve~~rd~~-------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++.+.+.+.++.+.++.++.+|+.. .....+|...||++++.+|+++++++...+ T Consensus 355 ~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 355 GKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred CccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 35556677777788888888888743 234678888999999999999999999999 No 117 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.92 E-value=1.6e-26 Score=162.04 Aligned_cols=261 Identities=15% Similarity=0.104 Sum_probs=191.4 Q ss_pred CCCccc--eeeeeechHHHHHHHHHHHHHHhhhhcc-cccccccccCCCceEEEEeeccCCccccccCCCcCCccccccc Q lcl|NC_010147. 1 MPQGIT--KTSNQIIPEVLAPMMQAQLEKKLRFASF-AEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETK 77 (274) Q Consensus 1 Ma~~~T--~~~~~~~Pev~~~~v~~~~~~~~v~~~~-~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~ 77 (274) ++..++ ..+..++|+.+.+.|.+.+.+.+++.++ +.+ +.+..| .+++|++...+.+.|++||+.++..+++++ T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~---v~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~ 205 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART---LPLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFD 205 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhcccee---eecCCC-ceEEEEEeCCcceeeeccCcccccccccee Confidence 332222 2234589999999999999988888776 322 222222 589999988788899999999999999999 Q ss_pred eeEEEeeeecceeeeeHHHHhhcC--ccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--c-----------cccccccC- Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHANKVDNDVLEALMGAK--L-----------TVNADITK- 141 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~--~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--~-----------~~~~~~~~- 141 (274) ++++.+++++..+.+|++...++. +++.+.+.+++++++++++|..++..-.+.. . .......+ T Consensus 206 ~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:80 206 DLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTL 285 (435) T ss_pred eEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccch Confidence 999999999999999999988874 4688999999999999999999987632210 0 00111112 Q ss_pred ---HHHHHHHHHHHhhcC--CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc--- Q lcl|NC_010147. 142 ---LNGLQSAIDKFNDED--LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG--- 213 (274) Q Consensus 142 ---~d~i~~A~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~--- 213 (274) +.++.++...+..++ .....|+|||..+..|++.. +.+ |..+.....-++++|+||++++.||.+ T Consensus 286 ~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk------d~~-G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 286 QKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR------DGN-GNKVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh------ccC-CceeccCCCCCeEeeeeeEEeccccccccC Confidence 345666666665443 35568999999999887532 111 222222223458999999999999853 Q ss_pred -----eEEEEeCCeEEEEeecCceeeeecchh-------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 214 -----TAILAKKGAVKLILKRDFFLEVARDAS-------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 214 -----~~~~~~~~a~~~~~~~~~~ve~~rd~~-------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) ..++.+.+.+.++.+.+++++..++.. +....++...||++++.+|+++++++...+.- T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 359 AGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred CCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 345566666767778889998888753 34578889999999999999999999766655 No 118 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.92 E-value=1.6e-26 Score=162.06 Aligned_cols=262 Identities=15% Similarity=0.100 Sum_probs=192.1 Q ss_pred CCCccc--eeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGIT--KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~T--~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) +++.++ ...-.++|+.+...|.+.+.+.+.+.++... ......| .+++|++...+.+.|++||+.++..+++++. T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~--~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~ 206 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGAR--TLPLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDD 206 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcce--eeecCCC-ceEEEEEeCCcceeeeccCccccccccceeE Confidence 332222 2223589999999999999988888776321 1222223 5899999877888999999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCc--cHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc---------c--ccccccC-- Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYG--DPQGEQVRQHGLAHANKVDNDVLEALMGA--KL---------T--VNADITK-- 141 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~--d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--~~---------~--~~~~~~~-- 141 (274) +++.+++++..+.+|++...++.. ++.+.+.+++++++++++|..++..-.++ +. . ......+ T Consensus 207 i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 286 (435) T protein:vir:14 207 LKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQ 286 (435) T ss_pred EEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchh Confidence 999999999999999999888854 47799999999999999999998653221 00 0 0111122 Q ss_pred --HHHHHHHHHHHhhcC--CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 142 --LNGLQSAIDKFNDED--LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 142 --~d~i~~A~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) ++.+.++...+..+. .....++|||..+..|++.. +++ |..++....-++++|+||++++.||.. T Consensus 287 ~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk------d~~-G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~ 359 (435) T protein:vir:14 287 KIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLR------DGN-GNKVYPELANGMLKGYPVGKTTQVPINLGET 359 (435) T ss_pred hHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhh------ccC-CceeccCCCCCeeecceeEeeccccccccCC Confidence 345666666665543 35668999999999987642 111 222222233468999999999999853 Q ss_pred ----eEEEEeCCeEEEEeecCceeeeecchh-------------hcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 214 ----TAILAKKGAVKLILKRDFFLEVARDAS-------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 214 ----~~~~~~~~a~~~~~~~~~~ve~~rd~~-------------~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) ..++.+.+.+.++.+.+++++.+++.. +....++...|+++++.+|+++++++...+.- T Consensus 360 ~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 360 GKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred CccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 456666677777788889998887643 35678899999999999999999999877666 No 119 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.92 E-value=2.3e-26 Score=161.12 Aligned_cols=262 Identities=15% Similarity=0.117 Sum_probs=199.0 Q ss_pred CCCcccee-eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccc Q lcl|NC_010147. 1 MPQGITKT-SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETK 77 (274) Q Consensus 1 Ma~~~T~~-~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~ 77 (274) +..+.|.. +..++|+.|...|.+.+.+.+.+.+++.+.+. .+.+.++|.+.. .+.+.|++||+..+. ++++++ T Consensus 127 ~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 202 (394) T protein:vir:97 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFK 202 (394) T ss_pred hccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeec----cCcceEEEEEecCCCccceecccccccccccccce Confidence 33333333 34589999999999999999999888765432 234578888763 356789999999985 679999 Q ss_pred eeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCC Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDL 157 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~ 157 (274) .+++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..++....+.+ .....++++++++....-+. . T Consensus 203 ~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~---~~~~~~~~~~~~~~~~~~~~-~ 278 (394) T protein:vir:97 203 DVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDP-A 278 (394) T ss_pred eEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhh-h Confidence 9999999999999999999999999999999999999999999999987765443 23456789999887654433 2 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcceEEEEeCC-eEEEEeecCceee Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKG-AVKLILKRDFFLE 234 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~~~~~~~~~-a~~~~~~~~~~ve 234 (274) ....|+|||..+..|++.... .+......-+.+|..++++|+||+++++ ++.+++++.+.. .+.++.+.++.++ T Consensus 279 ~~a~~v~n~~~~~~l~~lkd~---~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~ 355 (394) T protein:vir:97 279 YNVSLIVSQSFYQTLDTLKDG---NGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLR 355 (394) T ss_pred hCCEEEEcHHHHHHHHHhhcc---CCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEE Confidence 345799999999998753210 1111111113455667999999999654 566777766644 4667778888888 Q ss_pred eecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 235 VARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 235 ~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ..++... ...+++..||++++.+|+++++++.+.++..+ T Consensus 356 ~~~~~~~-~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 356 WADNEIY-GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred Eeccccc-ceeEEEEEEEccEEecccceEEEEecccccCC Confidence 7766544 56789999999999999999999999998888 No 120 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.92 E-value=2.1e-26 Score=161.31 Aligned_cols=269 Identities=17% Similarity=0.120 Sum_probs=187.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |+..+|.....++|+.+...|.+.+.+.+.+.+++.+-.. .+..++||+... .+.+.|++||+.++.++++++++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~----~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccccc----CCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 5555555556689999999999999999999998765322 344689998754 45788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------c-ccccc------------ Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------K-LTVNA------------ 137 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---------~-~~~~~------------ 137 (274) ++.+++++..+.+|++...++ +++.+.+.+++++.+++++|..++..-.+. + ..... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:78 227 YEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred EeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhh Confidence 999999999999999988765 679999999999999999999998652110 0 00000 Q ss_pred ------------------------------------------ccc---CHHHHHHHHHHHhhcC-CCceEEEEcHHHHHH Q lcl|NC_010147. 138 ------------------------------------------DIT---KLNGLQSAIDKFNDED-LEPMVLFINPLDAGK 171 (274) Q Consensus 138 ------------------------------------------~~~---~~d~i~~A~~~l~~~~-~~~~~~vv~p~~~~~ 171 (274) ..+ ..+.+..+...+...+ .....|+|||..+.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:78 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 000 0112233333333322 345589999999998 Q ss_pred HHhh--cccccccccccc-ccceeccccceeccceEEEcCCCCcceEEEEe--CCeEEEEeecCceeeeecc----hhhc Q lcl|NC_010147. 172 LRGD--ASTNFTRATELG-DDIIVKGAFGEALGAIIVRTNKLEAGTAILAK--KGAVKLILKRDFFLEVARD----ASTK 242 (274) Q Consensus 172 L~k~--~~~~~~~~s~~~-~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~--~~a~~~~~~~~~~ve~~rd----~~~~ 242 (274) |++. ..-.++-....+ ......+.-+++.|+||+++++||.++.++.+ .+++.++.+.++.++..+. -.+. T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n 465 (497) T protein:vir:78 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG 465 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcC Confidence 8653 222222222111 11112223458999999999999999987744 3456677788888776543 2346 Q ss_pred ceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 243 TTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 243 ~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ...++...||++.+.+|+++++++.++++.-- T Consensus 466 ~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 466 KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred cEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 66788889999999999999999963322222 No 121 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.92 E-value=2.1e-26 Score=161.31 Aligned_cols=269 Identities=17% Similarity=0.120 Sum_probs=187.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |+..+|.....++|+.+...|.+.+.+.+.+.+++.+-.. .+..++||+... .+.+.|++||+.++.++++++++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~----~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccccc----CCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 5555555556689999999999999999999998765322 344689998754 45788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------c-ccccc------------ Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------K-LTVNA------------ 137 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---------~-~~~~~------------ 137 (274) ++.+++++..+.+|++...++ +++.+.+.+++++.+++++|..++..-.+. + ..... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:10 227 YEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred EeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhh Confidence 999999999999999988765 679999999999999999999998652110 0 00000 Q ss_pred ------------------------------------------ccc---CHHHHHHHHHHHhhcC-CCceEEEEcHHHHHH Q lcl|NC_010147. 138 ------------------------------------------DIT---KLNGLQSAIDKFNDED-LEPMVLFINPLDAGK 171 (274) Q Consensus 138 ------------------------------------------~~~---~~d~i~~A~~~l~~~~-~~~~~~vv~p~~~~~ 171 (274) ..+ ..+.+..+...+...+ .....|+|||..+.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:10 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 000 0112233333333322 345589999999998 Q ss_pred HHhh--cccccccccccc-ccceeccccceeccceEEEcCCCCcceEEEEe--CCeEEEEeecCceeeeecc----hhhc Q lcl|NC_010147. 172 LRGD--ASTNFTRATELG-DDIIVKGAFGEALGAIIVRTNKLEAGTAILAK--KGAVKLILKRDFFLEVARD----ASTK 242 (274) Q Consensus 172 L~k~--~~~~~~~~s~~~-~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~--~~a~~~~~~~~~~ve~~rd----~~~~ 242 (274) |++. ..-.++-....+ ......+.-+++.|+||+++++||.++.++.+ .+++.++.+.++.++..+. -.+. T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n 465 (497) T protein:vir:10 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG 465 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcC Confidence 8653 222222222111 11112223458999999999999999987744 3456677788888776543 2346 Q ss_pred ceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 243 TTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 243 ~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ...++...||++.+.+|+++++++.++++.-- T Consensus 466 ~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 466 KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred cEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 66788889999999999999999963322222 No 122 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.92 E-value=3.6e-26 Score=160.08 Aligned_cols=267 Identities=13% Similarity=0.070 Sum_probs=194.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |...++..+..++|+.++..|.+.+.+.+.+.+++.+... .+ ...++.+|+.. ..+.+.|++||+.++. +++++++ T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 193 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-ST-SNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTI 193 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeec-cC-CcceEEEEeecCCccceeeecCccccccccccceee Confidence 4434444445689999999999999999999988765321 11 11234444443 3356789999999985 7899999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHH-HhhcCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~-l~~~~~ 157 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++++|+.++....+... .....+++++++++.. +..+.. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~--~~~~~~~~~i~~~~~~~~~~~~~ 271 (404) T protein:vir:39 194 IKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK--KPTIAKFDDVITMINTSVDPAII 271 (404) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--ccccccHHHHHHHHHHhhhhhhc Confidence 9999999999999999999999999999999999999999999999987655433 2344678999998764 444445 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC--CCcc-----eEEEEeCC-eEEEEeec Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAG-----TAILAKKG-AVKLILKR 229 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~--v~~~-----~~~~~~~~-a~~~~~~~ 229 (274) ....|+|||..+..|++... .........-+..+..++++|+||+++++ +|.. ..++.+.. ++.++.+. T Consensus 272 ~~a~~v~n~~~~~~L~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (404) T protein:vir:39 272 ATSSLLTNQSGLNKLALVKT---AEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (404) T ss_pred cCCEEEEcHHHHHHHHHhhc---cCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeec Confidence 66789999999999986321 01111111112445567999999999765 4432 23444444 56677788 Q ss_pred Cceeeeecch----hhcceEEEEEEEEEEEEEcCccEEEEEecCCCC-CC Q lcl|NC_010147. 230 DFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL-EM 274 (274) Q Consensus 230 ~~~ve~~rd~----~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~-~~ 274 (274) +++++.++.. .++...++...||++++.+|+++++++++.++. += T Consensus 349 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~ 398 (404) T protein:vir:39 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVG 398 (404) T ss_pred ceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeeccccCCC Confidence 8888887764 346678889999999999999999999644433 33 No 123 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.92 E-value=4.1e-26 Score=159.75 Aligned_cols=267 Identities=12% Similarity=0.048 Sum_probs=198.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCC-ccccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIP-TDILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~-~~~~t~~~ 78 (274) |+-.++.-...++|+.|...|.+.+.+.+.+.+++.+.+. .+.+.++|.+.. .+.+.|+.|+++.+ .++++++. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPV----TTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeec----cCCeeEEEEEecCCCcccccccccccccccccccee Confidence 5544444445789999999999999999999888765432 344577887753 34557899998887 47999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +.+.+++++..+.+|++...++..|+.+.+.+++++.+++..|..++..+.+......+...+++.+.++....-+... T Consensus 185 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~- 263 (389) T protein:vir:10 185 VDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTLVDSLKHILNVDLDPAY- 263 (389) T ss_pred eeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccHHHHHHHHHhhhhhhh- Confidence 9999999999999999999999999999999999999999999999999888777777777899999998764322222 Q ss_pred ceEEEEcHHHHHHHHhhcc--ccccccccccccceeccccceeccceEEEcCCC-Ccc---e-EEEEe-CC-eEEEEeec Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRTNKL-EAG---T-AILAK-KG-AVKLILKR 229 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~--~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v-~~~---~-~~~~~-~~-a~~~~~~~ 229 (274) ...|+|||..+..|++... -.++-.... .+....|..++++|+||++.++. +.. . .++|+ .. ++.++.+. T Consensus 264 ~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 342 (389) T protein:vir:10 264 SRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQ 342 (389) T ss_pred CcEEEecHHHHHHHHHhhccCCCeeeecCc-ccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeec Confidence 4689999999999986321 111111111 11112344568999999876553 321 1 24554 34 46777788 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.++..++... .+.+++..|+++++.+|+++++++.+.++..= T Consensus 343 ~~~i~~~~~~~~-~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 386 (389) T protein:vir:10 343 QVTLAWEDSKIY-GKYLGAAFRFGVQKADSKAGYFVTNTDVPGSA 386 (389) T ss_pred ceEEEeeccccc-cceEEEEEEeccEEecccceEEEEeeccCCCC Confidence 888888876554 45788889999999999999999975444333 No 124 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.92 E-value=3.8e-27 Score=165.46 Aligned_cols=257 Identities=12% Similarity=0.073 Sum_probs=195.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.+...|.+.+.+.+.+++++.+-. ..| .++|.... .+++.|++||+.++.++++++++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 156 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 156 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEe----cCC--ceEEEEecCCCcccccccccccccccccceee Confidence 333333444568999999999999999988888876532 122 35677653 46789999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------ccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------LTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------~~~~~~~~~~d~i~~A~ 149 (274) ++.+++++..+++|++...++..|+.+.+.+++++.+++..+..++..-.+.. ....++...+|.|+++. T Consensus 157 ~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~~ 236 (352) T protein:vir:78 157 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINAL 236 (352) T ss_pred eecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999875665664432211 11123345689999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|......+..|+||+..+..|++.. . +.+ +.+..|.-.+++|.||++++.+++ ++|+.-...+.... T Consensus 237 ~~l~~~~~~~a~~~mn~~t~~~l~~~~-----~--~~~-~~~~~~~~~~llG~PV~~~~~~~~---~~~Gdf~~~~~~~~ 305 (352) T protein:vir:78 237 ADLHEDYRDNATIYMRYADYVKIISVL-----S--NGT-TNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 305 (352) T ss_pred hccChhhhcCCEEEEehHHHHHHHHHH-----h--ccC-CcccccCCccccccceEEecCCCc---eeEeehhhhhhhhh Confidence 998877777888999999987776531 1 111 223445566899999999998764 33432222233345 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +..++.+++..++...++.+.||++++++|++++.++.++++-++ T Consensus 306 ~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~ 350 (352) T protein:vir:78 306 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSL 350 (352) T ss_pred hheeeeeccccCCeeEEEEEeeeCceeechhheEEEEeecccCCC Confidence 566778888888889999999999999999999999998888888 No 125 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.92 E-value=1.9e-26 Score=161.57 Aligned_cols=259 Identities=17% Similarity=0.152 Sum_probs=195.7 Q ss_pred CCCccc-eeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCCCcCCc-cccccc Q lcl|NC_010147. 1 MPQGIT-KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIPT-DILETK 77 (274) Q Consensus 1 Ma~~~T-~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg~~i~~-~~~t~~ 77 (274) |...+| .-+..++|+.|...|.+.+.+.+.+.+++.+.+. ++.++++|.+. ..+.+.|+.||...+. ++++++ T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~ 208 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQA----STQKGTYPTVANATTKMVTVAELEKNPAMAKPEFK 208 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEec----cCcceEEEEEecCCCccccccccccccccccccce Confidence 333322 2234689999999999999999999888765322 34467888876 3467889999999875 689999 Q ss_pred eeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCC Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDL 157 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~ 157 (274) .+++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+.+. ....+++.+.++....-+.. T Consensus 209 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~- 284 (400) T protein:vir:38 209 PVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA---KTISSVDDLKHINNVDLDPA- 284 (400) T ss_pred eeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc---cccccHHHHHHHHHhhhhhh- Confidence 99999999999999999999999999999999999999999999999877665443 34567888888876543332 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce----EEEE-eCC-eEEEEeecCc Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILA-KKG-AVKLILKRDF 231 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~----~~~~-~~~-a~~~~~~~~~ 231 (274) ....|+|||..+..|++.... .....-..-+..|..++++|+||++++++|.+. .++| +.+ ++.++.+.++ T Consensus 285 ~~a~~v~~~~~~~~l~~lkd~---~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~ 361 (400) T protein:vir:38 285 YSRVIIASQSFYNFLDTVKDG---NGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADF 361 (400) T ss_pred hCcEEEEcHHHHHHHHHhhcc---CCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecce Confidence 356899999999998764210 100111112345666799999999999988543 2444 434 4666777888 Q ss_pred eeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 232 FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 232 ~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) .++..++.. +...+++..||++++.+|+++++|+.+.++ T Consensus 362 ~~~~~~~~~-~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 362 MVRWVDDQI-YGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEecccc-cceeEEEEEEeccEEecccceEEEEeecCC Confidence 888776654 456899999999999999999999998888 No 126 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.92 E-value=5e-26 Score=159.29 Aligned_cols=269 Identities=12% Similarity=0.067 Sum_probs=197.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc--ccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD--ILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~--~~t~~~ 78 (274) |...++..+..++|+.|...+.+.+.+.+++.+++...+. .+ ....+.+|+....+.+.|+.||+..+.+ +++++. T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~-~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~ 187 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPV-FT-RSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLER 187 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeec-cC-CccceEEEEecCCcceeeccccccccccccccceee Confidence 5444444455689999999999999999999888755322 11 2335778887777788999999998875 578999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------cccccccCHHHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------TVNADITKLNGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------~~~~~~~~~d~i~ 146 (274) +++++++++..+.+|++...++..++.+.+.+++++.+++.+|..++....+... ...+...+++.+. T Consensus 188 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~ 267 (404) T protein:vir:10 188 FNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDFK 267 (404) T ss_pred eEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHHH Confidence 9999999999999999999999899999999999999999999999876443211 1123345688888 Q ss_pred HHHHH-HhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE-cCCCCcce----EEEEe- Q lcl|NC_010147. 147 SAIDK-FNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR-TNKLEAGT----AILAK- 219 (274) Q Consensus 147 ~A~~~-l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~-s~~v~~~~----~~~~~- 219 (274) ++... +.........|+|||..+..|++.. +. ........-+..|..++++|+||++ ++.+|.++ .++++ T Consensus 268 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk--d~-~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd 344 (404) T protein:vir:10 268 KCKNVELLNVFKATSSWIVNQDGFNYLDSLE--DK-TGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGD 344 (404) T ss_pred HHHHhhhhccccCCCEEEEcHHHHHHHHHhh--cc-CCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEe Confidence 87763 4444445667999999999988642 11 1111111113456677899999985 45555432 24444 Q ss_pred C-CeEEEEeecCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 220 K-GAVKLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 220 ~-~a~~~~~~~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) . .++.++.+.+++++.+++.. ++...++...|+++.+.+|+++++++++.|+.+- T Consensus 345 ~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 345 TKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 3 35667778888888776643 4667899999999999999999999987777777 No 127 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.92 E-value=3.3e-26 Score=160.27 Aligned_cols=265 Identities=13% Similarity=0.054 Sum_probs=194.9 Q ss_pred CCCccceee--eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC-CccccccCCCcCCccccccc Q lcl|NC_010147. 1 MPQGITKTS--NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETK 77 (274) Q Consensus 1 Ma~~~T~~~--~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~-~~~~~~~eg~~i~~~~~t~~ 77 (274) -++.++... -.++|+.|...|.+.+.+.+.+.+++.+... ..+..+.+|..... ..+.|++||+.++.+++++. T Consensus 115 ~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~ 191 (409) T protein:vir:45 115 RAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTT---SDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFG 191 (409) T ss_pred hhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeec---CCCceEEEEeeccCccccccccccccccccccccc Confidence 233333333 3589999999999999999998888765432 23445677776543 34569999999999999999 Q ss_pred eeEEEeeeec-ceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------cccccccccC Q lcl|NC_010147. 78 KREAKIRKIA-KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------------KLTVNADITK 141 (274) Q Consensus 78 ~~~~~~~~~~-~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---------------~~~~~~~~~~ 141 (274) ...+..+|++ ..+.+|++...++..|+.+.+.+++++.+++++|..++..-.+. .....++.++ T Consensus 192 ~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~ 271 (409) T protein:vir:45 192 MGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVK 271 (409) T ss_pred eeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccc Confidence 9999998875 56789999999999999999999999999999999998643221 1122345578 Q ss_pred HHHHHHHHHHHhhcCCCce-E-EEEcHHHHHHHHhhc--cccccccccccccceeccccceeccceEEEcCCCCc----c Q lcl|NC_010147. 142 LNGLQSAIDKFNDEDLEPM-V-LFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEA----G 213 (274) Q Consensus 142 ~d~i~~A~~~l~~~~~~~~-~-~vv~p~~~~~L~k~~--~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~----~ 213 (274) ++.|+++...|..+..... | +++||..+..|++.. .-.++ -.+-+.+|..++++|.||+++++||. . T Consensus 272 ~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i-----~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~ 346 (409) T protein:vir:45 272 WQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPL-----WLPDIVGVAPASVLNVPYVIDQEIDDIGAGK 346 (409) T ss_pred hHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCcee-----eccCcCCCCCceecceeeEEecCcCCccCCc Confidence 9999999999987665443 3 578999998886532 11111 11123456667899999999999984 1 Q ss_pred eEEEE-eCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 214 TAILA-KKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 214 ~~~~~-~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) ..++| +.+.+.+..+.+..++..++..+ +...++...||++++.+|+++++++..+++.- T Consensus 347 ~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 347 KFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred cEEEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 23444 44555566667777777777664 55678888999999999999999997555555 No 128 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.92 E-value=5.8e-26 Score=158.92 Aligned_cols=267 Identities=12% Similarity=0.066 Sum_probs=198.0 Q ss_pred CCCcccee--eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcc-cccc Q lcl|NC_010147. 1 MPQGITKT--SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTD-ILET 76 (274) Q Consensus 1 Ma~~~T~~--~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~-~~t~ 76 (274) |...++.. +..++|+.|+..|.+.+.+.+.+.+++..-+. .+ ....+.+|.... .+.+.|+.||+.++.. .+++ T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENV-TT-SHGSRVYEKLADITPLKDLDDESALIGDNDDPEL 182 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeec-cC-CcceEEEEeeccCCccccccccccccccccccce Confidence 44444433 34689999999999999999999998765321 11 112344444433 3456799999999865 6899 Q ss_pred ceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhc Q lcl|NC_010147. 77 KKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDE 155 (274) Q Consensus 77 ~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~ 155 (274) +.+++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..++....+.... ....+++.++++.. .+... T Consensus 183 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~--~~~~~~~~i~~~~~~~l~~~ 260 (395) T protein:vir:38 183 TVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKK--PTISQFDNIKDLENNTLDPA 260 (395) T ss_pred eeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccccHHHHHHHHHHhhhhh Confidence 9999999999999999999999999999999999999999999999999876554332 33467899999875 45555 Q ss_pred CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcc------eEEEEeCC-eEEEEee Q lcl|NC_010147. 156 DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG------TAILAKKG-AVKLILK 228 (274) Q Consensus 156 ~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~------~~~~~~~~-a~~~~~~ 228 (274) ......|+|||..+..|++... .........-+.+|..++++|+||+++++++.+ ..++.+.+ .+.++.+ T Consensus 261 ~~~~a~~v~n~~~~~~L~~lkd---~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~ 337 (395) T protein:vir:38 261 IESTSSFITNQSGYNILSKVKD---ADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDR 337 (395) T ss_pred hcCCCEEEEcHHHHHHHHHhhc---cCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEe Confidence 5567789999999999976321 111111112234566778999999999875422 23444434 4667778 Q ss_pred cCceeeeecchh----hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 229 RDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 229 ~~~~ve~~rd~~----~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .++.++..++.. +....++...||++++.+|+++++++++.++.+= T Consensus 338 ~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 387 (395) T protein:vir:38 338 QQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQA 387 (395) T ss_pred cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 888888877543 4567888889999999999999999998887777 No 129 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.91 E-value=3.6e-27 Score=165.56 Aligned_cols=270 Identities=12% Similarity=0.058 Sum_probs=208.4 Q ss_pred CCCcccee--------eeeech-HHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKT--------SNQIIP-EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~--------~~~~~P-ev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) |...++-. ++.-+. |+|+..|.+.+.+.++|.++..+ +++. .|++++||+.+.. .++.+..|+++.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-rti~--~GkS~qf~~iG~~-~a~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QTVT--GTNTVSNKYLGET-ELQVLAPGQSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-eeec--ccceEEEEEEeee-EEeeeccccccCC Confidence 77765422 112222 78999999999999999988765 3433 5899999998654 6677889999999 Q ss_pred cccccceeEEEeee-ecceeeeeHHHHhhcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhhccc------c---------- Q lcl|NC_010147. 72 DILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAK------L---------- 133 (274) Q Consensus 72 ~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~~~~~~~~a~------~---------- 133 (274) +.+..++..++++. ++..+.|.|++..++..| +-.++.+++++++|+..|+.++.....+. . T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 99999999999987 477788999999999999 78899999999999999999886553211 0 Q ss_pred -----cc--cccccC----HHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 134 -----TV--NADITK----LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 134 -----~~--~~~~~~----~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) +. ....++ ++.|.+|.+.|.+.++ +.|+++++|++|..|+++..+-.......+.+...+|.++.++ T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 00 001123 3556788899998875 6799999999999999875432222222334557899999999 Q ss_pred cceEEEcCCCCcc---------------------------eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 201 GAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~v~~~---------------------------~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) |++|++|+++|.+ .+++|++.|++.+.-.++..+.+|+..++.+.|...+.|| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G 316 (402) T protein:vir:97 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) T ss_pred ceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhC Confidence 9999999999831 1368899999998888999999999999999999999999 Q ss_pred EEEEcCccEEEEEe-----cCCCCCC Q lcl|NC_010147. 254 AYLYDESKAVKITK-----GSGSLEM 274 (274) Q Consensus 254 ~~~~~~~~~v~~~~-----~~a~~~~ 274 (274) ....+|+.+.+++. ++=.+++ T Consensus 317 ~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 317 AIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred CcccCccceEEEEEecccccccCCcc Confidence 99999999988842 2223344 No 130 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.91 E-value=5.4e-27 Score=164.61 Aligned_cols=257 Identities=12% Similarity=0.081 Sum_probs=194.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.+++.|.+.+.+.+.+++++.+-. .++ .++|.... .+++.|++||+..+.++++++++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 333333334568999999999999999988888876532 122 45777653 35788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------cccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA----------KLTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a----------~~~~~~~~~~~d~i~~A~ 149 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++..+..++....+. .....+...++|.|+++. T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:94 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 9999999999999999999999999999999999999998777776544331 112223445699999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|..+......|+||+..+..|++.. .. +++.+..|.-.+++|.||++++.+++ ++|+.-...+.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~-----~~---~~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:94 272 ADLHEDYRDNATIYMRYADYVKIISVL-----SN---GTTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHH-----hc---CCCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 988877767778999998887665421 11 11223445667899999999998764 33332222222334 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.....+++..++...++...||++++++|+++++++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:94 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 556677888888899999999999999999999999998888888 No 131 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.91 E-value=5.4e-27 Score=164.61 Aligned_cols=257 Identities=12% Similarity=0.081 Sum_probs=194.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.+++.|.+.+.+.+.+++++.+-. .++ .++|.... .+++.|++||+..+.++++++++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 333333334568999999999999999988888876532 122 45777653 35788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------cccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA----------KLTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a----------~~~~~~~~~~~d~i~~A~ 149 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++..+..++....+. .....+...++|.|+++. T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:96 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 9999999999999999999999999999999999999998777776544331 112223445699999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|..+......|+||+..+..|++.. .. +++.+..|.-.+++|.||++++.+++ ++|+.-...+.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~-----~~---~~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:96 272 ADLHEDYRDNATIYMRYADYVKIISVL-----SN---GTTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHH-----hc---CCCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 988877767778999998887665421 11 11223445667899999999998764 33332222222334 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.....+++..++...++...||++++++|+++++++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:96 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 556677888888899999999999999999999999998888888 No 132 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.91 E-value=5.4e-27 Score=164.61 Aligned_cols=257 Identities=12% Similarity=0.081 Sum_probs=194.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.+++.|.+.+.+.+.+++++.+-. .++ .++|.... .+++.|++||+..+.++++++++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 333333334568999999999999999988888876532 122 45777653 35788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------cccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA----------KLTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a----------~~~~~~~~~~~d~i~~A~ 149 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++..+..++....+. .....+...++|.|+++. T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:26 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 9999999999999999999999999999999999999998777776544331 112223445699999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|..+......|+||+..+..|++.. .. +++.+..|.-.+++|.||++++.+++ ++|+.-...+.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~-----~~---~~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:26 272 ADLHEDYRDNATIYMRYADYVKIISVL-----SN---GTTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHH-----hc---CCCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 988877767778999998887665421 11 11223445667899999999998764 33332222222334 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.....+++..++...++...||++++++|+++++++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:26 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 556677888888899999999999999999999999998888888 No 133 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.91 E-value=4.9e-26 Score=159.35 Aligned_cols=261 Identities=10% Similarity=0.062 Sum_probs=200.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCc--cccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGD--AQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~--~~~~~eg~~i~~~~~t~~~ 78 (274) .+..++..+..++|+.+...|.+.+.+.+.+.+++.... ..+.++++|.+..... ..|++||..++.++++++. T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~ 189 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIP----VNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQP 189 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeee----ccCCceEEEEeecCCccceeeccccccccccccceeE Confidence 343444445668999999999999999999988876532 2344578887765433 4568999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+.+|++...++..++.+.+.+++++.+++.+|..+++...+.... +...+++.|+++...+..+... T Consensus 190 i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~--~~~~~~d~i~~~~~~l~~~~~~ 267 (421) T protein:vir:13 190 MAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLAE--ETINDYAGLVKTINSLVPNARK 267 (421) T ss_pred EEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcccc--ccccchHHHHHHHHHhhhhhcC Confidence 99999999999999999999999999999999999999999999999877664432 2345799999999999888888 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccc---eeccccceeccceEEEcCCCCcce-----EEEEeCC-eEEEEeec Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDI---IVKGAFGEALGAIIVRTNKLEAGT-----AILAKKG-AVKLILKR 229 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~---~~~g~ig~~~G~~Vv~s~~v~~~~-----~~~~~~~-a~~~~~~~ 229 (274) ...|+|||..+..|++... + .|..+ ...|..++++|+||++++++|.+. .++.+.. ++.++.+. T Consensus 268 ~a~~v~n~~~~~~l~~lkd------~-~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 340 (421) T protein:vir:13 268 RAIIVTNSDGRAYLDGLMD------K-QGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRK 340 (421) T ss_pred CCEEEEcHHHHHHHHHhhc------C-CCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEec Confidence 8899999999999875321 1 11211 234556789999999999998543 3444545 36677788 Q ss_pred Cceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +++++..++..+ +...++...||++++.+|++++.+.....+.-. T Consensus 341 ~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v 387 (421) T protein:vir:13 341 QYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIV 387 (421) T ss_pred ceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccceee Confidence 999999888765 445788889999999999987665544322222 No 134 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.91 E-value=1.3e-26 Score=162.46 Aligned_cols=257 Identities=12% Similarity=0.092 Sum_probs=193.0 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.+...|.+.+.+.+.+.+++.+-.. ++ .++|... ..+++.|++||+..+.++++++++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~----~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI----KG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 191 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeec----CC--ceEEEEeecCCccccccCccccccccccccee Confidence 4333333445689999999999999999888888765321 22 4577654 345788999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------ccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------LTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------~~~~~~~~~~d~i~~A~ 149 (274) .+.+++++..+++|++...++..|+.+.+.+++++.++++.+..++....+.. ....+....+|.|+++. T Consensus 192 ~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~ 271 (387) T protein:vir:93 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999987777775443321 11223345689999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|..+......|+||+..+..+++.. .. +++.+..|.-.+++|+||++++.++. .++.+.+.+ +.... T Consensus 272 ~~l~~~~~~~a~~~mn~~t~~~~~~~~-----~d---~~~~~~~~~~~~llG~PV~~~~~~~~--~~~GDf~~~-~~~~~ 340 (387) T protein:vir:93 272 ADLHEDYRDNATIYMRYADYVKIISVL-----SN---GTTNFFDTPAEKVFGKPVVFTDAAVK--PIVGDFNYF-GINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHH-----hc---CCCcccccCCccccccceEEecCCCc--eeeeehhhh-heehh Confidence 999887777778999998876654321 11 11122345556899999999998764 233333332 33344 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +..++.+++..++...++.+.||++++++|++++.++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~ 385 (387) T protein:vir:93 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSL 385 (387) T ss_pred hheeeecccccCCceeEEEEeeeCceeechhheEEEEeecCCCCC Confidence 566777788888888999999999999999999999988888777 No 135 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.91 E-value=9.4e-26 Score=157.80 Aligned_cols=263 Identities=12% Similarity=-0.026 Sum_probs=195.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) ++...+.-+..++|+.+...|.+.+.+.+.+.+++.+.+. .+....||.....+.+.|+.|+++++. .+++++.+ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~----~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNT----TATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeec----CCceeEEEEEcCCcceeeeccccccCccccccceee Confidence 5555555667799999999999999999999888765432 345688999888888999999999875 68999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------ccccccccCHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------------LTVNADITKLN 143 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------------~~~~~~~~~~d 143 (274) .+.+++++..+.+|++...++..|+.+.+.+++++.+++++|+.++..-.+.. ....+...+++ T Consensus 160 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~ 239 (390) T protein:vir:40 160 QTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDL 239 (390) T ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccchh Confidence 99999999999999999999999999999999999999999999986432110 01112233444 Q ss_pred HHHHHHHHHhh----c---CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEE Q lcl|NC_010147. 144 GLQSAIDKFND----E---DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAI 216 (274) Q Consensus 144 ~i~~A~~~l~~----~---~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~ 216 (274) ...++...+.. . .....+|+|||..+..+++.. ......+ |.. +.. ....|+||++|++||+++.+ T Consensus 240 ~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~--~~~~d~~-G~~-v~~---~~~~g~pvv~~~~~p~~~i~ 312 (390) T protein:vir:40 240 TPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAA--TSYMTPQ-GVW-VTG---ILPVPLEIVQSVAVPVGKAV 312 (390) T ss_pred hHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHH--hhccCCC-Ccc-ccc---cCCCceeEEEcCCCCCCcEE Confidence 44444433322 1 234567999998764433211 1112211 222 111 22479999999999999988 Q ss_pred EEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCC--C Q lcl|NC_010147. 217 LAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLE--M 274 (274) Q Consensus 217 ~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~--~ 274 (274) +.+.+.+.++.+.++.++.+++..+ +.+.+++..|+++++.+|+++++++.+++... + T Consensus 313 ~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~ 374 (390) T protein:vir:40 313 AGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAI 374 (390) T ss_pred EEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCC Confidence 8888787777888888888876644 67889999999999999999999998777543 3 No 136 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.91 E-value=2.1e-25 Score=155.90 Aligned_cols=266 Identities=11% Similarity=0.026 Sum_probs=194.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc-cccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~-~~t~~~~ 79 (274) |...++..+..++|+.+...|.+.+.+.+++.+++.+... .+ +..+..+|+....+.+.|++||++++.. .++++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCccceeecccccccccccccceeE Confidence 5544455556689999999999999999999888765322 11 1224567776666788899999999865 6899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHH-HHHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~-~~l~~~~~~ 158 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..+++...+... ...++++.++++. ..+...... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cCccCHHHHHHHHHHhhhhhhcc Confidence 999999999999999999999899999999999999999999999877655433 3457899999987 466666667 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE-cCC-CC------cce-EEEEe-CC-eEEEEe Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR-TNK-LE------AGT-AILAK-KG-AVKLIL 227 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~-s~~-v~------~~~-~~~~~-~~-a~~~~~ 227 (274) +..|+|||..+..|++.. + ......-..-+..|..++++|.|+++ +++ .+ .++ .++++ .+ .+.++. T Consensus 261 ~a~~vm~~~~~~~L~~lk--d-~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLK--D-KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhh--c-cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 788999999999997631 1 01111111112345667899987655 333 22 122 24444 33 466677 Q ss_pred ecCceeeeecch--h--hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 228 KRDFFLEVARDA--S--TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 228 ~~~~~ve~~rd~--~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.++.++.++.. . ++...++...|+++++.+|+++++++++.+.+-- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 788888877643 2 3556788889999999999999999975544444 No 137 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.91 E-value=2.1e-25 Score=155.90 Aligned_cols=266 Identities=11% Similarity=0.026 Sum_probs=194.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc-cccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~-~~t~~~~ 79 (274) |...++..+..++|+.+...|.+.+.+.+++.+++.+... .+ +..+..+|+....+.+.|++||++++.. .++++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCccceeecccccccccccccceeE Confidence 5544455556689999999999999999999888765322 11 1224567776666788899999999865 6899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHH-HHHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~-~~l~~~~~~ 158 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..+++...+... ...++++.++++. ..+...... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cCccCHHHHHHHHHHhhhhhhcc Confidence 999999999999999999999899999999999999999999999877655433 3457899999987 466666667 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE-cCC-CC------cce-EEEEe-CC-eEEEEe Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR-TNK-LE------AGT-AILAK-KG-AVKLIL 227 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~-s~~-v~------~~~-~~~~~-~~-a~~~~~ 227 (274) +..|+|||..+..|++.. + ......-..-+..|..++++|.|+++ +++ .+ .++ .++++ .+ .+.++. T Consensus 261 ~a~~vm~~~~~~~L~~lk--d-~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLK--D-KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhh--c-cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 788999999999997631 1 01111111112345667899987655 333 22 122 24444 33 466677 Q ss_pred ecCceeeeecch--h--hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 228 KRDFFLEVARDA--S--TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 228 ~~~~~ve~~rd~--~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.++.++.++.. . ++...++...|+++++.+|+++++++++.+.+-- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 788888877643 2 3556788889999999999999999975544444 No 138 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.91 E-value=2.1e-25 Score=155.90 Aligned_cols=266 Identities=11% Similarity=0.026 Sum_probs=194.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc-cccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~-~~t~~~~ 79 (274) |...++..+..++|+.+...|.+.+.+.+++.+++.+... .+ +..+..+|+....+.+.|++||++++.. .++++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCccceeecccccccccccccceeE Confidence 5544455556689999999999999999999888765322 11 1224567776666788899999999865 6899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHH-HHHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~-~~l~~~~~~ 158 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..+++...+... ...++++.++++. ..+...... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cCccCHHHHHHHHHHhhhhhhcc Confidence 999999999999999999999899999999999999999999999877655433 3457899999987 466666667 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE-cCC-CC------cce-EEEEe-CC-eEEEEe Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR-TNK-LE------AGT-AILAK-KG-AVKLIL 227 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~-s~~-v~------~~~-~~~~~-~~-a~~~~~ 227 (274) +..|+|||..+..|++.. + ......-..-+..|..++++|.|+++ +++ .+ .++ .++++ .+ .+.++. T Consensus 261 ~a~~vm~~~~~~~L~~lk--d-~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLK--D-KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhh--c-cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 788999999999997631 1 01111111112345667899987655 333 22 122 24444 33 466677 Q ss_pred ecCceeeeecch--h--hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 228 KRDFFLEVARDA--S--TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 228 ~~~~~ve~~rd~--~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.++.++.++.. . ++...++...|+++++.+|+++++++++.+.+-- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 788888877643 2 3556788889999999999999999975544444 No 139 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.91 E-value=2.1e-25 Score=155.90 Aligned_cols=266 Identities=11% Similarity=0.026 Sum_probs=194.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcc-cccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~-~~t~~~~ 79 (274) |...++..+..++|+.+...|.+.+.+.+++.+++.+... .+ +..+..+|+....+.+.|++||++++.. .++++++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCccceeecccccccccccccceeE Confidence 5544455556689999999999999999999888765322 11 1224567776666788899999999865 6899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHH-HHHhhcCCC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~-~~l~~~~~~ 158 (274) ++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..+++...+... ...++++.++++. ..+...... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cCccCHHHHHHHHHHhhhhhhcc Confidence 999999999999999999999899999999999999999999999877655433 3457899999987 466666667 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE-cCC-CC------cce-EEEEe-CC-eEEEEe Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR-TNK-LE------AGT-AILAK-KG-AVKLIL 227 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~-s~~-v~------~~~-~~~~~-~~-a~~~~~ 227 (274) +..|+|||..+..|++.. + ......-..-+..|..++++|.|+++ +++ .+ .++ .++++ .+ .+.++. T Consensus 261 ~a~~vm~~~~~~~L~~lk--d-~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLK--D-KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhh--c-cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 788999999999997631 1 01111111112345667899987655 333 22 122 24444 33 466677 Q ss_pred ecCceeeeecch--h--hcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 228 KRDFFLEVARDA--S--TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 228 ~~~~~ve~~rd~--~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.++.++.++.. . ++...++...|+++++.+|+++++++++.+.+-- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 788888877643 2 3556788889999999999999999975544444 No 140 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.91 E-value=1.5e-26 Score=162.13 Aligned_cols=257 Identities=12% Similarity=0.081 Sum_probs=193.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |...++.-+..++|+.++..|.+.+.+.+.+.+++.+-. .++ .++|.+.. .+++.|++||+..+.++++++++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 332223333468999999999999999999988876532 122 45777653 35688999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------cccccccccCHHHHHHHH Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA----------KLTVNADITKLNGLQSAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a----------~~~~~~~~~~~d~i~~A~ 149 (274) ++.+++++..+++|++...++..|+.+.+.+++++.+++..+..++....+. .....+....+|+|+++. T Consensus 207 ~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~ 286 (402) T protein:vir:93 207 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 286 (402) T ss_pred eecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 9999999999999999999999999999999999999998777666544321 112223445689999999 Q ss_pred HHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeec Q lcl|NC_010147. 150 DKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~ 229 (274) ..|..+......|+||+..+..|++.. .. +++.+..|.-.+++|.||++++.+++ ++|+.-...+.... T Consensus 287 ~~l~~~y~~na~~imn~~t~~~~~~~~-----~d---~~~~~~~~~~~~llG~PV~~t~~~~~---i~~GDf~~~~~~~~ 355 (402) T protein:vir:93 287 ADLHEDYRDNATIYMRYADYVKIISVL-----SN---GTTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 355 (402) T ss_pred hccChhhhcCCEEEEechHHHHHHHHH-----hc---CCCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 988877767778999998877665421 11 11223345567899999999998764 33432222233334 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +..+..+|++..+...++...|+++++++|++++.++.++++.+. T Consensus 356 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~ 400 (402) T protein:vir:93 356 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 400 (402) T ss_pred hhhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecCCCCC Confidence 455677888888899999999999999999999999998888888 No 141 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.89 E-value=1.2e-24 Score=151.66 Aligned_cols=265 Identities=13% Similarity=0.048 Sum_probs=189.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) +...++.-...++|+.+...+.+ +.+...+.+++.+-.. .....++|.+.. .+.+.|+.|++.++. ++++++. T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESV----TTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeEee----ccCceeeEEeecccccccccccccccccccccccee Confidence 33333444446899999887765 4455556666544221 233567887753 356789999999984 6789999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHH-HHhhcCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~-~l~~~~~ 157 (274) +++.+++++..+.+|++...++..|+.+.+.+.+++.+++.+|..+++...++.... +...+++++.++.. .+..+.. T Consensus 231 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~-~~~~~~~~~~~~~~~~l~~~~~ 309 (437) T protein:vir:10 231 ILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKT-TSTYLLGDLKKVLNVTLKPQDS 309 (437) T ss_pred eeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc-ccccchhhHHHHHHhhhhhhhh Confidence 999999999999999999999999999999999999999999999999876654433 34556788888765 4555555 Q ss_pred CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCC--Ccc---e--EEEEeCC-eEEEEeec Q lcl|NC_010147. 158 EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAG---T--AILAKKG-AVKLILKR 229 (274) Q Consensus 158 ~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v--~~~---~--~~~~~~~-a~~~~~~~ 229 (274) .+..|+|||..+..|++... ..+......-+..|..++++|.||++++++ |.+ . .++.+.+ ++.++.+. T Consensus 310 ~~~~~~~~~~~~~~l~~lkd---~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~ 386 (437) T protein:vir:10 310 AAASIVMSQSAYNLFDMATD---AMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLT 386 (437) T ss_pred cCCEEEEcHHHHHHHHHhhc---cCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeee Confidence 66789999999998876421 011111111133566678999999998764 432 2 3333444 56677788 Q ss_pred CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 230 DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 230 ~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.++...+...+.+.+++..||++++++|+++++|+....+.-- T Consensus 387 ~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 387 EITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred ceEEEEecccccccceeeEEEEEccEEecccceEEEEeecccccc Confidence 888887766666778889999999999999999999854333222 No 142 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.89 E-value=1e-24 Score=152.12 Aligned_cols=265 Identities=9% Similarity=-0.008 Sum_probs=188.8 Q ss_pred CCCccce-eeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccc---cCCCcCCcccccc Q lcl|NC_010147. 1 MPQGITK-TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV---AEGEKIPTDILET 76 (274) Q Consensus 1 Ma~~~T~-~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~---~eg~~i~~~~~t~ 76 (274) ++..++. ..-.++|+.|+..|.+.+.+.+.+.+++.+... +..+++|.+...+.+.|. .+|+.++.+++++ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~-----~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f 215 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKT-----KENIKYPVLVKKAEAQGHKNERTNNEMPETDIEF 215 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceecc-----CCceEEEEEecCCcccceecccccccccccccce Confidence 3433322 233579999999999999999999998866332 224789988766666664 5577889999999 Q ss_pred ceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----------ccccccccCHHHH Q lcl|NC_010147. 77 KKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------LTVNADITKLNGL 145 (274) Q Consensus 77 ~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-----------~~~~~~~~~~d~i 145 (274) +++++.+++++..+.+|++...++..|+.+.+.+++++.+++++|+.++..-.+.. ....+...++|.| T Consensus 216 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l 295 (434) T protein:vir:62 216 DEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDAL 295 (434) T ss_pred eeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999996543211 1112344579999 Q ss_pred HHHHHHHhhcCCCceEEEEcHHHHHHHHhhcc--ccccccccccccceeccccceeccceEEEcCCCCcce-----EEEE Q lcl|NC_010147. 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT-----AILA 218 (274) Q Consensus 146 ~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~--~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~-----~~~~ 218 (274) +++...+..+...+..|+|||..+..|++... -.++-.. ..-...|...+++|+||++++.+|.+. .++| T Consensus 296 ~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~---~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~ 372 (434) T protein:vir:62 296 VKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRP---FNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYF 372 (434) T ss_pred HHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeecc---CCCccCCCCceecceeeEEecCccCccCCCceEEEE Confidence 99999998777777889999999999875321 1111000 001224556689999999999998543 1333 Q ss_pred -eCCeEEEEeec-Cceeeeecchhh--cceEEEEEEEEEEEEEc-CccEEEEEe--cCCCCC Q lcl|NC_010147. 219 -KKGAVKLILKR-DFFLEVARDAST--KTTALYSDKHYVAYLYD-ESKAVKITK--GSGSLE 273 (274) Q Consensus 219 -~~~a~~~~~~~-~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~-~~~~v~~~~--~~a~~~ 273 (274) +.+.+.++.+. .+.++..++..+ +...+++..|++++++. |..+.+++. .+|+.- T Consensus 373 Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 373 GDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred eeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 55555555544 455666555544 44557888999999774 887766642 333333 No 143 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.89 E-value=2.4e-25 Score=155.51 Aligned_cols=265 Identities=14% Similarity=0.154 Sum_probs=184.4 Q ss_pred CCCccc----------eeeeee-chHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEE----EeeccCCccccccC Q lcl|NC_010147. 1 MPQGIT----------KTSNQI-IPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTF----PAFVYSGDAQVVAE 65 (274) Q Consensus 1 Ma~~~T----------~~~~~~-~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~i----p~~~~~~~~~~~~e 65 (274) |.+.+- ++++++ .|+++..++.+.+.+ ..+.....+... ...+..+.+ |.+. .++++++.| T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~-~~iad~lf~~~~--a~~~~~v~f~~~~p~~~-~~d~e~VaE 76 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVN-QFISESLFRNGG--ANPNGVVAYNEGNPSFL-EDDVADVAE 76 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhc-cchhhhhhhccc--ccccceeEEEecccccc-cCcHhhccC Confidence 766531 223333 377777777665533 333332222111 112335555 3333 368999999 Q ss_pred CCcCCccccccceeEE-EeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---ccccccccC Q lcl|NC_010147. 66 GEKIPTDILETKKREA-KIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---LTVNADITK 141 (274) Q Consensus 66 g~~i~~~~~t~~~~~~-~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---~~~~~~~~~ 141 (274) |+++|....+++...+ +.+|++..+++|||+...+..++++...+++++.++|++|+.++..+..+. ..+.+.+.. T Consensus 77 ggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~ 156 (318) T protein:vir:10 77 FGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDN 156 (318) T ss_pred cccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCC Confidence 9999999999977665 568999999999999999999999999999999999999999999886532 222222221 Q ss_pred ----HHHHHHHHHHH------------h----hcCCCceEEEEcHHHHHHHHhhcccccccccc-ccccc----eecccc Q lcl|NC_010147. 142 ----LNGLQSAIDKF------------N----DEDLEPMVLFINPLDAGKLRGDASTNFTRATE-LGDDI----IVKGAF 196 (274) Q Consensus 142 ----~d~i~~A~~~l------------~----~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~-~~~~~----~~~g~i 196 (274) -.++++|.... + ..++.+..+||||..++.|+++.. +..... .++.. -..|.+ T Consensus 157 ~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~--~~~~y~~~a~~~~~~~~~tg~~ 234 (318) T protein:vir:10 157 GGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNEN--FMKVYERNANYVSTAPDWTGNF 234 (318) T ss_pred cccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchh--hhhhhhccchhhhhcccccccc Confidence 11333332211 1 124577899999999999998753 222211 11111 123444 Q ss_pred -ceeccceEEEcCCCCcceEEEEeCCeEEEEe-ecCceeeeecch-------hhcceEEEEEEEEEEEEEcCccEEEEEe Q lcl|NC_010147. 197 -GEALGAIIVRTNKLEAGTAILAKKGAVKLIL-KRDFFLEVARDA-------STKTTALYSDKHYVAYLYDESKAVKITK 267 (274) Q Consensus 197 -g~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~-~~~~~ve~~rd~-------~~~~~~v~~~~~yg~~~~~~~~~v~~~~ 267 (274) |+++|++|+.|+++|.+++|++.++++|+.. ..|+.++..|++ ...+..++.++.....|.+|.++++||. T Consensus 235 ~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itg 314 (318) T protein:vir:10 235 PGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTG 314 (318) T ss_pred cceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEee Confidence 6789999999999999999999999999763 567788888876 4566788888999999999999999996 Q ss_pred cCCC Q lcl|NC_010147. 268 GSGS 271 (274) Q Consensus 268 ~~a~ 271 (274) =..+ T Consensus 315 i~~~ 318 (318) T protein:vir:10 315 IVTP 318 (318) T ss_pred ccCC Confidence 5555 No 144 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=99.88 E-value=4e-24 Score=148.83 Aligned_cols=263 Identities=12% Similarity=0.019 Sum_probs=172.8 Q ss_pred Cccceeeee--echHHHHHHHHHHHHHHhhhhcccccc-cccccC--CCceEEEEeeccCCc--cccccCCCcCCccccc Q lcl|NC_010147. 3 QGITKTSNQ--IIPEVLAPMMQAQLEKKLRFASFAEVD-STLQGQ--PGDTLTFPAFVYSGD--AQVVAEGEKIPTDILE 75 (274) Q Consensus 3 ~~~T~~~~~--~~Pev~~~~v~~~~~~~~v~~~~~~~~-~~~~~~--~g~tv~ip~~~~~~~--~~~~~eg~~i~~~~~t 75 (274) |.+|..+|+ |.|.+...++ |++.+++..++.+..- -.+... .|+....|+|+..+. ..++...+++++.+++ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~-e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYL-ERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred CceeeecceeeehhhhhhhHH-hhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 558999995 6676666654 4455444443322111 112222 388999999972222 2355556779999999 Q ss_pred cceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHH---HHhhcc----cc---cccccccCHHH Q lcl|NC_010147. 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVL---EALMGA----KL---TVNADITKLNG 144 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~---~~~~~a----~~---~~~~~~~~~d~ 144 (274) ..+.......+ ..-+..+..+....+.||+..+..-....|.+.++..+. +.+.++ +. ....+..+... T Consensus 80 ~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~~~ 159 (315) T protein:vir:96 80 ADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEGKKV 159 (315) T ss_pred cccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccCHHH Confidence 98876654433 344666666666678889877665555555555554433 333221 11 11234468899 Q ss_pred HHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEE Q lcl|NC_010147. 145 LQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVK 224 (274) Q Consensus 145 i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~ 224 (274) |++|.++|||+......++|||.+|..|+++++++++.... +.+++.+..+ ++|.+|++||.||.+++|.+++||++ T Consensus 160 l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~--~~~~~~~~~~-~lGkrViVdD~~P~~~~~gl~~GAi~ 236 (315) T protein:vir:96 160 LTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEA--GVVVYGGTPG-TLGKPVLVTDQCPATKIFGLVAGAVM 236 (315) T ss_pred HHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhccccc--ceeEecCcCc-ccccEEEEECCCCcceeeeeecceee Confidence 99999999999999999999999999999988877665433 3344444444 55999999999999999999999999 Q ss_pred EEeecCc---eeeeecchhhcceEEEEE-EEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 225 LILKRDF---FLEVARDASTKTTALYSD-KHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 225 ~~~~~~~---~ve~~rd~~~~~~~v~~~-~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +....++ ..|.. +.+.+..+ ++.....++|.++-.-+.+..||-. T Consensus 237 ~~~~~~~~~~~~~~~-----g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~sPt~ 285 (315) T protein:vir:96 237 ITESQAPGMRSYQID-----DQENLAIGFRAEGTANVEVLGYKWKTKTNVNPAS 285 (315) T ss_pred ecCCCccccccccCC-----CcceeEEEEeeeeEeeeeeeeEEeecCCCcCCCh Confidence 9876662 23333 33444444 4445566777777665545555555 No 145 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.88 E-value=4.9e-24 Score=148.38 Aligned_cols=257 Identities=14% Similarity=0.091 Sum_probs=185.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCCCcCCc-cccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIPT-DILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg~~i~~-~~~t~~~ 78 (274) |....+.-....+|+.+...+.+.. +...+.+.+.... ..+...++|... ..+.+.|+.|++..+. ++++++. T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~~~-~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 206 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLEPK-DIVDLSKYVRSVP----VNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVE 206 (397) T ss_pred hhcccccccccchhHHHHHHHHHhh-hhhhHHHhhhhcc----ccccceeEEEEeccCCccccccccccccccccccccc Confidence 4433445555689999998888743 3334444443321 123345666654 3356778999998875 6899999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHHHHHHHHhhcCCC Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~~A~~~l~~~~~~ 158 (274) +++.+++++..+.+|++...++..|+.+.+.+++++.+++.+|..+++...... ....+++|+|.++....-+.. . T Consensus 207 i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~---~~~~~~~d~~~~~~~~~~~~~-~ 282 (397) T protein:vir:96 207 IDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT---AKSVVGVDGLKDLINKEIKKV-Y 282 (397) T ss_pred eeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccccchHHHHHHHHHhhhhh-c Confidence 999999999999999999999989999999999999999999999987765443 344578999999887654433 3 Q ss_pred ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcce-----EEEE-eCC-eEEEEeecCc Q lcl|NC_010147. 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT-----AILA-KKG-AVKLILKRDF 231 (274) Q Consensus 159 ~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~-----~~~~-~~~-a~~~~~~~~~ 231 (274) ...|+|||..+..|++.. + ..+...-.+-+..|..++++|.||+++++.+.+. .++| +.+ .+.++.+.++ T Consensus 283 ~a~~v~n~~~~~~l~~lk--d-~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 359 (397) T protein:vir:96 283 DVKLFISASMYSELDKLK--D-KNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQV 359 (397) T ss_pred CcEEEEcHHHHHHHHHhh--c-cCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecce Confidence 568999999999987632 1 1111111112345666799999999877643221 2444 444 3567778888 Q ss_pred eeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 232 FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 232 ~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++..+.. .+.+.+++..|+++++.+|+++++++++.| T Consensus 360 ~~~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 360 SVSWVDNN-IYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEeccc-ccceeEEEEEEEccEEecccceEEEEeecC Confidence 88776654 446778999999999999999999999999 No 146 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=99.88 E-value=3.3e-24 Score=149.28 Aligned_cols=265 Identities=12% Similarity=0.022 Sum_probs=189.9 Q ss_pred CCCcccee-eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQGITKT-SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~-~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) +....+.. ...++|+.+...|.+.+.+.+.+.+.+.+... .| ..++|.....+.+.|++||++++..+++++.+ T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~----~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i 222 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPL----KG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQI 222 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeec----Cc-eeEeeeecCCcceeecccccccccccccccce Confidence 22222221 23689999999999999988888887755332 22 46788877777889999999999999999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----------ccc--------ccc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-----------VNA--------DIT 140 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~-----------~~~--------~~~ 140 (274) .+.+++++..+.+|++...++..++.+.+.+.+++.+++.+|..++..-.+..+. ... ..+ T Consensus 223 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 302 (466) T protein:vir:80 223 EVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNL 302 (466) T ss_pred eecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeeccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999988642211000 000 001 Q ss_pred CHHH--------------HHHHHHHH---hhcCCC-ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccc Q lcl|NC_010147. 141 KLNG--------------LQSAIDKF---NDEDLE-PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) Q Consensus 141 ~~d~--------------i~~A~~~l---~~~~~~-~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~ 202 (274) +... +.++...+ -..... ..+|++|+..+..|++.... ..+. |......+.-..++|. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~---~~~~-g~~~~~~~~~~~i~G~ 378 (466) T protein:vir:80 303 STTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAIT---FNSA-GALVASLNNTMPIVGG 378 (466) T ss_pred chhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhccccc---ccCC-ccccccCCCccccccc Confidence 1111 22222111 222233 34689999999888754311 1111 1111111122358999 Q ss_pred eEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 203 IIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 203 ~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ||+++++||++..++.....+.++.+.++.++.+++..+ +.+.+++..|+++++++|+++++++++..++=- T Consensus 379 pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~ 452 (466) T protein:vir:80 379 DIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTT 452 (466) T ss_pred ceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCccc Confidence 999999999999888888888888889999988877764 667899999999999999999999987665433 No 147 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.87 E-value=2.8e-24 Score=149.68 Aligned_cols=269 Identities=18% Similarity=0.194 Sum_probs=202.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccc---cc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILE---TK 77 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t---~~ 77 (274) -+| +|-.+++++|.+++.-+.|..+.......+...- .| .-|....+|.+..+ .+.+++||++++...++ ++ T Consensus 72 e~m-tt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~-~L--~~Grsm~F~~~g~~-Ra~~IgEGgE~~~~sld~~T~d 146 (393) T protein:vir:79 72 EFM-ATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKI-RL--KSGQSMIFPSIGIM-RAYDVAEGQEIPEDSIDWQTHE 146 (393) T ss_pred hhh-cCCCcceechhhhhhhhhhcccchhHHHHHHHHH-hh--hcCcceeccchhee-eeccccccccccccchhhhcCC Confidence 222 3667889999999999988544432222221110 11 12566788887654 77889999999887665 66 Q ss_pred eeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------------cc Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV--------------------NA 137 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~--------------------~~ 137 (274) .++++.+|.+-.+.+|||...+++.|++....+++++.|+|+.|..++..+++...++ -. T Consensus 147 sv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qN 226 (393) T protein:vir:79 147 SPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQN 226 (393) T ss_pred ceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCcccccc Confidence 7788888889999999999999999999999999999999999999999987644321 23 Q ss_pred cccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceecc-----------ceEEE Q lcl|NC_010147. 138 DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-----------AIIVR 206 (274) Q Consensus 138 ~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G-----------~~Vv~ 206 (274) +.+++++|+|.....-.+...+.+++|||-.|..+.|++.........+|+-.-+.-.-.+.+| +.|++ T Consensus 227 GTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~ 306 (393) T protein:vir:79 227 DTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNL 306 (393) T ss_pred ccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeEEE Confidence 5578999999999988889999999999999999999877666655555532112112223344 89999 Q ss_pred cCCCCc------ceEEEEeCCeEEEEe-ecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCC------CCC Q lcl|NC_010147. 207 TNKLEA------GTAILAKKGAVKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG------SLE 273 (274) Q Consensus 207 s~~v~~------~~~~~~~~~a~~~~~-~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a------~~~ 273 (274) |+.+|- ...|.++++..++.+ +.++.++...|+-++-+.|+-+.|||+.++|..+.+...+.+. .+- T Consensus 307 sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~y~~P~ 386 (393) T protein:vir:79 307 SPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKSYAEPM 386 (393) T ss_pred ecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecceeecccccch Confidence 999983 345788998888764 6778889999999999999999999999999988776555443 222 Q ss_pred C Q lcl|NC_010147. 274 M 274 (274) Q Consensus 274 ~ 274 (274) + T Consensus 387 ~ 387 (393) T protein:vir:79 387 L 387 (393) T ss_pred h Confidence 2 No 148 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.87 E-value=2.9e-24 Score=149.63 Aligned_cols=265 Identities=18% Similarity=0.169 Sum_probs=174.1 Q ss_pred CCCcc--ceeeee--echHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccc Q lcl|NC_010147. 1 MPQGI--TKTSNQ--IIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILET 76 (274) Q Consensus 1 Ma~~~--T~~~~~--~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~ 76 (274) |+-+. |...++ .+.-.|...+...+.+-+-.-+.....+--+|....+.++|.|..++++++++||+.||.++++. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 66432 222222 11112344433333332222111111111123333344566667889999999999999999996 Q ss_pred c---eeEEEeeeecceeeeeHHHH-hhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---ccCHHHHHHHH Q lcl|NC_010147. 77 K---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNAD---ITKLNGLQSAI 149 (274) Q Consensus 77 ~---~~~~~~~~~~~~~~vtd~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~---~~~~d~i~~A~ 149 (274) . ..+++++|+.+.+ |||+. ..+..+++.+..+|+.+.+++++|+++++.+++++.+.... ..+++.|..|+ T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq~Al 158 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQGAL 158 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeecHHHHHHHH Confidence 4 5788899988865 99997 67789999999999999999999999999999998765443 35689999988 Q ss_pred HHHh------hcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeE Q lcl|NC_010147. 150 DKFN------DEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAV 223 (274) Q Consensus 150 ~~l~------~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~ 223 (274) ..+. +.+....+++|||..++++|+++.+. ..++..|...+. .++|+.||+|+++|+|+.|.-...++ T Consensus 159 ~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~-~~~t~fG~n~L~-----nfLG~~II~S~kv~~G~~~~T~~~Ni 232 (303) T protein:vir:10 159 SKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFIN-STGAQFGVNLLT-----PYVGVKIVEFADVPQGEVWMTVAENL 232 (303) T ss_pred HhhhhhccccccccccEEEEEchHHHHHHhhcCCcc-hhhhhhhhhhhh-----hhhcceEEEeccCCCceEEEeeccce Confidence 7663 22334569999999999999987654 455667777665 49999999999999999999999988 Q ss_pred EEEeecCceeeeecc--hhhcceEEEEEEE-------------E-EEEEE--cCccEEEEEe-cCCCCCC Q lcl|NC_010147. 224 KLILKRDFFLEVARD--ASTKTTALYSDKH-------------Y-VAYLY--DESKAVKITK-GSGSLEM 274 (274) Q Consensus 224 ~~~~~~~~~ve~~rd--~~~~~~~v~~~~~-------------y-g~~~~--~~~~~v~~~~-~~a~~~~ 274 (274) .+++..+ .=+..+- -....+-+.+..| + |..+. .+++|++.+. +.=+.|+ T Consensus 233 ~~ay~~~-~g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~~ 301 (303) T protein:vir:10 233 NVAYANP-RGELSRAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGEL 301 (303) T ss_pred EEEEecC-chhhhhhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCCC Confidence 8876433 1111110 0112222333322 1 11111 5678888886 3344566 No 149 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.87 E-value=1.1e-23 Score=146.46 Aligned_cols=255 Identities=22% Similarity=0.255 Sum_probs=172.3 Q ss_pred CCC-------ccceeeeee--chHHHHHHHHHHHHHHhhhhcccccccccccCCCceE-EEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQ-------GITKTSNQI--IPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTL-TFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~-------~~T~~~~~~--~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv-~ip~~~~~~~~~~~~eg~~i~ 70 (274) |.. ..|++.|+- +.-.|...+...+.+-+-.-+.. +...-..|.++ ++|.|..++++++++||+.|| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~---r~~pla~GstIkt~k~~~y~gda~dVaEGe~Ip 77 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVT---RKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhc---ccccccCCCEEeeccceeeeeccccccCCcccc Confidence 432 123343431 11234444444444333222211 11122238899 556799999999999999999 Q ss_pred ccccccc---eeEEEeeeecceeeeeHHHH-hhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHH Q lcl|NC_010147. 71 TDILETK---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQ 146 (274) Q Consensus 71 ~~~~t~~---~~~~~~~~~~~~~~vtd~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~ 146 (274) .++++.. ..+++++|+.+.+ |||+. ..+.++|+.+..+|+.+.+++++|+++++.+++++.+..+ +.+.|. T Consensus 78 lskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~---t~~~lQ 152 (296) T protein:vir:98 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDA---LGAGLQ 152 (296) T ss_pred hhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeee---chhhHH Confidence 9999976 4888888988884 99996 6788999999999999999999999999999998876543 334444 Q ss_pred --------HHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccc-eeccceEEEcCCCCcceEEE Q lcl|NC_010147. 147 --------SAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRTNKLEAGTAIL 217 (274) Q Consensus 147 --------~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig-~~~G~~Vv~s~~v~~~~~~~ 217 (274) ++..+|++.+....+++|||...+.++++..+ ...+..| +... .++|..||.|+++|+|+.|. T Consensus 153 ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~i--t~qt~fG------~tyl~nfLG~~II~S~kV~~G~~~~ 224 (296) T protein:vir:98 153 GALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--TTQTAFG------LTYLVDFTGTVIISTNDVTKGEIWA 224 (296) T ss_pred HHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCcc--chhheec------hhhhhhccccEEEEcCcCCCceEEE Confidence 44467888777789999999999999988743 2222223 3333 39999999999999999999 Q ss_pred EeCCeEEEEeecC--ceeeeecchhhcceEEEEEEE-------------E-EEEEE--cCccEEEEEecCCC Q lcl|NC_010147. 218 AKKGAVKLILKRD--FFLEVARDASTKTTALYSDKH-------------Y-VAYLY--DESKAVKITKGSGS 271 (274) Q Consensus 218 ~~~~a~~~~~~~~--~~ve~~rd~~~~~~~v~~~~~-------------y-g~~~~--~~~~~v~~~~~~a~ 271 (274) ....++.+++-.+ -++-....-....+-+.+..| + |..+. .+++|++.+.++|. T Consensus 225 T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 225 TVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred eeecceEEEeecccccchhhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 9999988876432 112111111222232333333 1 11111 67899999998888 No 150 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=99.86 E-value=2.8e-23 Score=144.21 Aligned_cols=254 Identities=15% Similarity=0.037 Sum_probs=190.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) +....+.-...++|+.+.+.|.+.+.+.+.+++++++.+. +| ..+||.-...+.+.|..|+.+++. ++++++++ T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~----~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceeEeecccccccccCccceeE Confidence 3322233334689999999999999999999999876432 22 478998877788999999988864 58999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------c--cc---------- Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------L--TV---------- 135 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~------------~--~~---------- 135 (274) .+..++++....+|++...++..|+.+.+.+++++.+++.+|..++..-.... . .. T Consensus 154 ~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:96 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeec Confidence 99999999999999999999999999999999999999999999886431100 0 00 Q ss_pred -----cccccCHHHHHHHHHHHhhc----C-------CCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 136 -----NADITKLNGLQSAIDKFNDE----D-------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 136 -----~~~~~~~d~i~~A~~~l~~~----~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) .....+.+.+++....|-.. + ....+|+|||..+..++... .+... +|...++ T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~--~~~~~---------~G~~~~~ 302 (377) T protein:vir:96 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF--TSRNQ---------FGEYVTV 302 (377) T ss_pred cccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccc--cccCC---------CCCceec Confidence 00113455555554443221 1 12346999999987764321 22221 2334456 Q ss_pred ccc--eEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh--hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 200 LGA--IIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 200 ~G~--~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) +|+ +|+.|+.+|++++++++.+.+.++.+.+++++..++.. .+.+.+++..|+++++++|+++++++.+.. T Consensus 303 l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 655 57889999999999889899988899999998877665 477899999999999999999999999888 No 151 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.86 E-value=5.5e-24 Score=148.11 Aligned_cols=270 Identities=13% Similarity=0.085 Sum_probs=199.9 Q ss_pred CCCccceee---------eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTS---------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~---------~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) |.+..+-.- ..+-=|+|+..+..++.+++++.++..+- ++ .+|+++++|+.+.. .++.+..|+++.. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vR-ti--~~gkS~qf~~~G~s-~~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ-TV--TGTNTVSNKYLGET-ELQVLAPGQSPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceee-ee--cccceEEEEEeeee-EeeeecCCCCcCC Confidence 776543111 11233779999999999999999876543 33 35899999998654 6777899999999 Q ss_pred cccccceeEEEeee-ecceeeeeHHHHhhcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhhccc----------c------ Q lcl|NC_010147. 72 DILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAK----------L------ 133 (274) Q Consensus 72 ~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------~------ 133 (274) +.+..++..++|+. ++..+.|.|++..++..| +-.++.+++++++|+..|+.++..+..+. . T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 99999999999987 478899999999999999 78899999999999999998876663211 0 Q ss_pred ---cccc----cccC----HHHHHHHHHHHhhcCC-CceE-EEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 134 ---TVNA----DITK----LNGLQSAIDKFNDEDL-EPMV-LFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 134 ---~~~~----~~~~----~d~i~~A~~~l~~~~~-~~~~-~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) .+.. ..++ .+.|.+|...|.+.++ ..++ ++++|..|+.|++.+.+-.......+.+...+|.+.+++ T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~va 236 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSY 236 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEe Confidence 0000 0112 3457788888988876 3344 455677777777643211112112233557789999999 Q ss_pred cceEEEcCCCCcc---------------------------eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 201 GAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~v~~~---------------------------~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) |++|++|+++|.+ .+.+|++.|++.+.-.++..|.+|+...+.+.|...+.|| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g 316 (401) T protein:vir:70 237 NCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEG 316 (401) T ss_pred ceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhC Confidence 9999999999831 1268899999998888899999999999999999999999 Q ss_pred EEEEcCccEEEEE--ecC---C--CCCC Q lcl|NC_010147. 254 AYLYDESKAVKIT--KGS---G--SLEM 274 (274) Q Consensus 254 ~~~~~~~~~v~~~--~~~---a--~~~~ 274 (274) ....+|+.++.++ ++. + +..+ T Consensus 317 ~g~~RPeaa~vv~~k~~~~~~~~~~~~~ 344 (401) T protein:vir:70 317 AIPDRWEAVSVVTTKRNTTTGAVEGTDG 344 (401) T ss_pred CcccchhheEEEeecCcccccccccCCc Confidence 9999999998874 221 1 1221 No 152 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=99.86 E-value=4.5e-23 Score=143.12 Aligned_cols=258 Identities=12% Similarity=0.054 Sum_probs=188.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC-cccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~-~~~~t~~~~ 79 (274) |...++--.-.++|+.+.+.|.+.+.+.+.+.+++.+-.. +| ...||.....+.+.|..|+.+++ ..+++++++ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~----~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNA----GI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccCccccccceee Confidence 2222233334589999999999999999999999865432 23 46899988888888988877775 568999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccc---------------ccccccC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---KLT---------------VNADITK 141 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---~~~---------------~~~~~~~ 141 (274) .+..++++..+.+|++...++..|+.+.+.+.+++.+++++|+.++..-... +.. ..+..++ T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t 240 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLT 240 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchhh Confidence 9999999999999999999999999999999999999999999988654221 100 0111223 Q ss_pred HHHHHHHHHHHhh--------------cCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceec--cceEE Q lcl|NC_010147. 142 LNGLQSAIDKFND--------------EDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL--GAIIV 205 (274) Q Consensus 142 ~d~i~~A~~~l~~--------------~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~--G~~Vv 205 (274) ++.+..+...+.+ .......++|||..+..+.... .|.+ . .|...+++ |+||+ T Consensus 241 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~--~~~~--~-------~G~~~~~lg~g~~v~ 309 (395) T protein:vir:95 241 FADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARY--TYLT--A-------NGGFVTVLPYNVTII 309 (395) T ss_pred hhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcc--eecc--C-------CCcceeccCCcceEE Confidence 3333332222211 1123456899999887664322 2222 1 23344554 66799 Q ss_pred EcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 206 ~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .|+.||++++++.+.+.+.++.+.+++++.+++..+ +...++.+.|+++++++++++++++.+.++.-- T Consensus 310 ~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 310 TSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPR 380 (395) T ss_pred EcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCC Confidence 999999999888888888788888888888777654 667899999999999999999998887655544 No 153 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.86 E-value=1.2e-23 Score=146.30 Aligned_cols=270 Identities=12% Similarity=0.089 Sum_probs=200.1 Q ss_pred CCCccc--eee-------eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGIT--KTS-------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T--~~~-------~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) |.+... .-. +.+-=|+|+..|..++.+++++.++..+- ++ .+|++++||+.+.. .++....|+++.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vR-tI--~~gkS~qf~~lG~s-~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ-TV--TGTNTVSNKYLGET-ELQVLAPGQSPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceee-ee--cccceEEEEEeeee-EEeeecCCCCcCC Confidence 766542 111 11333789999999999999999876553 33 35899999998654 6777899999999 Q ss_pred cccccceeEEEeee-ecceeeeeHHHHhhcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------- Q lcl|NC_010147. 72 DILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAK----------------- 132 (274) Q Consensus 72 ~~~t~~~~~~~~~~-~~~~~~vtd~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~~~~~~~~a~----------------- 132 (274) +.+..++..++++. +.....|.|++..++..| +-.++.+++++++|+..|+.++..+..+. T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 99999999999987 477889999999999999 89999999999999999998876543211 Q ss_pred --cccc----ccccCH----HHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 133 --LTVN----ADITKL----NGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 133 --~~~~----~~~~~~----d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) ..+. ...++. +.|.+|...|.+.++ ...+++++|..|+.|+....+-.......+.+....|.+.++. T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 0010 111232 245678888887775 3345666778887777643211111111123446788899999 Q ss_pred cceEEEcCCCCcc---------------------------eEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 201 GAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~v~~~---------------------------~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) |++|+.|+++|.+ .+.+|+++|++.+.-.++..|.+||+..+.+.|...+.|| T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G 316 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEG 316 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhC Confidence 9999999999831 1268899999998888899999999999999999999999 Q ss_pred EEEEcCccEEEEEecCCCCC-C Q lcl|NC_010147. 254 AYLYDESKAVKITKGSGSLE-M 274 (274) Q Consensus 254 ~~~~~~~~~v~~~~~~a~~~-~ 274 (274) ....+|+.+.+++.+--+.- . T Consensus 317 ~g~~RPeaa~vv~~~~~~~~~~ 338 (400) T protein:vir:10 317 AIPDRWEAVSVVTTKRQSTGAV 338 (400) T ss_pred CcccchhheEEEEecCCccccc Confidence 99999999999986332221 1 No 154 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=99.85 E-value=1e-23 Score=146.66 Aligned_cols=265 Identities=13% Similarity=0.018 Sum_probs=186.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC-cccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~-~~~~t~~~~ 79 (274) +....+.-...++|+.+.+.|.+.+.+.+.+.+.+++.. ..|. .++|.....+.+.|..|+++++ ..+.+++++ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~----~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----TSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEe----cCcc-eEEEEecCCcceeEeecccccCcccCccceeE Confidence 444434444668999999999999999999989886533 2243 6899888888899999988876 468899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------------cccccccC Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------------TVNADITK 141 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------------~~~~~~~~ 141 (274) ++..++++....+|++...++..|+.+.+.+++++.+++.+|..++..-....+ ...+.... T Consensus 154 ~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:98 154 DFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccch Confidence 999999998899999999999999999999999999999999998865322110 00011112 Q ss_pred HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhh--cccc--ccccc-c----cc--ccceeccccceeccce--EEEcC Q lcl|NC_010147. 142 LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGD--ASTN--FTRAT-E----LG--DDIIVKGAFGEALGAI--IVRTN 208 (274) Q Consensus 142 ~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~--~~~~--~~~~s-~----~~--~~~~~~g~ig~~~G~~--Vv~s~ 208 (274) .+.+.++...+.........++||......+++. ..-+ |+... + .. .....+|...+++|+| |+.|+ T Consensus 234 ~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~ 313 (377) T protein:vir:98 234 KEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESL 313 (377) T ss_pred hhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecC Confidence 2334444333322222333444444444433321 1000 10000 0 00 0001245666788776 78899 Q ss_pred CCCcceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 209 KLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 209 ~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .+|++++++.+.+.+.++.+.+++++..++..+ +.+.++++.|+++++++|+++++++.++. T Consensus 314 ~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 314 AVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CCCcccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 999999998888988888899999888776654 67889999999999999999999999988 No 155 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=99.85 E-value=4e-23 Score=143.37 Aligned_cols=258 Identities=13% Similarity=0.060 Sum_probs=190.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) |...++.-...++|+.+.+.|.+.+.+.+.+++++++.+. +| ..+||+....+.+.|..|+.+++. .+++++++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~----~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec----Cc-ceEEEEecCCcceeeecccccccccccccceee Confidence 3333333445789999999999999999999999876432 23 368998887788999999988864 48899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------cc---------cccc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------TV---------NADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~~---------~~~~ 139 (274) .+..++++..+.+|++...++..|+.+.+.+++++.+++.+|..++..-....+ .+ .... T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:95 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999988865321100 00 0011 Q ss_pred -------cCHHHHHHHHHHHhhc------C-CCceEEEEcHHHHHHHHhhccccccccccccccceecccccee--ccce Q lcl|NC_010147. 140 -------TKLNGLQSAIDKFNDE------D-LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA--LGAI 203 (274) Q Consensus 140 -------~~~d~i~~A~~~l~~~------~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~--~G~~ 203 (274) ..++.+.+....+... . ....+|+|||..+..|++.. .+.. + +|..-.. .|.+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~--~~~~-~--------~G~~v~~l~~g~~ 299 (381) T protein:vir:95 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--THLN-A--------NGVYVTALPFNLN 299 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc--ccCC-C--------CCceeecCCCCce Confidence 1244555555555322 1 23456899999999887543 1111 1 1111122 4677 Q ss_pred EEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh--hcceEEEEEEEEEEEEEcCccEEEEEecC--CCCCC Q lcl|NC_010147. 204 IVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKITKGS--GSLEM 274 (274) Q Consensus 204 Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~--a~~~~ 274 (274) |+.|+.||++++++.+.+.+.++.+.++.++..++.. .+.+.++++.|+++++++|+++++++.+. +.+-. T Consensus 300 vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:95 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred EEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 9999999999999888888888899999988877654 47789999999999999999999977544 33322 No 156 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=99.85 E-value=4e-23 Score=143.37 Aligned_cols=258 Identities=13% Similarity=0.060 Sum_probs=190.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCc-ccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~-~~~t~~~~ 79 (274) |...++.-...++|+.+.+.|.+.+.+.+.+++++++.+. +| ..+||+....+.+.|..|+.+++. .+++++++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~----~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec----Cc-ceEEEEecCCcceeeecccccccccccccceee Confidence 3333333445789999999999999999999999876432 23 368998887788999999988864 48899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------cc---------cccc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------TV---------NADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~~---------~~~~ 139 (274) .+..++++..+.+|++...++..|+.+.+.+++++.+++.+|..++..-....+ .+ .... T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999988865321100 00 0011 Q ss_pred -------cCHHHHHHHHHHHhhc------C-CCceEEEEcHHHHHHHHhhccccccccccccccceecccccee--ccce Q lcl|NC_010147. 140 -------TKLNGLQSAIDKFNDE------D-LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA--LGAI 203 (274) Q Consensus 140 -------~~~d~i~~A~~~l~~~------~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~--~G~~ 203 (274) ..++.+.+....+... . ....+|+|||..+..|++.. .+.. + +|..-.. .|.+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~--~~~~-~--------~G~~v~~l~~g~~ 299 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--THLN-A--------NGVYVTALPFNLN 299 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc--ccCC-C--------CCceeecCCCCce Confidence 1244555555555322 1 23456899999999887543 1111 1 1111122 4677 Q ss_pred EEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchh--hcceEEEEEEEEEEEEEcCccEEEEEecC--CCCCC Q lcl|NC_010147. 204 IVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKITKGS--GSLEM 274 (274) Q Consensus 204 Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~--a~~~~ 274 (274) |+.|+.||++++++.+.+.+.++.+.++.++..++.. .+.+.++++.|+++++++|+++++++.+. +.+-. T Consensus 300 vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:10 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred EEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 9999999999999888888888899999988877654 47789999999999999999999977544 33322 No 157 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.84 E-value=9.7e-23 Score=141.28 Aligned_cols=271 Identities=13% Similarity=0.093 Sum_probs=173.1 Q ss_pred CCCcccee--eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCc-cccccCCCc-----CCcc Q lcl|NC_010147. 1 MPQGITKT--SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGD-AQVVAEGEK-----IPTD 72 (274) Q Consensus 1 Ma~~~T~~--~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~-~~~~~eg~~-----i~~~ 72 (274) -+..+|.. ..++.|+.+.+.+.+.+.+.+++.+++.+.. +.+ .+..++||+....+. +.|.+||+. .+.+ T Consensus 155 ~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~-~~~-~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s 232 (477) T protein:vir:84 155 RDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEP-LPG-GTSSINIPKILTGTSTAIQAADNAALTAPSAHEV 232 (477) T ss_pred ccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceee-ecC-CcceeEEEEEecCcceeeeeccCccccccccccc Confidence 12222222 2356677778889999998888877654421 222 345689998754433 457888865 4567 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc-----------cccccc Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL-----------TVNADI 139 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--~~-----------~~~~~~ 139 (274) +++++.+++.+++++..+.+|++...++..++.+.+.+++++.+++++|..++..-.+. +. ...... T Consensus 233 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~ 312 (477) T protein:vir:84 233 DLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAG 312 (477) T ss_pred ccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccc Confidence 88999999999999999999999999999999999999999999999999998653221 10 011111 Q ss_pred c-------CHHHHHHHHHHHhhcCC-CceEEEEcHHHHHHHHhhc--ccccc--cccccc------ccceeccccceecc Q lcl|NC_010147. 140 T-------KLNGLQSAIDKFNDEDL-EPMVLFINPLDAGKLRGDA--STNFT--RATELG------DDIIVKGAFGEALG 201 (274) Q Consensus 140 ~-------~~d~i~~A~~~l~~~~~-~~~~~vv~p~~~~~L~k~~--~~~~~--~~s~~~------~~~~~~g~ig~~~G 201 (274) . .++.|+++...+..+.. ...+|+|||..+..|++.. ...++ +..... .+.+..|..|+++| T Consensus 313 ~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G 392 (477) T protein:vir:84 313 SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG 392 (477) T ss_pred cchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc Confidence 1 24456666666655443 4568999999998886532 11111 110000 12244566789999 Q ss_pred ceEEEcCCCCcce-------EEEE-eCCeEEEEeecCceeeeecchhhc--ceEEEEEEEEEEEEE-cCccEEEEEecCC Q lcl|NC_010147. 202 AIIVRTNKLEAGT-------AILA-KKGAVKLILKRDFFLEVARDASTK--TTALYSDKHYVAYLY-DESKAVKITKGSG 270 (274) Q Consensus 202 ~~Vv~s~~v~~~~-------~~~~-~~~a~~~~~~~~~~ve~~rd~~~~--~~~v~~~~~yg~~~~-~~~~~v~~~~~~a 270 (274) +||++|+.||.+. .++| +.+.+.++. .+..++.+++.... ...++...++.+..+ +|+++|.+|.++. T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred cceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecccc Confidence 9999999999642 2333 444444433 34555555444332 233333333444444 5999999997554 Q ss_pred C-CCC Q lcl|NC_010147. 271 S-LEM 274 (274) Q Consensus 271 ~-~~~ 274 (274) . +-. T Consensus 472 ~~~~~ 476 (477) T protein:vir:84 472 TAPTF 476 (477) T ss_pred ccccc Confidence 3 333 No 158 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=99.84 E-value=2e-22 Score=139.55 Aligned_cols=260 Identities=12% Similarity=0.033 Sum_probs=187.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC-cccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~-~~~~t~~~~ 79 (274) |...++.-...++|+.+.+.|.+.+.+.+.+++++.+... +| ..++|.....+.+.|..|+.+++ ..+++++++ T Consensus 76 ~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~----~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEec----Cc-ceEEEeecCCcceEEeecccccccccCccceeE Confidence 3333333345789999999999999999999999876432 23 46789888778888999888875 458899999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------cc---------cccc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------TV---------NADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-----------~~---------~~~~ 139 (274) .+..++++....+|++...++..|+.+.+.+++++.+++.+|..++..-.+..+ .+ .... T Consensus 151 ~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~ 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999988755322110 00 0111 Q ss_pred cCHHH-------HHHHHHHHhh-------cCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEE Q lcl|NC_010147. 140 TKLNG-------LQSAIDKFND-------EDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) Q Consensus 140 ~~~d~-------i~~A~~~l~~-------~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv 205 (274) +++.. +.+....+.. ......+|+|||..+..|++.. .+.. ++ |.. +.. .-.|.+|+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~--~~~~-~~-G~~-v~~----lp~g~~vv 301 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--THLN-AN-GVY-VTA----LPFNLNVI 301 (381) T ss_pred ccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcccc--ccCC-CC-Cce-eec----CCCCceeE Confidence 12222 2221111111 0123457899999999887643 2211 11 221 110 12588899 Q ss_pred EcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 206 ~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) .++.||++++++.+.+.+.++.+.++.++..++..+ +.+.+++..|+++++++|+++++++.+..-.+- T Consensus 302 ~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred EcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcc Confidence 999999999988888888888899999888776654 777899999999999999999997765433332 No 159 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=99.83 E-value=1.7e-22 Score=139.88 Aligned_cols=258 Identities=13% Similarity=0.051 Sum_probs=188.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCC-cccccccee Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~-~~~~t~~~~ 79 (274) |...++.-...++|+.+.+.|.+.+.+.+.+.+++++.. .+|. .+||+....+.+.|..|+.+++ ..+.++++. T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~----~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRT----TGLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEe----cCCc-eEEEEEcCCcceEEeecccccccccCcceeeE Confidence 444444445578999999999999999999999886542 2343 6899988888899999988875 568999999 Q ss_pred EEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc-c----------ccc---------cccc Q lcl|NC_010147. 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA-K----------LTV---------NADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a-~----------~~~---------~~~~ 139 (274) ++..++++..+.+|++...++..|+.+.+.+++++.+++.+|+.++..-... + ..+ .... T Consensus 158 ~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 237 (383) T protein:vir:78 158 ESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGT 237 (383) T ss_pred eecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccch Confidence 9999999999999999999999999999999999999999999988553211 0 000 0112 Q ss_pred cCHHHHHHHHHHHhh---c------C-----CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccc--e Q lcl|NC_010147. 140 TKLNGLQSAIDKFND---E------D-----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA--I 203 (274) Q Consensus 140 ~~~d~i~~A~~~l~~---~------~-----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~--~ 203 (274) ++++.+......+.. + . .....|++||..+..+... ..... .+|...+++|+ + T Consensus 238 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~~---------~~G~~~t~l~~~~~ 306 (383) T protein:vir:78 238 LTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQ--YTSLN---------ANGVYVTALPFNLN 306 (383) T ss_pred hhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccc--hhccC---------CCCceeeecCCCce Confidence 233333332222211 0 0 1223578888666544321 11111 12333455554 5 Q ss_pred EEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh--cceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 204 IVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 204 Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~--~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) |+.|++||++++++.+.+.+.++.+.+++++.+++..+ +.+.+++..|++++++||+++++++.+.+.+|- T Consensus 307 iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~ 379 (383) T protein:vir:78 307 IIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQ 379 (383) T ss_pred EEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCC Confidence 88899999999988888888888899999987766554 678999999999999999999999988888887 No 160 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=99.81 E-value=5.8e-21 Score=131.53 Aligned_cols=261 Identities=9% Similarity=0.111 Sum_probs=170.0 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhccccccc--ccccCCCceEEEEeeccCCccccccCCC-cCCccccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVVAEGE-KIPTDILETK 77 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~--~~~~~~g~tv~ip~~~~~~~~~~~~eg~-~i~~~~~t~~ 77 (274) ||.. . + ++.|++.+.+++.+.++++.|....+ .....+|++|+||+... ....+|..+. ......++.+ T Consensus 1 MA~~--n----~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~-~gl~DY~R~~~g~~~g~~~~~ 72 (299) T protein:vir:79 1 MAAL--N----Y-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTIST-TGRVDSNRDTIAVAQRNYDNA 72 (299) T ss_pred Cccc--h----h-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccc-ccccccccCCCcccccccCcc Confidence 6621 1 2 48999999999999999887764432 33445689999999965 5777898765 4566688889 Q ss_pred eeEEEeee-ecceeeeeHHHHhhcCcc--HHHHHHHHHHHHHHHHHHHHHHHHhhcccc----c----ccccccCHHHHH Q lcl|NC_010147. 78 KREAKIRK-IAKGTSITDEALLSGYGD--PQGEQVRQHGLAHANKVDNDVLEALMGAKL----T----VNADITKLNGLQ 146 (274) Q Consensus 78 ~~~~~~~~-~~~~~~vtd~~~~~~~~d--~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~----~----~~~~~~~~d~i~ 146 (274) ..++++.+ ++..|.+++.+..++... ......+.....++..+|+..++.+.+... . ..++..-|+.|. T Consensus 73 ~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~ 152 (299) T protein:vir:79 73 WEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFD 152 (299) T ss_pred eeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHH Confidence 99999976 799999996555554333 223334445566788899988876643221 1 122233478899 Q ss_pred HHHHHHhhcCC--CceEEEEcHHHHHHHHhhcccccccccccc-ccceeccccceeccceEEE--cCCCCc------c-- Q lcl|NC_010147. 147 SAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELG-DDIIVKGAFGEALGAIIVR--TNKLEA------G-- 213 (274) Q Consensus 147 ~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~~~~~g~ig~~~G~~Vv~--s~~v~~------~-- 213 (274) ++..+|.++++ ++|+++|+|+++..|+++..+ ......+ ....++|.+|++.|++|+. |+.+.. | T Consensus 153 ~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f--~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~ 230 (299) T protein:vir:79 153 KLMEKMTEARVPENGRILYVTPVVNTLIKNAKEI--QRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWK 230 (299) T ss_pred HHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhh--hcccccccccceeeeeeeeecceEEEEechhhcCccceeccCcc Confidence 99999999886 579999999999999987643 3333333 2357899999999999987 555541 1 Q ss_pred --------eEEEEeCCeEEEEeecCceeeeecc-hhhcce-EEEEEEEEEEEEEc-CccEEEEEecCCCC Q lcl|NC_010147. 214 --------TAILAKKGAVKLILKRDFFLEVARD-ASTKTT-ALYSDKHYVAYLYD-ESKAVKITKGSGSL 272 (274) Q Consensus 214 --------~~~~~~~~a~~~~~~~~~~ve~~rd-~~~~~~-~v~~~~~yg~~~~~-~~~~v~~~~~~a~~ 272 (274) ..++++++|..-..+.. .+..+.. .....| +...|.++.+-+++ -...+.+.+..|-. T Consensus 231 ~~~~ak~in~ii~~~~a~~~~~K~~-~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 231 VGAGAKQIFMSLVHPSAIITPVSYQ-FSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred ccCcccccceEEEcCCeeeeeEeee-eEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 13567777776555443 2222221 222223 44555556666553 33334444433333 No 161 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.81 E-value=9.3e-21 Score=130.41 Aligned_cols=264 Identities=14% Similarity=0.093 Sum_probs=194.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCC----ccccccCCCcCCcccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSG----DAQVVAEGEKIPTDILET 76 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~----~~~~~~eg~~i~~~~~t~ 76 (274) |... ....-.+.|+.+.. +.+.+.+.+.+.+++++..+.. ..+.+||++.... ...|.++.++.+.+++++ T Consensus 14 it~~-d~~gG~L~P~~~~~-~i~~l~e~s~i~~~a~vi~t~~---s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf 88 (314) T protein:vir:41 14 IDVP-DLGKGILAVQRFGE-FVREVRENSAIIKDARVLNALK---SYEVDISRISLGVELEPGRNTSGTKVAPTADEVTV 88 (314) T ss_pred cccc-cCCCceeChHHHHH-HHHHHHhccchhhheeeecccC---ccceeecccccCcccccccccccCCccCCcccccc Confidence 3211 11233589999865 6688999999999998754432 2357888875321 233556667788999999 Q ss_pred ceeEEEeeeecceeeeeHHHHhhcC--ccHHHHHHHHHHHHHHHHHHHHHHHHhhc-----------------cccc--- Q lcl|NC_010147. 77 KKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHANKVDNDVLEALMG-----------------AKLT--- 134 (274) Q Consensus 77 ~~~~~~~~~~~~~~~vtd~~~~~~~--~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-----------------a~~~--- 134 (274) ++..+..+++...+.++++...++. +|+.+.+.+++++.+++.++..+++.-.. +... T Consensus 89 ~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~ 168 (314) T protein:vir:41 89 STNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTD 168 (314) T ss_pred cceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceee Confidence 9999999999999999999999885 59999999999999999999888765321 1100 Q ss_pred --ccccccCHHHHHHHHHHHhhcCC---CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCC Q lcl|NC_010147. 135 --VNADITKLNGLQSAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK 209 (274) Q Consensus 135 --~~~~~~~~d~i~~A~~~l~~~~~---~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~ 209 (274) ..+...+.+.|.++...|..... ..-+|+||+..+.++++.... .....++..+..|.-.+++|+||+.++. T Consensus 169 ~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~---~~~~l~~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 169 AEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLV---RETGLGDSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred cCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhc---cCCcccchhhhCCCCceecceeeEeccc Confidence 11223456678888888877542 345799999999888764211 2233455566677778899999999998 Q ss_pred CC-----cceEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEe-cCCCC Q lcl|NC_010147. 210 LE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK-GSGSL 272 (274) Q Consensus 210 v~-----~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~-~~a~~ 272 (274) +| +..+++.++..+.++.+..++++.+|+...+...++.++|+++.+..++++|+... .+++. T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ccccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 85 55667777888888888899999999999999999999999999998877766654 33333 No 162 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=99.78 E-value=8.9e-20 Score=125.03 Aligned_cols=262 Identities=14% Similarity=0.078 Sum_probs=187.1 Q ss_pred CCCcccee-eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccC----CccccccCCCcCCccccc Q lcl|NC_010147. 1 MPQGITKT-SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~~-~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~----~~~~~~~eg~~i~~~~~t 75 (274) -++.++-. .-.+.|+.+.. +.+.+.+.+.+.+.+.+..... +.+.+|+..... ....|.+++++.+.++++ T Consensus 17 k~~t~~d~~Gg~l~P~~~~~-~i~~~~e~s~~l~~~~vi~~~~---~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~ 92 (315) T protein:vir:41 17 PKIDVPDLGRGVLSVDRFGE-FVKAVRDSAVIIPEARIDNALK---SYEKDISRLSLVLDVGPGRDETGQKLAPPESTAE 92 (315) T ss_pred hhcCCcCCCCceechHHHHH-HHHHHHhhhhhhhhceeeeccc---cccccccccccCcccccccccccCcCCCCCCccc Confidence 11111111 12478998876 5578999999999987643222 233445443211 123456677778889999 Q ss_pred cceeEEEeeeecceeeeeHHHHhhcC--ccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------ccc---- Q lcl|NC_010147. 76 TKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHANKVDNDVLEALMGA---------------KLT---- 134 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~vtd~~~~~~~--~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a---------------~~~---- 134 (274) +++..+..++++..+.++++...++. +|+.+.+..++++.+++..+..+++.-.++ ... T Consensus 93 f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~ 172 (315) T protein:vir:41 93 VKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTES 172 (315) T ss_pred cceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceeccccccccc Confidence 99999999999888999999998874 699999999999999999999888763211 000 Q ss_pred ---ccccccCHHHHHHHHHHHhhcC---CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC Q lcl|NC_010147. 135 ---VNADITKLNGLQSAIDKFNDED---LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN 208 (274) Q Consensus 135 ---~~~~~~~~d~i~~A~~~l~~~~---~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~ 208 (274) ..+...+.+.|.++...|.... ....+|+||+..+.++++... ......++..+..|.-.+++|+||+..+ T Consensus 173 ~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~---~~g~~lw~~~~~~g~~~tl~G~PV~~~~ 249 (315) T protein:vir:41 173 DVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALK---GRETGLGDQALTGANSILYDGRPVQYVP 249 (315) T ss_pred ccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhc---cCCCccccchhhcCCCceecccceEecc Confidence 1122345778888888887643 345689999999998877431 2333446666777888899999999999 Q ss_pred CCC-----cceEEEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 209 KLE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 209 ~v~-----~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) .|| ++.+++.+...+.++.+..++++.+|+.......++.+.|+++.+..+++.+.-..+. T Consensus 250 ~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 250 ALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 885 4555666677788888889999999999999999999999999887666633333222 No 163 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.76 E-value=1.8e-19 Score=123.35 Aligned_cols=254 Identities=12% Similarity=0.064 Sum_probs=180.2 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~ 80 (274) ||... -+.|++.+.+++...++.+.+.+.+. .+.+|++|+||+... ....+|..+..+...+++.+..+ T Consensus 1 Main~--------a~~~~~~Ld~~~~~~~~t~~l~~~~~--~~~ggktVkI~~i~~-~gl~DY~R~~g~~~g~v~~~~et 69 (290) T protein:vir:78 1 MAINY--------VDKYGKELDQKLVFGTYTNELETPNL--LWLDAKTFKIQTITT-TGLKAHTRNKGYNEGSASNTNKS 69 (290) T ss_pred CchhH--------HHHHHHHHHHHHHhhheeeeccccce--eeccCCEEEEeeecc-CcccccccCCCcccCccccceee Confidence 77543 26899999999999999988876554 455699999999974 57788999889988999999999 Q ss_pred EEeee-ecceeeee--HHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---c----cccccCHHHHHHHHH Q lcl|NC_010147. 81 AKIRK-IAKGTSIT--DEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT---V----NADITKLNGLQSAID 150 (274) Q Consensus 81 ~~~~~-~~~~~~vt--d~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~---~----~~~~~~~d~i~~A~~ 150 (274) .++.+ ++..|.++ |++..+....+.....+...+.++..+|+..++.+.+.... . .++..-|+.|.++.. T Consensus 70 ~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 70 YTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIR 149 (290) T ss_pred EEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHH Confidence 99866 79999998 87777766777778888888899999999988776443211 1 122334788888999 Q ss_pred HHhhcCCCceEEEEcHHHHHHHHhhccccccccc---cccccceeccccceeccceEEEcCC---C-C---------cc- Q lcl|NC_010147. 151 KFNDEDLEPMVLFINPLDAGKLRGDASTNFTRAT---ELGDDIIVKGAFGEALGAIIVRTNK---L-E---------AG- 213 (274) Q Consensus 151 ~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s---~~~~~~~~~g~ig~~~G~~Vv~s~~---v-~---------~~- 213 (274) +|.+...++|+++|+|.++..|+++.. |.... +.+.+ ..+|.++++.|++|+..+. + . ++ T Consensus 150 ~ldevp~~~rvl~vtp~~~~lL~~~~~--f~r~~~~~~~~~~-~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~ 226 (290) T protein:vir:78 150 KVKKYGTQNLVMYVSPDVMAALELSDD--FVRAINVQNIGPS-SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAA 226 (290) T ss_pred HHHhcCCCCeEEEECHHHHHHHhhChh--hhccccccccccc-cccceeeeecCcEEEEecccchhhhhhhhcccccccC Confidence 998877889999999999999988764 43332 22333 3488999999999987542 1 0 11 Q ss_pred -----eEEEEeCCeEEEEeecCceeeee---cchhhcceEEEEEEEEEEEEEcCccE-EEEEecC Q lcl|NC_010147. 214 -----TAILAKKGAVKLILKRDFFLEVA---RDASTKTTALYSDKHYVAYLYDESKA-VKITKGS 269 (274) Q Consensus 214 -----~~~~~~~~a~~~~~~~~~~ve~~---rd~~~~~~~v~~~~~yg~~~~~~~~~-v~~~~~~ 269 (274) ..++++++|..-..+.. .+... .+.......+.+|.++.+-+++-.+- +....+- T Consensus 227 ~ak~in~ii~~~~a~i~~~K~~-~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 227 GAKKLNFLLVNKGSVVGGAKHA-SIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CccceeEEEEcCCceeeeeeee-EEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 12456777765544443 23222 22223456888888888888744332 3322222 No 164 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.73 E-value=1.1e-18 Score=119.14 Aligned_cols=267 Identities=12% Similarity=0.081 Sum_probs=187.9 Q ss_pred CCCccce---eeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccccc-CC-CcCCccccc Q lcl|NC_010147. 1 MPQGITK---TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVA-EG-EKIPTDILE 75 (274) Q Consensus 1 Ma~~~T~---~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~-eg-~~i~~~~~t 75 (274) -++..+. ..-..+|.-+++.+.+++.+.+.|.+.+++... .....+||.+...+...+.. ++ ...+..+++ T Consensus 15 ~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v----~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~ 90 (321) T protein:vir:31 15 EKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETV----GAKKTRIPTLNIGERHRRPQDEGEWNENESDVS 90 (321) T ss_pred HhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeec----cCcceeeeeeccCCcccccccccccccccccce Confidence 1222221 112345555777778889999899888766432 23346788886555555554 33 345677889 Q ss_pred cceeEEEeeeecceeeeeHHHHhhc--CccHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------------c Q lcl|NC_010147. 76 TKKREAKIRKIAKGTSITDEALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------------L 133 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~vtd~~~~~~--~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~--------------------~ 133 (274) +++.++..++.+..+.++++...++ .+|+.+.+.+.+++.+++.++..++..-..+. . T Consensus 91 ~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~ 170 (321) T protein:vir:31 91 TGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETI 170 (321) T ss_pred eeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccc Confidence 9999999999999999999988876 46999999999999999999988775532211 1 Q ss_pred cccccccCHHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcCCCC Q lcl|NC_010147. 134 TVNADITKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE 211 (274) Q Consensus 134 ~~~~~~~~~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~ 211 (274) ...++.+++|.|.++...|..... ..-+++||++.+..+++.. .+ .....++..+..|...+++|+||+.++++| T Consensus 171 ~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l-~~--~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP 247 (321) T protein:vir:31 171 DAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTL-TD--RDTPLGDNVIMGEADVNPFSFPIIGSGLWP 247 (321) T ss_pred cccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHH-hc--CCCccccchhhccccccccceeEEEcCCCC Confidence 122344678999999999977653 3447899999887665421 11 222345555667777789999999999999 Q ss_pred cceEEEEeCCeEEEEeecCceeeeecchhh---cceEEE--EEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 212 AGTAILAKKGAVKLILKRDFFLEVARDAST---KTTALY--SDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 212 ~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~---~~~~v~--~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ++.+++.+...+.++..++++++..++... ..+.++ .+..+++.+-++++++.++-=.=+.|. T Consensus 248 ~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 248 DDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred CCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcchhc Confidence 999999999999888888888887777543 234444 344688889999999999843333444 No 165 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.68 E-value=1.8e-17 Score=112.39 Aligned_cols=264 Identities=10% Similarity=0.074 Sum_probs=164.7 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccc---cccccccCCCceEEEEeeccCCccccccCCCcCC-cccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAE---VDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILET 76 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~---~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~-~~~~t~ 76 (274) ||... -+.|.+.+.+++...++.+.+.. ....+...+|++|+||+..-.....+|+...... ...++. T Consensus 1 Mainy--------a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~ 72 (346) T protein:vir:10 1 MTINY--------AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSN 72 (346) T ss_pred Ccchh--------HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccc Confidence 66543 24688888888876655433221 1122344568999999995333566787766664 578898 Q ss_pred ceeEEEeee-ecceeeee--HHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccccccCHHH Q lcl|NC_010147. 77 KKREAKIRK-IAKGTSIT--DEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------TVNADITKLNG 144 (274) Q Consensus 77 ~~~~~~~~~-~~~~~~vt--d~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------~~~~~~~~~d~ 144 (274) +..+.++.+ ++..|.++ |++.......+.....+......+-.+|+..|+.+.+... ...+...-|+. T Consensus 73 ~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~ 152 (346) T protein:vir:10 73 DWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPA 152 (346) T ss_pred ceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHH Confidence 999999866 79999999 6554432233333333445555667889987776543211 11122334788 Q ss_pred HHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEE--cCCCC------cc- Q lcl|NC_010147. 145 LQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVR--TNKLE------AG- 213 (274) Q Consensus 145 i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~--s~~v~------~~- 213 (274) |.++..+|.++++ ++|+++|+|+++..|++... |......++....+|.++++.|++|+. |+.+. .| T Consensus 153 i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~--f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~ 230 (346) T protein:vir:10 153 FDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEA--MNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGS 230 (346) T ss_pred HHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchh--heeccccccccccceeeeeecCeEEEEcchhhcccchhhccCc Confidence 8999999999875 67999999999998887664 444444444334599999999999987 45553 11 Q ss_pred ---e------EEEEeCCeEEEEeecC-ceeeeecchhhcceEEEEEEEEEEEEEcCcc-EEEEEecCCC--------CCC Q lcl|NC_010147. 214 ---T------AILAKKGAVKLILKRD-FFLEVARDASTKTTALYSDKHYVAYLYDESK-AVKITKGSGS--------LEM 274 (274) Q Consensus 214 ---~------~~~~~~~a~~~~~~~~-~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~-~v~~~~~~a~--------~~~ 274 (274) + .++++++|..-..+.. +.+-.--....+...+.+|.++.+-+++-.+ .+.+....|. .|- T Consensus 231 ~~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~~~~~~ 310 (346) T protein:vir:10 231 KIIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQSGQDA 310 (346) T ss_pred cccCCccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecccccCccCccccc Confidence 1 2566777765444433 2332222334566688889888888874433 3433433332 111 No 166 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.61 E-value=2.1e-16 Score=106.53 Aligned_cols=261 Identities=15% Similarity=0.155 Sum_probs=170.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhccccccc--ccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~--~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) ||... -+.|.+.+.+++...+....+.+..+ .....+|++|+||+........+|..+...+...++.+. T Consensus 1 Main~--------~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~ 72 (285) T protein:vir:79 1 MTVVL--------DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGK 72 (285) T ss_pred Ccchh--------hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceee Confidence 66542 34688999999988888877654432 334456899999999644567789998889999999999 Q ss_pred eEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccc----ccccCHHHHHHHHHHH Q lcl|NC_010147. 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQH-GLAHANKVDNDVLEALMGAKLTVN----ADITKLNGLQSAIDKF 152 (274) Q Consensus 79 ~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~-a~~~a~~~d~~~~~~~~~a~~~~~----~~~~~~d~i~~A~~~l 152 (274) .+.++.+ ++..|.++..+...+..-.++.+.+++ ....+-.+|+..++.+.+...... ++..-++.|.++..+| T Consensus 73 et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~~l 152 (285) T protein:vir:79 73 ETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEAYM 152 (285) T ss_pred eEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 9999876 788998884333333222345555553 334457889887777654332222 2233477888999999 Q ss_pred hhcCC-CceEEEEcHHHHHHHHhhccccccccccccccce---eccccceecc-ceEEE--cCCCCcce------EEEEe Q lcl|NC_010147. 153 NDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDII---VKGAFGEALG-AIIVR--TNKLEAGT------AILAK 219 (274) Q Consensus 153 ~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~---~~g~ig~~~G-~~Vv~--s~~v~~~~------~~~~~ 219 (274) .++++ .+|+++|+|+++..|++... |....+..+... .++.++.+.| ++|+. |+.+..++ .++++ T Consensus 153 de~~vp~~rvl~vTp~~~~~Lk~s~~--~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~ 230 (285) T protein:vir:79 153 FDNEVPGGFVMFVSSAYYTALKQSAA--VTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTP 230 (285) T ss_pred HHcCCCCceEEEEChHHHHHHHhhhh--hheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEec Confidence 99876 78999999999999988764 344333222211 3456899998 89987 45665333 25677 Q ss_pred CCeEEEEeecCceeeeecc--hhhcceEEEEEEEEEEEEEcCc-cEEEEEecCCC Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARD--ASTKTTALYSDKHYVAYLYDES-KAVKITKGSGS 271 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd--~~~~~~~v~~~~~yg~~~~~~~-~~v~~~~~~a~ 271 (274) ++|..-..+.+..--.+.+ .......+.+|.++++-+++-. ..+.+...+|- T Consensus 231 ~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 231 LSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred CceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 7775444443321112222 2334568899999888888443 33444544444 No 167 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.56 E-value=1.6e-15 Score=101.73 Aligned_cols=263 Identities=11% Similarity=0.087 Sum_probs=166.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCc--CCccccccce Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK--IPTDILETKK 78 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~--i~~~~~t~~~ 78 (274) |||. +--.+.|.+.+.+++...++...+-.......-.+|++|+||+... ....+|..+.. .+..+++.+. T Consensus 1 Mant------l~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~-~gl~DY~R~~g~~~~~g~v~~~~ 73 (312) T protein:vir:10 1 MANT------LAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLST-DGLGDYSRGSANAYVGGDVKFEY 73 (312) T ss_pred CCcc------hhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeec-ccccccccccCCccccccccccc Confidence 7743 2235789999999999888776653333333335689999999874 46677887666 4555788888 Q ss_pred eEEEeee-ecceeeee--HHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------cccccCHHH Q lcl|NC_010147. 79 REAKIRK-IAKGTSIT--DEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-----------NADITKLNG 144 (274) Q Consensus 79 ~~~~~~~-~~~~~~vt--d~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~-----------~~~~~~~d~ 144 (274) .+.++.+ ++..|.++ |++.............+......+-.+|+..++.+.+..... .+...-|+. T Consensus 74 et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~ 153 (312) T protein:vir:10 74 ETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINK 153 (312) T ss_pred eeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHH Confidence 8888865 78999999 666555444455555555677788899999887765322111 122334778 Q ss_pred HHHHHHHHhhcCC-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCC------cce- Q lcl|NC_010147. 145 LQSAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLE------AGT- 214 (274) Q Consensus 145 i~~A~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~------~~~- 214 (274) |.++..+|.++++ .+|+++|.|.++..|++..... ......+.+ ..++.++++.|++|+.-+ .+- +|+ T Consensus 154 i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~-~~~~~~~~~-~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t 231 (312) T protein:vir:10 154 IKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEK-LTAVTFAQG-GIQTQVPSIDGCALIKTPQNRMYSSILLNDGTT 231 (312) T ss_pred HHHHHHHHHHccCCCceEEEeChHHHHHHhhhhhce-ecccccccc-eeeeeeeeecccEEEEchhhhccceeeeccCcc Confidence 8889999999876 5899999999998887653222 233333333 458899999999998733 231 121 Q ss_pred ------------------EEEEeCCeEEEEeecC-cee-eeecchhhcceEEEEEEEEEEEEEcCc-cEEEEEe--cCCC Q lcl|NC_010147. 215 ------------------AILAKKGAVKLILKRD-FFL-EVARDASTKTTALYSDKHYVAYLYDES-KAVKITK--GSGS 271 (274) Q Consensus 215 ------------------~~~~~~~a~~~~~~~~-~~v-e~~rd~~~~~~~v~~~~~yg~~~~~~~-~~v~~~~--~~a~ 271 (274) .++++++|..-..+.. +.+ +.+-+.......+..|.++.+-+++-. ..+.+.+ +-.. T Consensus 232 ~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~~ 311 (312) T protein:vir:10 232 SNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKPV 311 (312) T ss_pred cccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccCC Confidence 1345555544333322 121 111222334568999999988888443 2333333 3333 Q ss_pred C Q lcl|NC_010147. 272 L 272 (274) Q Consensus 272 ~ 272 (274) . T Consensus 312 ~ 312 (312) T protein:vir:10 312 G 312 (312) T ss_pred C Confidence 3 No 168 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.56 E-value=1.8e-16 Score=106.90 Aligned_cols=257 Identities=15% Similarity=0.143 Sum_probs=178.3 Q ss_pred CCCccc--eeee--eechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccc-------cCCCcC Q lcl|NC_010147. 1 MPQGIT--KTSN--QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV-------AEGEKI 69 (274) Q Consensus 1 Ma~~~T--~~~~--~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~-------~eg~~i 69 (274) |+...- ...| -++|.-|-.-+.+-+.+...+.++- ..|.- +|.|+.-|.......++.+ .||+.+ T Consensus 127 ~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf---~tLP~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L 202 (410) T protein:vir:83 127 YARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTL---GTLPL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTEL 202 (410) T ss_pred HHHhhccCcccccccccchhHhhhHHHHHhhccchhhhh---hhCCC-CCCeeEEeeecccccccccccccccccccccc Confidence 333221 1122 2344446554444444443333322 22333 4788888776554454433 499999 Q ss_pred CccccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccccCH----HH Q lcl|NC_010147. 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-VNADITKL----NG 144 (274) Q Consensus 70 ~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~-~~~~~~~~----d~ 144 (274) +..|++++..++.++.+++...+|+...+.+....++...+.+..+-|+..++..-+.|...... +....++. .. T Consensus 203 ~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~ 282 (410) T protein:vir:83 203 DSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASA 282 (410) T ss_pred cccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHH Confidence 99999999999999999999999999999999999999999998888888887766666433221 22223343 35 Q ss_pred HHHHHHHHhhc--CCCceEEEEcHHHHHHHHhhccccccccccc-------cccceeccccceeccceEEEcCCCCcceE Q lcl|NC_010147. 145 LQSAIDKFNDE--DLEPMVLFINPLDAGKLRGDASTNFTRATEL-------GDDIIVKGAFGEALGAIIVRTNKLEAGTA 215 (274) Q Consensus 145 i~~A~~~l~~~--~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~-------~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~ 215 (274) |.|+..++.++ +...+++.|+|++...+-+ .|...+.. +.+.+-+|.-|.++|+||+++++.+.|++ T Consensus 283 i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~----~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA 358 (410) T protein:vir:83 283 IWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGP----LFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDA 358 (410) T ss_pred HHHHHHHHhhhhccceeeeEEechhhhhhccc----eeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCee Confidence 67888888887 7788999999999765543 23333222 22333377888999999999999999999 Q ss_pred EEEeCCeEEEEeec--CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 216 ILAKKGAVKLILKR--DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 216 ~~~~~~a~~~~~~~--~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) +++.+.|+.+++.. |+++ ++-++..-...++ -||++..++|.+++-+... T Consensus 359 ~f~~~~Ai~~~eS~~gp~qL-~d~~i~nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 359 YLFSTAAIECFEQRVGTLQV-VEPSVFGLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred eEeccceeeeeecCCceeEe-eCCchhhhhhhhe--eeeeeccccccceeeeccC Confidence 99999999998855 3444 4444444333333 6789999999999998776 No 169 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.56 E-value=1e-15 Score=102.70 Aligned_cols=259 Identities=13% Similarity=0.060 Sum_probs=168.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCccccccCCCcC--Cccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~eg~~i--~~~~~t 75 (274) ||+..+++..+ . + +.+++.++..+|+.+++.+.+.+. .+.|++|+||......... |.+. .++.+. T Consensus 1 Ma~~~~~~lti---~-~-~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~----G~~~t~~~~~~~ 71 (430) T protein:vir:21 1 MALNEGQIVTL---A-V-DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE----GWDLTDKATGLL 71 (430) T ss_pred CccccchhhHH---H-H-HHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccc----cccccCCCccce Confidence 98775444332 2 2 778889999999999866544332 3579999999875432221 2211 134677 Q ss_pred cceeEEEeeee-cceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccCHHHHH Q lcl|NC_010147. 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------VNADITKLNGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------~~~~~~~~d~i~ 146 (274) .++..+++.+. ...|.+++.+ +...+...++.+...+.+|.++|.+|++.+...... .......+.++. T Consensus 72 e~~v~~~~~~~~~V~~~~~~kE--l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:21 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred eeeEeEEEeeeccceEEeehhH--hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHH Confidence 78888888764 5667888665 467888899999999999999999999886543211 122234578899 Q ss_pred HHHHHHhhcCC---CceEEEEcHHHHHHHHhhccccccccccccccceeccccce-eccce-EEEcCCCCc--------- Q lcl|NC_010147. 147 SAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA--------- 212 (274) Q Consensus 147 ~A~~~l~~~~~---~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~-~~G~~-Vv~s~~v~~--------- 212 (274) ++.+.|.+.+. .+|.++++|..++.|..... .+......+...+++|+|++ +.|++ ++.++++|. T Consensus 150 ~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~-~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~ 228 (430) T protein:vir:21 150 DAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLT-KRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhc-cccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCc Confidence 99999998875 35899999999988754321 12222222334556777775 66775 344444431 Q ss_pred ------------------------------------------------c------------------------------- Q lcl|NC_010147. 213 ------------------------------------------------G------------------------------- 213 (274) Q Consensus 213 ------------------------------------------------~------------------------------- 213 (274) | T Consensus 229 tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I 308 (430) T protein:vir:21 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) T ss_pred eeccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEE Confidence 0 Q ss_pred ------------------------------------e-----EEEEeCCeEEEEeecC---------------------c Q lcl|NC_010147. 214 ------------------------------------T-----AILAKKGAVKLILKRD---------------------F 231 (274) Q Consensus 214 ------------------------------------~-----~~~~~~~a~~~~~~~~---------------------~ 231 (274) . .++||++||.++.+.- + T Consensus 309 ~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) T protein:vir:21 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) T ss_pred eecccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccce Confidence 0 0455666666654321 1 Q ss_pred e--eeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 232 F--LEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 232 ~--ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) . +-++.|....++..+.+.-||++.++|+-.+++=..+++ T Consensus 389 sirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 1 223345556778888889999999999997555555555 No 170 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.53 E-value=4.8e-15 Score=99.09 Aligned_cols=259 Identities=14% Similarity=0.065 Sum_probs=168.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCccccccCCCcCC--ccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVAEGEKIP--TDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~eg~~i~--~~~~t 75 (274) |||..++.. +++.+.+.+-+++.+++.+.+.+.+.+. .+.|++|++|....... .+|..++ .+++. T Consensus 1 MAn~l~~~~-----~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~----~~G~~~t~~~~~i~ 71 (430) T protein:vir:10 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT----QEGWDLTDKATGLL 71 (430) T ss_pred CccchhhHH-----HHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccc----ccCcccCCCCCccc Confidence 998765553 3688888899999999999866544332 35699999998754321 2233222 34677 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccCHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------VNADITKLNGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------~~~~~~~~d~i~ 146 (274) .++..+++.+ ....|.+++.+ +...+...+..+...+.+|.++|.++++....-... .......+..+. T Consensus 72 e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:10 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred cceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHH Confidence 7788888876 46678888766 356777788889999999999999999886442221 122233468899 Q ss_pred HHHHHHhhcCCC---ceEEEEcHHHHHHHHhhccccccccccccccceeccccce-eccce-EEEcCCCCc--------- Q lcl|NC_010147. 147 SAIDKFNDEDLE---PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA--------- 212 (274) Q Consensus 147 ~A~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~-~~G~~-Vv~s~~v~~--------- 212 (274) ++.+.|.+.+.. +|.++++|..++.|..... .+..........+++|+|++ +.|++ ++.++.+|. T Consensus 150 ~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~-~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~ 228 (430) T protein:vir:10 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLT-KRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhc-cccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCc Confidence 999999988763 5899999999988753221 11122222334456666665 66664 333333321 Q ss_pred ------------------------------------------------c------------------------------- Q lcl|NC_010147. 213 ------------------------------------------------G------------------------------- 213 (274) Q Consensus 213 ------------------------------------------------~------------------------------- 213 (274) | T Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I 308 (430) T protein:vir:10 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) T ss_pred eeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEE Confidence 0 Q ss_pred -----------------------------------------eEEEEeCCeEEEEeecC---------------------- Q lcl|NC_010147. 214 -----------------------------------------TAILAKKGAVKLILKRD---------------------- 230 (274) Q Consensus 214 -----------------------------------------~~~~~~~~a~~~~~~~~---------------------- 230 (274) ..++||++||.++.+.- T Consensus 309 ~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Gl 388 (430) T protein:vir:10 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) T ss_pred eccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceE Confidence 01456677777665321 Q ss_pred -ceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 231 -FFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 231 -~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) +.+-++.|....++..|.+.-||++.++|+-.+++=..+++ T Consensus 389 sirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 11223455666777888889999999999998555555555 No 171 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.53 E-value=4.8e-15 Score=99.09 Aligned_cols=259 Identities=14% Similarity=0.065 Sum_probs=168.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccc---cCCCceEEEEeeccCCccccccCCCcCC--ccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---GQPGDTLTFPAFVYSGDAQVVAEGEKIP--TDILE 75 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~---~~~g~tv~ip~~~~~~~~~~~~eg~~i~--~~~~t 75 (274) |||..++.. +++.+.+.+-+++.+++.+.+.+.+.+. .+.|++|++|....... .+|..++ .+++. T Consensus 1 MAn~l~~~~-----~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~----~~G~~~t~~~~~i~ 71 (430) T protein:vir:92 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT----QEGWDLTDKATGLL 71 (430) T ss_pred CccchhhHH-----HHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccc----ccCcccCCCCCccc Confidence 998765553 3688888899999999999866544332 35699999998754321 2233222 34677 Q ss_pred cceeEEEeee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccCHHHHH Q lcl|NC_010147. 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------VNADITKLNGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------~~~~~~~~d~i~ 146 (274) .++..+++.+ ....|.+++.+ +...+...+..+...+.+|.++|.++++....-... .......+..+. T Consensus 72 e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:92 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred cceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHH Confidence 7788888876 46678888766 356777788889999999999999999886442221 122233468899 Q ss_pred HHHHHHhhcCCC---ceEEEEcHHHHHHHHhhccccccccccccccceeccccce-eccce-EEEcCCCCc--------- Q lcl|NC_010147. 147 SAIDKFNDEDLE---PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA--------- 212 (274) Q Consensus 147 ~A~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~-~~G~~-Vv~s~~v~~--------- 212 (274) ++.+.|.+.+.. +|.++++|..++.|..... .+..........+++|+|++ +.|++ ++.++.+|. T Consensus 150 ~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~-~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~ 228 (430) T protein:vir:92 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLT-KRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhc-cccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCc Confidence 999999988763 5899999999988753221 11122222334456666665 66664 333333321 Q ss_pred ------------------------------------------------c------------------------------- Q lcl|NC_010147. 213 ------------------------------------------------G------------------------------- 213 (274) Q Consensus 213 ------------------------------------------------~------------------------------- 213 (274) | T Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I 308 (430) T protein:vir:92 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) T ss_pred eeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEE Confidence 0 Q ss_pred -----------------------------------------eEEEEeCCeEEEEeecC---------------------- Q lcl|NC_010147. 214 -----------------------------------------TAILAKKGAVKLILKRD---------------------- 230 (274) Q Consensus 214 -----------------------------------------~~~~~~~~a~~~~~~~~---------------------- 230 (274) ..++||++||.++.+.- T Consensus 309 ~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Gl 388 (430) T protein:vir:92 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) T ss_pred eccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceE Confidence 01456677777665321 Q ss_pred -ceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 231 -FFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 231 -~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) +.+-++.|....++..|.+.-||++.++|+-.+++=..+++ T Consensus 389 sirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 11223455666777888889999999999998555555555 No 172 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.49 E-value=2.2e-14 Score=95.47 Aligned_cols=265 Identities=11% Similarity=0.098 Sum_probs=167.1 Q ss_pred CCCc-cceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccccccee Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~ 79 (274) |+.. +||.-+ --+.|.+.+.+++...+..+.+.+.+..+ -.+|++|+||+... ....+|..+......+++.+.. T Consensus 1 ~~~~an~mAln--ya~~~~~~Ld~~~~~~~~t~~l~~~~~~~-~~Gak~VkIp~i~~-~gl~dY~R~~g~~~g~v~~~~e 76 (311) T protein:vir:99 1 MPTDAETRGFN--YVTKDGNLLDQKITAGLFTAALGTPEVDL-VNGGRSFTLKTIST-SGLKDHTRGKGFNSGTISDEKT 76 (311) T ss_pred CCCcchhhHHH--HHHHHHHHHHHHHHhhhcccceecCchhe-eecCCEEEEEeeee-ccccccccccCccccceeeeee Confidence 6532 233311 14678888999998888777676555544 23689999999974 5777888888888899999999 Q ss_pred EEEeee-ecceeeee--HHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------------ccccccC Q lcl|NC_010147. 80 EAKIRK-IAKGTSIT--DEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT---------------VNADITK 141 (274) Q Consensus 80 ~~~~~~-~~~~~~vt--d~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~---------------~~~~~~~ 141 (274) +.++.+ ++..|.++ |++.......+-.-..+......+-.+|+..++.+.+.... .....++ T Consensus 77 t~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt 156 (311) T protein:vir:99 77 IYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLD 156 (311) T ss_pred EEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccC Confidence 999866 78999998 54432222222233334445556778898877666321110 0111222 Q ss_pred ----HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccc-ccccccccccceeccccceeccceEEEc---CCCC-- Q lcl|NC_010147. 142 ----LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAIIVRT---NKLE-- 211 (274) Q Consensus 142 ----~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~-~~~~s~~~~~~~~~g~ig~~~G~~Vv~s---~~v~-- 211 (274) ++.|..+...+.+...++|+++|+|.++..|++...+. .+...+.+.+. .++.++.+.|++|+.. +.+. T Consensus 157 ~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~-i~~~V~~lDgv~Ii~V~ps~r~~t~ 235 (311) T protein:vir:99 157 ETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTA-LESRITSIDGVQLIEVYESNRFMTK 235 (311) T ss_pred HHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeecccccccc-cccccceecCeEEEEecCchhhcch Confidence 56677788888887788999999999999887654332 22333334332 4677999999998754 2231 Q ss_pred ----cce----------EEEEeCCeEEEEeecC-cee-eeecchhhcceEEEEEEEEEEEEEc-CccEEEEEecCC Q lcl|NC_010147. 212 ----AGT----------AILAKKGAVKLILKRD-FFL-EVARDASTKTTALYSDKHYVAYLYD-ESKAVKITKGSG 270 (274) Q Consensus 212 ----~~~----------~~~~~~~a~~~~~~~~-~~v-e~~rd~~~~~~~v~~~~~yg~~~~~-~~~~v~~~~~~a 270 (274) +|. .++++++|..-..+.. +.+ +..-+.......+.+|.++.+-+++ ....+.+....| T Consensus 236 ~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 236 YDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred hhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 221 2566777765544432 111 1112223346788888888888874 444566676666 No 173 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.47 E-value=1.2e-15 Score=102.48 Aligned_cols=179 Identities=15% Similarity=0.161 Sum_probs=113.7 Q ss_pred eee-ecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------cc-cccc----CH Q lcl|NC_010147. 83 IRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------------VN-ADIT----KL 142 (274) Q Consensus 83 ~~~-~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~--------------~~-~~~~----~~ 142 (274) ++. +...+.|+|.+..|+..|+++++.+++++++|+.+|+.++..+..+... .. +..+ -+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 544 3567899999999999999999999999999999999998776533111 00 1112 25 Q ss_pred HHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhh-cccccccc-ccccccceecc-ccceeccceEEEcCCCCc--ceE Q lcl|NC_010147. 143 NGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGD-ASTNFTRA-TELGDDIIVKG-AFGEALGAIIVRTNKLEA--GTA 215 (274) Q Consensus 143 d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~-~~~~~~~~-s~~~~~~~~~g-~ig~~~G~~Vv~s~~v~~--~~~ 215 (274) +.|++|..+|.++++ ..||++++|+.|..|++. ..+ +... ...+++.+++| .+++++|++|++|+++|. |+. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~-~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~ 159 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTN-ILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN 159 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcc-eeeeecccccccccccceeeeecCcEEEEeccCCcccccc Confidence 788899999999886 789999999877777652 211 1111 11233456777 599999999999999996 444 Q ss_pred EEEeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEe--cCCCCCC Q lcl|NC_010147. 216 ILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK--GSGSLEM 274 (274) Q Consensus 216 ~~~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~--~~a~~~~ 274 (274) +....+.+... .-+.+..|-. +++.+ ..+.+|+++..++. ...-+.+ T Consensus 160 ~~~~ag~~~~~---~~~~~~yr~~--fs~~~-------glv~~~~Avgtvkl~~~~~~~~~ 208 (221) T protein:vir:17 160 LVTDPGDATTS---GENNGSYRPA--ITDRA-------GLVFHKEAADTVEVLLPPSRPPL 208 (221) T ss_pred cccCCcccccc---cccccccccc--ccceE-------EEEEcchheeeeeeecCCCCCce Confidence 44444433221 1112222222 11111 34556666655553 2332333 No 174 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.45 E-value=3.3e-14 Score=94.47 Aligned_cols=260 Identities=10% Similarity=0.086 Sum_probs=159.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc----CCccccccCCCcCCcccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY----SGDAQVVAEGEKIPTDILET 76 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~----~~~~~~~~eg~~i~~~~~t~ 76 (274) |||.. . -.+.|.+.+.+++...++.+.+-.........+|++|+||++.- +....+|.++.......++. T Consensus 1 Mantl-~-----ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~ 74 (302) T protein:vir:78 1 MANSL-A-----LAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTL 74 (302) T ss_pred CCchh-H-----HHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceee Confidence 77532 1 14679999999998888777664333344566799999999962 33566788888777788888 Q ss_pred ceeEEEeee-ecceeeee--HHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cccccccCH----H Q lcl|NC_010147. 77 KKREAKIRK-IAKGTSIT--DEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------TVNADITKL----N 143 (274) Q Consensus 77 ~~~~~~~~~-~~~~~~vt--d~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------~~~~~~~~~----d 143 (274) +..+.+..+ ++..|.++ |++........-....+......+-.+|+..++.+-+... ...+...+. + T Consensus 75 ~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~ 154 (302) T protein:vir:78 75 AWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMG 154 (302) T ss_pred eeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHH Confidence 888888765 78888888 4433222222223333335556677899988876643221 111222344 4 Q ss_pred HHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccc-cccccccceeccccceeccceEEEcC--CCC-c------- Q lcl|NC_010147. 144 GLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTR-ATELGDDIIVKGAFGEALGAIIVRTN--KLE-A------- 212 (274) Q Consensus 144 ~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~-~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~-~------- 212 (274) .|..+...|+++ ++++++|.|.++..|++...+.-.. ....+.+ ..++.++.+.|++|+.-+ .+. + T Consensus 155 ~i~~~~~~~~e~--~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~-~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~ 231 (302) T protein:vir:78 155 DIATAMELVDDS--NQLILVTSPTTLAGLLNTALIRESKNTQVLRRG-EVDTKITFIQDVEVLQVPSEYLYDKVAPKVGV 231 (302) T ss_pred HHHHHHHHhhcc--CCeEEEEChHHHHHHhcchhhccceeccccccc-cccceeeeecccEEEEchhhhcccceeccCCc Confidence 555666777765 5899999999999988654333221 2222222 236789999999998743 331 1 Q ss_pred --c------eEEEEeCCeEEEEeecC-ceeeeecchhhc--ceEEEEEEEEEEEEEcCc-cE--EEEEecCC Q lcl|NC_010147. 213 --G------TAILAKKGAVKLILKRD-FFLEVARDASTK--TTALYSDKHYVAYLYDES-KA--VKITKGSG 270 (274) Q Consensus 213 --~------~~~~~~~~a~~~~~~~~-~~ve~~rd~~~~--~~~v~~~~~yg~~~~~~~-~~--v~~~~~~a 270 (274) + ..+++++++..-..+.. +.+ ...+.... ...+.+|.++.+-+++-. .. +.++.+.| T Consensus 232 ~~~~~ak~INfiiv~~~a~ia~~K~~~~~i-f~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 232 PDYTGAKKIPYMIFKRDAPTGIVKTDKVRV-FEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred cccCCccceeEEEECCCeeeeeeeeeeeEe-eCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 0 12456666665444433 222 12223333 458999999988888554 22 44455566 No 175 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=99.44 E-value=2e-14 Score=95.72 Aligned_cols=259 Identities=10% Similarity=-0.009 Sum_probs=154.4 Q ss_pred CCCcc--ceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccce Q lcl|NC_010147. 1 MPQGI--TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) Q Consensus 1 Ma~~~--T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~ 78 (274) +.... .-......|..+...+...+...+.+.+.+.+. +.....+|....-..+.++.||...|.+++++.. T Consensus 237 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~------~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~ 310 (517) T protein:vir:97 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE------NLPTLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) T ss_pred eeeecccccccccccchHHHHHHHHhhhhhccceeeeeec------cccceeeecccccceeeeeecCCcccccccceee Confidence 11000 011223456655555555454443333433221 1223455554443456678999999999999999 Q ss_pred eEEEeeeecceeeeeHHHHhhcCcc----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cc----cccccCHHH Q lcl|NC_010147. 79 REAKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAKL------TV----NADITKLNG 144 (274) Q Consensus 79 ~~~~~~~~~~~~~vtd~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------~~----~~~~~~~d~ 144 (274) .++.+++++..+.+|++...++..| +.+.+.+++++.++++.+..++..-.+... .. .......+. T Consensus 311 ~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~ 390 (517) T protein:vir:97 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) T ss_pred EEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccch Confidence 9999999999999999998887776 788999999999999999999865332111 00 111122345 Q ss_pred HHHHHHHHhhcCC--CceEEEEcHHHHHHHHhh--ccccccccccccccceeccccceeccceEEEcCCCCcceE-EEEe Q lcl|NC_010147. 145 LQSAIDKFNDEDL--EPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA-ILAK 219 (274) Q Consensus 145 i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~--~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~-~~~~ 219 (274) +.++...+..+.. ....|+|||..++.|++. ..-+++- ...+..+...+++|+.-+++ .++.+.. +.+. T Consensus 391 ~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~-----~~~~~~~~~~~l~G~~~~~~-~~~~~~~~~~~~ 464 (517) T protein:vir:97 391 IQELLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVF-----PVGVSNQTIATHFGFNRLVQ-SVAVDEKTAVSL 464 (517) T ss_pred HHHHHHHHHHHhhhccCCEEEECHHHHHHHHHhhcCCCCeec-----cCcCCcccccccCCcccccc-ccccCceeEeec Confidence 5555555554432 456799999999988753 3223322 12233445566777543332 3333433 3333 Q ss_pred CCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec-CCCC Q lcl|NC_010147. 220 KGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG-SGSL 272 (274) Q Consensus 220 ~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~-~a~~ 272 (274) . .+.++-+.+..+..+-+....++.+....+.|..|..|++++..++. .++. T Consensus 465 ~-~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 465 S-GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred c-ccEEEeecceeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 3 33343444433333333344567778888999999999999888752 2233 No 176 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.16 E-value=2.2e-11 Score=79.00 Aligned_cols=262 Identities=11% Similarity=0.040 Sum_probs=157.7 Q ss_pred CCCccceeee--eechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCcccc-ccc Q lcl|NC_010147. 1 MPQGITKTSN--QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDIL-ETK 77 (274) Q Consensus 1 Ma~~~T~~~~--~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~-t~~ 77 (274) |+ .-++++ -+.|.-....+.|.+.+.+-+..... +....|+..+.++...++.+.|...++.++++.. |+. T Consensus 25 m~--alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lp----f~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~ 98 (330) T protein:vir:94 25 MP--TVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMP----FTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFT 98 (330) T ss_pred hh--hhhhhHHhhcCchhhHHHHHHhhhccchHHhhcc----cccccCCcceeeeeecCCcceeeeccccccccCcceee Confidence 55 333443 25677777778887776644443322 2122245677788778899999888888888754 567 Q ss_pred eeEEEeeeecceeeeeHHHHhhcC--ccHHHHHHHHHHHHHHHHHHHHHHHHhhc------------ccccc----cccc Q lcl|NC_010147. 78 KREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHANKVDNDVLEALMG------------AKLTV----NADI 139 (274) Q Consensus 78 ~~~~~~~~~~~~~~vtd~~~~~~~--~d~~~~~~~~~a~~~a~~~d~~~~~~~~~------------a~~~~----~~~~ 139 (274) +.+.....++..+.|+.......+ .|...+..+...+++.++.+..+|..-.+ ..+.. .+++ T Consensus 99 q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~ 178 (330) T protein:vir:94 99 KVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGT 178 (330) T ss_pred eeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCC Confidence 888888888888888866543322 24555556677788999999888874211 11211 3466 Q ss_pred cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccc--cccccccccccceeccccceeccceEEEcCCCCcc---- Q lcl|NC_010147. 140 TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAST--NFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG---- 213 (274) Q Consensus 140 ~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~---- 213 (274) .+.|.+-+++.+.-....++.+++||.....++++-.+- .+..... ..+ +..-.+-+|.|+||+.+|.+|.+ T Consensus 179 ~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~-~~~-~~G~~v~~~~GvPi~~~d~ip~~~~~~ 256 (330) T protein:vir:94 179 LTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEV-MTL-PSGRQIPTYRGVPWFVNDFIPSNMTQG 256 (330) T ss_pred CCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCc-ccc-cCCCEEeeeCCeEEEecccccCCCCcc Confidence 777777666666644455788999866655544432210 0000000 000 12224567999999999999863 Q ss_pred ------eEEEEeCC-------eEEEEee--cCceeeeec-chhhcceEEEEEEEEEEEEEcCccEEEEEe-cCC Q lcl|NC_010147. 214 ------TAILAKKG-------AVKLILK--RDFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVKITK-GSG 270 (274) Q Consensus 214 ------~~~~~~~~-------a~~~~~~--~~~~ve~~r-d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~-~~a 270 (274) ++|++..| -.++-.. .++.|+..- ...+..-..+...+||.++.+|+++.+|+. ... T Consensus 257 ~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 257 TATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred cCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 23555432 1222211 135554322 223333344556899999999999999984 333 No 177 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.05 E-value=3.7e-11 Score=77.81 Aligned_cols=268 Identities=17% Similarity=0.133 Sum_probs=158.9 Q ss_pred CCCc--------cce---eeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccc-ccCCCc Q lcl|NC_010147. 1 MPQG--------ITK---TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGEK 68 (274) Q Consensus 1 Ma~~--------~T~---~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~-~~eg~~ 68 (274) |-+. .+. .++.+.--.|-+..+....+.+++.+++... .+....|.||++.++..++++.. ..||-+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~-piPkn~GkTIk~r~y~pl~~~~~pl~eGv~ 79 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVT-NMPKHYGKTIKVYEYVPLLDDRNINDQGID 79 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhccccc-ccccccCCeEEEEecccccccccchhcCCC Confidence 3221 111 1112222245555666666779999998653 45566799999999988776543 344431 Q ss_pred -----C----------C-------------------ccccccceeEEEeeeecceeeeeHHHHhhcCccHH-HHHHHHHH Q lcl|NC_010147. 69 -----I----------P-------------------TDILETKKREAKIRKIAKGTSITDEALLSGYGDPQ-GEQVRQHG 113 (274) Q Consensus 69 -----i----------~-------------------~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~-~~~~~~~a 113 (274) + + -.+++-.++...++|+++..++||+.......+.+ ..+.+.+. T Consensus 80 a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell 159 (401) T protein:vir:95 80 ASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELM 159 (401) T ss_pred cccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHh Confidence 1 1 11233345666788999999999998777655544 43333322 Q ss_pred H-HH---HHHHHHHHHHHhh-----cc-c-------ccccccccCHHHHHHHHHHHhhcC-------------C------ Q lcl|NC_010147. 114 L-AH---ANKVDNDVLEALM-----GA-K-------LTVNADITKLNGLQSAIDKFNDED-------------L------ 157 (274) Q Consensus 114 ~-~~---a~~~d~~~~~~~~-----~a-~-------~~~~~~~~~~d~i~~A~~~l~~~~-------------~------ 157 (274) . .- -+.+..++++... ++ + .+......+++.+..+...|.++. . T Consensus 160 ~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~ 239 (401) T protein:vir:95 160 NGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIG 239 (401) T ss_pred hhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccc Confidence 2 22 2233445554431 11 1 111122357899998888887521 1 Q ss_pred CceEEEEcHHHHHHH------Hhhccccccccccccc-cceeccccceeccceEEEcCCCC------------------- Q lcl|NC_010147. 158 EPMVLFINPLDAGKL------RGDASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLE------------------- 211 (274) Q Consensus 158 ~~~~~vv~p~~~~~L------~k~~~~~~~~~s~~~~-~~~~~g~ig~~~G~~Vv~s~~v~------------------- 211 (274) .-++.+|||+....| ..++ .|++.-++++ +.+.+|+||.+.++|+|+++.+. T Consensus 240 ~s~va~~h~~L~~di~a~~D~~~~~--~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~ 317 (401) T protein:vir:95 240 ATRVMYVGSELVPELKAMKDLFGNK--AFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTS 317 (401) T ss_pred cceEEEEecCchhHHHHHHHhcCCC--CceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccc Confidence 236789999554444 4444 5788877776 67899999999999999998742 Q ss_pred ---------cceEEEEeCCeEEEEe-ecC-----ceeeeec---------chhhcceEEEEEEEEEEEEEcCccEEEEEe Q lcl|NC_010147. 212 ---------AGTAILAKKGAVKLIL-KRD-----FFLEVAR---------DASTKTTALYSDKHYVAYLYDESKAVKITK 267 (274) Q Consensus 212 ---------~~~~~~~~~~a~~~~~-~~~-----~~ve~~r---------d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~ 267 (274) .|...+++..||+... +.+ .++-+.+ |++.+.=.+.-...|++.+++|+..++|+- T Consensus 318 ~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies 397 (401) T protein:vir:95 318 MVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKT 397 (401) T ss_pred cccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEe Confidence 1233577888887653 111 1222222 233333344445778999999999999875 Q ss_pred cCCCC Q lcl|NC_010147. 268 GSGSL 272 (274) Q Consensus 268 ~~a~~ 272 (274) .+- + T Consensus 398 ~a~-~ 401 (401) T protein:vir:95 398 VAP-L 401 (401) T ss_pred ecC-C Confidence 433 3 No 178 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.03 E-value=1.8e-11 Score=79.45 Aligned_cols=266 Identities=17% Similarity=0.169 Sum_probs=176.6 Q ss_pred Cccc-eeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeEE Q lcl|NC_010147. 3 QGIT-KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREA 81 (274) Q Consensus 3 ~~~T-~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~~ 81 (274) |..| ..-..+..|+|++.|...+.+.+.=-.++..-..+ ..|++++||..+. +....-.|.++...+.+.+++.+. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF--~~G~~L~I~tiGs-~~~~~~~E~~~~~~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDF--GSGETLHIKTIGS-VTLQEAEEDTPLIYNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccC--CCCCEEEecccCc-eeeeccccCCCeeecccccceEEE Confidence 3333 33346788999999988887765433333211112 2489999998753 355666788899999999999999 Q ss_pred Eeeee-cceeeeeHHHHhhc--CccHHHHHHHHHHHHHHHHHHHHHHHHhhc------cccccc-----------ccccC Q lcl|NC_010147. 82 KIRKI-AKGTSITDEALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMG------AKLTVN-----------ADITK 141 (274) Q Consensus 82 ~~~~~-~~~~~vtd~~~~~~--~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~------a~~~~~-----------~~~~~ 141 (274) .+..+ +.+|.++|....++ ..+++++.....++++....+.++++.-.. .+..+. ..... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 88765 77899999877765 456788888888899999999988866432 111111 12346 Q ss_pred HHHHHHHHHHHhhcCC--CceEEEEcHHHHHHHHhhccccccccccccccceecc------ccceeccceEEEcCCCC-- Q lcl|NC_010147. 142 LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG------AFGEALGAIIVRTNKLE-- 211 (274) Q Consensus 142 ~d~i~~A~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g------~ig~~~G~~Vv~s~~v~-- 211 (274) +..|+.....|..++. ++++.++.|.+...|...--+. ...++.+.=++-+| .+-.++|..+++|+.+- T Consensus 158 ~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It-~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 158 LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTIT-HDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheee-cccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 7788999999988775 7899999999999886432221 11233333344444 23458999999999773 Q ss_pred ---c------ceE---EEE--eCCe--EEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 212 ---A------GTA---ILA--KKGA--VKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 212 ---~------~~~---~~~--~~~a--~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) + |.+ |+. +-+- |..+-++-.+.|..|+.++..+.-..++|||..+.+.+-++.+-..+-+- T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSATAY 313 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccccC Confidence 1 111 221 1111 11122333456788888888888889999999999888776655433333 No 179 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.99 E-value=6.2e-10 Score=71.04 Aligned_cols=264 Identities=10% Similarity=0.069 Sum_probs=150.1 Q ss_pred CCCccceee-eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC-----CcCCcccc Q lcl|NC_010147. 1 MPQGITKTS-NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-----EKIPTDIL 74 (274) Q Consensus 1 Ma~~~T~~~-~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg-----~~i~~~~~ 74 (274) |+ ..|-.- ....+......|.|.+.+.+-+..... +-...|+..+..+....+++...+-+ ...+++.. T Consensus 1 mp-altLaea~k~~~d~l~~~ViE~~~~~s~lL~~Lp----F~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~ 75 (310) T protein:vir:97 1 MA-SVTLAESAKLAQDELVAGVIENIITVNRMFDVLP----FDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAA 75 (310) T ss_pred Cc-ccchHHHhhcCcchHHHHHHHHHhccchHHHhCC----cccccCCcceeeEeeccCCcccccccccccCCCcccccc Confidence 76 233221 235666677777777765544333221 11112445666665555554433222 33456777 Q ss_pred ccceeEEEeeeecceeeeeHHHHhhcCccHHHHH---HHHHHHHHHHHHHHHHHHHhh---------c---ccccc---- Q lcl|NC_010147. 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQ---VRQHGLAHANKVDNDVLEALM---------G---AKLTV---- 135 (274) Q Consensus 75 t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~---~~~~a~~~a~~~d~~~~~~~~---------~---a~~~~---- 135 (274) ++.+.+...+-.+..+.|+......-+.++++++ .++..+++.++.+..+|+.-. . .++.. T Consensus 76 t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~ 155 (310) T protein:vir:97 76 TFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGA 155 (310) T ss_pred ccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCC Confidence 7888888888888888888654433334455554 566778889999888887321 1 11222 Q ss_pred cccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccc-ccccccccceeccccceeccceEEEcCCCCcc- Q lcl|NC_010147. 136 NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFT-RATELGDDIIVKGAFGEALGAIIVRTNKLEAG- 213 (274) Q Consensus 136 ~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~-~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~- 213 (274) .+++.++|.+-+++...-....++.++++||+.+.+++.-.+--.. ...... .-...-.+.+|.|+||+.++.+|.+ T Consensus 156 ~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~-~~~~G~~v~~~~GiPi~~~d~ip~~~ 234 (310) T protein:vir:97 156 TGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVV-ELPSGAEVPAYSGTPIFRNDYIPTNQ 234 (310) T ss_pred CCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCcc-ccCCCCEEeeeCCeEEEEeCccCCCc Confidence 2356677777766666655566888999999765444322110000 000000 0112234568999999999999853 Q ss_pred ---------eEEEEeCCe-------EEEEe-e-cCceeeeec-chhhcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 214 ---------TAILAKKGA-------VKLIL-K-RDFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 214 ---------~~~~~~~~a-------~~~~~-~-~~~~ve~~r-d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .+|++..|- .++.. + .++.|+... -..+..-..++..+||.++.+|+++.+|..-.= T Consensus 235 ~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 235 TKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred cccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 235554432 22211 1 235565543 223333345556789999999999999985333 No 180 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=98.98 E-value=9.2e-12 Score=81.08 Aligned_cols=251 Identities=11% Similarity=-0.004 Sum_probs=125.6 Q ss_pred CCC-------------c--------------------------cceeeeeechHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_010147. 1 MPQ-------------G--------------------------ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTL 41 (274) Q Consensus 1 Ma~-------------~--------------------------~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~ 41 (274) |.. . ....+...+|..+...+.....+........ T Consensus 171 ~~~~~~~~~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 244 (480) T protein:vir:40 171 REASIPSEKPEDAERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGL------ 244 (480) T ss_pred hhhhccccchhhhhhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcc------ Confidence 000 0 0000000111111111111111111100000 Q ss_pred ccCCCceEEEEeeccCCccccccCC----CcCCccccccceeEEE---eeeecceeeeeHHHHhhcCccHHHHHHHHHHH Q lcl|NC_010147. 42 QGQPGDTLTFPAFVYSGDAQVVAEG----EKIPTDILETKKREAK---IRKIAKGTSITDEALLSGYGDPQGEQVRQHGL 114 (274) Q Consensus 42 ~~~~g~tv~ip~~~~~~~~~~~~eg----~~i~~~~~t~~~~~~~---~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~ 114 (274) ++..+ ......|.+++ ...++...+ +..+. +++++....++.+...++ .++.+.+.+++++ T Consensus 245 ------~~~~~---g~~~~~~~~e~~~~~~~~~~~~~~--~~~~~~~~v~~l~~~~k~t~~lLDDa-~~l~~~i~~~l~~ 312 (480) T protein:vir:40 245 ------TLAED---GVDDTFISGTFKAGTDKNKSQTAT--KRSLRPQMAEAYLQMDKATVRGVNDS-GALSEYVMSEMVN 312 (480) T ss_pred ------eeeec---cccceeeeeeeecccccccccccc--cchhhHHHHHHHHHhHHHHHHHhhhh-HHHHHHHHHHHHH Confidence 00000 00012233222 222222221 22222 233444445554444344 4799999999999 Q ss_pred HHHHHHHHHHHHHhhcccc----------cccccccCHHHHHHHHHHHhhcCCCce-EEEEcHHHHHHHHhhcccccccc Q lcl|NC_010147. 115 AHANKVDNDVLEALMGAKL----------TVNADITKLNGLQSAIDKFNDEDLEPM-VLFINPLDAGKLRGDASTNFTRA 183 (274) Q Consensus 115 ~~a~~~d~~~~~~~~~a~~----------~~~~~~~~~d~i~~A~~~l~~~~~~~~-~~vv~p~~~~~L~k~~~~~~~~~ 183 (274) .++++.+..++..-.+.+. ......+..+.|.++...|......+. .|+|||..++.|++... ..+ T Consensus 313 ~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD---~~G 389 (480) T protein:vir:40 313 RVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKG---TDG 389 (480) T ss_pred HHHHHHHHHhhccCCCCccccccceeecccccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhc---CCC Confidence 9999999999877322111 111111233444456666665554555 68999999998876421 111 Q ss_pred ccccccceeccccceeccceEEEcC-CCCcceEEE-EeCCeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCcc Q lcl|NC_010147. 184 TELGDDIIVKGAFGEALGAIIVRTN-KLEAGTAIL-AKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) Q Consensus 184 s~~~~~~~~~g~ig~~~G~~Vv~s~-~v~~~~~~~-~~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~ 261 (274) .-..++.+..|...+++|+||++++ .+|++...+ .+..++.++.+ .++...+.+...-...+....++|.++.+|++ T Consensus 390 ~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~ 468 (480) T protein:vir:40 390 HSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDL-NVENYNDFDLRYNVEQWLSETLVGGSIRGKNR 468 (480) T ss_pred CeeccCcccccCcceecccceeeeeccccCCcceeeeCCccEEEEec-ccceecccccccchhhhhhhhhhceeeEcccc Confidence 1122334556778899999988764 566665433 33444555543 34444444555666677778899999999999 Q ss_pred EEEEEecCCCCC Q lcl|NC_010147. 262 AVKITKGSGSLE 273 (274) Q Consensus 262 ~v~~~~~~a~~~ 273 (274) ++.+++.+.=.- T Consensus 469 ~~~~~~~~~~~~ 480 (480) T protein:vir:40 469 SAYLKKKGSLGV 480 (480) T ss_pred EEEEEeccCcCC Confidence 999887543333 No 181 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.93 E-value=2e-10 Score=73.78 Aligned_cols=271 Identities=15% Similarity=0.150 Sum_probs=162.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhc-c--------cccccccccCCCceEEEEeeccCCccccccCCCcC-- Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFAS-F--------AEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI-- 69 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~-~--------~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i-- 69 (274) ||-+.+-..+-.....|+..+.....+++-|.+ + ..+..++....|++|+++....+... .+..++.+ T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~-gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGK-PTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccC-CcccCceeec Confidence 886666666655566899887777766665544 2 22344577778999999988766422 22223332 Q ss_pred CccccccceeEEEeeeecceeeeeH-HHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_010147. 70 PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------- 133 (274) Q Consensus 70 ~~~~~t~~~~~~~~~~~~~~~~vtd-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--------------- 133 (274) -++.+++.+..++|.+.-..+.... .....+-.|+.....+.++.+|++..|..++-.+.++.. T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~ 159 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYA 159 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccccc Confidence 4678999999999998766664433 334456788999999999999999999988877654210 Q ss_pred -------c----------------ccccccCHHHHHHHHHHHhhcC----------------CCceEEEEcHHHHHHHHh Q lcl|NC_010147. 134 -------T----------------VNADITKLNGLQSAIDKFNDED----------------LEPMVLFINPLDAGKLRG 174 (274) Q Consensus 134 -------~----------------~~~~~~~~d~i~~A~~~l~~~~----------------~~~~~~vv~p~~~~~L~k 174 (274) + ..++..+++.|-.|...+...+ .+..++++||..+..|+. T Consensus 160 ~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) T protein:vir:93 160 GNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMRT 239 (364) T ss_pred ccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhhh Confidence 0 0123456777877777654321 134589999999999986 Q ss_pred hcccc---ccc---cccccccceeccccceeccceEEEcCCCCc------------ceEEEEeCCeEEEEeec--Cce-- Q lcl|NC_010147. 175 DASTN---FTR---ATELGDDIIVKGAFGEALGAIIVRTNKLEA------------GTAILAKKGAVKLILKR--DFF-- 232 (274) Q Consensus 175 ~~~~~---~~~---~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~------------~~~~~~~~~a~~~~~~~--~~~-- 232 (274) +..-. +.. .+....+.+..|.+|.|.|+.|+...+++. ..+++.+..|..++... +.+ T Consensus 240 ~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~~ 319 (364) T protein:vir:93 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) T ss_pred cCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCCCce Confidence 44212 222 233344678999999999999998887752 12356666665554322 222 Q ss_pred -eeeecchhhcceEEEEEEEEEEEEE----cCccEEEEEecCCCCC Q lcl|NC_010147. 233 -LEVARDASTKTTALYSDKHYVAYLY----DESKAVKITKGSGSLE 273 (274) Q Consensus 233 -ve~~rd~~~~~~~v~~~~~yg~~~~----~~~~~v~~~~~~a~~~ 273 (274) .|...|-... -.|......|.+-+ ..-+++.|--++..-. T Consensus 320 w~Ee~~D~gn~-~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 320 WEETVKDYGNE-PAIAAGFIAGMKKARFNNKDFGVISIDTAAKKHS 364 (364) T ss_pred eeecccCCCCc-hhhhhhhHhhhhhcccCCccceEEEecccccccC Confidence 2222221111 11222211121111 2334444432222222 No 182 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.92 E-value=6.2e-10 Score=71.06 Aligned_cols=255 Identities=11% Similarity=0.091 Sum_probs=161.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHH-----hhhhccccccccccc-CCCceEEEEeeccCCccccccCCCcCCcccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-----LRFASFAEVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKIPTDIL 74 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~-----~v~~~~~~~~~~~~~-~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~ 74 (274) +|..+| .+| -|.++.+-++..+.+. .-|..++.. .++.- ++. +-..++..|+-..+.||.++....+ T Consensus 359 ~A~~hs-TsD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~-~~~~DFk~~---~~~~lg~~~~L~~V~E~gEyk~~t~ 431 (652) T protein:vir:79 359 AAFTHS-TSD--FGNILLDVANKAILQGWEDAPETYEQWTRK-GQLSDFKIA---HRVGMGGFSALRQVREGAEYKYVTT 431 (652) T ss_pred HHhhcC-cch--HHHHHHHHHHHHHHHHHhhhHHHHHHHhcc-CCCcccccc---ceeecCCCCCccccCCCCccceeee Confidence 443222 344 3666666554444322 122233221 11110 112 2223344566667899999999888 Q ss_pred ccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------------cccc Q lcl|NC_010147. 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT----------------VNAD 138 (274) Q Consensus 75 t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~----------------~~~~ 138 (274) +....++.+..+|+.|.+|.++..-..-+.+..+-+.++++-++.+.+.+++.+.+.+.- ..++ T Consensus 432 ~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~a 511 (652) T protein:vir:79 432 GDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESA 511 (652) T ss_pred cCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccc Confidence 888889999999999999999988877889999999999999999999988887654211 1234 Q ss_pred ccCHHHHHHHHHHHhhc-------CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccc-eEEEcCCC Q lcl|NC_010147. 139 ITKLNGLQSAIDKFNDE-------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA-IIVRTNKL 210 (274) Q Consensus 139 ~~~~d~i~~A~~~l~~~-------~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~-~Vv~s~~v 210 (274) .++.+.+-.|+..|... +..+++++|+|+.....++......+..++. -+|.+.-+.|+ +||+++.+ T Consensus 512 a~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~-----~~~~~Np~~~~~~~i~eprL 586 (652) T protein:vir:79 512 AMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADI-----NAGIINPVKDFATVIAEPRL 586 (652) T ss_pred cCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCccccc-----cccccccccccccccccccc Confidence 46777788877666432 2367899999988765544321122222221 12334445664 88999999 Q ss_pred Ccce--E-EEEeCC---eEEEE--eec-CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEe Q lcl|NC_010147. 211 EAGT--A-ILAKKG---AVKLI--LKR-DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK 267 (274) Q Consensus 211 ~~~~--~-~~~~~~---a~~~~--~~~-~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~ 267 (274) +... . |++.+. .+.++ .+. ...+|+........-.+++|+-||++++|=.+++|.|. T Consensus 587 ~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 587 DDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCCCcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 6533 3 445433 34443 332 23466554444455677888889999999999999887 No 183 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.83 E-value=2e-09 Score=68.28 Aligned_cols=256 Identities=14% Similarity=0.124 Sum_probs=160.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHH----H-hhhhccccccccccc-CCCceEEEEeeccCCccccccCCCcCCcccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEK----K-LRFASFAEVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKIPTDIL 74 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~----~-~v~~~~~~~~~~~~~-~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~ 74 (274) ||..+ ..+|. |.++.+-.+..+.+ . .-|..++.. .++.- ++...+ .++..++-..+.||.++....+ T Consensus 394 ~a~~h-tTSDF--p~IL~~~~nk~l~~~y~~a~~t~~~~~~~-~~~~DFk~~~~~---~lg~~~~L~~V~E~gEyk~~t~ 466 (693) T protein:vir:95 394 LAFTH-TSSDF--GLILLDVANKSVLAGWEEAEETFPLWTKS-GILTDFKPARRV---GLGEFSSLRQVREGAEYKYVTL 466 (693) T ss_pred HHHhc-Ccchh--HHHHHHHHHHHHHHHHHhhhhHHHHHhcc-CCCCccccccee---ecCCCCChhhcCCCCceeeeec Confidence 44322 23342 66665554443332 1 122222211 01100 112222 2344455667899999998888 Q ss_pred ccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------------ccccc Q lcl|NC_010147. 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL----------------TVNAD 138 (274) Q Consensus 75 t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~----------------~~~~~ 138 (274) .....++.+..+++.|.+|.++..-..-+.+..+-++++++-++.+++.+++.+.+.+. +..+. T Consensus 467 ~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~s 546 (693) T protein:vir:95 467 GERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAAS 546 (693) T ss_pred CCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccccccccccc Confidence 88888899999999999999999888888999999999999999999999988865421 11234 Q ss_pred ccCHHHHHHHHHHHhhc------------CCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccc-eEE Q lcl|NC_010147. 139 ITKLNGLQSAIDKFNDE------------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA-IIV 205 (274) Q Consensus 139 ~~~~d~i~~A~~~l~~~------------~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~-~Vv 205 (274) .++.+.+-.++..|... +..+++++|+|+.....++......++..+.. +|.+.-+.|+ +|| T Consensus 547 als~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~-----~~~~NP~~~~~~vi 621 (693) T protein:vir:95 547 ALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVN-----SGIVNPIRAFAQVI 621 (693) T ss_pred ccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccc-----cccccchhcccccc Confidence 56788888877666331 13678999999888766553222223322211 2333446664 788 Q ss_pred EcCCCCc--ceE-EEEeC-C--eEEEEe--ec-CceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 206 RTNKLEA--GTA-ILAKK-G--AVKLIL--KR-DFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 206 ~s~~v~~--~~~-~~~~~-~--a~~~~~--~~-~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) +++.++. ++. |++.. + .+.++. +. ...+|+...-....-.+++|+-||++++|=.+++|=..+ T Consensus 622 ~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 622 GEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred ccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 8999863 444 44433 2 344443 32 234555555455556788888899999999998874444 No 184 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.51 E-value=2.8e-07 Score=56.47 Aligned_cols=263 Identities=12% Similarity=-0.006 Sum_probs=149.3 Q ss_pred CCCccceee---eeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec--cCCcccc--ccCCCcCCccc Q lcl|NC_010147. 1 MPQGITKTS---NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV--YSGDAQV--VAEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~~---~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~--~~~~~~~--~~eg~~i~~~~ 73 (274) ||.+..... ..-.-|.+++.|..- .+.-+...++.++..-+=++..|. .+.++.. -.||.+.+... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~i-------sp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNI-------APYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeeeeechhhhheec-------CCccCcceeeecCceecccEEEEEeeecCCccccccccCccccccc Confidence 888654332 222333344443221 111111112222211111344453 3333332 34777766555 Q ss_pred cccceeEEEeeee-cceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc-----ccc----------- Q lcl|NC_010147. 74 LETKKREAKIRKI-AKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG-----AKL----------- 133 (274) Q Consensus 74 ~t~~~~~~~~~~~-~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-----a~~----------- 133 (274) .......-..-++ .+.++||.-+... +..|.++.-..+-...+.|.++..++..-+. ++. T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i 153 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYY 153 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHh Confidence 5444444444443 5667777655443 2356666666666777889999988865321 000 Q ss_pred ----------------------cccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcc--ccccc--ccccc Q lcl|NC_010147. 134 ----------------------TVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAS--TNFTR--ATELG 187 (274) Q Consensus 134 ----------------------~~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~--~~~~~--~s~~~ 187 (274) .....+++-+.|.++.+++=+++.....++|+|.....|-+.-. ...+. ..+.- T Consensus 154 ~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~ 233 (317) T protein:vir:88 154 KTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNR 233 (317) T ss_pred ccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeE Confidence 00112367888999999999999998999999988876643210 01111 11100 Q ss_pred ccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh-cceEEEEEEEEEEEEEcCccEEEEE Q lcl|NC_010147. 188 DDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST-KTTALYSDKHYVAYLYDESKAVKIT 266 (274) Q Consensus 188 ~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~-~~~~v~~~~~yg~~~~~~~~~v~~~ 266 (274) -+...+-....|.=++++.+.++|.++.++++++.+.+..-+++..| ..++. .++......-|+.++.||.+.++|+ T Consensus 234 ~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e--~laKtGd~~k~~i~~E~tLe~~N~~a~a~i~ 311 (317) T protein:vir:88 234 IAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQH--ELAKTGDSEKRQLLVEYTFRVNNEKSGALIR 311 (317) T ss_pred EEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceee--ccCCCcccceeEEEEEEEEEEcCccceeEEE Confidence 01111112222333699999999999999999999988654554433 22222 3344555566999999999999999 Q ss_pred ecCCCC Q lcl|NC_010147. 267 KGSGSL 272 (274) Q Consensus 267 ~~~a~~ 272 (274) --.++. T Consensus 312 ~l~~~~ 317 (317) T protein:vir:88 312 DVVAQL 317 (317) T ss_pred EecccC Confidence 988888 No 185 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.43 E-value=9.3e-08 Score=59.14 Aligned_cols=271 Identities=14% Similarity=0.074 Sum_probs=152.0 Q ss_pred CCCccceee--eeechHHHHHHHHHHHHHHhhh----hc---------------------ccccccccccCCCceEEEEe Q lcl|NC_010147. 1 MPQGITKTS--NQIIPEVLAPMMQAQLEKKLRF----AS---------------------FAEVDSTLQGQPGDTLTFPA 53 (274) Q Consensus 1 Ma~~~T~~~--~~~~Pev~~~~v~~~~~~~~v~----~~---------------------~~~~~~~~~~~~g~tv~ip~ 53 (274) |.-..|... +..-...|+..+-....+++-| .+ ...+..+|+...|++|+++. T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L 80 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFHF 80 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEeE Confidence 665555443 3455678988876555443222 11 02234457677899999999 Q ss_pred eccCCccccccCCCc--CCccccccceeEEEeeeecceeeeeHH-HHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010147. 54 FVYSGDAQVVAEGEK--IPTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG 130 (274) Q Consensus 54 ~~~~~~~~~~~eg~~--i~~~~~t~~~~~~~~~~~~~~~~vtd~-~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~ 130 (274) ...+....... ++. =-.+.+++.+..++|++.-..+..-.- ....+-.|+.....+.++.+|++..|..+|-.+.+ T Consensus 81 ~~~L~g~gv~G-d~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laG 159 (430) T protein:vir:10 81 VQPANAFPIMG-SEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAG 159 (430) T ss_pred eeccccCceec-CceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 87764333222 222 246788999999999987666655433 33446788999999999999999999887766543 Q ss_pred cc----------------------------cc------------------------ccccccCHHHHHHHHHHHhhcC-- Q lcl|NC_010147. 131 AK----------------------------LT------------------------VNADITKLNGLQSAIDKFNDED-- 156 (274) Q Consensus 131 a~----------------------------~~------------------------~~~~~~~~d~i~~A~~~l~~~~-- 156 (274) +. ++ ..++..+++.|-+|...+.... T Consensus 160 arg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~ 239 (430) T protein:vir:10 160 ARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELP 239 (430) T ss_pred hhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCC Confidence 20 00 0112235566666766665421 Q ss_pred --------CC------ceEEEEcHHHHHHHHhhccccccc------cccccccceeccccceeccceEEEcCCC------ Q lcl|NC_010147. 157 --------LE------PMVLFINPLDAGKLRGDASTNFTR------ATELGDDIIVKGAFGEALGAIIVRTNKL------ 210 (274) Q Consensus 157 --------~~------~~~~vv~p~~~~~L~k~~~~~~~~------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v------ 210 (274) .+ ..++++||.++..|+.+....... +.....+.+.+|..|+|.|+.|+.-.++ T Consensus 240 i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf~~g 319 (430) T protein:vir:10 240 PPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRFYAG 319 (430) T ss_pred CcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeeecCC Confidence 12 378999999999999887532111 1222347889999999999998875322 Q ss_pred -------------------C--------cceEEEEeCCeEEEEeec----Cce---eeeecchhh----cceEEEEE--E Q lcl|NC_010147. 211 -------------------E--------AGTAILAKKGAVKLILKR----DFF---LEVARDAST----KTTALYSD--K 250 (274) Q Consensus 211 -------------------~--------~~~~~~~~~~a~~~~~~~----~~~---ve~~rd~~~----~~~~v~~~--~ 250 (274) | ...+.+.+..|...+... +.+ .|...|-.. ....|.+. . T Consensus 320 ~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~G~kK~ 399 (430) T protein:vir:10 320 DTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAILGCSKI 399 (430) T ss_pred CccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHhcccee Confidence 0 011235555555554332 111 233222211 11222211 2 Q ss_pred EEEEE-----EEcCccEEEEEecCCCCCC Q lcl|NC_010147. 251 HYVAY-----LYDESKAVKITKGSGSLEM 274 (274) Q Consensus 251 ~yg~~-----~~~~~~~v~~~~~~a~~~~ 274 (274) ||..+ -+..-++ |....|.+-+ T Consensus 400 rF~~~~~~~~~~~DfGv--i~idtaa~~~ 426 (430) T protein:vir:10 400 RFAVEATNGLEYTDHGV--MAIDTAVKII 426 (430) T ss_pred eecCCCCCCceeeeeEE--EEhhhhhhhh Confidence 22111 1122222 2233333333 No 186 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.39 E-value=2.6e-07 Score=56.70 Aligned_cols=261 Identities=10% Similarity=0.033 Sum_probs=149.1 Q ss_pred CCCccceeeeeechHHHH---HHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC-CcCCcccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLA---PMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDILET 76 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~---~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg-~~i~~~~~t~ 76 (274) |.+.--..+-.|.-+.|. +.+.+...+.++.+.+..+...+.. .-.+++.+.+...|.+++++.+ +++|..+... T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPG-YAKYFEYPVFDGVGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCC-ceeEEEeeeeeccCceeEeCCCccccceeeccc Confidence 444322222234443333 2344444444555555444333222 2357788888878889998865 4588999999 Q ss_pred ceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc--------cc---c-ccccccc- Q lcl|NC_010147. 77 KKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG--------AK---L-TVNADIT- 140 (274) Q Consensus 77 ~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~--------a~---~-~~~~~~~- 140 (274) ......++.++..|.++..+.+. .+.++-..-...+++.+++..|+-++-.-.. .+ . ....++. T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccC Confidence 99999999988888777655443 4677888888889999999999866533221 11 1 1111222 Q ss_pred ---CHHHHHHHHHHHhhc--C-CCceEEEEcHHHHHHHHhhccccccccccccc-cce-eccccceeccceEEEcCCCC- Q lcl|NC_010147. 141 ---KLNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRGDASTNFTRATELGD-DII-VKGAFGEALGAIIVRTNKLE- 211 (274) Q Consensus 141 ---~~d~i~~A~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~-~~~-~~g~ig~~~G~~Vv~s~~v~- 211 (274) -+++|..+...+-.. + ..+..++++|..+..|.+-. ..++... ..+ .+....++.+.|.+.+.+-. T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~-----~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~~~g 234 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLV-----PGTSVSYGEFFRQNNSGVTVEFVQYLNDYNGTG 234 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhcc-----CCCCccHHHHHHHhcCCceEEEeeeeccCCCCc Confidence 267777887766432 3 36678999999999885321 1111100 111 12222233333333332221 Q ss_pred cceEEEE--eCCeEEEEeecCceeeeecchhhcceEEEEEEEE-EEEEEcCccEEEE---Eec Q lcl|NC_010147. 212 AGTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKHY-VAYLYDESKAVKI---TKG 268 (274) Q Consensus 212 ~~~~~~~--~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~y-g~~~~~~~~~v~~---~~~ 268 (274) +...+++ ++..+.+....+.++.. -........+....++ |+-+.+|.+++++ |++ T Consensus 235 ~~~~v~~~~~~~~~~~~v~~~~~~~~-~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 235 TSAAIAYEKDPNNMAIEIPEATNALP-AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred ceEEEEEEcCCceEEEEcCcceeeec-ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 2223443 35566666555544432 1223344566666665 6999999999999 555 No 187 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.26 E-value=4.3e-07 Score=55.48 Aligned_cols=223 Identities=13% Similarity=0.049 Sum_probs=128.3 Q ss_pred CCCccceeeee--e------------ch--HHHHHHHHHHHHHHhhhhcc--------cccccccccCCCceEEEEeecc Q lcl|NC_010147. 1 MPQGITKTSNQ--I------------IP--EVLAPMMQAQLEKKLRFASF--------AEVDSTLQGQPGDTLTFPAFVY 56 (274) Q Consensus 1 Ma~~~T~~~~~--~------------~P--ev~~~~v~~~~~~~~v~~~~--------~~~~~~~~~~~g~tv~ip~~~~ 56 (274) |++ +..++. . .| .+|+..+.....+..-+..+ ..+..+|+...|++|+++.... T Consensus 1 mt~--~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~ 78 (318) T protein:vir:27 1 MTT--VTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) T ss_pred CCc--cCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeec Confidence 443 222220 0 11 24666544333333222111 2334567777899999998876 Q ss_pred CCccccccCCCc--CCccccccceeEEEeeeecceeeeeHH-HHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_010147. 57 SGDAQVVAEGEK--IPTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) Q Consensus 57 ~~~~~~~~eg~~--i~~~~~t~~~~~~~~~~~~~~~~vtd~-~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~ 133 (274) +......+ ++. --++.+++.+..++|.+.-..+..-.. ....+-.|+.....+.++.+|++..|.-+|-.+.++.. T Consensus 79 L~g~gv~G-d~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg 157 (318) T protein:vir:27 79 LSKRPTMG-DERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (318) T ss_pred cccCcccc-CceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 64333222 222 246788999999999887665543332 33346688999999999999999999988776644321 Q ss_pred -------------------------cc-------------------cccccCHHHHHHHHHHHhh--c--------CCC- Q lcl|NC_010147. 134 -------------------------TV-------------------NADITKLNGLQSAIDKFND--E--------DLE- 158 (274) Q Consensus 134 -------------------------~~-------------------~~~~~~~d~i~~A~~~l~~--~--------~~~- 158 (274) .+ .++..+++.|-++...+.. . +.+ T Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~ 237 (318) T protein:vir:27 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (318) T ss_pred ccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccc Confidence 00 0122234445455555532 1 111 Q ss_pred -----ceEEEEcHHHHHHHHhhccc-cccc-------cccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEE Q lcl|NC_010147. 159 -----PMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) Q Consensus 159 -----~~~~vv~p~~~~~L~k~~~~-~~~~-------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~ 225 (274) ..++++||.++..|+++... +|.. .+.+..+.+..|..|.|.|+-+..-.++|--- +.-+.+.+ T Consensus 238 ~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf---~~G~~v~~ 314 (318) T protein:vir:27 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF---YQGQRFWY 314 (318) T ss_pred cCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEE---cCCCeeee Confidence 37899999999999987421 1222 12234567899999999999998888887210 01111211 Q ss_pred EeecCc Q lcl|NC_010147. 226 ILKRDF 231 (274) Q Consensus 226 ~~~~~~ 231 (274) +. -+ T Consensus 315 ~~--~~ 318 (318) T protein:vir:27 315 QR--IT 318 (318) T ss_pred ee--cC Confidence 11 00 No 188 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.24 E-value=1.8e-06 Score=52.07 Aligned_cols=259 Identities=13% Similarity=0.096 Sum_probs=144.2 Q ss_pred CCCccceeeeeechHHH---HHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCC-cCCcccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVL---APMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGE-KIPTDILET 76 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~---~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~-~i~~~~~t~ 76 (274) |-+ ...-.|.-+.| .+.+.+.+.+.++.+.+..+...+. -...++..+.....|.+++++++. +++..+... T Consensus 1 ~~~---~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~-~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~ 76 (301) T protein:vir:80 1 MQG---KITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVN-EGAESYSFDVMTRSGAAKIIANGADDLPLVDVDM 76 (301) T ss_pred CCc---cccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCC-CceEEEEEeeeccceeEEEecCcccccccccccc Confidence 332 22223444433 3345666666666666543332221 123566777777778888887755 589999999 Q ss_pred ceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc--------ccc-----c------ Q lcl|NC_010147. 77 KKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG--------AKL-----T------ 134 (274) Q Consensus 77 ~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~--------a~~-----~------ 134 (274) ......+..++..|.++..+.+. .+.++...-...+++.+++..|+.+|-.... .+. + T Consensus 77 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~ 156 (301) T protein:vir:80 77 VRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVG 156 (301) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccc Confidence 99999999888777776554443 4778888888999999999999876644221 000 0 Q ss_pred cccccc--C----HHHHHHHHHHHhhc--C-CCceEEEEcHHHHHHHHhhcccccccccccc-ccce-eccccceeccce Q lcl|NC_010147. 135 VNADIT--K----LNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRGDASTNFTRATELG-DDII-VKGAFGEALGAI 203 (274) Q Consensus 135 ~~~~~~--~----~d~i~~A~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~~~-~~g~ig~~~G~~ 203 (274) ..+++. + +++|..+..++-.. + ..+..++++|..|..|..-.. ....+.. -+.+ .+....++.+.| T Consensus 157 ~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~---~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 157 NVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRY---SNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred cccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccc---cCCCCeeHHHHHHHHcCcceEEEcc Confidence 011111 2 56777788777542 2 366789999999998853110 0111100 0111 122223344444 Q ss_pred EEEcCCCC-cceEEEEe--CCeEEEEeecCceeeeecchhhc-ceEEEEEEE-EEEEEEcCccEEEEEec Q lcl|NC_010147. 204 IVRTNKLE-AGTAILAK--KGAVKLILKRDFFLEVARDASTK-TTALYSDKH-YVAYLYDESKAVKITKG 268 (274) Q Consensus 204 Vv~s~~v~-~~~~~~~~--~~a~~~~~~~~~~ve~~rd~~~~-~~~v~~~~~-yg~~~~~~~~~v~~~~~ 268 (274) .+.+.... ++..+++. +..+.+....+.+.. ..+.+. ...+....+ .|+-+.+|.+++++..= T Consensus 234 ~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~--~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 234 DLAGMGTAGSDSFAVIHDSNETAELIIPMDITRH--PEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeccCCCCcccEEEEEecCCcEEEEEecCceeee--cceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 44333221 22234443 334444444443321 112222 223333334 47889999999998865 No 189 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.19 E-value=7.9e-07 Score=54.02 Aligned_cols=261 Identities=13% Similarity=0.060 Sum_probs=142.7 Q ss_pred CCCccceee----eeechHHHH---HHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC-CcCCcc Q lcl|NC_010147. 1 MPQGITKTS----NQIIPEVLA---PMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTD 72 (274) Q Consensus 1 Ma~~~T~~~----~~~~Pev~~---~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg-~~i~~~ 72 (274) |.+.+..-+ -.|.-+.|. +.+.+...+.++.+.+..+...+. -.-.+++.+.+...|.+++++++ ++++.. T Consensus 19 ~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~-~~~~~~~~~~~~~~G~a~~~~d~~~dip~v 97 (319) T protein:vir:10 19 IQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELS-PTDKTFEYMTFDKVGTAQIIADYTDDLPLV 97 (319) T ss_pred hhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCC-CceEEEEeeeeccccceeeecCccccccce Confidence 222211111 135454443 223333344444444443332221 12356788888888899999775 458999 Q ss_pred ccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc--------ccc---cccc- Q lcl|NC_010147. 73 ILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG--------AKL---TVNA- 137 (274) Q Consensus 73 ~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~--------a~~---~~~~- 137 (274) +.........+..++..|.++..+... .+.++...-...+++.+++..|+-++-.-.. .+. ...+ T Consensus 98 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~ 177 (319) T protein:vir:10 98 DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGK 177 (319) T ss_pred eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCC Confidence 999999999999888888777654443 4677888888889999999999866533211 110 0011 Q ss_pred ----cccC----HHHHHHHHHHHhhc--C-CCceEEEEcHHHHHHHHhhcccccccccccc-ccce-eccccceeccceE Q lcl|NC_010147. 138 ----DITK----LNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRGDASTNFTRATELG-DDII-VKGAFGEALGAII 204 (274) Q Consensus 138 ----~~~~----~d~i~~A~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~~~-~~g~ig~~~G~~V 204 (274) +.-+ +++|..+..++-.. + ..+..++++|+.+..|..-. ..++.. -..+ .++...++.+.|. T Consensus 178 ~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~-----~~~~~t~l~~lk~~~~~l~I~~~pe 252 (319) T protein:vir:10 178 WIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRM-----PETTMSYLDYFKSQNSGIEIDSIAE 252 (319) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhccc-----CCCCeeHHHHHHHhcCCceEEEeee Confidence 1112 45566676666432 2 36779999999999884211 111110 0111 2222233444444 Q ss_pred EEcCCCC-cceEEEE--eCCeEEEEeecCceeeeecchhhcceEEEEEEE-EEEEEEcCccEEEEEec Q lcl|NC_010147. 205 VRTNKLE-AGTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKH-YVAYLYDESKAVKITKG 268 (274) Q Consensus 205 v~s~~v~-~~~~~~~--~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~-yg~~~~~~~~~v~~~~~ 268 (274) +..-+-. +...+++ ++..+.+....+.++..- ........+....+ .|+-+..|.++++++.= T Consensus 253 l~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 253 LEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ecccCCCcceEEEEEecCCceEEEecCcceeeeee-eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 4432221 1122333 455566655455443321 12223344444444 46888999999998865 No 190 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=98.14 E-value=3.6e-06 Score=50.40 Aligned_cols=263 Identities=13% Similarity=0.093 Sum_probs=133.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccc--cccC-CCcCCccccccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ--VVAE-GEKIPTDILETK 77 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~--~~~e-g~~i~~~~~t~~ 77 (274) +.-... -.-.+.|+++...+ +.+.+...|.+.+.+... .-.+.+|+++.. |.-- -..| |+.-.....+.. T Consensus 23 it~~~l-~~g~L~p~~a~~Fl-~~v~~~t~iL~~~r~~~~----~s~~~ei~kig~-G~r~~r~~~e~~~~~~~~~~~~~ 95 (360) T protein:vir:99 23 IGLAEL-DGFQLPVDVTEEFL-ERMQKGVQILGMADTMTL----ARLEMEVPQFGV-PRLSGHTRDEEGSRTENSEAESG 95 (360) T ss_pred cccccc-CceeecHHHHHHHH-HHHhhccchhhhcceeec----cccccccccccc-ceeeccccccCCCCCcCCcCccc Confidence 221111 13457788776665 446666666666644321 122334444422 1100 0011 111111122222 Q ss_pred eeEE-EeeeecceeeeeHHHHhhcCc----cHHHHHHHHHHHHHHHHHHHHHHHH------------------------- Q lcl|NC_010147. 78 KREA-KIRKIAKGTSITDEALLSGYG----DPQGEQVRQHGLAHANKVDNDVLEA------------------------- 127 (274) Q Consensus 78 ~~~~-~~~~~~~~~~vtd~~~~~~~~----d~~~~~~~~~a~~~a~~~d~~~~~~------------------------- 127 (274) .+.. ..++.-..+.+..+...+... .+.+.+.++++..+++.+....+.. T Consensus 96 ~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlK 175 (360) T protein:vir:99 96 SVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIA 175 (360) T ss_pred cCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHH Confidence 2222 122322334555554444322 3345666666666666554332221 Q ss_pred -hhccccccc------------------------c--------cccCHHHHHHHHHHHhhcCCC----ceEEEEcHHHHH Q lcl|NC_010147. 128 -LMGAKLTVN------------------------A--------DITKLNGLQSAIDKFNDEDLE----PMVLFINPLDAG 170 (274) Q Consensus 128 -~~~a~~~~~------------------------~--------~~~~~d~i~~A~~~l~~~~~~----~~~~vv~p~~~~ 170 (274) +.+....+. + .+.+-+-|.+++..|-...-. .-++++||..+. T Consensus 176 ka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~ 255 (360) T protein:vir:99 176 RAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQ 255 (360) T ss_pred HhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHH Confidence 111110000 0 001223356777777665321 227899998776 Q ss_pred HHHhhccccccccccccccceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecCceeeeecchhh----cceEE Q lcl|NC_010147. 171 KLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST----KTTAL 246 (274) Q Consensus 171 ~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~ve~~rd~~~----~~~~v 246 (274) ..++... -+.+..|+.++..+..-++.|+||+..+.+|++.+++.++..+.++....++++...++.+ ..... T Consensus 256 ~yr~~L~---~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~ 332 (360) T protein:vir:99 256 SYTMSLT---EREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSR 332 (360) T ss_pred HHHHHHh---ccCcccchhheecccccccceeeeEEcCCCCCCceEEeccCceeEEeeeeeEEeecccchhhhhhceeee Confidence 5554321 2344567766666655578999999999999999999999999999888887754333322 11122 Q ss_pred EE-EEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 247 YS-DKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 247 ~~-~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) +. +..+-+.+-+++++|.++--.=++- T Consensus 333 ~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 333 NWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EEEEEEeeEEEEecccEEEEecCCCCCC Confidence 22 3346777778889999884222222 No 191 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=98.13 E-value=2.4e-06 Score=51.43 Aligned_cols=253 Identities=14% Similarity=0.055 Sum_probs=138.7 Q ss_pred CccceeeeeechHHHHHHHHHHHHHH-hhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeEE Q lcl|NC_010147. 3 QGITKTSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREA 81 (274) Q Consensus 3 ~~~T~~~~~~~Pev~~~~v~~~~~~~-~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~~ 81 (274) |.+|.-.=..+-+-+.+.+.+.+... .-+..++.+.. ......+....+..+...++ ..+.....++....++ T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~----sdf~~~~~~~lg~~p~l~e~--~Ge~~~~~l~~~~~~i 74 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVP----SNTSSNDYKWLSTFPKMRRW--IGAKVVKNLKAYKYVV 74 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecC----CCcceeeceecCCCCCcccc--ccceeeccccccceeE Confidence 33332110001111222233333222 12333332211 12222333333333443333 3667888899888999 Q ss_pred EeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------------- Q lcl|NC_010147. 82 KIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------------------------- 135 (274) Q Consensus 82 ~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~-------------------------- 135 (274) +.++++..+.++.++.....-.....+.++++++.++..|+.+++.+.+..... T Consensus 75 ~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~ 154 (302) T protein:vir:10 75 ENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAP 154 (302) T ss_pred EeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchh Confidence 999999999999999999888899999999999999999999999887532110 Q ss_pred ---cccccCHHHHHHHHHHHh----hcC----CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceE Q lcl|NC_010147. 136 ---NADITKLNGLQSAIDKFN----DED----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAII 204 (274) Q Consensus 136 ---~~~~~~~d~i~~A~~~l~----~~~----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~V 204 (274) ....++.+.+-.++..|. +.+ ..++.++|+|......++.-..... .+...+..+ .-+.+ T Consensus 155 ~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~--~~g~~Np~~-------g~~~~ 225 (302) T protein:vir:10 155 LSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKL--ADNTPNPYV-------GTAEL 225 (302) T ss_pred hhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcccc--CCCCcceec-------cceEE Confidence 001233444555555442 222 3678999999877655442111111 111122211 12689 Q ss_pred EEcCCCCcceE-EEEe-CCeEEEE---eecCceeeeecchhhcceEEEEEEEEEE------EEEcCccEEEEEecCCC Q lcl|NC_010147. 205 VRTNKLEAGTA-ILAK-KGAVKLI---LKRDFFLEVARDASTKTTALYSDKHYVA------YLYDESKAVKITKGSGS 271 (274) Q Consensus 205 v~s~~v~~~~~-~~~~-~~a~~~~---~~~~~~ve~~rd~~~~~~~v~~~~~yg~------~~~~~~~~v~~~~~~a~ 271 (274) |+++.+..++. |++. +..+... .+++..+|..-+.....-..+.++.||+ ....+..+.. .++.|| T Consensus 226 vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~-s~g~~~ 302 (302) T protein:vir:10 226 VVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYG-STGTGA 302 (302) T ss_pred EEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhc-cCccCC Confidence 99999987776 4553 3333322 1334556665555555556666666665 3334433334 444444 No 192 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=97.96 E-value=4e-06 Score=50.21 Aligned_cols=271 Identities=13% Similarity=0.035 Sum_probs=139.9 Q ss_pred CCCccceee------eee------ch--HHHHHHHHHHHHHHhhh----h----cccccccccccCCCceEEEEeeccCC Q lcl|NC_010147. 1 MPQGITKTS------NQI------IP--EVLAPMMQAQLEKKLRF----A----SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) Q Consensus 1 Ma~~~T~~~------~~~------~P--ev~~~~v~~~~~~~~v~----~----~~~~~~~~~~~~~g~tv~ip~~~~~~ 58 (274) |..-..--+ .+| .| ..|+..+.......+-+ . ....+..++....|++|+++....+. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 221110000 000 12 12333222211111111 1 11233456777789999999887663 Q ss_pred ccccccCCCcC--CccccccceeEEEeeeecceeeeeH-HHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Q lcl|NC_010147. 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) Q Consensus 59 ~~~~~~eg~~i--~~~~~t~~~~~~~~~~~~~~~~vtd-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-- 133 (274) .. .+..++.+ -++.+++.+..++|.+.-..+.... .....+-.|+.......++++|++..|+.+|-.+.++.. T Consensus 81 g~-gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:32 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred cC-CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 23323333 4678999999999998766654443 334456788999999999999999999988866654321 Q ss_pred -----------------------cc-------------------cccccCHHHHHHHHHHHhh--c--------CC---- Q lcl|NC_010147. 134 -----------------------TV-------------------NADITKLNGLQSAIDKFND--E--------DL---- 157 (274) Q Consensus 134 -----------------------~~-------------------~~~~~~~d~i~~A~~~l~~--~--------~~---- 157 (274) .+ .++.++++.|-++...+.. . +. T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:32 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 00 0122334445555555532 1 11 Q ss_pred --CceEEEEcHHHHHHHHhhccc-cccc-------cccccccceeccccceeccceEEEcCCCCc--------------- Q lcl|NC_010147. 158 --EPMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA--------------- 212 (274) Q Consensus 158 --~~~~~vv~p~~~~~L~k~~~~-~~~~-------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~--------------- 212 (274) ...++++||.++..|+++... +|.. .+.+.++.+..|..|.|.|+.|..-.++|- T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:32 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 247899999999999998531 1222 222445788999999999998887665530 Q ss_pred -------------ceEEEEeCCeEEEEeecC--ce---eeeecchhhcceEEEEEEEEEEEEEc-C------ccEEEEEe Q lcl|NC_010147. 213 -------------GTAILAKKGAVKLILKRD--FF---LEVARDASTKTTALYSDKHYVAYLYD-E------SKAVKITK 267 (274) Q Consensus 213 -------------~~~~~~~~~a~~~~~~~~--~~---ve~~rd~~~~~~~v~~~~~yg~~~~~-~------~~~v~~~~ 267 (274) .-+++.+..|..++.+++ .+ .|...|-... -.|......|.+-++ + .-.-+|.. T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~i 398 (404) T protein:vir:32 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEe Confidence 122667777765553321 11 2322221110 111111111111111 0 11222222 Q ss_pred cCCCCC Q lcl|NC_010147. 268 GSGSLE 273 (274) Q Consensus 268 ~~a~~~ 273 (274) ..|.+= T Consensus 399 dta~~~ 404 (404) T protein:vir:32 399 DTAVKL 404 (404) T ss_pred cccccC Confidence 222222 No 193 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=97.96 E-value=4e-06 Score=50.21 Aligned_cols=271 Identities=13% Similarity=0.035 Sum_probs=139.9 Q ss_pred CCCccceee------eee------ch--HHHHHHHHHHHHHHhhh----h----cccccccccccCCCceEEEEeeccCC Q lcl|NC_010147. 1 MPQGITKTS------NQI------IP--EVLAPMMQAQLEKKLRF----A----SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) Q Consensus 1 Ma~~~T~~~------~~~------~P--ev~~~~v~~~~~~~~v~----~----~~~~~~~~~~~~~g~tv~ip~~~~~~ 58 (274) |..-..--+ .+| .| ..|+..+.......+-+ . ....+..++....|++|+++....+. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 221110000 000 12 12333222211111111 1 11233456777789999999887663 Q ss_pred ccccccCCCcC--CccccccceeEEEeeeecceeeeeH-HHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Q lcl|NC_010147. 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) Q Consensus 59 ~~~~~~eg~~i--~~~~~t~~~~~~~~~~~~~~~~vtd-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-- 133 (274) .. .+..++.+ -++.+++.+..++|.+.-..+.... .....+-.|+.......++++|++..|+.+|-.+.++.. T Consensus 81 g~-gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:81 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred cC-CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 23323333 4678999999999998766654443 334456788999999999999999999988866654321 Q ss_pred -----------------------cc-------------------cccccCHHHHHHHHHHHhh--c--------CC---- Q lcl|NC_010147. 134 -----------------------TV-------------------NADITKLNGLQSAIDKFND--E--------DL---- 157 (274) Q Consensus 134 -----------------------~~-------------------~~~~~~~d~i~~A~~~l~~--~--------~~---- 157 (274) .+ .++.++++.|-++...+.. . +. T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:81 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 00 0122334445555555532 1 11 Q ss_pred --CceEEEEcHHHHHHHHhhccc-cccc-------cccccccceeccccceeccceEEEcCCCCc--------------- Q lcl|NC_010147. 158 --EPMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA--------------- 212 (274) Q Consensus 158 --~~~~~vv~p~~~~~L~k~~~~-~~~~-------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~--------------- 212 (274) ...++++||.++..|+++... +|.. .+.+.++.+..|..|.|.|+.|..-.++|- T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:81 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 247899999999999998531 1222 222445788999999999998887665530 Q ss_pred -------------ceEEEEeCCeEEEEeecC--ce---eeeecchhhcceEEEEEEEEEEEEEc-C------ccEEEEEe Q lcl|NC_010147. 213 -------------GTAILAKKGAVKLILKRD--FF---LEVARDASTKTTALYSDKHYVAYLYD-E------SKAVKITK 267 (274) Q Consensus 213 -------------~~~~~~~~~a~~~~~~~~--~~---ve~~rd~~~~~~~v~~~~~yg~~~~~-~------~~~v~~~~ 267 (274) .-+++.+..|..++.+++ .+ .|...|-... -.|......|.+-++ + .-.-+|.. T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~i 398 (404) T protein:vir:81 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEe Confidence 122667777765553321 11 2322221110 111111111111111 0 11222222 Q ss_pred cCCCCC Q lcl|NC_010147. 268 GSGSLE 273 (274) Q Consensus 268 ~~a~~~ 273 (274) ..|.+= T Consensus 399 dta~~~ 404 (404) T protein:vir:81 399 DTAVKL 404 (404) T ss_pred cccccC Confidence 222222 No 194 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=97.96 E-value=4e-06 Score=50.21 Aligned_cols=271 Identities=13% Similarity=0.035 Sum_probs=139.9 Q ss_pred CCCccceee------eee------ch--HHHHHHHHHHHHHHhhh----h----cccccccccccCCCceEEEEeeccCC Q lcl|NC_010147. 1 MPQGITKTS------NQI------IP--EVLAPMMQAQLEKKLRF----A----SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) Q Consensus 1 Ma~~~T~~~------~~~------~P--ev~~~~v~~~~~~~~v~----~----~~~~~~~~~~~~~g~tv~ip~~~~~~ 58 (274) |..-..--+ .+| .| ..|+..+.......+-+ . ....+..++....|++|+++....+. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 221110000 000 12 12333222211111111 1 11233456777789999999887663 Q ss_pred ccccccCCCcC--CccccccceeEEEeeeecceeeeeH-HHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Q lcl|NC_010147. 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) Q Consensus 59 ~~~~~~eg~~i--~~~~~t~~~~~~~~~~~~~~~~vtd-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-- 133 (274) .. .+..++.+ -++.+++.+..++|.+.-..+.... .....+-.|+.......++++|++..|+.+|-.+.++.. T Consensus 81 g~-gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:10 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred cC-CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 23323333 4678999999999998766654443 334456788999999999999999999988866654321 Q ss_pred -----------------------cc-------------------cccccCHHHHHHHHHHHhh--c--------CC---- Q lcl|NC_010147. 134 -----------------------TV-------------------NADITKLNGLQSAIDKFND--E--------DL---- 157 (274) Q Consensus 134 -----------------------~~-------------------~~~~~~~d~i~~A~~~l~~--~--------~~---- 157 (274) .+ .++.++++.|-++...+.. . +. T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:10 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 00 0122334445555555532 1 11 Q ss_pred --CceEEEEcHHHHHHHHhhccc-cccc-------cccccccceeccccceeccceEEEcCCCCc--------------- Q lcl|NC_010147. 158 --EPMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA--------------- 212 (274) Q Consensus 158 --~~~~~vv~p~~~~~L~k~~~~-~~~~-------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~--------------- 212 (274) ...++++||.++..|+++... +|.. .+.+.++.+..|..|.|.|+.|..-.++|- T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:10 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 247899999999999998531 1222 222445788999999999998887665530 Q ss_pred -------------ceEEEEeCCeEEEEeecC--ce---eeeecchhhcceEEEEEEEEEEEEEc-C------ccEEEEEe Q lcl|NC_010147. 213 -------------GTAILAKKGAVKLILKRD--FF---LEVARDASTKTTALYSDKHYVAYLYD-E------SKAVKITK 267 (274) Q Consensus 213 -------------~~~~~~~~~a~~~~~~~~--~~---ve~~rd~~~~~~~v~~~~~yg~~~~~-~------~~~v~~~~ 267 (274) .-+++.+..|..++.+++ .+ .|...|-... -.|......|.+-++ + .-.-+|.. T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~i 398 (404) T protein:vir:10 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEe Confidence 122667777765553321 11 2322221110 111111111111111 0 11222222 Q ss_pred cCCCCC Q lcl|NC_010147. 268 GSGSLE 273 (274) Q Consensus 268 ~~a~~~ 273 (274) ..|.+= T Consensus 399 dta~~~ 404 (404) T protein:vir:10 399 DTAVKL 404 (404) T ss_pred cccccC Confidence 222222 No 195 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=97.96 E-value=4e-06 Score=50.21 Aligned_cols=271 Identities=13% Similarity=0.035 Sum_probs=139.9 Q ss_pred CCCccceee------eee------ch--HHHHHHHHHHHHHHhhh----h----cccccccccccCCCceEEEEeeccCC Q lcl|NC_010147. 1 MPQGITKTS------NQI------IP--EVLAPMMQAQLEKKLRF----A----SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) Q Consensus 1 Ma~~~T~~~------~~~------~P--ev~~~~v~~~~~~~~v~----~----~~~~~~~~~~~~~g~tv~ip~~~~~~ 58 (274) |..-..--+ .+| .| ..|+..+.......+-+ . ....+..++....|++|+++....+. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 221110000 000 12 12333222211111111 1 11233456777789999999887663 Q ss_pred ccccccCCCcC--CccccccceeEEEeeeecceeeeeH-HHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Q lcl|NC_010147. 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) Q Consensus 59 ~~~~~~eg~~i--~~~~~t~~~~~~~~~~~~~~~~vtd-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~-- 133 (274) .. .+..++.+ -++.+++.+..++|.+.-..+.... .....+-.|+.......++++|++..|+.+|-.+.++.. T Consensus 81 g~-gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:10 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred cC-CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 23323333 4678999999999998766654443 334456788999999999999999999988866654321 Q ss_pred -----------------------cc-------------------cccccCHHHHHHHHHHHhh--c--------CC---- Q lcl|NC_010147. 134 -----------------------TV-------------------NADITKLNGLQSAIDKFND--E--------DL---- 157 (274) Q Consensus 134 -----------------------~~-------------------~~~~~~~d~i~~A~~~l~~--~--------~~---- 157 (274) .+ .++.++++.|-++...+.. . +. T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:10 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 00 0122334445555555532 1 11 Q ss_pred --CceEEEEcHHHHHHHHhhccc-cccc-------cccccccceeccccceeccceEEEcCCCCc--------------- Q lcl|NC_010147. 158 --EPMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA--------------- 212 (274) Q Consensus 158 --~~~~~vv~p~~~~~L~k~~~~-~~~~-------~s~~~~~~~~~g~ig~~~G~~Vv~s~~v~~--------------- 212 (274) ...++++||.++..|+++... +|.. .+.+.++.+..|..|.|.|+.|..-.++|- T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:10 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 247899999999999998531 1222 222445788999999999998887665530 Q ss_pred -------------ceEEEEeCCeEEEEeecC--ce---eeeecchhhcceEEEEEEEEEEEEEc-C------ccEEEEEe Q lcl|NC_010147. 213 -------------GTAILAKKGAVKLILKRD--FF---LEVARDASTKTTALYSDKHYVAYLYD-E------SKAVKITK 267 (274) Q Consensus 213 -------------~~~~~~~~~a~~~~~~~~--~~---ve~~rd~~~~~~~v~~~~~yg~~~~~-~------~~~v~~~~ 267 (274) .-+++.+..|..++.+++ .+ .|...|-... -.|......|.+-++ + .-.-+|.. T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~i 398 (404) T protein:vir:10 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEe Confidence 122667777765553321 11 2322221110 111111111111111 0 11222222 Q ss_pred cCCCCC Q lcl|NC_010147. 268 GSGSLE 273 (274) Q Consensus 268 ~~a~~~ 273 (274) ..|.+= T Consensus 399 dta~~~ 404 (404) T protein:vir:10 399 DTAVKL 404 (404) T ss_pred cccccC Confidence 222222 No 196 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=97.93 E-value=6.3e-06 Score=49.10 Aligned_cols=262 Identities=10% Similarity=0.047 Sum_probs=138.1 Q ss_pred CCCccc------------ee-----ee-eechHHHHHHHHHHHH----HHhhhhcccccccccccCCCceEEEEeeccCC Q lcl|NC_010147. 1 MPQGIT------------KT-----SN-QIIPEVLAPMMQAQLE----KKLRFASFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) Q Consensus 1 Ma~~~T------------~~-----~~-~~~Pev~~~~v~~~~~----~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~ 58 (274) |||..- +- +. .|.-+.|. +|..++. ..+..+.+..+.... +..-.+++.+.+...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~-~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEI-PGHAKYFEYPEFDGVG 78 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCC-CCceeEEEeeeecccc Confidence 444221 00 10 12221111 2222222 223333333322221 1113478888888889 Q ss_pred ccccccCC-CcCCccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc---- Q lcl|NC_010147. 59 DAQVVAEG-EKIPTDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG---- 130 (274) Q Consensus 59 ~~~~~~eg-~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~---- 130 (274) .+.+|+++ +++|..+........+++.++..+.++..+... .+.++-..-...+++.+++..|+-++-.-.+ T Consensus 79 ~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~ 158 (314) T protein:vir:10 79 IAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIV 158 (314) T ss_pred ceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccce Confidence 99999875 458999999999999999988888887655444 3677888888888899999888765533211 Q ss_pred ----cc----ccccccccC----HHHHHHHHHHHhhc--C-CCceEEEEcHHHHHHHHhhcccccccccccc-ccce-ec Q lcl|NC_010147. 131 ----AK----LTVNADITK----LNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRGDASTNFTRATELG-DDII-VK 193 (274) Q Consensus 131 ----a~----~~~~~~~~~----~d~i~~A~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~~~-~~ 193 (274) .+ ....+++.+ +++|+.+..++-.. + ..+..++++|..+..|..- ...++.. -..+ .+ T Consensus 159 GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~-----~~~~~~tvl~~l~~n 233 (314) T protein:vir:10 159 SVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGL-----VPQTNLSYGELFTRN 233 (314) T ss_pred eEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhccc-----ccCCCccHHHHHHHh Confidence 01 112234444 45666677776543 2 3567899999999877421 1111110 0111 22 Q ss_pred cccceeccceEEEcCCCCcceE-EEE--eCCeEEEEeecCceeeeecchhhcceEEEEEEE-EEEEEEcCccEEEEEe-c Q lcl|NC_010147. 194 GAFGEALGAIIVRTNKLEAGTA-ILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKH-YVAYLYDESKAVKITK-G 268 (274) Q Consensus 194 g~ig~~~G~~Vv~s~~v~~~~~-~~~--~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~-yg~~~~~~~~~v~~~~-~ 268 (274) +..-++.++|.+.+........ +++ ++..+.+....+.+.... ........+....+ .|+-+..|.+++++.. + T Consensus 234 ~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~ 312 (314) T protein:vir:10 234 NPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA-QPKDLHFRYPVTSKATGLIVYRPLTMAVIKGIT 312 (314) T ss_pred CCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecc-eecCceEEEcceeeeEEEEEECcceeEeeeeee Confidence 2223344444444333222222 223 333444443333332111 11223334444444 4788999999997662 3 Q ss_pred CC Q lcl|NC_010147. 269 SG 270 (274) Q Consensus 269 ~a 270 (274) =| T Consensus 313 ~~ 314 (314) T protein:vir:10 313 FA 314 (314) T ss_pred cC Confidence 33 No 197 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=97.93 E-value=9.4e-06 Score=48.14 Aligned_cols=262 Identities=12% Similarity=0.041 Sum_probs=139.7 Q ss_pred CCCccceeee---eechHH---HHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccCC-CcCCccc Q lcl|NC_010147. 1 MPQGITKTSN---QIIPEV---LAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~~~---~~~Pev---~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg-~~i~~~~ 73 (274) |.. +|+.++ .|.-+. +.+.+.+...+.++...+..+..... -.-.+++.+.+...|.+++++.+ +++|..+ T Consensus 26 ~~~-~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~-~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd 103 (329) T protein:vir:79 26 LRG-AKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELS-DTDKTFEYQTFDKVGHAKIIADYTDDLSTVD 103 (329) T ss_pred ccc-ceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCC-CceeEEEeeeeecceeeeeecCcccccceee Confidence 332 233332 233221 22233333333444444433322211 12357788888888899999865 5788889 Q ss_pred cccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc--------cc---ccccc-- Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG--------AK---LTVNA-- 137 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~--------a~---~~~~~-- 137 (274) ........++..++..|.++..+... .+.++...-...+++.+++..|+-++-.-.. .+ ....+ T Consensus 104 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~ 183 (329) T protein:vir:79 104 ALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGW 183 (329) T ss_pred cccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCC Confidence 99988888898888877776654443 4677888888888999999998765533211 00 00111 Q ss_pred ---ccc--C----HHHHHHHHHHHhhc--C-CCceEEEEcHHHHHHHHh-hcccccccccccc-ccce-eccccceeccc Q lcl|NC_010147. 138 ---DIT--K----LNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRG-DASTNFTRATELG-DDII-VKGAFGEALGA 202 (274) Q Consensus 138 ---~~~--~----~d~i~~A~~~l~~~--~-~~~~~~vv~p~~~~~L~k-~~~~~~~~~s~~~-~~~~-~~g~ig~~~G~ 202 (274) .+. + +++|.++..++-.. + ..+..++++|..+..|.. .+ .++.. -..+ .++..-++.+. T Consensus 184 ~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~------~~~~tvl~~lk~~~~~l~I~~~ 257 (329) T protein:vir:79 184 NNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMP------ETTMSYLDYFKQQNGGITIESI 257 (329) T ss_pred CCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccC------CCCccHHHHHHHhCCCcEEEEc Confidence 111 2 56677777777543 2 356789999999988842 21 11100 0111 12222234444 Q ss_pred eEEEcCCCC-cceEEEE--eCCeEEEEeecCceeeeecchhhcceEEEEEEE-EEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 203 IIVRTNKLE-AGTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKH-YVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 203 ~Vv~s~~v~-~~~~~~~--~~~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~-yg~~~~~~~~~v~~~~~~a~ 271 (274) |.+.+.... +...+++ ++..+.+....+.+... -........+....+ .|+-+..|.+++++..=--. T Consensus 258 ~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 258 SELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLT-AQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred ccccccCCCCceEEEEEecCCceEEEecCcceeeee-ceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 444333221 2223333 33444444434433321 112223334444444 57888999999998742111 No 198 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=97.12 E-value=0.00016 Score=41.34 Aligned_cols=257 Identities=14% Similarity=0.135 Sum_probs=116.1 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHH-hh----hhcccccc-cc---cccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-LR----FASFAEVD-ST---LQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~-~v----~~~~~~~~-~~---~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) ||+ +.|+|.|..+..++.+.-... .. |++...+. .+ +.+..+..+-.|.. +.+.+-+. T Consensus 1 M~~----i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v---------~~~~~~~~ 67 (348) T protein:vir:27 1 MGL----IYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAA---------AFDTNVTI 67 (348) T ss_pred Ccc----hhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeee---------cCCCCcce Confidence 775 578999999999887543222 11 22211111 11 12222222222222 22211111 Q ss_pred c-ccccceeEEEeeeecceeeeeHHH--H---hhcCcc--HHHHHHH-------HHHHHHHHHHHHHHHHHhhcccc--- Q lcl|NC_010147. 72 D-ILETKKREAKIRKIAKGTSITDEA--L---LSGYGD--PQGEQVR-------QHGLAHANKVDNDVLEALMGAKL--- 133 (274) Q Consensus 72 ~-~~t~~~~~~~~~~~~~~~~vtd~~--~---~~~~~d--~~~~~~~-------~~a~~~a~~~d~~~~~~~~~a~~--- 133 (274) . .-........+-+++-...++-.+ . ..+..+ ....+.+ .+.+.+.+.++..+...+.+... T Consensus 68 ~~r~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~ 147 (348) T protein:vir:27 68 RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFT 147 (348) T ss_pred ecccceeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEe Confidence 1 111122222222222222332111 1 111111 1222323 33344555555555555542211 Q ss_pred ------------------cccccc-----cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccccccccc-cc Q lcl|NC_010147. 134 ------------------TVNADI-----TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELG-DD 189 (274) Q Consensus 134 ------------------~~~~~~-----~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~ 189 (274) +..+.+ ..+++|.+....+.+.+..+..++|+++++..|+++..+.-.-....+ .. T Consensus 148 ~~~~~~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~ 227 (348) T protein:vir:27 148 SDGVNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGS 227 (348) T ss_pred cCCeeEEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCcccc Confidence 111111 124567777777777777888999999999999876543322211111 11 Q ss_pred ce----eccccceeccceEEEcC------------CCCcceEEEEeCCeEEEEeecCceeeee-----cc---------- Q lcl|NC_010147. 190 II----VKGAFGEALGAIIVRTN------------KLEAGTAILAKKGAVKLILKRDFFLEVA-----RD---------- 238 (274) Q Consensus 190 ~~----~~g~ig~~~G~~Vv~s~------------~v~~~~~~~~~~~a~~~~~~~~~~ve~~-----rd---------- 238 (274) .+ ....++++.|++|++=+ .+|+++.+++..+..+...-.+ ..|.. +. T Consensus 228 ~i~~~~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~-~~e~~~~~~~~~~~~~~~~~~~ 306 (348) T protein:vir:27 228 AVTKAELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGT-TPEESDLFADNTVNAEVEIVDN 306 (348) T ss_pred ccCHHHHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEecc-CcchhhhhhccccccceeeeCC Confidence 22 22344566788776522 2467777787777655332111 11110 00 Q ss_pred ---------hhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 239 ---------ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 239 ---------~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ..-..-.+.+-.+.--.+.+|+++.+++.-+|. T Consensus 307 ~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 307 GIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 000111222333334445588888888888887 No 199 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=96.99 E-value=0.00022 Score=40.63 Aligned_cols=255 Identities=11% Similarity=0.059 Sum_probs=135.6 Q ss_pred CCC-------ccceeeeeechHHHHHHHHHH----HHHHhhhhcccccccccccCCC-ceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQ-------GITKTSNQIIPEVLAPMMQAQ----LEKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~-------~~T~~~~~~~Pev~~~~v~~~----~~~~~v~~~~~~~~~~~~~~~g-~tv~ip~~~~~~~~~~~~eg~~ 68 (274) ||| ..+++.+..||-....+|... ..+.....++..+... +.-+ .+++++.+...|.+.+|+++++ T Consensus 35 ~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~--g~w~~~t~~y~~~e~~G~a~~ygd~ad 112 (339) T protein:vir:94 35 YAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKK--GDWTTTYGVFIIAEPVGQVATYSDWSA 112 (339) T ss_pred hhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccC--CCCcccEEEEeeeecccceEEcccccC Confidence 333 234556677887666666543 3334444444433222 2222 5799999999999999999999 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc-------------cc Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG-------------AK 132 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-------------a~ 132 (274) +|..+........++....-++.++.++... .+.++.++-...+.+++.+.+|+-.+-.-.+ +. T Consensus 113 ~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~ 192 (339) T protein:vir:94 113 NGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAP 192 (339) T ss_pred CCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCcccc Confidence 9888877666665555554455555544332 4577888888888888888888643322111 01 Q ss_pred ccccccc--cC----HHHHHHHHHHHhhcC------CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 133 LTVNADI--TK----LNGLQSAIDKFNDED------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 133 ~~~~~~~--~~----~d~i~~A~~~l~~~~------~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) .+..+.+ -+ +++|..+...+-... ..+..++++|..+..|-+-+.... .. -..+.. ++- T Consensus 193 v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~~~~----Tv-l~~lk~----n~p 263 (339) T protein:vir:94 193 VAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNNFGL----SA-GAKIAQ----TYP 263 (339) T ss_pred ccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCcCCc----cH-HHHHHH----hcC Confidence 1111222 23 455666666664321 134579999999998854221100 00 011111 133 Q ss_pred cceEEEcCCCCc---ceEEEEe-----CCeEEEEeecCceeeeec-chhhcceEEEEEEE-EEEEEEcCccEEEEEec Q lcl|NC_010147. 201 GAIIVRTNKLEA---GTAILAK-----KGAVKLILKRDFFLEVAR-DASTKTTALYSDKH-YVAYLYDESKAVKITKG 268 (274) Q Consensus 201 G~~Vv~s~~v~~---~~~~~~~-----~~a~~~~~~~~~~ve~~r-d~~~~~~~v~~~~~-yg~~~~~~~~~v~~~~~ 268 (274) +++|+..+.+.. +...++. +.-..+....+.+ .-. ........+....+ .|+-+..|.++++++.= T Consensus 264 nl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~--~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 264 NIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLR--SHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhh--ccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 566665554421 1222221 1111111111111 100 11222334444444 68888999999998755 No 200 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=96.63 E-value=7.8e-05 Score=43.12 Aligned_cols=268 Identities=13% Similarity=0.007 Sum_probs=128.0 Q ss_pred CCCcc-----ceeeeeec------------------hHHHHHHHHHHHH-----HHhhhhc-----cccccccc-cc--- Q lcl|NC_010147. 1 MPQGI-----TKTSNQII------------------PEVLAPMMQAQLE-----KKLRFAS-----FAEVDSTL-QG--- 43 (274) Q Consensus 1 Ma~~~-----T~~~~~~~------------------Pev~~~~v~~~~~-----~~~v~~~-----~~~~~~~~-~~--- 43 (274) |+..+ |....... +..+... .+... ....+.+ ........ .+ T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s-~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYP-LPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cccceeeeeccccccccccccccccccccccccccccccccch-hhccccccccccccccccccccccccccccCCCccc Confidence 22110 00000000 0000000 00000 0000000 00000000 00 Q ss_pred CCCc--eEEEEeeccCCcccc-------ccCCCcCCccccccceeEEEeeeecceeeeeHHHHhh-----cCccHHHHHH Q lcl|NC_010147. 44 QPGD--TLTFPAFVYSGDAQV-------VAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS-----GYGDPQGEQV 109 (274) Q Consensus 44 ~~g~--tv~ip~~~~~~~~~~-------~~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~-----~~~d~~~~~~ 109 (274) ..+. ..++-.-..+..++. -..+..+++-..+..+++++.+-|+..-++|=|..-+ .+-|..+++. T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELa 320 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIV 320 (523) T ss_pred ccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHH Confidence 0000 000000001111111 1223445666667777888877776655666543222 3688999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccc---------------cc--cc-------CHHHHHHHHHHHhh-cC-------- Q lcl|NC_010147. 110 RQHGLAHANKVDNDVLEALMGAKLTVN---------------AD--IT-------KLNGLQSAIDKFND-ED-------- 156 (274) Q Consensus 110 ~~~a~~~a~~~d~~~~~~~~~a~~~~~---------------~~--~~-------~~d~i~~A~~~l~~-~~-------- 156 (274) +-|+..|...|+++++..+.+.+.... .+ +. ..+.+-.+..++.+ ++ T Consensus 321 nILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~ 400 (523) T protein:vir:59 321 TLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAV 400 (523) T ss_pred HHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhccc Confidence 999999999999999998865432210 00 00 12233333333332 11 Q ss_pred CCceEEEEcHHHHHHHHhhcccccccccc-ccccceecccccee-ccceEEEcCCCCcceEEEEeCCeEE-----EEeec Q lcl|NC_010147. 157 LEPMVLFINPLDAGKLRGDASTNFTRATE-LGDDIIVKGAFGEA-LGAIIVRTNKLEAGTAILAKKGAVK-----LILKR 229 (274) Q Consensus 157 ~~~~~~vv~p~~~~~L~k~~~~~~~~~s~-~~~~~~~~g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a~~-----~~~~~ 229 (274) ...++++++|++.+.|...+......... ...+.. ..|.+ .|++|+++++.|..-..+.-++..+ ++... T Consensus 401 ~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~---~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~P 477 (523) T protein:vir:59 401 AGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIF---YVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAP 477 (523) T ss_pred ccccEEEEchhHHHHHHhccccccCCccccccccce---eEEEecCceEEEecCCCCcceEEEEecccCCcccccceecc Confidence 25789999999999997655432211111 111111 23555 3469999999886544444334221 11111 Q ss_pred Cceee---eecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCC Q lcl|NC_010147. 230 DFFLE---VARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) Q Consensus 230 ~~~ve---~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~ 272 (274) =+++. .-.||.+++-.+-...|||.+|.||.....+-..---+ T Consensus 478 y~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 478 YVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred cchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 12222 22378999999999999999999998775544332222 No 201 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=96.57 E-value=0.00034 Score=39.60 Aligned_cols=211 Identities=9% Similarity=-0.000 Sum_probs=120.8 Q ss_pred CCCccce---eee---eechHHHHHHHHHHHHHHhhh-hcccccccccccCCCceEEEEeeccCCccccccCCCcCCccc Q lcl|NC_010147. 1 MPQGITK---TSN---QIIPEVLAPMMQAQLEKKLRF-ASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~---~~~---~~~Pev~~~~v~~~~~~~~v~-~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~ 73 (274) ||..-+. +.+ .+.|.-....|.|.+.+.+-+ .-+-. .++..|..........++.+.|..-++.+++++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf----~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~ 76 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPF----VEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSK 76 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcce----eecccCCcceeeEeeccCCceeeecCCccCccc Confidence 6654222 222 144555555566666554332 22221 123223334444556788999999899999999 Q ss_pred cccceeEEEeeeecceeeeeHHHHhhcCccHHHH---HHHHHHHHHHHHHHHHHHHH---------------hhc----- Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQGE---QVRQHGLAHANKVDNDVLEA---------------LMG----- 130 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~---~~~~~a~~~a~~~d~~~~~~---------------~~~----- 130 (274) .++..++.....++..+.|+...... .++..+. -.++..+++.+++...+|.. +.+ T Consensus 77 ~tt~q~t~~l~ilgg~~eVDr~la~~-~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~ 155 (328) T protein:vir:95 77 STTVQVTDSVGMLETYAEVDKSLADL-NGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGN 155 (328) T ss_pred ceeEEEEEEEEEEecceeechHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccccc Confidence 99999999999999999999755544 3454433 44457777777777776632 100 Q ss_pred cccc---------------------------------------------------------------------------- Q lcl|NC_010147. 131 AKLT---------------------------------------------------------------------------- 134 (274) Q Consensus 131 a~~~---------------------------------------------------------------------------- 134 (274) +.+. T Consensus 156 a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvr 235 (328) T protein:vir:95 156 AQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVR 235 (328) T ss_pred ccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 0000 Q ss_pred ---cccccc----C----HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccce Q lcl|NC_010147. 135 ---VNADIT----K----LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAI 203 (274) Q Consensus 135 ---~~~~~~----~----~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~ 203 (274) .+.+.. . .+.+++|..++-.......+++||-.+...|+++.... ......-.-.-...+-.|.|+| T Consensus 236 I~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~--~n~~~~~~~~~g~~~t~~~gip 313 (328) T protein:vir:95 236 IANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEK--TSLAISVKETEGEWWTSFRGVP 313 (328) T ss_pred EecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcC--cceeeeeeccCCcceeEECCeE Confidence 000000 0 12345555555434446678999999999998764211 1111111111223455689999 Q ss_pred EEEcCCCCcceEEEE Q lcl|NC_010147. 204 IVRTNKLEAGTAILA 218 (274) Q Consensus 204 Vv~s~~v~~~~~~~~ 218 (274) |..++.+-..+.-++ T Consensus 314 ir~~dai~~tE~~vv 328 (328) T protein:vir:95 314 IRETDALLETEARVV 328 (328) T ss_pred EEEEeeeecCccccC Confidence 999988765554444 No 202 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=96.57 E-value=2.8e-05 Score=45.57 Aligned_cols=107 Identities=14% Similarity=0.145 Sum_probs=76.5 Q ss_pred EEcHHHHHHHHhhccccccccccccccceeccccc-eeccceEEEcCCCCcceEEEEeCCeEE-----------EEe--e Q lcl|NC_010147. 163 FINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRTNKLEAGTAILAKKGAVK-----------LIL--K 228 (274) Q Consensus 163 vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig-~~~G~~Vv~s~~v~~~~~~~~~~~a~~-----------~~~--~ 228 (274) +++-.+++++..+..++.-..-+ ..+++.+|.++ +.+|++++.|+++|-+++++++...++ |+. . T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE-~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~~~~ 79 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPRE-QANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAPAGN 79 (123) T ss_pred CcchhhHHHHhcchhcccccccc-cCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccCCCC Confidence 66666677776554332222222 23556666666 599999999999999899988865443 322 2 Q ss_pred cCceeeeecchh--hcceEEEEEEEEEEEEEcCccEEEEEecCC Q lcl|NC_010147. 229 RDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) Q Consensus 229 ~~~~ve~~rd~~--~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a 270 (274) .++++.+.|... .+...+|+|+.-..-++.|.+.++|+...- T Consensus 80 ~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 80 TGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred cceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 246677888877 777889999999999999999999997666 No 203 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=96.55 E-value=0.00018 Score=41.11 Aligned_cols=255 Identities=9% Similarity=0.021 Sum_probs=139.1 Q ss_pred CCC-------ccceeeeeechHHHHHHHHHHHHHH----hhhhcccccccccccCC-CceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQ-------GITKTSNQIIPEVLAPMMQAQLEKK----LRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~-------~~T~~~~~~~Pev~~~~v~~~~~~~----~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~ 68 (274) |+| ..++.++.-+|..+..|+..++.+. .+...+.-+.. .|.- -.++.++.....|.+..|+.+++ T Consensus 31 ~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D 108 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) T ss_pred hhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCC Confidence 333 2445566778988888875444332 22222222211 1211 24678888888899999999999 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHh---hcCccHHHHHHHHHHHHHHHHHHHHHHHH---------hhc----cc Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEA---------LMG----AK 132 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~---~~~~d~~~~~~~~~a~~~a~~~d~~~~~~---------~~~----a~ 132 (274) +|..+........+++.++.++.++.++.. ..+.|+.++-....++++.+++++..+-. ++. +. T Consensus 109 ~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~ 188 (336) T protein:vir:10 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccc Confidence 999999888888888888888888865433 34677778888888888888887643311 110 01 Q ss_pred ccccc---cccC----HHHHHHHHHHHhhc--C----CCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 133 LTVNA---DITK----LNGLQSAIDKFNDE--D----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 133 ~~~~~---~~~~----~d~i~~A~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) .+..+ +..+ +++|..+...|-.. + .....++++|..+..|-+-+ ..|..+ .+-.--++ T Consensus 189 ~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~n--------~~g~Tv-l~~lk~n~ 259 (336) T protein:vir:10 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAA-AAKLKDIF 259 (336) T ss_pred cccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCCC--------ccCccH-HHHHHHhc Confidence 11111 1123 45666666666542 2 24678999999988774321 111111 11011124 Q ss_pred ccceEEEcCCCC---cceEEEEeC-----CeEEEEeecCceeeeec-chhhcceEEEEE-EEEEEEEEcCccEEEEEec Q lcl|NC_010147. 200 LGAIIVRTNKLE---AGTAILAKK-----GAVKLILKRDFFLEVAR-DASTKTTALYSD-KHYVAYLYDESKAVKITKG 268 (274) Q Consensus 200 ~G~~Vv~s~~v~---~~~~~~~~~-----~a~~~~~~~~~~ve~~r-d~~~~~~~v~~~-~~yg~~~~~~~~~v~~~~~ 268 (274) -+++++..+.+. .+..+++-+ .-..... |..+..-. ........+... ...|+-+..|.++++++.= T Consensus 260 Pnl~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~--p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGF--TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CccEEEEccccccCCCceEEEEEEecCCCcceeeec--chhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 456666655542 112222211 1111111 11110000 011122233333 3468888899999998755 No 204 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.43 E-value=0.00024 Score=40.47 Aligned_cols=252 Identities=10% Similarity=0.027 Sum_probs=138.2 Q ss_pred CCCc-------cceeeeeechHHHHHHHHHHHHHH----hhhhcccccccccccCC-CceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQG-------ITKTSNQIIPEVLAPMMQAQLEKK----LRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~~-------~T~~~~~~~Pev~~~~v~~~~~~~----~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~ 68 (274) |||- .++.++.=+|..+..+|.-++-+. .+...+.-+.. .|.- -.+++++.....|.+..|+.+++ T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D 108 (336) T protein:vir:78 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhccccc--CCCccccEEEEeeeecceeeEEeecccC Confidence 4442 344455557888888875443322 22223322221 1222 24788888888899999999999 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHH---------hhc----cc Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEA---------LMG----AK 132 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~---------~~~----a~ 132 (274) +|..+........+++.++..+.++.++... .+.++.++-....++++.+.+++-.+-. ++. +. T Consensus 109 ~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~ 188 (336) T protein:vir:78 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred CCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcc Confidence 9999999999999999988888888655443 4677888888888888888777533211 110 01 Q ss_pred cccccc---ccC----HHHHHHHHHHHhhcC------CCceEEEEcHHHHHHHHhhcccccccccccccc---ceecccc Q lcl|NC_010147. 133 LTVNAD---ITK----LNGLQSAIDKFNDED------LEPMVLFINPLDAGKLRGDASTNFTRATELGDD---IIVKGAF 196 (274) Q Consensus 133 ~~~~~~---~~~----~d~i~~A~~~l~~~~------~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~---~~~~g~i 196 (274) .+..+. ..+ +++|..+...+...- ..+..++++|..+..|-+-+. .+.. .+.. T Consensus 189 ~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~n~--------~g~tv~~~lk~--- 257 (336) T protein:vir:78 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ--------YGLSAAAKLKE--- 257 (336) T ss_pred cccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCc--------cCccHHHHHHH--- Confidence 111111 123 445555666553321 134589999999998854221 1111 1111 Q ss_pred ceeccceEEEcCCCC---cceEEEEeCC-----eEEEEeecCcee-eeecchhhcceEEEEE-EEEEEEEEcCccEEEEE Q lcl|NC_010147. 197 GEALGAIIVRTNKLE---AGTAILAKKG-----AVKLILKRDFFL-EVARDASTKTTALYSD-KHYVAYLYDESKAVKIT 266 (274) Q Consensus 197 g~~~G~~Vv~s~~v~---~~~~~~~~~~-----a~~~~~~~~~~v-e~~rd~~~~~~~v~~~-~~yg~~~~~~~~~v~~~ 266 (274) ++=+++|+.-+.+. .+..+++.+. ...+....+.+. -.. .......+... ...|+-+..|.++++++ T Consensus 258 -n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq--~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~ 334 (336) T protein:vir:78 258 -IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIE--RYSSYFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) T ss_pred -hcCccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhcccee--ecCceeEeccccceeeeeeeccchheeec Confidence 13345666555442 1223333222 111111111110 011 11122333333 34688888999999877 Q ss_pred ec Q lcl|NC_010147. 267 KG 268 (274) Q Consensus 267 ~~ 268 (274) .= T Consensus 335 GI 336 (336) T protein:vir:78 335 GV 336 (336) T ss_pred cC Confidence 54 No 205 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=96.27 E-value=0.0008 Score=37.57 Aligned_cols=258 Identities=14% Similarity=0.134 Sum_probs=112.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHH-----hhhhcccccc-cc---cccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-----LRFASFAEVD-ST---LQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~-----~v~~~~~~~~-~~---~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) ||+ +.|.|.|..+..++.+..... ..|++...+. .+ +.+..|..+--|.. +++.+-+. T Consensus 1 M~~----l~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v---------~~~~~~~~ 67 (348) T protein:vir:49 1 MGL----IYDKVTASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAA---------AFDTNVTV 67 (348) T ss_pred Ccc----hhhhcCHHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeee---------cCCCCcce Confidence 664 478899999999887543221 1122221111 11 22333333322222 21111111 Q ss_pred c-ccccceeEEEeeeecceeeee--HHHHh---hcCc--cHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccc--- Q lcl|NC_010147. 72 D-ILETKKREAKIRKIAKGTSIT--DEALL---SGYG--DPQGEQVRQH-------GLAHANKVDNDVLEALMGAKL--- 133 (274) Q Consensus 72 ~-~~t~~~~~~~~~~~~~~~~vt--d~~~~---~~~~--d~~~~~~~~~-------a~~~a~~~d~~~~~~~~~a~~--- 133 (274) . .-........+-+++....++ |.... .+.. ...+.+.+++ .+.+.+.++..+...+.+... T Consensus 68 ~~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~ 147 (348) T protein:vir:49 68 RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFT 147 (348) T ss_pred ecccceeeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEe Confidence 1 111222222332232223332 21111 1111 1122333333 334455555555555543211 Q ss_pred ------------------cccccc-----cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccc-cccccccc Q lcl|NC_010147. 134 ------------------TVNADI-----TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFT-RATELGDD 189 (274) Q Consensus 134 ------------------~~~~~~-----~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~-~~s~~~~~ 189 (274) +....+ .-+.+|-+....+.+.+..+..++|+++++..|+++..+.-. ........ T Consensus 148 ~~g~~~~vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~ 227 (348) T protein:vir:49 148 SDGVNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGS 227 (348) T ss_pred cCCceEEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccc Confidence 111111 124566666667777777888999999999999876543321 11111111 Q ss_pred ce----eccccceeccceEEEcC------------CCCcceEEEEeCCeEEEEeecCc--------------eeeeec-- Q lcl|NC_010147. 190 II----VKGAFGEALGAIIVRTN------------KLEAGTAILAKKGAVKLILKRDF--------------FLEVAR-- 237 (274) Q Consensus 190 ~~----~~g~ig~~~G~~Vv~s~------------~v~~~~~~~~~~~a~~~~~~~~~--------------~ve~~r-- 237 (274) .+ ....++++.|++|++=+ .+|+++.+++..+..+...-.++ .++..+ T Consensus 228 ~i~~~~~~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~ 307 (348) T protein:vir:49 228 SVTKAELDNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNG 307 (348) T ss_pred cccHHHHHHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCe Confidence 22 12234456777776522 23566677776655442211111 011000 Q ss_pred -------chhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 238 -------DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 238 -------d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ...-..-.+.+-.+.-..+.+|+++.+.++-+|. T Consensus 308 ~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 308 IAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EEEeeeecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 0000011222222233344578888888888887 No 206 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=96.18 E-value=0.00033 Score=39.66 Aligned_cols=255 Identities=9% Similarity=0.020 Sum_probs=136.8 Q ss_pred CCCc-------cceeeeeechHHHHHHHHHHHHHH----hhhhcccccccccccCC-CceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQG-------ITKTSNQIIPEVLAPMMQAQLEKK----LRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~~-------~T~~~~~~~Pev~~~~v~~~~~~~----~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~ 68 (274) |+|- .++.++.=+|..+..+|..++.+. .....+.-+.. .|.- -.++.++.....|.+..|+.+++ T Consensus 31 ~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D 108 (336) T protein:vir:36 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) T ss_pred hhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCC Confidence 3432 223345557888888875443322 22222222211 1211 24678888888899999999999 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHh---hcCccHHHHHHHHHHHHHHHHHHHHHHH---------Hhhc----cc Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLE---------ALMG----AK 132 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~---~~~~d~~~~~~~~~a~~~a~~~d~~~~~---------~~~~----a~ 132 (274) +|..+........+++.++.++.++.++.. ..+.|+.++-....++++.+++++..+- .++. +. T Consensus 109 ~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~ 188 (336) T protein:vir:36 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccc Confidence 999999988888889888888888754433 3467777887888888888888763331 1110 01 Q ss_pred ccccc---cccC----HHHHHHHHHHHhhc--C----CCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 133 LTVNA---DITK----LNGLQSAIDKFNDE--D----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 133 ~~~~~---~~~~----~d~i~~A~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) .+..+ +..+ +++|..+...+... + .....++++|..+..|-+-+ ..|..++ +-.--++ T Consensus 189 ~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~n--------~~g~Tvl-~~lk~n~ 259 (336) T protein:vir:36 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIF 259 (336) T ss_pred cccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCCC--------ccCccHH-HHHHHhc Confidence 11111 1123 55666666666542 1 24678999999988774321 1111110 1011124 Q ss_pred ccceEEEcCCCC---cceEEEEeC-----CeEEEEeecCceeeeec-chhhcceEEEEE-EEEEEEEEcCccEEEEEec Q lcl|NC_010147. 200 LGAIIVRTNKLE---AGTAILAKK-----GAVKLILKRDFFLEVAR-DASTKTTALYSD-KHYVAYLYDESKAVKITKG 268 (274) Q Consensus 200 ~G~~Vv~s~~v~---~~~~~~~~~-----~a~~~~~~~~~~ve~~r-d~~~~~~~v~~~-~~yg~~~~~~~~~v~~~~~ 268 (274) -+++++..+.+. .+..+++-+ .-..... |..+..-. ........+... ...|+-+..|.++++++.= T Consensus 260 Pnl~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~--p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGF--TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CccEEEEccccccCCCceEEEEEEecCCCcceeeec--chhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 456666655442 112222211 1111111 11110000 011122233333 3468888899999998755 No 207 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=259 Identities=12% Similarity=0.103 Sum_probs=139.0 Q ss_pred CCCccceee-eeechHHHHHHHHHHHHHHhhhhcccccccccccCC-CceEEEEeeccCCc-cccc--------cCCCcC Q lcl|NC_010147. 1 MPQGITKTS-NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGD-AQVV--------AEGEKI 69 (274) Q Consensus 1 Ma~~~T~~~-~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~-~~~~--------~eg~~i 69 (274) |+...-.++ -.+.+| |...+..-+..+.+|.+.---.-.+.|.. .++.-.-+-++++- ...| +.|+.. T Consensus 1 m~t~N~n~avr~Y~Kq-f~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~ 79 (286) T protein:vir:94 1 MATTNNDLPVRVYSKE-FLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSN 79 (286) T ss_pred CCCCccccceeehhHH-HHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCccccccCCcc Confidence 776554333 334454 88888887888887754210011111211 12221111112110 1112 222221 Q ss_pred CccccccceeEEEee-----eecceeeeeH-HHHhhcCccHHHHHHHH---HHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_010147. 70 PTDILETKKREAKIR-----KIAKGTSITD-EALLSGYGDPQGEQVRQ---HGLAHANKVDNDVLEALMGAKLTVNADIT 140 (274) Q Consensus 70 ~~~~~t~~~~~~~~~-----~~~~~~~vtd-~~~~~~~~d~~~~~~~~---~a~~~a~~~d~~~~~~~~~a~~~~~~~~~ 140 (274) .--++...-.+. .+...|.+-. ++...-+.|+.+.+++| .+.+|++.+|..+-..+..+... .. T Consensus 80 ---SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~----t~ 152 (286) T protein:vir:94 80 ---SSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTD----LG 152 (286) T ss_pred ---ccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hh Confidence 111221111110 0111222221 22333445555555554 67899999998776666443322 23 Q ss_pred CHHHHHHHHHHHhhcCC-----CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC-CCCcce Q lcl|NC_010147. 141 KLNGLQSAIDKFNDEDL-----EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN-KLEAGT 214 (274) Q Consensus 141 ~~d~i~~A~~~l~~~~~-----~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~-~v~~~~ 214 (274) ++|.+..+...+.+... .+-...|||++|..|...++-+....|. .++ -+.-+-++.|+.+...+ ++-.|. T Consensus 153 ~~D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~TsaK~Ss--aNi-Dengi~~FkGf~i~e~P~~~~~g~ 229 (286) T protein:vir:94 153 AVDDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTAKNSA--VNI-DTNGMLSFRGIAITKVPTQYMGGK 229 (286) T ss_pred hhhhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccce--eee-ccCCcceecceEEeecchhhccCc Confidence 34666666555544332 3455899999999998776543333322 122 23336789999988876 344588 Q ss_pred EEEEeCCeEEEEeecCcee-eeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 215 AILAKKGAVKLILKRDFFL-EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 215 ~~~~~~~a~~~~~~~~~~v-e~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) ..+|.+..++..- .++++ .+-..+.+....+.+---||-.+.+..+.+.++.+.-+ T Consensus 230 ~aifs~dnig~af-tGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 230 AVIFAPDNVARVF-TGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred eEEEccccceeee-ccceeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 8899888887753 33343 34445677888999999999999988887777654333 No 208 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=95.99 E-value=0.00043 Score=39.06 Aligned_cols=252 Identities=10% Similarity=0.029 Sum_probs=136.9 Q ss_pred CCCc-------cceeeeeechHHHHHHHHHHHHHH----hhhhcccccccccccCC-CceEEEEeeccCCccccccCCCc Q lcl|NC_010147. 1 MPQG-------ITKTSNQIIPEVLAPMMQAQLEKK----LRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) Q Consensus 1 Ma~~-------~T~~~~~~~Pev~~~~v~~~~~~~----~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~ 68 (274) |||- .++.++.=+|..+..++.-++.+. .....+.-++. .|.- -.++.++.....|.+..|+++++ T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t--~g~w~~~~~~~~~~e~~G~a~~ygd~~d 108 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhccccc--CCCcceeeEEEEeeeeeeeEEEccccCC Confidence 4442 344455557888888875443322 12222222222 1221 25678888888899999999999 Q ss_pred CCccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHH---------hhc----cc Q lcl|NC_010147. 69 IPTDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEA---------LMG----AK 132 (274) Q Consensus 69 i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~---------~~~----a~ 132 (274) +|..+........+++.++.++.++.++... .+.++.++-.....+++.+++++-.+-. ++. +. T Consensus 109 ~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~ 188 (336) T protein:vir:10 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred CcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcc Confidence 9999999999999999888888888655443 4667777777778888887777532211 110 01 Q ss_pred cccccc---ccC----HHHHHHHHHHHhhcC------CCceEEEEcHHHHHHHHhhcccccccccccccc---ceecccc Q lcl|NC_010147. 133 LTVNAD---ITK----LNGLQSAIDKFNDED------LEPMVLFINPLDAGKLRGDASTNFTRATELGDD---IIVKGAF 196 (274) Q Consensus 133 ~~~~~~---~~~----~d~i~~A~~~l~~~~------~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~---~~~~g~i 196 (274) .+..+. ..+ +++|..+...+...- ..+..++++|..+..|.+-+ ..|.. .+.. T Consensus 189 ~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~n--------~~g~tv~~~lk~--- 257 (336) T protein:vir:10 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAAKLKE--- 257 (336) T ss_pred cccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCC--------ccCccHHHHHHH--- Confidence 111111 123 445666666663321 13457999999999885422 11111 1111 Q ss_pred ceeccceEEEcCCCC---cceEEEEeCCe-----EEEEeecCcee-eeecchhhcceEEEEEE-EEEEEEEcCccEEEEE Q lcl|NC_010147. 197 GEALGAIIVRTNKLE---AGTAILAKKGA-----VKLILKRDFFL-EVARDASTKTTALYSDK-HYVAYLYDESKAVKIT 266 (274) Q Consensus 197 g~~~G~~Vv~s~~v~---~~~~~~~~~~a-----~~~~~~~~~~v-e~~rd~~~~~~~v~~~~-~yg~~~~~~~~~v~~~ 266 (274) ++=+++|+..+.+. .+..+++.+.. .......+.+. ... .......+.... ..|+-+..|-+++++. T Consensus 258 -n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq--~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~ 334 (336) T protein:vir:10 258 -IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIE--RYSSYFRQKKSAGTWGAVIFRPFAVAQML 334 (336) T ss_pred -hCCccEEEEcccccccCCceEEEEEecccCCcceeeecChhhhcccee--ecCceeEeccccceeeeeeeccchheeec Confidence 13356666655543 12234433221 11111111110 011 111222333333 3578888999998877 Q ss_pred ec Q lcl|NC_010147. 267 KG 268 (274) Q Consensus 267 ~~ 268 (274) .= T Consensus 335 GI 336 (336) T protein:vir:10 335 GV 336 (336) T ss_pred cC Confidence 54 No 209 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=95.98 E-value=0.0012 Score=36.67 Aligned_cols=258 Identities=14% Similarity=0.103 Sum_probs=114.4 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHH-----hhhhcccccc-cc---cccCCCceEEEEeeccCCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-----LRFASFAEVD-ST---LQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~-----~v~~~~~~~~-~~---~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~ 71 (274) ||+ +.|.|.|..+..++.+..... ..|++-..+. .+ +.+..+..+-.|+ ++++.+-+. T Consensus 1 M~~----i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~---------v~~~~~~~~ 67 (348) T protein:vir:96 1 MGL----IYDKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKA---------AAFDTNVTI 67 (348) T ss_pred Ccc----hhhccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeee---------ecCCCCcce Confidence 664 477899998888886543221 1122221111 11 1222222222222 222222111 Q ss_pred c-ccccceeEEEeeeecceeeee--HHHHh----hc-CccHHHHHHHHHH-------HHHHHHHHHHHHHHhhccc---- Q lcl|NC_010147. 72 D-ILETKKREAKIRKIAKGTSIT--DEALL----SG-YGDPQGEQVRQHG-------LAHANKVDNDVLEALMGAK---- 132 (274) Q Consensus 72 ~-~~t~~~~~~~~~~~~~~~~vt--d~~~~----~~-~~d~~~~~~~~~a-------~~~a~~~d~~~~~~~~~a~---- 132 (274) . .-........+-+++-...++ |.... .+ .......+.++++ +.+.++++..+...+.+.. T Consensus 68 ~~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~ 147 (348) T protein:vir:96 68 RDRVSAEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFT 147 (348) T ss_pred ecccceeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEee Confidence 1 111222222332222222222 21111 11 1122233333333 3444455555555554211 Q ss_pred ---------------c--cccccc-----cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccccc-cccccc Q lcl|NC_010147. 133 ---------------L--TVNADI-----TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRA-TELGDD 189 (274) Q Consensus 133 ---------------~--~~~~~~-----~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~-s~~~~~ 189 (274) + +..+.+ ..+++|-++...+.+.+..+..++|+++++..|++++.+.-.-. .....+ T Consensus 148 ~~~~~~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~ 227 (348) T protein:vir:96 148 SDGVNKDIDYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGS 227 (348) T ss_pred cCCeeEEEeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccc Confidence 0 111111 12456666666777777788899999999999987654332211 111111 Q ss_pred ce----eccccceeccceEEEcC------------CCCcceEEEEeCCeEEEEeecCce----eeeecch---------- Q lcl|NC_010147. 190 II----VKGAFGEALGAIIVRTN------------KLEAGTAILAKKGAVKLILKRDFF----LEVARDA---------- 239 (274) Q Consensus 190 ~~----~~g~ig~~~G~~Vv~s~------------~v~~~~~~~~~~~a~~~~~~~~~~----ve~~rd~---------- 239 (274) .+ ....++.+.|++|++=+ .+|+++.+++..+..+...-.++. ...++.. T Consensus 228 ~~~~~~~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~ 307 (348) T protein:vir:96 228 SVTKAELQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSG 307 (348) T ss_pred cccHHHHHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCe Confidence 11 22344566788776522 245667777776654433211110 0000010 Q ss_pred ---------hhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 240 ---------STKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 240 ---------~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) .-....+.+-.+.--.+.+|+++.++++-+|. T Consensus 308 ~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 308 IAVTTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred eEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 00011222223333445578888888887777 No 210 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=95.91 E-value=0.0013 Score=36.46 Aligned_cols=257 Identities=14% Similarity=0.089 Sum_probs=137.9 Q ss_pred eeeeeechHHHHHHHHHHHHHHhhhhcc-cccccccccC-CCceEEEEeeccCCc-cccc--------cCCCcCCccccc Q lcl|NC_010147. 7 KTSNQIIPEVLAPMMQAQLEKKLRFASF-AEVDSTLQGQ-PGDTLTFPAFVYSGD-AQVV--------AEGEKIPTDILE 75 (274) Q Consensus 7 ~~~~~~~Pev~~~~v~~~~~~~~v~~~~-~~~~~~~~~~-~g~tv~ip~~~~~~~-~~~~--------~eg~~i~~~~~t 75 (274) |..-.+.+| |..++..-+..++.|... +-..-.+.|. ..++.-.-+-++++- ...| +.|+... -- T Consensus 1 ~avr~y~Kq-~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~s---sR 76 (287) T protein:vir:39 1 MAIKYFTKQ-YAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNT---SR 76 (287) T ss_pred CCcccccHH-HHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCcc---cc Confidence 233345554 788888888888887542 1100111111 112221112122110 1122 2222111 11 Q ss_pred cceeEEEe--e-e--ecceeeeeH-HHHhhcCccHHHHHH---HHHHHHHHHHHHHHHHHHhhcccccccccccCHHHHH Q lcl|NC_010147. 76 TKKREAKI--R-K--IAKGTSITD-EALLSGYGDPQGEQV---RQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQ 146 (274) Q Consensus 76 ~~~~~~~~--~-~--~~~~~~vtd-~~~~~~~~d~~~~~~---~~~a~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~ 146 (274) ++...-.+ . . +...|.+-. .+...-+.|+.+.++ ...+.+|++.+|..+-..+........+-.++-|.+. T Consensus 77 FG~rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~~~t~d~V~ 156 (287) T protein:vir:39 77 FGQRKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTVKLDEDSVT 156 (287) T ss_pred ccceeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheeeeecccchH Confidence 12111111 0 0 111222221 123333445555554 4568999999999887777655443333336666665 Q ss_pred HHHHHHhhc----CCC---ceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCCcceEEE Q lcl|NC_010147. 147 SAIDKFNDE----DLE---PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEAGTAIL 217 (274) Q Consensus 147 ~A~~~l~~~----~~~---~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~~~~~~~ 217 (274) .+...+.+. +.+ +-...|||++|..|...++-+....|. .++ =+..+-++.|+.+...+ ....|...+ T Consensus 157 ~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~Ss--aNi-Den~i~kFkGf~l~e~P~~~~q~g~~a~ 233 (287) T protein:vir:39 157 KLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSS--ANV-DEQTLYKFKGFILSELPDEKFQLNEGAY 233 (287) T ss_pred HHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccccce--eee-ccCCcceecceEEEecchHhhccCcEEE Confidence 555554432 332 346789999999998776543333322 122 23346789999998876 567889999 Q ss_pred EeCCeEEEEeecCcee-eeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCC Q lcl|NC_010147. 218 AKKGAVKLILKRDFFL-EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) Q Consensus 218 ~~~~a~~~~~~~~~~v-e~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~ 271 (274) |.+..++.+- .++++ .+-..+.+....+.+---||-.+.+..+.+.++.+.-- T Consensus 234 fs~dnig~af-~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 234 FAADNVGVAG-VGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred Eccccceeec-ccceeEEeeecccccceeeecccccccccccccceEEEEEecCC Confidence 9999888753 33443 34456677888999999999999977777666543322 No 211 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=95.91 E-value=0.001 Score=36.94 Aligned_cols=257 Identities=13% Similarity=0.052 Sum_probs=136.0 Q ss_pred CCC-----------------ccc-eeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCcccc Q lcl|NC_010147. 1 MPQ-----------------GIT-KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV 62 (274) Q Consensus 1 Ma~-----------------~~T-~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~ 62 (274) |-+ +.| +=.+-+.|+-+-..|...+.....+.+.-.+.+. |+--+..|. +..-.+.- T Consensus 101 ~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~----p~l~V~~~~-dt~~qa~g 175 (400) T protein:vir:93 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV----GALLVSRSF-DSANEAQV 175 (400) T ss_pred HhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecC----Cceeeecch-hhhcccce Confidence 221 122 1122245655555555555554444443222110 111111111 11111112 Q ss_pred ccCCCcCCccccccceeEEEeeeecceeeeeHHHHhh--cCccHHHHHHHHHHHHHHHH-HHHHHHHHhhcccc------ Q lcl|NC_010147. 63 VAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS--GYGDPQGEQVRQHGLAHANK-VDNDVLEALMGAKL------ 133 (274) Q Consensus 63 ~~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~--~~~d~~~~~~~~~a~~~a~~-~d~~~~~~~~~a~~------ 133 (274) ...|++=..+.++....++++...++..++.+.-... +++-+...+..++..++-++ ++.+++-.-.+... T Consensus 176 Hk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~ 255 (400) T protein:vir:93 176 HKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKE 255 (400) T ss_pred eccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcch Confidence 4456667788888888888888888777775433222 23456888889999888865 68776644111110 Q ss_pred ------------cccccccCHHHHHHH-HHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceec Q lcl|NC_010147. 134 ------------TVNADITKLNGLQSA-IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) Q Consensus 134 ------------~~~~~~~~~d~i~~A-~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~ 200 (274) +-.+..+.+.++.+- ..-......+...+|++|..++.|+.....+...--..++ ..-.|.+-. T Consensus 256 t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n---~d~~IA~~f 332 (400) T protein:vir:93 256 ADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKN---DDTEIASEV 332 (400) T ss_pred hhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeecc---ccchhhhhc Confidence 001222333333322 2211112235566788889898887643211110000111 123456667 Q ss_pred cc-eEEEcCCCCcce-EEEEeC-CeEEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 201 GA-IIVRTNKLEAGT-AILAKK-GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 201 G~-~Vv~s~~v~~~~-~~~~~~-~a~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) |+ ++|+...+|... .++++. .++ ...+...-.+++...-+..+-++.+.+.++--|.+.++++.+ T Consensus 333 Gv~~Lv~~Tr~~~~kp~V~VDek~~i---~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 333 GVDEIIVYTGSKALKPTVLVDQKYHI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ccceeeeeccCCCCCceeeeehhhhc---cccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 76 577777776433 233322 222 234444556777788888999999999999999999999998 No 212 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=94.98 E-value=0.003 Score=34.39 Aligned_cols=255 Identities=11% Similarity=0.058 Sum_probs=129.5 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhh---hcccccccccccCCCceEEEEeeccCC-ccc--cccCCC---cCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRF---ASFAEVDSTLQGQPGDTLTFPAFVYSG-DAQ--VVAEGE---KIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~---~~~~~~~~~~~~~~g~tv~ip~~~~~~-~~~--~~~eg~---~i~~ 71 (274) |++......| | +++++.+.-..+..+- ++.+. .+....++|.|+... ... ..+.+. .+++ T Consensus 2 ~~~~~~~~~d---p-~LT~~A~gy~n~~~ia~~l~P~vp-------v~~~~~k~~~f~~eaF~~~~t~r~~~~~~~~v~~ 70 (307) T protein:vir:10 2 GRLSKLRIVD---P-VLTNLAIGYTNAEFIGQSLMPVVE-------VEKEGGKIPKFGKESFRLYKTERALRARSNRMNP 70 (307) T ss_pred CCCCCCcccC---h-hHHHHHHhhcchhhhhhhcCCccc-------ccccccceeeECcccccchhhhcccCCCcceeec Confidence 4444344433 4 3555443222222111 12221 123345667764211 000 111221 1222 Q ss_pred cccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccccccc--- Q lcl|NC_010147. 72 DILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------LTVNADI--- 139 (274) Q Consensus 72 ~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------~~~~~~~--- 139 (274) ...+. ....+...+-...+++.+...+..|+.+...+.+.+.|.+..+..+...+.... ++.+..+ T Consensus 71 ~~~~~--~~~~~~~~~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~ 148 (307) T protein:vir:10 71 EDLGS--IDIVLDEHDLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAA 148 (307) T ss_pred ccccc--cccccccccccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCC Confidence 22222 223334444456677766667778899999999999888888877665543211 1111111 Q ss_pred --cCHHHHHHHHHHHhh-cCCCceEEEEcHHHHHHHHhhccccc-cccccccccceeccccceeccce-EEEcCCC---- Q lcl|NC_010147. 140 --TKLNGLQSAIDKFND-EDLEPMVLFINPLDAGKLRGDASTNF-TRATELGDDIIVKGAFGEALGAI-IVRTNKL---- 210 (274) Q Consensus 140 --~~~d~i~~A~~~l~~-~~~~~~~~vv~p~~~~~L~k~~~~~~-~~~s~~~~~~~~~g~ig~~~G~~-Vv~s~~v---- 210 (274) -....|.++..++.+ -+..+..++++++++..|++++.+.- +..+ +.+.+..-.+..++|+. |++.... T Consensus 149 ~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~--~~g~it~~~la~ll~v~~i~vg~a~~~~~ 226 (307) T protein:vir:10 149 GSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYS--MKGIVTVDLLKEIFEVENIAVGEAIYADD 226 (307) T ss_pred CCCcHHHHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCc--cccccCHHHHHHHhCceeEEEeeeeeecc Confidence 224566667777654 45789999999999999988764321 2221 23445555667788864 4442221 Q ss_pred -------CcceEEEEe--------C-----CeEEEEee-cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 211 -------EAGTAILAK--------K-----GAVKLILK-RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 211 -------~~~~~~~~~--------~-----~a~~~~~~-~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) ..+.+++.. . ..|||-.+ .+-.+...|....++..+++..++--.++-|++-..|+-+- T Consensus 227 ~~~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~ 306 (307) T protein:vir:10 227 KDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGIN 306 (307) T ss_pred CCccceeCCCceEEEecccccCCCCCcccccccceeEEEcCCeEeeceecCCceeEEeccccccceeecccccceeccCC Confidence 111222211 1 13555432 33334344556677777877766655555555555554333 Q ss_pred C Q lcl|NC_010147. 270 G 270 (274) Q Consensus 270 a 270 (274) . T Consensus 307 ~ 307 (307) T protein:vir:10 307 G 307 (307) T ss_pred C Confidence 3 No 213 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=94.04 E-value=0.0056 Score=32.94 Aligned_cols=211 Identities=11% Similarity=0.051 Sum_probs=114.6 Q ss_pred CC---Cccceeee---eechHHHHHHHHHHHHHHhh-hhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccc Q lcl|NC_010147. 1 MP---QGITKTSN---QIIPEVLAPMMQAQLEKKLR-FASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) Q Consensus 1 Ma---~~~T~~~~---~~~Pev~~~~v~~~~~~~~v-~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~ 73 (274) || ++.-++.+ .+.|.-....|.|.+.+.+- +.-+-..... ...|..-.+ ...+|++.|..-++.+++++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N--~~tg~~t~v--rt~LP~~~fR~lN~g~~~s~ 76 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGN--LPTGHRTSV--RTGLPTPTWRKLYGGVLPNK 76 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhcc--CCcccceeE--EeecCCchhhhcCCcccccc Confidence 65 33333434 24454444456666655333 3332222111 111222222 24578888988889999999 Q ss_pred cccceeEEEeeeecceeeeeHHHHhhcCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------hhc------c-- Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA-----------LMG------A-- 131 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~~~~~-----------~~~------a-- 131 (274) .++.+++.....++..+.|+... .+..++.. ....+...+.+.+++...+|.. |.. + T Consensus 77 ~tt~qvt~~l~ilgg~~eVDr~l-a~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~ 155 (330) T protein:vir:10 77 SSTAQVTDNCGMLEAYAEVDKAL-ADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) T ss_pred ceEEEEEEEeEEecchhhhhhHH-HhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCc Confidence 99999999999999999998754 44445554 3344557777777777666632 100 0 Q ss_pred ------------cc-------------------------c-----------ccc-------------------------- Q lcl|NC_010147. 132 ------------KL-------------------------T-----------VNA-------------------------- 137 (274) Q Consensus 132 ------------~~-------------------------~-----------~~~-------------------------- 137 (274) .. . .++ T Consensus 156 ~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~v 235 (330) T protein:vir:10 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) T ss_pred hhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccE Confidence 00 0 000 Q ss_pred --------cc----cCHHHHHH----HHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccceecc Q lcl|NC_010147. 138 --------DI----TKLNGLQS----AIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG 201 (274) Q Consensus 138 --------~~----~~~d~i~~----A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G 201 (274) .. ...++|++ |..++-.......+++||-.+...|+++.... ....++..-.-.-.+-.+.| T Consensus 236 vRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k--~n~~l~~~~~~g~~~t~~~g 313 (330) T protein:vir:10 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) T ss_pred EEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhc--ccceeeeeecCCeeeEEECC Confidence 00 01223333 44444333335568999999999998864211 11111111111112356899 Q ss_pred ceEEEcCCCCcceEEEE Q lcl|NC_010147. 202 AIIVRTNKLEAGTAILA 218 (274) Q Consensus 202 ~~Vv~s~~v~~~~~~~~ 218 (274) +||..++.+=....-++ T Consensus 314 ipir~~Dail~tE~~vv 330 (330) T protein:vir:10 314 IPVQRTDALLNTESRVV 330 (330) T ss_pred eEEEEEeeeecCccccC Confidence 99999888755554444 No 214 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=256 Identities=12% Similarity=-0.015 Sum_probs=112.5 Q ss_pred eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeec-cCCccccccCC---CcCCccccccceeEEEee Q lcl|NC_010147. 9 SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEG---EKIPTDILETKKREAKIR 84 (274) Q Consensus 9 ~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~~~~~~eg---~~i~~~~~t~~~~~~~~~ 84 (274) -|+|.+..+..++.+......-+...-..... ..+..+|.+=... ...-+..+.++ ..+.... ......++- T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~d~~fp~~~--~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~--~~~~~~~~p 76 (341) T protein:vir:34 1 MSMYTTAQLLAANEQKFKFDPLFLRLFFRESY--PFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRG--GSTSEFTPG 76 (341) T ss_pred CCCcCHHHHHHHHHhccCccchhHHhcCCccc--ccccceEEEEEeeCCeeEEEeecCCCCcceeccCc--eeeeEEecC Confidence 67788888877776543322222222111100 0011222221111 01111122222 2222222 222233333 Q ss_pred eecceeeee--HHHHhhc------CccHHHHHHHHHHH-------HHHHHHHHHHHHHhhccc----------------- Q lcl|NC_010147. 85 KIAKGTSIT--DEALLSG------YGDPQGEQVRQHGL-------AHANKVDNDVLEALMGAK----------------- 132 (274) Q Consensus 85 ~~~~~~~vt--d~~~~~~------~~d~~~~~~~~~a~-------~~a~~~d~~~~~~~~~a~----------------- 132 (274) ++.....++ |...... ..++.+.+.+.+.+ .+.+.++..+...+.+.. T Consensus 77 ~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDfg~ 156 (341) T protein:vir:34 77 YVKPKHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGR 156 (341) T ss_pred ccCccceeCHHHHHHHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEeCC Confidence 333333333 3322221 12233333333333 455555555666664211 Q ss_pred ---cc----ccccc-----cCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccccc-ccccccce-------e Q lcl|NC_010147. 133 ---LT----VNADI-----TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRA-TELGDDII-------V 192 (274) Q Consensus 133 ---~~----~~~~~-----~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~-s~~~~~~~-------~ 192 (274) +. ....+ ..++.+-+....+...+..+..++|+++++..|++++.+.-.-. .....+.+ . T Consensus 157 ~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 236 (341) T protein:vir:34 157 SEENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLG 236 (341) T ss_pred CCccceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhcccccccccccccccc Confidence 00 11111 23455666566666677788899999999999987654321111 11111101 1 Q ss_pred cc--ccceeccceEEEcC-----------CCCcceEEEEeCCeEEE-EeecCcee--------eeec------ch-hhcc Q lcl|NC_010147. 193 KG--AFGEALGAIIVRTN-----------KLEAGTAILAKKGAVKL-ILKRDFFL--------EVAR------DA-STKT 243 (274) Q Consensus 193 ~g--~ig~~~G~~Vv~s~-----------~v~~~~~~~~~~~a~~~-~~~~~~~v--------e~~r------d~-~~~~ 243 (274) .| ..+++.|++|++=+ .+|++.++++..+..+. ..+..... +..+ .. .-.. T Consensus 237 ~~~~~~~~~~g~~i~~y~~~y~ddG~~~~~ip~~~v~l~p~g~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~ 316 (341) T protein:vir:34 237 KAVSYKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAR 316 (341) T ss_pred cceeeeeecCCceEEEEcCEEEECCcEEeeecCCeEEEeeCCCcceEEEeecccccccccceeeeeEeeeeeeecCCCcE Confidence 11 23456788775422 26888888887765432 11111111 1111 00 0112 Q ss_pred eEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 244 TALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 244 ~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) -.+.+-.+--..+.+|+++++.+++ T Consensus 317 ~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 317 EFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred EEEEEcccceeeeeCCCcEEEEEeC Confidence 2233444444566789999999988 No 215 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=93.33 E-value=0.0079 Score=32.10 Aligned_cols=257 Identities=9% Similarity=-0.011 Sum_probs=131.8 Q ss_pred CCCc-----cceeeeeechHHHHHHHHHHHHHHhh----hhcccccccccccCC-CceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQG-----ITKTSNQIIPEVLAPMMQAQLEKKLR----FASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~-----~T~~~~~~~Pev~~~~v~~~~~~~~v----~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) +||- -++..+.=+|-.+..++..++.+-+. ..++.-+.. .|.- -.+++++.....|.+..|+.++++| T Consensus 61 ~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t--~g~W~~~t~ty~~~e~~G~A~~ygd~~D~P 138 (382) T protein:vir:96 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDT--VGSWEDQEIVQGIVEPAGTAVEYGDHTNIP 138 (382) T ss_pred cccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccc--cCCccceEEEEeeeecccceEEeecccCCC Confidence 3332 12333444588888887766554322 223332222 1222 2578899988889999999999999 Q ss_pred ccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHh----h----c--------c Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEAL----M----G--------A 131 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~----~----~--------a 131 (274) ..+........++....-.+.+.+++... .+.++.+.-.....+++.+.+++-.+-.- . + + T Consensus 139 l~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a 218 (382) T protein:vir:96 139 LTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPP 218 (382) T ss_pred ccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCccc Confidence 98888777777777766677776554443 46778888778888888888876544211 1 0 0 Q ss_pred cc-ccccccc--C----HHHHHHHHHHHhhcC-------CCceEEEEcHHHHHHHHhhccccccccccccccceeccccc Q lcl|NC_010147. 132 KL-TVNADIT--K----LNGLQSAIDKFNDED-------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG 197 (274) Q Consensus 132 ~~-~~~~~~~--~----~d~i~~A~~~l~~~~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig 197 (274) .. .+...+. + +++|..+...+...- .....++++|..+..|-+-+ ..+..+ .+-.-- T Consensus 219 ~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n--------~~g~Tv-l~~lk~ 289 (382) T protein:vir:96 219 FQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTT--------PYGISV-SDWIEQ 289 (382) T ss_pred ccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccccC--------ccCccH-HHHHHH Confidence 01 1112222 2 455666766664321 12447899999887774321 111111 000001 Q ss_pred eeccceEEEcCCCC---------cceEEEEeCCeEEEEee---cCcee-eeecc------h--hhcceEEEEE-EEEEEE Q lcl|NC_010147. 198 EALGAIIVRTNKLE---------AGTAILAKKGAVKLILK---RDFFL-EVARD------A--STKTTALYSD-KHYVAY 255 (274) Q Consensus 198 ~~~G~~Vv~s~~v~---------~~~~~~~~~~a~~~~~~---~~~~v-e~~rd------~--~~~~~~v~~~-~~yg~~ 255 (274) ++-+++|+.-+.+. ..-.|++.+..-..... .+... |.-|. . .......... ...|+- T Consensus 290 n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~ 369 (382) T protein:vir:96 290 TYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGAL 369 (382) T ss_pred hcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeE Confidence 13355555544441 01112222211000000 00000 00000 0 0011111111 347888 Q ss_pred EEcCccEEEEEec Q lcl|NC_010147. 256 LYDESKAVKITKG 268 (274) Q Consensus 256 ~~~~~~~v~~~~~ 268 (274) +..|.++++++.= T Consensus 370 i~~P~ai~~~~GI 382 (382) T protein:vir:96 370 CKRPWAVVRYLGI 382 (382) T ss_pred EEcchhhhhccCC Confidence 8899999987754 No 216 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=93.30 E-value=0.0081 Score=32.06 Aligned_cols=256 Identities=11% Similarity=0.019 Sum_probs=132.6 Q ss_pred CCCc-----c------ceeeeeechHHHHHHHHHHHHHHhh----hhcccccccccccCC-CceEEEEeeccCCcccccc Q lcl|NC_010147. 1 MPQG-----I------TKTSNQIIPEVLAPMMQAQLEKKLR----FASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVA 64 (274) Q Consensus 1 Ma~~-----~------T~~~~~~~Pev~~~~v~~~~~~~~v----~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~ 64 (274) |... . ...++.=+|..+..++ -.+.+-+. ..++.-+.. .|.- -.++.++.....|.+..|+ T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~-p~~i~~~tap~~a~~l~pv~t--~g~W~~~~~~~~v~e~~G~A~~yg 132 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWL-PGHVRILTAVREADEFLGLST--VGQWDDEQIVQRVLEGLGTAQPYT 132 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhc-chHHHHHhhhhhhhhhccccc--CCCceeeeEEEeeeeeeeeeEEec Confidence 2211 0 1122334566666665 23333221 112221211 1211 2577888888889999999 Q ss_pred CCCcCCccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc----------- Q lcl|NC_010147. 65 EGEKIPTDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMG----------- 130 (274) Q Consensus 65 eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~----------- 130 (274) .++++|..+........+++.+...+.+++++... .+.++.++-.....+++.+.+++-.+-.... T Consensus 133 d~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNd 212 (379) T protein:vir:10 133 DGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLND 212 (379) T ss_pred cccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeC Confidence 99999999988888888888777777777764443 4677888888888888888888754433110 Q ss_pred ----cccccc------cccc--C----HHHHHHHHHHHhhc--C-----CCceEEEEcHHHHHHHHhhcccccccccccc Q lcl|NC_010147. 131 ----AKLTVN------ADIT--K----LNGLQSAIDKFNDE--D-----LEPMVLFINPLDAGKLRGDASTNFTRATELG 187 (274) Q Consensus 131 ----a~~~~~------~~~~--~----~d~i~~A~~~l~~~--~-----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~ 187 (274) +..+.. ..+. + +++|..+...+-.. + .....++++|..+..|-+-+. .+ T Consensus 213 P~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n~--------~g 284 (379) T protein:vir:10 213 PNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPTE--------LG 284 (379) T ss_pred CCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccccc--------cC Confidence 000110 1111 2 34555555554321 1 233479999999998854221 11 Q ss_pred ccceeccccceeccceEEEcCCCCc-----ceEEEEeCCeEEEEeecCcee-----eeec----chhhcceEEEEE-EEE Q lcl|NC_010147. 188 DDIIVKGAFGEALGAIIVRTNKLEA-----GTAILAKKGAVKLILKRDFFL-----EVAR----DASTKTTALYSD-KHY 252 (274) Q Consensus 188 ~~~~~~g~ig~~~G~~Vv~s~~v~~-----~~~~~~~~~a~~~~~~~~~~v-----e~~r----d~~~~~~~v~~~-~~y 252 (274) ..+ .+-.--+|-+++|+..+.+.. ...+++.+..-+.-...+-.+ |..| ........+... ... T Consensus 285 ~Tv-l~~lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~ 363 (379) T protein:vir:10 285 YSV-AQYMRESYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATA 363 (379) T ss_pred ccH-HHHHHHhcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccceecCceeEecccccee Confidence 111 000011244566666655531 123444332111000000000 0001 011122223333 346 Q ss_pred EEEEEcCccEEEEEec Q lcl|NC_010147. 253 VAYLYDESKAVKITKG 268 (274) Q Consensus 253 g~~~~~~~~~v~~~~~ 268 (274) |+-+..|.+++++..+ T Consensus 364 Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 364 GAMLKRPFATYRQTGA 379 (379) T ss_pred eeeeecchhhheecCC Confidence 8888999999999888 No 217 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=93.17 E-value=0.0085 Score=31.93 Aligned_cols=210 Identities=10% Similarity=0.056 Sum_probs=111.5 Q ss_pred CCCccc---eeeee---echH-HHHHHHHHHHHHHhh-hhcccccccccccC--CCceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGIT---KTSNQ---IIPE-VLAPMMQAQLEKKLR-FASFAEVDSTLQGQ--PGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T---~~~~~---~~Pe-v~~~~v~~~~~~~~v-~~~~~~~~~~~~~~--~g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |+..-+ ++.+. +.|. .....|.|.+.+.+- +.-+-.. ++. .|... .....++.+.|..-++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~----e~N~~t~~~~--~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKT--TVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceee----eccCCcccee--eEEeccCCchhhccCCccC Confidence 775322 23232 2232 122334444444322 2222221 221 12222 2335678899988889999 Q ss_pred ccccccceeEEEeeeecceeeeeHHHHhhcCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------hhc------ Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA-----------LMG------ 130 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~~~~~-----------~~~------ 130 (274) +++.++..++.....++..+.|+...... .++.. ....++..+.+.+++...+|.. |.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~-~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~ 153 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccc Confidence 99999999999999999999998754444 34444 3344556777777777666632 000 Q ss_pred c---ccc------------------------------------c-----------cc----------------------- Q lcl|NC_010147. 131 A---KLT------------------------------------V-----------NA----------------------- 137 (274) Q Consensus 131 a---~~~------------------------------------~-----------~~----------------------- 137 (274) + .+. + .+ T Consensus 154 a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~ 233 (331) T protein:vir:10 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) T ss_pred cccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCccc Confidence 0 000 0 00 Q ss_pred ----cccC--------------HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 138 ----DITK--------------LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 138 ----~~~~--------------~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) ..++ .+.+++|..++-.......+++||-.+...|+++....... +-...+-.-.-.+-.+ T Consensus 234 v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~-~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:10 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA-STLTMEEIAGKKVVAF 312 (331) T ss_pred EEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce-eeeeeeecCCcceeEE Confidence 0010 12234444444433345578999999999998864221110 0000000111134568 Q ss_pred ccceEEEcCCCCcceEEEE Q lcl|NC_010147. 200 LGAIIVRTNKLEAGTAILA 218 (274) Q Consensus 200 ~G~~Vv~s~~v~~~~~~~~ 218 (274) .|+||..++.+-....-++ T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 9999998888755554444 No 218 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=93.17 E-value=0.0085 Score=31.93 Aligned_cols=210 Identities=10% Similarity=0.056 Sum_probs=111.5 Q ss_pred CCCccc---eeeee---echH-HHHHHHHHHHHHHhh-hhcccccccccccC--CCceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGIT---KTSNQ---IIPE-VLAPMMQAQLEKKLR-FASFAEVDSTLQGQ--PGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T---~~~~~---~~Pe-v~~~~v~~~~~~~~v-~~~~~~~~~~~~~~--~g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |+..-+ ++.+. +.|. .....|.|.+.+.+- +.-+-.. ++. .|... .....++.+.|..-++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~----e~N~~t~~~~--~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKT--TVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceee----eccCCcccee--eEEeccCCchhhccCCccC Confidence 775322 23232 2232 122334444444322 2222221 221 12222 2335678899988889999 Q ss_pred ccccccceeEEEeeeecceeeeeHHHHhhcCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------hhc------ Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA-----------LMG------ 130 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~~~~~-----------~~~------ 130 (274) +++.++..++.....++..+.|+...... .++.. ....++..+.+.+++...+|.. |.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~-~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~ 153 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccc Confidence 99999999999999999999998754444 34444 3344556777777777666632 000 Q ss_pred c---ccc------------------------------------c-----------cc----------------------- Q lcl|NC_010147. 131 A---KLT------------------------------------V-----------NA----------------------- 137 (274) Q Consensus 131 a---~~~------------------------------------~-----------~~----------------------- 137 (274) + .+. + .+ T Consensus 154 a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~ 233 (331) T protein:vir:10 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) T ss_pred cccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCccc Confidence 0 000 0 00 Q ss_pred ----cccC--------------HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 138 ----DITK--------------LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 138 ----~~~~--------------~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) ..++ .+.+++|..++-.......+++||-.+...|+++....... +-...+-.-.-.+-.+ T Consensus 234 v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~-~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:10 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA-STLTMEEIAGKKVVAF 312 (331) T ss_pred EEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce-eeeeeeecCCcceeEE Confidence 0010 12234444444433345578999999999998864221110 0000000111134568 Q ss_pred ccceEEEcCCCCcceEEEE Q lcl|NC_010147. 200 LGAIIVRTNKLEAGTAILA 218 (274) Q Consensus 200 ~G~~Vv~s~~v~~~~~~~~ 218 (274) .|+||..++.+-....-++ T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 9999998888755554444 No 219 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=93.17 E-value=0.0085 Score=31.93 Aligned_cols=210 Identities=10% Similarity=0.056 Sum_probs=111.5 Q ss_pred CCCccc---eeeee---echH-HHHHHHHHHHHHHhh-hhcccccccccccC--CCceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGIT---KTSNQ---IIPE-VLAPMMQAQLEKKLR-FASFAEVDSTLQGQ--PGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T---~~~~~---~~Pe-v~~~~v~~~~~~~~v-~~~~~~~~~~~~~~--~g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |+..-+ ++.+. +.|. .....|.|.+.+.+- +.-+-.. ++. .|... .....++.+.|..-++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~----e~N~~t~~~~--~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKT--TVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceee----eccCCcccee--eEEeccCCchhhccCCccC Confidence 775322 23232 2232 122334444444322 2222221 221 12222 2335678899988889999 Q ss_pred ccccccceeEEEeeeecceeeeeHHHHhhcCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------hhc------ Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA-----------LMG------ 130 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~~~~~-----------~~~------ 130 (274) +++.++..++.....++..+.|+...... .++.. ....++..+.+.+++...+|.. |.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~-~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~ 153 (331) T protein:vir:98 75 PEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccc Confidence 99999999999999999999998754444 34444 3344556777777777666632 000 Q ss_pred c---ccc------------------------------------c-----------cc----------------------- Q lcl|NC_010147. 131 A---KLT------------------------------------V-----------NA----------------------- 137 (274) Q Consensus 131 a---~~~------------------------------------~-----------~~----------------------- 137 (274) + .+. + .+ T Consensus 154 a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~ 233 (331) T protein:vir:98 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) T ss_pred cccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCccc Confidence 0 000 0 00 Q ss_pred ----cccC--------------HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceecccccee Q lcl|NC_010147. 138 ----DITK--------------LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) Q Consensus 138 ----~~~~--------------~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~ 199 (274) ..++ .+.+++|..++-.......+++||-.+...|+++....... +-...+-.-.-.+-.+ T Consensus 234 v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~-~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:98 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAA-STLTMEEIAGKKVVAF 312 (331) T ss_pred EEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccce-eeeeeeecCCcceeEE Confidence 0010 12234444444433345578999999999998864221110 0000000111134568 Q ss_pred ccceEEEcCCCCcceEEEE Q lcl|NC_010147. 200 LGAIIVRTNKLEAGTAILA 218 (274) Q Consensus 200 ~G~~Vv~s~~v~~~~~~~~ 218 (274) .|+||..++.+-....-++ T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 9999998888755554444 No 220 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=92.95 E-value=0.0093 Score=31.71 Aligned_cols=268 Identities=11% Similarity=0.050 Sum_probs=141.6 Q ss_pred CCCccceee---eeechHHHHHHHHHHHHHHhhhhcc-cccccccccC-CCceEEEEeeccCCccc--cccCCCcCCcc- Q lcl|NC_010147. 1 MPQGITKTS---NQIIPEVLAPMMQAQLEKKLRFASF-AEVDSTLQGQ-PGDTLTFPAFVYSGDAQ--VVAEGEKIPTD- 72 (274) Q Consensus 1 Ma~~~T~~~---~~~~Pev~~~~v~~~~~~~~v~~~~-~~~~~~~~~~-~g~tv~ip~~~~~~~~~--~~~eg~~i~~~- 72 (274) .|..+|.-. -.+.+| |...+..-+..+.+|... +-..-.+.|. ..++.-.-+-++++-+- .|.-++..... T Consensus 17 ~~~~t~N~n~avr~Y~Kq-f~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGt 95 (314) T protein:vir:98 17 FASGTANQNKAARSYQKE-FRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGE 95 (314) T ss_pred eeeccccCccceeeecHH-HHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCccccc Confidence 333332211 123444 777777777888877542 1100111121 11222111112211110 12111111111 Q ss_pred ----ccccceeEEEee-----eecceeeeeH-HHHhhcCccHHHHHHHH---HHHHHHHHHHHHHHHHhhcccc-ccccc Q lcl|NC_010147. 73 ----ILETKKREAKIR-----KIAKGTSITD-EALLSGYGDPQGEQVRQ---HGLAHANKVDNDVLEALMGAKL-TVNAD 138 (274) Q Consensus 73 ----~~t~~~~~~~~~-----~~~~~~~vtd-~~~~~~~~d~~~~~~~~---~a~~~a~~~d~~~~~~~~~a~~-~~~~~ 138 (274) .--++...-.+. .+...|.+-. ++...-+.|+.+.+++| .+.+|++.+|..+-..+..... +.... T Consensus 96 GTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~lt 175 (314) T protein:vir:98 96 GTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLT 175 (314) T ss_pred CCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh Confidence 111222111110 0111222221 22333455565555554 6789999999887766654332 23333 Q ss_pred ccCHHHHHHHHHHHhhcCC-----CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCC Q lcl|NC_010147. 139 ITKLNGLQSAIDKFNDEDL-----EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLE 211 (274) Q Consensus 139 ~~~~d~i~~A~~~l~~~~~-----~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~ 211 (274) .++.|.+..+...+.+... .+-...|||++|..|...++-+....|. .++ -+.-+-++.|+.+...+ .+. T Consensus 176 d~~~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~Ss--aNI-Dengi~~FkGf~i~e~P~~~~q 252 (314) T protein:vir:98 176 DYSADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDHPLTTSAKSSS--ANI-DQNGIVNFKGFAIQEIPESMLQ 252 (314) T ss_pred hcchhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhccccccccccce--eee-ccCCcceecceEEEecchhhcC Confidence 4556777666666654333 3456789999999998776543333322 122 23336789999988765 577 Q ss_pred cceEEEEeCCeEEEEeecCcee-eeecchhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 212 AGTAILAKKGAVKLILKRDFFL-EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 212 ~~~~~~~~~~a~~~~~~~~~~v-e~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~ 273 (274) .+...++....++..- .++++ .+-..+.+....+.+---||-.+.+..+.+.+++++-++- T Consensus 253 ~g~ia~~s~dnig~af-tGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~tp~~ 314 (314) T protein:vir:98 253 SGDVAYTYITNIGKAF-TGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTSTPEG 314 (314) T ss_pred CCcEEEEccccceeec-ccceeeeeeecccccceeeecccccccccccccceeeEEEecCCCC Confidence 7887777777777642 23333 3444567788899999999999998888888777766655 No 221 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=92.87 E-value=0.0097 Score=31.63 Aligned_cols=257 Identities=11% Similarity=-0.001 Sum_probs=105.2 Q ss_pred CCCccce---------eeeeechHHHHHHHHHHHHHHhh---hhcccccccccccCCCceEEEEeec-cCCc-cccccCC Q lcl|NC_010147. 1 MPQGITK---------TSNQIIPEVLAPMMQAQLEKKLR---FASFAEVDSTLQGQPGDTLTFPAFV-YSGD-AQVVAEG 66 (274) Q Consensus 1 Ma~~~T~---------~~~~~~Pev~~~~v~~~~~~~~v---~~~~~~~~~~~~~~~g~tv~ip~~~-~~~~-~~~~~eg 66 (274) |.|..++ +.|.|.+..+..++.+.-.+... |++-..+ .+..+.+.... ..+. +...+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~~~~l~~~~Fp~~~~-------~~~~~~~~~~~~~~~~~a~~v~~~ 73 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTRNRQYPEMLGDTLFPAVKV-------PTLEVDILKAGSRVPTIASVSAFD 73 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHHhcCcchhhHhhcCCcccc-------ccceeEEEeeccCcceeeeeecCC Confidence 7776543 34556777777777643221111 1111111 11112111111 0111 1122333 Q ss_pred CcCCccccccceeEEEeeeecceeeee--HHHHhhcC--ccHHHHHHHH-------HHHHHHHHHHHHHHHHhhccc--- Q lcl|NC_010147. 67 EKIPTDILETKKREAKIRKIAKGTSIT--DEALLSGY--GDPQGEQVRQ-------HGLAHANKVDNDVLEALMGAK--- 132 (274) Q Consensus 67 ~~i~~~~~t~~~~~~~~~~~~~~~~vt--d~~~~~~~--~d~~~~~~~~-------~a~~~a~~~d~~~~~~~~~a~--- 132 (274) .+.+..+-+.......+-.++....++ |.....+. ......+.+. +.+.+.+.++-.+...+.++. T Consensus 74 ~~~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~ 153 (349) T protein:vir:10 74 AEAEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITD 153 (349) T ss_pred CCcceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEE Confidence 332222222222222222333333333 32222222 2222233333 334444555555566554321 Q ss_pred --------------c----ccccccc--C---HHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcccccc-ccccccc Q lcl|NC_010147. 133 --------------L----TVNADIT--K---LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFT-RATELGD 188 (274) Q Consensus 133 --------------~----~~~~~~~--~---~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~-~~s~~~~ 188 (274) + +....+. + +++|.+...+ .+..+..++|+++++..|++++.+.-. ..+..+. T Consensus 154 ~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~---~g~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~ 230 (349) T protein:vir:10 154 KKNGIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWSDS---LDVTPTRALTSKKVLRILMRSTEIKEAIFGKDTGR 230 (349) T ss_pred cCCcEEEecccCccceeEecCcccCCCCCCCHHHHHHHHHHH---hCCCccEEEeCHHHHHHHhcCHHHHHHhccccccc Confidence 1 1111122 2 3344444433 356778999999999999876543221 1122111 Q ss_pred c---ceeccccceeccceEEEcC----------------CCCcceEEEEeCCeEEEEeecCceeeeec------------ Q lcl|NC_010147. 189 D---IIVKGAFGEALGAIIVRTN----------------KLEAGTAILAKKGAVKLILKRDFFLEVAR------------ 237 (274) Q Consensus 189 ~---~~~~g~ig~~~G~~Vv~s~----------------~v~~~~~~~~~~~a~~~~~~~~~~ve~~r------------ 237 (274) . ......++.+.|++|++-+ .+|++.++++..+..+...-..+ .|... T Consensus 231 ~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~-~e~~~~~~g~~~~~~~~ 309 (349) T protein:vir:10 231 VVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPT-PEENRLISSNAQVSNVG 309 (349) T ss_pred ccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeecc-chhhhhcccccceeecc Confidence 0 1123455666776665532 24667777766655443221111 11000 Q ss_pred ---------chhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 238 ---------DASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 238 ---------d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) +..-..-.+.+..+.-..+.+|++++++++= T Consensus 310 ~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 310 NIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred ceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 0000111222223333444567777776655 No 222 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=273 Identities=9% Similarity=-0.051 Sum_probs=122.8 Q ss_pred CCCccceee--------------------eeechHHHHH--HHHHHHHHHhhhhcccccc---ccc-cc-CCCceEEEEe Q lcl|NC_010147. 1 MPQGITKTS--------------------NQIIPEVLAP--MMQAQLEKKLRFASFAEVD---STL-QG-QPGDTLTFPA 53 (274) Q Consensus 1 Ma~~~T~~~--------------------~~~~Pev~~~--~v~~~~~~~~v~~~~~~~~---~~~-~~-~~g~tv~ip~ 53 (274) +.+..|..+ +.+....... ..........++....... ..+ .+ ..+...++.. T Consensus 147 e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~ 226 (521) T protein:vir:72 147 MYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAE 226 (521) T ss_pred hcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeec Confidence 111111111 1111100000 0000000000110000000 000 00 0012222222 Q ss_pred eccCCcccc---ccC--CCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010147. 54 FVYSGDAQV---VAE--GEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDV 124 (274) Q Consensus 54 ~~~~~~~~~---~~e--g~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~ 124 (274) .-....++- +.. +..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-|+..|...|++++ T Consensus 227 gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINRei 306 (521) T protein:vir:72 227 GMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREV 306 (521) T ss_pred ccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHH Confidence 111112221 111 123455555667777777777655566654322 23688999999999999999999999 Q ss_pred HHHhhcc--------ccc--cccccc------C-------HHHHHHHHHHHhh-c-------C-CCceEEEEcHHHHHHH Q lcl|NC_010147. 125 LEALMGA--------KLT--VNADIT------K-------LNGLQSAIDKFND-E-------D-LEPMVLFINPLDAGKL 172 (274) Q Consensus 125 ~~~~~~a--------~~~--~~~~~~------~-------~d~i~~A~~~l~~-~-------~-~~~~~~vv~p~~~~~L 172 (274) +..+.-. +.+ .....+ + .+.+-.+..++.. + . ...++++++|++.+.| T Consensus 307 i~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L 386 (521) T protein:vir:72 307 VDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVL 386 (521) T ss_pred hhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHH Confidence 9776311 111 111111 1 1222223333321 1 2 4668999999999998 Q ss_pred Hhhcccccccccccccccee----cccccee-ccceEEEcCCCCcceEEEEeCCeE------EEEeecCceeeeecchhh Q lcl|NC_010147. 173 RGDASTNFTRATELGDDIIV----KGAFGEA-LGAIIVRTNKLEAGTAILAKKGAV------KLILKRDFFLEVARDAST 241 (274) Q Consensus 173 ~k~~~~~~~~~s~~~~~~~~----~g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a~------~~~~~~~~~ve~~rd~~~ 241 (274) .......+..+.....+... +-..|.+ .|++|+++++.|..-..+.-+|.- -|+-=.+...-.--|+.+ T Consensus 387 ~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~s 466 (521) T protein:vir:72 387 ASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKN 466 (521) T ss_pred hhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCcc Confidence 75443333333321112111 1123443 457999999988544433322321 122111222223458899 Q ss_pred cceEEEEEEEEEEEEEcCcc-------EEEEEecCC-----CCCC Q lcl|NC_010147. 242 KTTALYSDKHYVAYLYDESK-------AVKITKGSG-----SLEM 274 (274) Q Consensus 242 ~~~~v~~~~~yg~~~~~~~~-------~v~~~~~~a-----~~~~ 274 (274) ++-.+-...|||..+ ||-. ..+|+-.-. ..+| T Consensus 467 fqP~~g~~tRY~l~~-NP~~~~~~~~~a~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:72 467 FQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred ccceeeeeeeeceee-cCcccccCcccceeecCcChhhhcCcccc Confidence 999999999999874 6632 233332221 1223 No 223 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=92.69 E-value=0.01 Score=31.46 Aligned_cols=254 Identities=11% Similarity=0.068 Sum_probs=129.0 Q ss_pred CCCccceeeeeechHHHHHHHHH----HHHHHhhhhcccccccccccCCCceEEEEeeccCC-ccc--cccCCCcCCccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQA----QLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSG-DAQ--VVAEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~----~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~-~~~--~~~eg~~i~~~~ 73 (274) |.+...... .| +++++... .+-...+ ++.+. .+....+++.|+... ... ..+.+.. +.. T Consensus 2 ~~~~~~~~~---dp-~LT~~A~gy~n~~~Iad~l-fP~vp-------V~~~~~k~~~f~~e~f~~~~t~ra~~~~--~~~ 67 (307) T protein:vir:79 2 GRLSKLRIV---DP-VLTNLAIGYTNAEFIGQTL-MPVVE-------VEKEGGKIPKFGKESFRLYQTERALRAK--SNR 67 (307) T ss_pred CCCCCCccc---CH-HHHHHHhhccchhhhhhhc-CCccc-------ccccccceeeeccccccccccccccCCC--cce Confidence 333323332 24 35444332 2211111 12221 133445666664211 000 1122221 112 Q ss_pred cc---cceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccccccc-- Q lcl|NC_010147. 74 LE---TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------LTVNADI-- 139 (274) Q Consensus 74 ~t---~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~---------~~~~~~~-- 139 (274) ++ .+.....+.+.+-...+++.....+..|+.+...+.+.+.+.+..+..+.+.+.... ++.+..+ T Consensus 68 v~~~~~~~~~~~~~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd 147 (307) T protein:vir:79 68 MNPEDIDSVDVNLDEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTA 147 (307) T ss_pred eeeeccccccccccccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCC Confidence 22 222333344444455666666666777898999999999888888887776664321 1111111 Q ss_pred ---cCHHHHHHHHHHHhhc-CCCceEEEEcHHHHHHHHhhcccc-ccccccccccceeccccceeccce-EEEcCCC--- Q lcl|NC_010147. 140 ---TKLNGLQSAIDKFNDE-DLEPMVLFINPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAI-IVRTNKL--- 210 (274) Q Consensus 140 ---~~~d~i~~A~~~l~~~-~~~~~~~vv~p~~~~~L~k~~~~~-~~~~s~~~~~~~~~g~ig~~~G~~-Vv~s~~v--- 210 (274) -....|.++..++.+. +..+..++++++++..|++++.+. .+..+ ..+.+..-.+..++|+. |++-... T Consensus 148 ~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~--~~g~it~~~la~l~~v~~V~vg~a~y~~ 225 (307) T protein:vir:79 148 ANSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYS--MKGIVTVDLLKEIFEVENIAVGEAIYAD 225 (307) T ss_pred CCCCcHHHHHHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCc--cccccCHHHHHHHhCceeEEEeeeeeec Confidence 2245666677777654 578999999999999998875432 12222 23455555667788886 5553322 Q ss_pred C--------cceEEEEe------C-------CeEEEEee-cCceeeeecchhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 211 E--------AGTAILAK------K-------GAVKLILK-RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 211 ~--------~~~~~~~~------~-------~a~~~~~~-~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) . .+.+++.. . ..++|..+ .+-.+...|....++..+++...+--.++-|+.-..|+-+ T Consensus 226 ~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~ 305 (307) T protein:vir:79 226 DKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGI 305 (307) T ss_pred ccccchhcCCCceEEEecccccCCCCCcccccccceeEEecCceEEecccCCCceeEEeecccccceeeccccchhhccC Confidence 1 11222221 0 13455433 2333333455566777777777666666666655555544 Q ss_pred CC Q lcl|NC_010147. 269 SG 270 (274) Q Consensus 269 ~a 270 (274) .. T Consensus 306 v~ 307 (307) T protein:vir:79 306 NG 307 (307) T ss_pred CC Confidence 44 No 224 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=92.46 E-value=0.011 Score=31.26 Aligned_cols=269 Identities=10% Similarity=-0.020 Sum_probs=124.2 Q ss_pred CCCccceeee-e-----echHHHHH--HHHHHHHHHhhhhccccccccccc-------CCCceEEEEeeccCCcccccc- Q lcl|NC_010147. 1 MPQGITKTSN-Q-----IIPEVLAP--MMQAQLEKKLRFASFAEVDSTLQG-------QPGDTLTFPAFVYSGDAQVVA- 64 (274) Q Consensus 1 Ma~~~T~~~~-~-----~~Pev~~~--~v~~~~~~~~v~~~~~~~~~~~~~-------~~g~tv~ip~~~~~~~~~~~~- 64 (274) .+...+.... . ..++.... ....-.....+....+ +....+ ..+...++-.--....++... T Consensus 174 ~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~--~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~ 251 (534) T protein:vir:10 174 FVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPA--DQTEAGLAYKWLLANGYAVETSSAMATAFAELQQG 251 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccC--CccccccccccccccccceecccccchhhHhhhcc Confidence 1111111100 0 00000000 0000000000000000 000000 001111111111111222111 Q ss_pred ----CCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_010147. 65 ----EGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN 136 (274) Q Consensus 65 ----eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~ 136 (274) .+..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-|+..|...|+++++..+.+.+.... T Consensus 252 ~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k 331 (534) T protein:vir:10 252 FNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGK 331 (534) T ss_pred CCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheee Confidence 1223566677777888887777655555544322 23688999999999999999999999988865332211 Q ss_pred ----------ccc------cC-------HHHHHHHHHHHhhc-C--------CCceEEEEcHHHHHHHHhhccccccccc Q lcl|NC_010147. 137 ----------ADI------TK-------LNGLQSAIDKFNDE-D--------LEPMVLFINPLDAGKLRGDASTNFTRAT 184 (274) Q Consensus 137 ----------~~~------~~-------~d~i~~A~~~l~~~-~--------~~~~~~vv~p~~~~~L~k~~~~~~~~~s 184 (274) ... .+ .+.+-.+..++... + ...++++++|++.+.|.-....++.+.. T Consensus 332 ~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~ 411 (534) T protein:vir:10 332 TGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVM 411 (534) T ss_pred cccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccc Confidence 111 11 22333333334322 2 2567999999999999765433332222 Q ss_pred ccccc----ceeccccceec-cceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 185 ELGDD----IIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 185 ~~~~~----~~~~g~ig~~~-G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) ..+.+ ....-..|.+. |++|+++++.+..-..+.-+|. +-|+-=.+......-|+.+++=.+-...||| T Consensus 412 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 491 (534) T protein:vir:10 412 GANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYG 491 (534) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 11111 11111345553 5799999998854433332232 1122112222334468899999999999999 Q ss_pred EEEEcCccE-------EEEEecCCCCCC Q lcl|NC_010147. 254 AYLYDESKA-------VKITKGSGSLEM 274 (274) Q Consensus 254 ~~~~~~~~~-------v~~~~~~a~~~~ 274 (274) ..+ ||-.. .+|.- +.+.| T Consensus 492 l~~-NP~~~~~~~~~~~~i~~--g~~~~ 516 (534) T protein:vir:10 492 VKL-HPMADATQNKGFAKISN--GMPQH 516 (534) T ss_pred eee-cCcccccCCcccccccc--CCcch Confidence 875 66322 11111 11222 No 225 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=92.06 E-value=0.013 Score=30.92 Aligned_cols=258 Identities=13% Similarity=0.059 Sum_probs=130.0 Q ss_pred CCCccceeeeeechHHHHHHHHHH-------HHHHhhh-hcccccccccccCCCceEEEEeecc-CCccccccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQ-------LEKKLRF-ASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~-------~~~~~v~-~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~i~~ 71 (274) |+-++ ++ |+...-+.++ +.++..+ .-+........-.+|.+|..|---. .+...|+.--+.+.. T Consensus 1 mp~~~--ls-----el~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~ 73 (321) T protein:vir:34 1 MPFPN--IS-----DIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPT 73 (321) T ss_pred CCCch--HH-----HHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeecc Confidence 55321 22 2222211111 1111111 1111110101112467887776543 567788875455444 Q ss_pred ccc-ccceeEEEeeeecceeeeeHHHHhhcC-----ccHHHHHHHHHHHHHHHHHHHHHHHHhhc-----------c--- Q lcl|NC_010147. 72 DIL-ETKKREAKIRKIAKGTSITDEALLSGY-----GDPQGEQVRQHGLAHANKVDNDVLEALMG-----------A--- 131 (274) Q Consensus 72 ~~~-t~~~~~~~~~~~~~~~~vtd~~~~~~~-----~d~~~~~~~~~a~~~a~~~d~~~~~~~~~-----------a--- 131 (274) ..- .+.......++.+.++.++-+...+.. .|+|.+-.+.+-+.+++.+|..|..--.+ . T Consensus 74 ~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~ 153 (321) T protein:vir:34 74 APQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPV 153 (321) T ss_pred chhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhccc Confidence 332 344466677888889999987766644 45778888888888899998877642211 0 Q ss_pred -ccc--c-----------------cccccCHHHHHHHHHH----HhhcCCCceEEEEcHHHHHHHHhhcccccccccccc Q lcl|NC_010147. 132 -KLT--V-----------------NADITKLNGLQSAIDK----FNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELG 187 (274) Q Consensus 132 -~~~--~-----------------~~~~~~~d~i~~A~~~----l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~ 187 (274) +.+ + .++..+...+..+..+ +...+..+..++.+.+.|..-++.-.-.---.++ T Consensus 154 ~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~-- 231 (321) T protein:vir:34 154 DPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSA-- 231 (321) T ss_pred CCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeeccc-- Confidence 000 0 0111244455554443 3444567889999988887655421100001111 Q ss_pred ccceeccccc-eeccceEEEcC----CCCcceEEEEeCCeEEEEeecC---ceeeeecchhhcceEEEEEEEEE--EEEE Q lcl|NC_010147. 188 DDIIVKGAFG-EALGAIIVRTN----KLEAGTAILAKKGAVKLILKRD---FFLEVARDASTKTTALYSDKHYV--AYLY 257 (274) Q Consensus 188 ~~~~~~g~ig-~~~G~~Vv~s~----~v~~~~~~~~~~~a~~~~~~~~---~~ve~~rd~~~~~~~v~~~~~yg--~~~~ 257 (274) +. ..-|..+ .|.|+.|+.++ .+|.+++|.+....+.+..-++ +.+-..|-.-..+|.+.-...+- ..+- T Consensus 232 ~~-a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~~s 310 (321) T protein:vir:34 232 EE-ANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLTCS 310 (321) T ss_pred cc-ccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchhHHhhhhhhhheeeee Confidence 11 2223333 58999999998 5789999999999998874332 22322221112233333322222 2222 Q ss_pred cCccEEEEEec Q lcl|NC_010147. 258 DESKAVKITKG 268 (274) Q Consensus 258 ~~~~~v~~~~~ 268 (274) ||..=.+++-- T Consensus 311 n~~~~~vL~~~ 321 (321) T protein:vir:34 311 GAQFQGRLIAE 321 (321) T ss_pred cccceeEEeeC Confidence 33333333322 No 226 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=92.03 E-value=0.013 Score=30.89 Aligned_cols=253 Identities=9% Similarity=0.012 Sum_probs=124.1 Q ss_pred CCCccceeeeeechH-HHHHHHH----HHHHHHhhhhcccccccccccCCCceEEEEeeccCCcc-----ccccCCCcCC Q lcl|NC_010147. 1 MPQGITKTSNQIIPE-VLAPMMQ----AQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDA-----QVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~T~~~~~~~Pe-v~~~~v~----~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~-----~~~~eg~~i~ 70 (274) |++. .|.+. +++++.. ..+....+| +.+.+ +..+.++|.|... ++ ...+.+.... T Consensus 1 ~~~~------~~~~dp~LT~~A~gy~n~~~Ia~~l~-P~vpV-------~~~~~~~~~f~~~-e~F~~~~t~r~~~~~~~ 65 (309) T protein:vir:99 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVL-PRVPV-------GKQEFKFWKYDLA-QGFTVPETLVGRKSKPN 65 (309) T ss_pred CCCC------CcCcCHhHHHHHhhccChhhhhhhcC-Ccccc-------Cccccceeeechh-hcccccchhhccCCCcc Confidence 5543 34333 4555443 222222222 33322 3345667776421 11 1234444433 Q ss_pred ccccccceeEEEeeeecceeeeeHHHH--hhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccc-- Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEAL--LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------TVNA-- 137 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~--~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~---------~~~~-- 137 (274) .-+.+..+.+...+..+-...+..+.. ..+..|+.+...+.+.+.+....+..+.+.+..+.. +.+. T Consensus 66 ~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~w 145 (309) T protein:vir:99 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQW 145 (309) T ss_pred eEeecccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCcccc Confidence 333344445555555555555665554 345678999999999999888888766655433211 1111 Q ss_pred ---cccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccc-cccccccccceeccccceeccc-eEEEcCCC-- Q lcl|NC_010147. 138 ---DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNF-TRATELGDDIIVKGAFGEALGA-IIVRTNKL-- 210 (274) Q Consensus 138 ---~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~-~~~s~~~~~~~~~g~ig~~~G~-~Vv~s~~v-- 210 (274) ..-....|-++...+ +..+..++++..++..|++.+.+.- +..+....+.+..-++..++|+ .|++.... T Consensus 146 sd~~SDPi~~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n 222 (309) T protein:vir:99 146 SDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLN 222 (309) T ss_pred CCCCCCcHHHHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceee Confidence 111234555565555 5688999999999999987654322 2222222345555667788998 57764432 Q ss_pred -C-----cceEEEEeCCeE----------------EEEe----ecCceeeeecchhhcceEEEEEEEEEEEEEcCccEEE Q lcl|NC_010147. 211 -E-----AGTAILAKKGAV----------------KLIL----KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVK 264 (274) Q Consensus 211 -~-----~~~~~~~~~~a~----------------~~~~----~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~v~ 264 (274) . ..-.++++...+ +|-. +..=..+..+-...+...|++..++--.++-++.-.. T Consensus 223 ~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~l 302 (309) T protein:vir:99 223 IARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) T ss_pred ccccccccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchh Confidence 0 011133333222 1110 1110112122222333445555555545555555555 Q ss_pred EEecCCC Q lcl|NC_010147. 265 ITKGSGS 271 (274) Q Consensus 265 ~~~~~a~ 271 (274) |+-+.|. T Consensus 303 i~~~va~ 309 (309) T protein:vir:99 303 FENAVAA 309 (309) T ss_pred hhhcccC Confidence 5544444 No 227 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=91.47 E-value=0.016 Score=30.47 Aligned_cols=257 Identities=9% Similarity=-0.043 Sum_probs=132.4 Q ss_pred CCCcc-----ceeeeeechHHHHHHHHHHHHHHh----hhhcccccccccccCC-CceEEEEeeccCCccccccCCCcCC Q lcl|NC_010147. 1 MPQGI-----TKTSNQIIPEVLAPMMQAQLEKKL----RFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) Q Consensus 1 Ma~~~-----T~~~~~~~Pev~~~~v~~~~~~~~----v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~~~~~~eg~~i~ 70 (274) |||-- .+.+++=+|-.+..|+.-++.+-+ +..++.-++. .|.- -.+++++.....|.+..|+.++++| T Consensus 65 ~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t--~g~W~~~~~~f~v~e~~G~A~~ygd~~D~P 142 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKT--VGSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) T ss_pred cccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccc--cCCccceeEEEeeeecceeEEEeecccCCC Confidence 45531 233344367778887665543321 1122222221 1211 2478888888889999999999999 Q ss_pred ccccccceeEEEeeeecceeeeeHHHHhh---cCccHHHHHHHHHHHHHHHHHHHHHHHH------------hhc----c Q lcl|NC_010147. 71 TDILETKKREAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEA------------LMG----A 131 (274) Q Consensus 71 ~~~~t~~~~~~~~~~~~~~~~vtd~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~~~~~------------~~~----a 131 (274) ..+.......-+++...-.+.+.+++... .+.|+.+.-+..+.+++.+..++-.|-. ++- + T Consensus 143 l~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a 222 (388) T protein:vir:99 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) T ss_pred ceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCccc Confidence 99888888777777777777777665443 4677888888888888888777654411 110 0 Q ss_pred ccccc--c---cc--cC----HHHHHHHHHHHhhc--C-----CCceEEEEcHHHHHHHHhhccccccccccccccceec Q lcl|NC_010147. 132 KLTVN--A---DI--TK----LNGLQSAIDKFNDE--D-----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) Q Consensus 132 ~~~~~--~---~~--~~----~d~i~~A~~~l~~~--~-----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~ 193 (274) ...++ . .+ -+ +++|..+...+... + ..+..++++|..+..|-+-+ ..+..+ .+ T Consensus 223 ~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~n--------~~g~Tv-l~ 293 (388) T protein:vir:99 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISV-RD 293 (388) T ss_pred ccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccccC--------cCCccH-HH Confidence 00111 1 11 12 45566666666332 1 12347999999998884321 111111 00 Q ss_pred cccceeccceEEEcCCCC------cc-eEEEEeCCeEE-----------EEeecCceeeeecch-hhcceEEEEE-EEEE Q lcl|NC_010147. 194 GAFGEALGAIIVRTNKLE------AG-TAILAKKGAVK-----------LILKRDFFLEVARDA-STKTTALYSD-KHYV 253 (274) Q Consensus 194 g~ig~~~G~~Vv~s~~v~------~~-~~~~~~~~a~~-----------~~~~~~~~ve~~rd~-~~~~~~v~~~-~~yg 253 (274) -.--++-+++++.-+.+. .+ ..+++.+.--. +...-+.++..-..+ ......+... ..+| T Consensus 294 ~lk~n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~G 373 (388) T protein:vir:99 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAG 373 (388) T ss_pred HHHHhcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceee Confidence 001124455655554432 11 12333221100 000001111110001 1112222222 3468 Q ss_pred EEEEcCccEEEEEec Q lcl|NC_010147. 254 AYLYDESKAVKITKG 268 (274) Q Consensus 254 ~~~~~~~~~v~~~~~ 268 (274) +-+..|.++++++.= T Consensus 374 v~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 374 VMLKRPWAVVRLIGL 388 (388) T ss_pred eEEeccchhheeccC Confidence 888899999997754 No 228 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=91.00 E-value=0.018 Score=30.15 Aligned_cols=267 Identities=15% Similarity=0.032 Sum_probs=117.9 Q ss_pred CCCc-cceeeeeechHHHHHHHHHHHHHHhhhhcccc--c--cc---ccccCCCceEEEEeeccCCcccccc-------- Q lcl|NC_010147. 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAE--V--DS---TLQGQPGDTLTFPAFVYSGDAQVVA-------- 64 (274) Q Consensus 1 Ma~~-~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~--~--~~---~~~~~~g~tv~ip~~~~~~~~~~~~-------- 64 (274) |.-. .+..+... ++.-.-.+.....|...+. . .. .++...|+++++-+=-..-.++.++ T Consensus 61 l~~~~~~~ta~~~-----a~~T~i~V~~~~~f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~i 135 (418) T protein:vir:96 61 MVFASAVVTAEAL-----ADATVLTVENSDGLTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVI 135 (418) T ss_pred eeeeeEEEEEEEe-----cCceEEEecCCcccccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEe Confidence 2211 11111110 0000000111111111111 0 01 1122347777665421111222333 Q ss_pred -----CCCcCCcccc-ccceeEEEeee-ecceeeeeHHHHhh----cCccHHHHHHHHHHHHHHHHHHHHHHHHhh---- Q lcl|NC_010147. 65 -----EGEKIPTDIL-ETKKREAKIRK-IAKGTSITDEALLS----GYGDPQGEQVRQHGLAHANKVDNDVLEALM---- 129 (274) Q Consensus 65 -----eg~~i~~~~~-t~~~~~~~~~~-~~~~~~vtd~~~~~----~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~---- 129 (274) ||.+.+...- ...... .+-+ +..++++|+-+... +..|......+++-.. ..++++.++..-+ T Consensus 136 g~~~eEGsd~~ta~~~k~~~vs-N~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~ 213 (418) T protein:vir:96 136 GTAFEEGSQRPTARSIQPVYVP-NFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGT 213 (418) T ss_pred ecCcccccccCCcceecceecc-chhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCC Confidence 3333322210 000011 1122 23456666654332 4445544443333333 3455555543321 Q ss_pred --ccc-----------------cccc---ccccCHHHHHHHHHHHh--hcC--C-C---ceEEEEcHHHHHHHHhhcccc Q lcl|NC_010147. 130 --GAK-----------------LTVN---ADITKLNGLQSAIDKFN--DED--L-E---PMVLFINPLDAGKLRGDASTN 179 (274) Q Consensus 130 --~a~-----------------~~~~---~~~~~~d~i~~A~~~l~--~~~--~-~---~~~~vv~p~~~~~L~k~~~~~ 179 (274) +.+ .... ...++.|.++++....= ..+ . . ...+.|++++...+-+....- T Consensus 214 ~ng~p~~~t~R~m~gI~~f~~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~~I 293 (418) T protein:vir:96 214 YNGQPLHTTQGIVDAIRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFGEV 293 (418) T ss_pred CCCcccccccchhHHHHhhccccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhcee Confidence 100 0111 12457899998876631 111 1 1 156899999988775432211 Q ss_pred ccccccccccceeccccceeccceEEEcCCC-----CcceEEEEeCCeEEEEe--ecCceeeee-------------cch Q lcl|NC_010147. 180 FTRATELGDDIIVKGAFGEALGAIIVRTNKL-----EAGTAILAKKGAVKLIL--KRDFFLEVA-------------RDA 239 (274) Q Consensus 180 ~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~~v-----~~~~~~~~~~~a~~~~~--~~~~~ve~~-------------rd~ 239 (274) ....++..-+...+.....+.=++|+..+++ |+|+.++++++++.+.. .++...|.. ..- T Consensus 294 ~~~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~ 373 (418) T protein:vir:96 294 TVTQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSY 373 (418) T ss_pred EeccccceeceEEEEEEeeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCccccccccccc Confidence 1122222223333333333323589888865 56778999999988764 244333322 111 Q ss_pred hhcceEEEEE--EEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 240 STKTTALYSD--KHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 240 ~~~~~~v~~~--~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) ..+.|...+. .-|..++.||.+.++|+.=.-++|- T Consensus 374 ~~~~D~~~G~l~~Eltle~~N~~a~a~itgl~~~~~~ 410 (418) T protein:vir:96 374 GHGVDAQGGSLTSEWALELLNPQGCAVITGLQKAKER 410 (418) T ss_pred ccccccccCEEEEEEEEEeecccccEEeecccccccc Confidence 1223555444 3388999999999999976666665 No 229 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=90.39 E-value=0.021 Score=29.77 Aligned_cols=267 Identities=11% Similarity=0.006 Sum_probs=122.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhh-h--hccccc---ccc---ccc-CCCceEEEEeec---cCCccccccCCC Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLR-F--ASFAEV---DST---LQG-QPGDTLTFPAFV---YSGDAQVVAEGE 67 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v-~--~~~~~~---~~~---~~~-~~g~tv~ip~~~---~~~~~~~~~eg~ 67 (274) +...... --+|.+-......+..... . ...... ... ... ....+.+...+. ....++.+.+++ T Consensus 114 q~~~~~a----~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~ 189 (457) T protein:vir:10 114 ERNPAAA----GYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDST 189 (457) T ss_pred ccccccc----cccceeeeccCcccCcccccccccccccccccccccccccCccccccccccccccchhhhhhhccCCCC Confidence 1111000 0122221111101100000 0 000000 000 000 000001110000 112233333332 Q ss_pred ---cCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---- Q lcl|NC_010147. 68 ---KIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---- 136 (274) Q Consensus 68 ---~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~---- 136 (274) .+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-++..|...|+++++..+.+.+.... T Consensus 190 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~ 269 (457) T protein:vir:10 190 ANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNT 269 (457) T ss_pred CccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeecccc Confidence 3455555667777777777666666655322 24688999999999999999999999999865432211 Q ss_pred --c------cccC----HHHHHHHHHHH--------hh-cCCCceEEEEcHHHHHHHHhhccccccccccccc-----cc Q lcl|NC_010147. 137 --A------DITK----LNGLQSAIDKF--------ND-EDLEPMVLFINPLDAGKLRGDASTNFTRATELGD-----DI 190 (274) Q Consensus 137 --~------~~~~----~d~i~~A~~~l--------~~-~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~-----~~ 190 (274) . ...+ .+.+-.+..++ .+ .-...++++++|.+.+.|-...-.++.+..+.-. +. T Consensus 270 ~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~ 349 (457) T protein:vir:10 270 ATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDD 349 (457) T ss_pred ccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhcccccccccc Confidence 1 1111 22222222222 11 1246789999999999987644344444332111 11 Q ss_pred eecccccee-ccceEEEcCCC----Cc-ceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEEEEEEEEEEc Q lcl|NC_010147. 191 IVKGAFGEA-LGAIIVRTNKL----EA-GTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYD 258 (274) Q Consensus 191 ~~~g~ig~~-~G~~Vv~s~~v----~~-~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~ 258 (274) ......|.+ .|++|++++.. |. |-.+.++ |. +-|+-=.+...-.--|+.+++-.+-...|||. .+| T Consensus 350 ~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~K-G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~N 427 (457) T protein:vir:10 350 TSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYK-GTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGM-VSN 427 (457) T ss_pred ccceeEEEecCCeEEEEecccccCCccceEEEEEe-CCcceecceeecccccccccCccCCccccceeeeeeeeee-eec Confidence 223345665 45799999644 22 2222222 21 11211111111112288899999999999999 678 Q ss_pred CccEEEEEecCCC--CCC Q lcl|NC_010147. 259 ESKAVKITKGSGS--LEM 274 (274) Q Consensus 259 ~~~~v~~~~~~a~--~~~ 274 (274) |-..-. +-+.+. ..| T Consensus 428 P~~~~~-~~~~~~~~~~~ 444 (457) T protein:vir:10 428 PFAGGL-TQGSGALTVNA 444 (457) T ss_pred cccccc-ccccccccccc Confidence 874422 211111 111 No 230 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=89.89 E-value=0.024 Score=29.48 Aligned_cols=271 Identities=8% Similarity=-0.040 Sum_probs=118.8 Q ss_pred CCCccceeeeee----chHHHH--HHHH-HH------HHHHhhhh---c-c-cccccccccCC----------CceEEEE Q lcl|NC_010147. 1 MPQGITKTSNQI----IPEVLA--PMMQ-AQ------LEKKLRFA---S-F-AEVDSTLQGQP----------GDTLTFP 52 (274) Q Consensus 1 Ma~~~T~~~~~~----~Pev~~--~~v~-~~------~~~~~v~~---~-~-~~~~~~~~~~~----------g~tv~ip 52 (274) |--..|..+-.. .+..-. .+.. .. ......+. . . +.....+.+.. +...++. T Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~ 221 (514) T protein:vir:56 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEID 221 (514) T ss_pred ccccCcCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhh Confidence 222222211000 000000 0000 00 00000000 0 0 00000000000 1111111 Q ss_pred eeccCCcccc---cc--CCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010147. 53 AFVYSGDAQV---VA--EGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDND 123 (274) Q Consensus 53 ~~~~~~~~~~---~~--eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~ 123 (274) .--....++- +. .+..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-++..|...|+++ T Consensus 222 ~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINRe 301 (514) T protein:vir:56 222 AGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNRE 301 (514) T ss_pred hhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 1111111121 11 1234566666777777777777655555544322 2368899999999999999999999 Q ss_pred HHHHhhccccc--------cc-cccc-------------CHHHHHHHHHHHhh-cC--------CCceEEEEcHHHHHHH Q lcl|NC_010147. 124 VLEALMGAKLT--------VN-ADIT-------------KLNGLQSAIDKFND-ED--------LEPMVLFINPLDAGKL 172 (274) Q Consensus 124 ~~~~~~~a~~~--------~~-~~~~-------------~~d~i~~A~~~l~~-~~--------~~~~~~vv~p~~~~~L 172 (274) ++..+...... +. ...+ ..+.+-.+..++.. ++ ...++++++|.+.+.| T Consensus 302 ii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L 381 (514) T protein:vir:56 302 IVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSAL 381 (514) T ss_pred HHHHHHhheeehhcccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHH Confidence 98776432111 11 1011 12223333333332 22 3678999999999998 Q ss_pred Hhhccccccccccccc--------cceeccccceeccceEEEcCCCCcceEEEEeCCeE------EEEeecCceeeeecc Q lcl|NC_010147. 173 RGDASTNFTRATELGD--------DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAV------KLILKRDFFLEVARD 238 (274) Q Consensus 173 ~k~~~~~~~~~s~~~~--------~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~------~~~~~~~~~ve~~rd 238 (274) .....+.+......++ ..+..|.++ .|++|+++++.|..-..+.-+|.- -|+-=.+...-..-| T Consensus 382 ~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~--~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~d 459 (514) T protein:vir:56 382 SMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLG--GRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSD 459 (514) T ss_pred HhhhhhccccccCccccccccccCcceEEEEec--CceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccC Confidence 6433222222211111 112222222 567999999988544333322321 121111222223348 Q ss_pred hhhcceEEEEEEEEEEEEEcCccEEEEEecCCCCCC Q lcl|NC_010147. 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) Q Consensus 239 ~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~a~~~~ 274 (274) +.+++-.+-...|||..+ ||-.=..-.....+..| T Consensus 460 p~sfqP~~g~~tRY~l~~-NPy~~~~~~~~~~~~~~ 494 (514) T protein:vir:56 460 SKNFQPVIGFKTRYGVQV-NPFADPTASATKVGNGA 494 (514) T ss_pred Cccccceeeeeeeeceee-CCCCCccccccccCCcc Confidence 899999999999999875 66320000001112222 No 231 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=89.32 E-value=0.027 Score=29.18 Aligned_cols=256 Identities=13% Similarity=0.012 Sum_probs=108.2 Q ss_pred eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccC---CCcCCccccccceeEEEee Q lcl|NC_010147. 9 SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAE---GEKIPTDILETKKREAKIR 84 (274) Q Consensus 9 ~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~e---g~~i~~~~~t~~~~~~~~~ 84 (274) -|+|.+..+..++.+.-....-+...-..... ..+..+|.+=..+. ..-+..+.+ |..+.... ......++- T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~~~~Fp~~~--~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~--~~~~~~~~p 76 (341) T protein:vir:39 1 MSVYTTAQLLAVNEKKFKFDPLFLRIFFRETY--PFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRG--GSTSEFTPG 76 (341) T ss_pred CCccCHHHHHHHHHhhcCccchhHhhcCCccc--ccCcceEEEEEecCCceeeEEecCCCCcceecccc--eeeeeEecc Confidence 67788887877776654332222221111000 00112222211110 011112222 22222222 222333333 Q ss_pred eecceeee--eHHHHhhc------CccHHHHHHHH-------HHHHHHHHHHHHHHHHhhcccc---------------- Q lcl|NC_010147. 85 KIAKGTSI--TDEALLSG------YGDPQGEQVRQ-------HGLAHANKVDNDVLEALMGAKL---------------- 133 (274) Q Consensus 85 ~~~~~~~v--td~~~~~~------~~d~~~~~~~~-------~a~~~a~~~d~~~~~~~~~a~~---------------- 133 (274) ++.....+ .|....+. ..++.+...+. +.+.+.+.++..+...+.+... T Consensus 77 ~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~ 156 (341) T protein:vir:39 77 YVKPKHEVNPLMTLRRLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGR 156 (341) T ss_pred ccCcccccCHHHHHHHhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccC Confidence 33322223 33322111 12233333333 3334444445445555532110 Q ss_pred ----c----cccccc-----CHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhccccccccc-ccccccee------- Q lcl|NC_010147. 134 ----T----VNADIT-----KLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRAT-ELGDDIIV------- 192 (274) Q Consensus 134 ----~----~~~~~~-----~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s-~~~~~~~~------- 192 (274) . ....+. ..+.+-+....+.+.+..+..++|+++++..|++++.+.-.-.. ....+.+. T Consensus 157 ~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 236 (341) T protein:vir:39 157 SAGNNIVQAGAAAWSSRDKETYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKDLG 236 (341) T ss_pred CccceeEecCCccCCCCCCchHHHHHHHHHHHHhcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhhhh Confidence 0 111111 24555555555555677788999999999999876543211111 11111111 Q ss_pred cc--ccceeccceEEEcC-----------CCCcceEEEEeCCeEEE-EeecCceee--------eecch-------hhcc Q lcl|NC_010147. 193 KG--AFGEALGAIIVRTN-----------KLEAGTAILAKKGAVKL-ILKRDFFLE--------VARDA-------STKT 243 (274) Q Consensus 193 ~g--~ig~~~G~~Vv~s~-----------~v~~~~~~~~~~~a~~~-~~~~~~~ve--------~~rd~-------~~~~ 243 (274) .| .++++.|++|++=+ .+|++.++++..+..+. ..+.....+ ..|-+ .-.. T Consensus 237 ~~~~~~~~~~g~~i~~y~~~y~d~g~~~~~ip~~~~~l~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~ 316 (341) T protein:vir:39 237 KAVSYKGMYGDVAIVVYSGQYIENDVKKNYLPDLTMVLGNTQARGLRTYGCILDADAQREGINASTRYPKNWVQTGDPAR 316 (341) T ss_pred hHhhhhhhhcCceEEEEccEEEecCcEEeeecCCeEEEeeCCCcceEEEecccchhhcccceeeeeeeeeeeeecCCCcE Confidence 11 34567788876622 26788888877655332 112111111 11100 0012 Q ss_pred eEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 244 TALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 244 ~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) -.+.+-.+--..+.+|++++++|++ T Consensus 317 ~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 317 EFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred EEEEEeccccceeeCCCcEEEEEeC Confidence 2233333344455689999998888 No 232 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=87.89 E-value=0.036 Score=28.51 Aligned_cols=212 Identities=13% Similarity=-0.023 Sum_probs=111.1 Q ss_pred CCCcccee---ee---eechHHHHHHHHHHHHHHhh-hhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccc Q lcl|NC_010147. 1 MPQGITKT---SN---QIIPEVLAPMMQAQLEKKLR-FASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) Q Consensus 1 Ma~~~T~~---~~---~~~Pev~~~~v~~~~~~~~v-~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~ 73 (274) ||..-+.+ .+ .+.|.-....|.|.+.+.+- +.-+-..... ...|..-.+ ...++++.|..-++.+++++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N--~~tg~~~~v--rt~LP~~~fR~lN~g~~~s~ 76 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCN--DGSKHKTTI--RAGIPEPVWRRYNQGVQPTK 76 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhccc--CCcccceeE--EEecCCchhhhcCCcccccc Confidence 76543322 22 12233333335555554332 2222222111 111222222 24578888988889999999 Q ss_pred cccceeEEEeeeecceeeeeHHHHhhcCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------h-------hc-- Q lcl|NC_010147. 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA-----------L-------MG-- 130 (274) Q Consensus 74 ~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~~~~~-----------~-------~~-- 130 (274) .++.+++.....++..+.|+. ...+..+|.. ....+...+.+.+++...+|.. | .+ T Consensus 77 ~tt~qvt~~l~ilgg~~eVDr-~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~ 155 (335) T protein:vir:73 77 TQTVPVTDTTGMLYDLGFVDK-ALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSK 155 (335) T ss_pred ceEEEEEEEEEEecchhhhhH-HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccc Confidence 999999999999999999986 4555555653 3344456777777777666632 1 00 Q ss_pred c---cccc------------------------------------------------------------------------ Q lcl|NC_010147. 131 A---KLTV------------------------------------------------------------------------ 135 (274) Q Consensus 131 a---~~~~------------------------------------------------------------------------ 135 (274) + .+.. T Consensus 156 a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~ 235 (335) T protein:vir:73 156 AASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRS 235 (335) T ss_pred cCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCccc Confidence 0 0000 Q ss_pred -------c-----ccccCHHHHHH----HHH--HHhhcCCCceEEEEcHHHHHHHHhhccccccccccccccceeccccc Q lcl|NC_010147. 136 -------N-----ADITKLNGLQS----AID--KFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG 197 (274) Q Consensus 136 -------~-----~~~~~~d~i~~----A~~--~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig 197 (274) + .+..+...|++ |.. .+-..+....++.||-.+...|+++.... ....+.-+-...-.+- T Consensus 236 vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~--~n~~l~~~~~~g~~~t 313 (335) T protein:vir:73 236 ISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNA--KNVNLTIEEYGGKKIV 313 (335) T ss_pred EEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhcc--CceeeeeeccCCceeE Confidence 0 00111123333 321 12222234468999999999998864321 1111111111111234 Q ss_pred eeccceEEEcCCCCcceEEEEe Q lcl|NC_010147. 198 EALGAIIVRTNKLEAGTAILAK 219 (274) Q Consensus 198 ~~~G~~Vv~s~~v~~~~~~~~~ 219 (274) .+.|+||..++.+=....-+.. T Consensus 314 ~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 314 SFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred EECCeEEEEEeeeecCcccccC Confidence 6889999988887554443333 No 233 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=87.09 E-value=0.041 Score=28.19 Aligned_cols=270 Identities=9% Similarity=-0.028 Sum_probs=124.6 Q ss_pred CCCccce-eeeeechHHHHHH--HHHHHHHHhhhhcccccccc-----cc--cCCCceEEEEeeccCCccccc---c--C Q lcl|NC_010147. 1 MPQGITK-TSNQIIPEVLAPM--MQAQLEKKLRFASFAEVDST-----LQ--GQPGDTLTFPAFVYSGDAQVV---A--E 65 (274) Q Consensus 1 Ma~~~T~-~~~~~~Pev~~~~--v~~~~~~~~v~~~~~~~~~~-----~~--~~~g~tv~ip~~~~~~~~~~~---~--e 65 (274) .+-.++. ..+.+... +... +.-+.....-+.... .+.. +. ...+...+++.--.+..++-. . - T Consensus 166 ~~~~~~~~~Gd~~~~~-~~~~g~~~~~~~~~~t~~~t~-~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss 243 (521) T protein:vir:10 166 LAASTQTTVGDIYTHF-FQDTGTVYLQASAQVTISSTA-DDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGST 243 (521) T ss_pred cccccccccccccccc-ccccccceecccccccCCCcc-cccccccccccccccccceeecccccchhhHhhhccCCCCc Confidence 2221111 11211110 0000 000000000000000 0000 00 001222233222111122211 1 1 Q ss_pred CCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------cc Q lcl|NC_010147. 66 GEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--------KL 133 (274) Q Consensus 66 g~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a--------~~ 133 (274) +..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-|+..|...|+++++..+.-. +. T Consensus 244 ~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~ 323 (521) T protein:vir:10 244 DNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTL 323 (521) T ss_pred cccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeee Confidence 234566677777788877777655556644322 236889999999999999999999999776321 11 Q ss_pred c--cccccc------C-------HHHHHHHHHHHhh-c-------C-CCceEEEEcHHHHHHHHhhcccccccccccccc Q lcl|NC_010147. 134 T--VNADIT------K-------LNGLQSAIDKFND-E-------D-LEPMVLFINPLDAGKLRGDASTNFTRATELGDD 189 (274) Q Consensus 134 ~--~~~~~~------~-------~d~i~~A~~~l~~-~-------~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~ 189 (274) + .....+ + .+.+-.+..++.. + . ...++++++|++.+.|.......+..+.....+ T Consensus 324 ~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g 403 (521) T protein:vir:10 324 TPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATG 403 (521) T ss_pred ccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccccc Confidence 1 111111 1 1222223333321 1 2 466899999999999875443344333322222 Q ss_pred ceecc----cccee-ccceEEEcCCCCcceEEEEeCCeE------EEEeecCceeeeecchhhcceEEEEEEEEEEEEEc Q lcl|NC_010147. 190 IIVKG----AFGEA-LGAIIVRTNKLEAGTAILAKKGAV------KLILKRDFFLEVARDASTKTTALYSDKHYVAYLYD 258 (274) Q Consensus 190 ~~~~g----~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a~------~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~ 258 (274) ..... ..|.+ .|++|+++++.|..-..+.-+|.- -|+-=.+...-.--|+.+++-.+-...|||..+ | T Consensus 404 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-N 482 (521) T protein:vir:10 404 FNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-N 482 (521) T ss_pred ccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-c Confidence 22111 23554 347999999988544433322321 122111222223458899999999999999874 6 Q ss_pred CccEEEEEecC-------------CCCCC Q lcl|NC_010147. 259 ESKAVKITKGS-------------GSLEM 274 (274) Q Consensus 259 ~~~~v~~~~~~-------------a~~~~ 274 (274) |-.. ..+-+. ...+| T Consensus 483 P~~~-~~~~~~~~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:10 483 PFAE-SAAQAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred Cccc-ccCCccceeecccchhhhcccccc Confidence 6322 211111 11222 No 234 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=87.00 E-value=0.042 Score=28.15 Aligned_cols=187 Identities=12% Similarity=0.068 Sum_probs=107.4 Q ss_pred CCCccceeeeeechHHHHHH---HHHHHHHHhh-----hhcccccccccccCCCceEEEEeeccCCccc-cccCCCcCCc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPM---MQAQLEKKLR-----FASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIPT 71 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~---v~~~~~~~~v-----~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~-~~~eg~~i~~ 71 (274) |. |.|+.+... +...+.+... +..+|.+-. +.+..-+..-.+..+... |+ .+... T Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~~~~~a~~~~~~iA~~vp----St~~~~tY~wLg~fP~lrewi---Ger~i 64 (305) T protein:vir:19 1 MI---------VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVN----SSTRSNTYGWLGKFPTLKEWV---GKRTI 64 (305) T ss_pred Cc---------cCHHHHHHHHHHHHHHHHHHHhhcCcccceEEeEec----CCCCcccccccccCCccchhh---cceee Confidence 32 233333222 1222222111 122221111 111112222223333333 44 45788 Q ss_pred cccccceeEEEeeeecceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------------------ Q lcl|NC_010147. 72 DILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL------------------ 133 (274) Q Consensus 72 ~~~t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~------------------ 133 (274) .++....-.++-+.+...+.|++.+.++..........+++++..+..=|..+++.|+.+.. T Consensus 65 ~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~ 144 (305) T protein:vir:19 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) T ss_pred eeccccceeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCccc Confidence 88999989999999999999999999999888999999999999999888888877642100 Q ss_pred ---------cc--------------------------------------------------------------------- Q lcl|NC_010147. 134 ---------TV--------------------------------------------------------------------- 135 (274) Q Consensus 134 ---------~~--------------------------------------------------------------------- 135 (274) ++ T Consensus 145 ~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq 224 (305) T protein:vir:19 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQ 224 (305) T ss_pred CCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchh Confidence 00 Q ss_pred ----cccccCHHHHHHHHHHHhhc----C----CCceEEEEcHHHHHHHHhhccccccccccccccceeccccceecc-c Q lcl|NC_010147. 136 ----NADITKLNGLQSAIDKFNDE----D----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-A 202 (274) Q Consensus 136 ----~~~~~~~d~i~~A~~~l~~~----~----~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G-~ 202 (274) ..++++.+.+-.|+.+|... + ..++.++|+|.....-++.-.-+.+.+.. ++..--+.| + T Consensus 225 ~a~gS~~~Ls~~nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~-------~~~~Np~~g~~ 297 (305) T protein:vir:19 225 MAVAVKGDLTLDNLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGN-------TTVSNEMKGKL 297 (305) T ss_pred heecCCCCCCHHHHHHHHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCcc-------ccccceecceE Confidence 01234566677777777532 2 25678999998876554432112232221 112233566 6 Q ss_pred eEEEcCCC Q lcl|NC_010147. 203 IIVRTNKL 210 (274) Q Consensus 203 ~Vv~s~~v 210 (274) .+++++.+ T Consensus 298 eliV~P~L 305 (305) T protein:vir:19 298 QLVVADYL 305 (305) T ss_pred EEEecccC Confidence 89999999 No 235 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=86.31 E-value=0.046 Score=27.89 Aligned_cols=254 Identities=9% Similarity=-0.039 Sum_probs=121.1 Q ss_pred eeeeeechHH---HHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccc--ccc-CCCcCCccccccceeE Q lcl|NC_010147. 7 KTSNQIIPEV---LAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ--VVA-EGEKIPTDILETKKRE 80 (274) Q Consensus 7 ~~~~~~~Pev---~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~--~~~-eg~~i~~~~~t~~~~~ 80 (274) |++=.|.-.. +.+.+.+.-.+.+.+..+..++....- .-.+++...+...|.++ |++ .++++|..+....+.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~-~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~ 79 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAV-GITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTR 79 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCc-ccceEEEeeeeccCcccccccCCcCCccceeecccceeE Confidence 2222222211 112222222222333333333322111 13477888888888888 655 4578999999999999 Q ss_pred EEeeeecceeeeeHH--HHhh-cCccHHHHHHHHHHHHHHHHHHHHHHHHhh---c------ccc-----cc----cccc Q lcl|NC_010147. 81 AKIRKIAKGTSITDE--ALLS-GYGDPQGEQVRQHGLAHANKVDNDVLEALM---G------AKL-----TV----NADI 139 (274) Q Consensus 81 ~~~~~~~~~~~vtd~--~~~~-~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~---~------a~~-----~~----~~~~ 139 (274) .+++.++..+.++-. ...+ .+.++.+.-.+.+.+++...+++-.+-.-. + .+. .+ ..++ T Consensus 80 ~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w 159 (304) T protein:vir:52 80 SYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKV 159 (304) T ss_pred EEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCcc Confidence 999988777666543 3333 355566666666667888877764332211 0 000 00 0111 Q ss_pred --cCHH----HHHHHHHHHhhc-C--CCceEEEEcHHHHHHHHhhcccccccccccc-ccce-eccccceeccceEEEc- Q lcl|NC_010147. 140 --TKLN----GLQSAIDKFNDE-D--LEPMVLFINPLDAGKLRGDASTNFTRATELG-DDII-VKGAFGEALGAIIVRT- 207 (274) Q Consensus 140 --~~~d----~i~~A~~~l~~~-~--~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~-~~~~-~~g~ig~~~G~~Vv~s- 207 (274) -+.+ +|.++...+-.. + ..+..++++|..+..|..-.. ...+.. -..+ .+....+-.++.|..- T Consensus 160 ~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~----~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~ 235 (304) T protein:vir:52 160 QAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQR----ANTDTTALEFLTKHLSAAAGRQVAIKALP 235 (304) T ss_pred ccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccC----CCCCchHHHHHHHhcccccCCcceEEEec Confidence 1333 445565555432 2 356789999999998843111 111100 0111 1222112122333221 Q ss_pred -CCCCcc-----eEEEEeC--CeEEEEeecCceeeeecchhhcc-e-EEEEEE-EEEEEEEcCccEEEEEe Q lcl|NC_010147. 208 -NKLEAG-----TAILAKK--GAVKLILKRDFFLEVARDASTKT-T-ALYSDK-HYVAYLYDESKAVKITK 267 (274) Q Consensus 208 -~~v~~~-----~~~~~~~--~a~~~~~~~~~~ve~~rd~~~~~-~-~v~~~~-~yg~~~~~~~~~v~~~~ 267 (274) .....| ..+++.+ ..+.+.. |.+++..-...++. . .+-+.. .-|+-+..|..++++-. T Consensus 236 ~~~~~~g~~g~~r~vvY~~d~~~~~~~v--P~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 236 SNYGTRVTDGKTRAMVYVNSKEHVIFDV--PMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccccccCCCCceEEEEEecChhheEEec--CccccccchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 122211 1233333 2333322 22222222222222 1 222333 34888889999999988 No 236 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=79.68 E-value=0.1 Score=26.02 Aligned_cols=258 Identities=12% Similarity=0.050 Sum_probs=106.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHH-------hhhhcccccccccccCCCceEEEEeec---cCCc-cccccCCCcC Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-------LRFASFAEVDSTLQGQPGDTLTFPAFV---YSGD-AQVVAEGEKI 69 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~-------~v~~~~~~~~~~~~~~~g~tv~ip~~~---~~~~-~~~~~eg~~i 69 (274) |+ .|.-.+.|.|..+..++.+..... ..|++-..+ ..+++-... ..+- +...+.+.+. T Consensus 1 M~--~~~~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~---------~~~~~~~~~~~~~~~~~a~~~~~~~~~ 69 (348) T protein:vir:98 1 MS--WTLDTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDV---------DDITFEFLRGGGGLAETASYRSWDTES 69 (348) T ss_pred Cc--chhhhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCccc---------cceEEEEEeccCCceeeeeeecCCCcc Confidence 44 566678899999999987654321 112221111 111111110 0000 1122333322 Q ss_pred Ccccc-ccceeEEEeeeecceeeeeHHHHhhcCccHHHHHH-------HHHHHHHHHHHHHHHHHHhhccc--------- Q lcl|NC_010147. 70 PTDIL-ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQV-------RQHGLAHANKVDNDVLEALMGAK--------- 132 (274) Q Consensus 70 ~~~~~-t~~~~~~~~~~~~~~~~vtd~~~~~~~~d~~~~~~-------~~~a~~~a~~~d~~~~~~~~~a~--------- 132 (274) +..+- ........+-.++....++..+.......+.+++. +++.+.+.+.++-.+...+.+.. T Consensus 70 ~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~ 149 (348) T protein:vir:98 70 KIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQT 149 (348) T ss_pred ceeecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceE Confidence 22221 23333444444444444544333322222222332 23344444455544455554311 Q ss_pred ----------cccccc------ccCHHHHHHHHHHHhhc-CCCceEEEEcHHHHHHHHhhcccccc-ccccc--ccccee Q lcl|NC_010147. 133 ----------LTVNAD------ITKLNGLQSAIDKFNDE-DLEPMVLFINPLDAGKLRGDASTNFT-RATEL--GDDIIV 192 (274) Q Consensus 133 ----------~~~~~~------~~~~d~i~~A~~~l~~~-~~~~~~~vv~p~~~~~L~k~~~~~~~-~~s~~--~~~~~~ 192 (274) .+.... ...+++|.+....+.+. +..+..++|+++++..|+++..+.-. ..... ....+. T Consensus 150 vDyg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~ 229 (348) T protein:vir:98 150 VDFGRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVS 229 (348) T ss_pred EccccCcccccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccC Confidence 111111 12345677777766664 67889999999999999876543211 11111 111222 Q ss_pred cccc----ceeccceEEEcCC-----------CCcceEEEEeCCeEEEEee--------cCceeeeec------------ Q lcl|NC_010147. 193 KGAF----GEALGAIIVRTNK-----------LEAGTAILAKKGAVKLILK--------RDFFLEVAR------------ 237 (274) Q Consensus 193 ~g~i----g~~~G~~Vv~s~~-----------v~~~~~~~~~~~a~~~~~~--------~~~~ve~~r------------ 237 (274) .+.+ +.+.+.+|.+-+. +|++..+++..+....... -+...|... T Consensus 230 ~~~~~~~~~~~g~~~i~~~d~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~ 309 (348) T protein:vir:98 230 VEQLNTVLSSMGLPPIEVYDAKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPG 309 (348) T ss_pred HHHHHHHHHhhCCeEEEEeeeEEEcCCceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCc Confidence 2222 2232334444221 2455555443221110000 000111111 Q ss_pred -------chhhcceEEEEEEEEEEEEEcCccEEEEEecC Q lcl|NC_010147. 238 -------DASTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) Q Consensus 238 -------d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~~ 269 (274) +..--.-.+.+-.+.-..+.+|+++++.++=+ T Consensus 310 i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 310 IVAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 00001112222233333445777777777655 No 237 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=79.18 E-value=0.11 Score=25.91 Aligned_cols=273 Identities=8% Similarity=-0.023 Sum_probs=119.8 Q ss_pred CCCcc----ceeeeeechHHHHH-HHHHHH-HH-Hhhhhcccccc-----ccccc-----------CCCceEEEEeeccC Q lcl|NC_010147. 1 MPQGI----TKTSNQIIPEVLAP-MMQAQL-EK-KLRFASFAEVD-----STLQG-----------QPGDTLTFPAFVYS 57 (274) Q Consensus 1 Ma~~~----T~~~~~~~Pev~~~-~v~~~~-~~-~~v~~~~~~~~-----~~~~~-----------~~g~tv~ip~~~~~ 57 (274) +.... +..+..=.+..... -..+.+ .+ ...+...+.-. ....+ ..+...++..--.. T Consensus 160 ~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsT 239 (529) T protein:vir:10 160 KGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccch Confidence 11000 00000000000000 000000 00 00000000000 00000 00111111111111 Q ss_pred Cccccc-----cCCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010147. 58 GDAQVV-----AEGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEAL 128 (274) Q Consensus 58 ~~~~~~-----~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~ 128 (274) ..++.. ..+..+++-..+..+++++.+-|+..-+.|=|..- .-+-|..+++.+-|+..|...|+++++..+ T Consensus 240 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i 319 (529) T protein:vir:10 240 SIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHh Confidence 112211 11234566666777777777777655566654322 236889999999999999999999999865 Q ss_pred hcccc--------cc--ccccc-------------CHHHHHHHHHHHhhc-C--------CCceEEEEcHHHHHHHHhhc Q lcl|NC_010147. 129 MGAKL--------TV--NADIT-------------KLNGLQSAIDKFNDE-D--------LEPMVLFINPLDAGKLRGDA 176 (274) Q Consensus 129 ~~a~~--------~~--~~~~~-------------~~d~i~~A~~~l~~~-~--------~~~~~~vv~p~~~~~L~k~~ 176 (274) ..... +. ..+.+ ..+.+-.+..++.+. + ....+++++|.+.+.|--.. T Consensus 320 ~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVD 399 (529) T ss_pred hhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhc Confidence 42211 00 11111 122233333333321 1 24678999999999886322 Q ss_pred ccccccccccccccee----cccccee-ccceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcceE Q lcl|NC_010147. 177 STNFTRATELGDDIIV----KGAFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTA 245 (274) Q Consensus 177 ~~~~~~~s~~~~~~~~----~g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~ 245 (274) .+.+........+... +-..|.+ .|++|+++++.|..-..+.-+|. +-|+-=.+.....--|+.+++=. T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 AGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPV 479 (529) T ss_pred cccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 2222222211222111 1123454 34799999998854433332232 11221122222334588999999 Q ss_pred EEEEEEEEEEEEcCccEEE-------EEecCCCCC------C Q lcl|NC_010147. 246 LYSDKHYVAYLYDESKAVK-------ITKGSGSLE------M 274 (274) Q Consensus 246 v~~~~~yg~~~~~~~~~v~-------~~~~~a~~~------~ 274 (274) +-...|||..+ ||-...+ +....--.. | T Consensus 480 ~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 480 MGFKTRYAIGV-NPFAESRTQAPTSRISNGMPGAHSVGKNAY 520 (529) T ss_pred eeeeeeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 99999999874 6633211 110000001 1 No 238 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=74.30 E-value=0.16 Score=24.95 Aligned_cols=264 Identities=10% Similarity=0.015 Sum_probs=121.6 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcc-----cccccc---------ccc-CCCceEEEEeeccCCccccc-- Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASF-----AEVDST---------LQG-QPGDTLTFPAFVYSGDAQVV-- 63 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~-----~~~~~~---------~~~-~~g~tv~ip~~~~~~~~~~~-- 63 (274) -++..+-. .......+.... .... .++.+. +..... +.. ..|...+++.--....++-+ T Consensus 167 ~s~~~~g~-~~~~g~~~~~~~--~~~g-~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~ 242 (524) T protein:vir:98 167 FAKITTGT-AIATGAIVYHIF--QETG-IAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQEN 242 (524) T ss_pred cccccccc-cccccccccccc--cccc-ceeccccccCcccccccccccccccccccccccceeecccccchhhhhhhcc Confidence 11110000 000000000000 0000 000000 000000 000 01222233322222222222 Q ss_pred ---cCCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Q lcl|NC_010147. 64 ---AEGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--- 133 (274) Q Consensus 64 ---~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~--- 133 (274) ..+..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-|+..|...|+++++..+.-... T Consensus 243 ~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~ 322 (524) T protein:vir:98 243 FNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGK 322 (524) T ss_pred CCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheece Confidence 11334666677777888887777655555544322 23688999999999999999999999977642111 Q ss_pred -----cc--ccccc-------------CHHHHHHHHHHHhhc--------C-CCceEEEEcHHHHHHHHhh-cccccccc Q lcl|NC_010147. 134 -----TV--NADIT-------------KLNGLQSAIDKFNDE--------D-LEPMVLFINPLDAGKLRGD-ASTNFTRA 183 (274) Q Consensus 134 -----~~--~~~~~-------------~~d~i~~A~~~l~~~--------~-~~~~~~vv~p~~~~~L~k~-~~~~~~~~ 183 (274) .+ .++.+ ..+.+-.+..++.+. . ...++++++|++.+.|-.. .. +++. T Consensus 323 ~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g--~~~~ 400 (524) T protein:vir:98 323 SGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSG--ITPA 400 (524) T ss_pred eecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcc--cccc Confidence 10 11111 123333333333321 1 2478999999999988642 22 2222 Q ss_pred ccccc--------cceeccccceeccceEEEcCCCCcceEEEEeCCeE------EEEeecCceeeeecchhhcceEEEEE Q lcl|NC_010147. 184 TELGD--------DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAV------KLILKRDFFLEVARDASTKTTALYSD 249 (274) Q Consensus 184 s~~~~--------~~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~------~~~~~~~~~ve~~rd~~~~~~~v~~~ 249 (274) +..-+ ..+..|.++ .|++|+++++.|..-..+.-+|.- -|+-=.+...-.--|+.+++=.+-.. T Consensus 401 s~~~~~~~~~d~~~~~~~G~l~--~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 478 (524) T protein:vir:98 401 SQGLQKTLNVDTTKAVFAGVLG--GTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 478 (524) T ss_pred cchhhcccccCCccceEEEEec--CceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeee Confidence 21111 122334433 468999999988544433322321 12111122222345889999999999 Q ss_pred EEEEEEEEcCccEEEEEecCC--------CCCC Q lcl|NC_010147. 250 KHYVAYLYDESKAVKITKGSG--------SLEM 274 (274) Q Consensus 250 ~~yg~~~~~~~~~v~~~~~~a--------~~~~ 274 (274) .|||..+ ||-.. ..+.+.+ -..| T Consensus 479 tRY~l~~-NP~~~-~~~~~~~~ri~~g~~~~~~ 509 (524) T protein:vir:98 479 TRYGIGI-NPFAN-SRSQAPADRITSGMISKEM 509 (524) T ss_pred eeeceee-cCccc-ccCCccccccccCcchHhh Confidence 9999874 66332 1111111 0111 No 239 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=73.78 E-value=0.17 Score=24.86 Aligned_cols=270 Identities=9% Similarity=-0.037 Sum_probs=120.7 Q ss_pred CCCccceeeeeechHHHHH-------HHHHHH-HH-Hhhhhccccc---ccc--ccc-----CC-----------CceEE Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAP-------MMQAQL-EK-KLRFASFAEV---DST--LQG-----QP-----------GDTLT 50 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~-------~v~~~~-~~-~~v~~~~~~~---~~~--~~~-----~~-----------g~tv~ 50 (274) |..++-.+=.+ - -.|.. --.|.+ .+ ...|+..+.. ... ..+ .. ..+.+ T Consensus 97 MTgPTGLIFAm-R-srY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~ 174 (462) T protein:vir:10 97 MTGPTGLIFAM-R-SFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYE 174 (462) T ss_pred CCcchhhhhee-e-eeccCCccccccccchhhhccCCcCccccccccccccccccccccccccccccceeecCCCcccee Confidence 33322100000 0 00000 000111 01 1111110000 000 000 00 00011 Q ss_pred EEeec---cCCccccccCC---CcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHH Q lcl|NC_010147. 51 FPAFV---YSGDAQVVAEG---EKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKV 120 (274) Q Consensus 51 ip~~~---~~~~~~~~~eg---~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~ 120 (274) +-... ....++-...+ ..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-++..|...| T Consensus 175 ~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEI 254 (462) T protein:vir:10 175 VTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEI 254 (462) T ss_pred cccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHh Confidence 10000 01112222221 23555566677777777766655555544322 2468899999999999999999 Q ss_pred HHHHHHHhhccccccc------cc------ccC----HHHHHHHHHHHh-hc--------CCCceEEEEcHHHHHHHHhh Q lcl|NC_010147. 121 DNDVLEALMGAKLTVN------AD------ITK----LNGLQSAIDKFN-DE--------DLEPMVLFINPLDAGKLRGD 175 (274) Q Consensus 121 d~~~~~~~~~a~~~~~------~~------~~~----~d~i~~A~~~l~-~~--------~~~~~~~vv~p~~~~~L~k~ 175 (274) +++++..+.+.+.... .. +.+ ++.+..+..++. ++ -...++++++|++.+.|.-. T Consensus 255 NReii~~l~~~a~~~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~s 334 (462) T protein:vir:10 255 NREVVRTIYVNAVKGAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMA 334 (462) T ss_pred hHHHHhhhhhhheeeecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhc Confidence 9999999866443211 11 111 222333333342 11 24678999999999988543 Q ss_pred cccccccccc---ccc--cceecccccee-ccceEEEcCCC----C-cceEEEEeC-----CeEEEEeecCceeeeecch Q lcl|NC_010147. 176 ASTNFTRATE---LGD--DIIVKGAFGEA-LGAIIVRTNKL----E-AGTAILAKK-----GAVKLILKRDFFLEVARDA 239 (274) Q Consensus 176 ~~~~~~~~s~---~~~--~~~~~g~ig~~-~G~~Vv~s~~v----~-~~~~~~~~~-----~a~~~~~~~~~~ve~~rd~ 239 (274) .-.++.++-+ .+. +-.-....|.+ .|++|++++.. | +|-.+.++. +.+-|+-=.+.....-.|+ T Consensus 335 G~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp 414 (462) T protein:vir:10 335 GVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINP 414 (462) T ss_pred cchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCC Confidence 3333333211 110 11112235554 45799999743 2 222222221 1111221122222233488 Q ss_pred hhcceEEEEEEEEEEEEEcCccEEEEEe----cCCCCCC Q lcl|NC_010147. 240 STKTTALYSDKHYVAYLYDESKAVKITK----GSGSLEM 274 (274) Q Consensus 240 ~~~~~~v~~~~~yg~~~~~~~~~v~~~~----~~a~~~~ 274 (274) .+++-.+-...|||..+ ||-..-. +- -.....| T Consensus 415 ~sfqP~~g~~tRY~l~~-NP~t~~~-~~~~~~~~~~~n~ 451 (462) T protein:vir:10 415 NTFQPKIGFKTRYGMVS-NPFSGGL-TQGSGALTANANK 451 (462) T ss_pred ccccceeeeeeeeeeee-cCCCCCc-CCccccccccCcc Confidence 89999999999999874 5552211 11 1123333 No 240 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=70.06 E-value=0.21 Score=24.25 Aligned_cols=262 Identities=14% Similarity=0.039 Sum_probs=117.4 Q ss_pred CCCccce-eeee---------echHHHHHHHHHHHHHHhhhhc-ccccccccccCCCceEEEEeeccCCccccc------ Q lcl|NC_010147. 1 MPQGITK-TSNQ---------IIPEVLAPMMQAQLEKKLRFAS-FAEVDSTLQGQPGDTLTFPAFVYSGDAQVV------ 63 (274) Q Consensus 1 Ma~~~T~-~~~~---------~~Pev~~~~v~~~~~~~~v~~~-~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~------ 63 (274) |.-.+.. .+.. -.++. +.+..+|.. ...-.-.++...|+++++-+=...-.++-+ T Consensus 61 ~~~~~~~~ta~a~a~~T~l~ve~~~~--------f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~ 132 (418) T protein:vir:10 61 MVFASAVVTAEAAADATVLTVENSDG--------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKL 132 (418) T ss_pred EeeeeEEEEEEEecCceEEEEcCcce--------eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceE Confidence 2221111 1110 01111 222111100 000000112224667766543211122222 Q ss_pred -------cCCCcCCccccccceeEEEeee-ecceeeeeHHHHhh----cCccHHHHHHHHHHHHHHHHHHHHHHHHhhc- Q lcl|NC_010147. 64 -------AEGEKIPTDILETKKREAKIRK-IAKGTSITDEALLS----GYGDPQGEQVRQHGLAHANKVDNDVLEALMG- 130 (274) Q Consensus 64 -------~eg~~i~~~~~t~~~~~~~~~~-~~~~~~vtd~~~~~----~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~- 130 (274) .||.+.+...-......-.+.+ +..++++|+-+... ...|+.+.-.++.... +..+++.++..... T Consensus 133 ~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~-av~iEkalI~G~~~~ 211 (418) T protein:vir:10 133 IVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFM 211 (418) T ss_pred EEeccccccccccCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHH-HHHHHHHHhcccccC Confidence 3444433221111111111223 24567777754442 3345554444443333 34678887766421 Q ss_pred ----cc------------------ccccc---cccCHHHHHHHHHHH--hhcC--CC----ceEEEEcHHHHHHHHhhcc Q lcl|NC_010147. 131 ----AK------------------LTVNA---DITKLNGLQSAIDKF--NDED--LE----PMVLFINPLDAGKLRGDAS 177 (274) Q Consensus 131 ----a~------------------~~~~~---~~~~~d~i~~A~~~l--~~~~--~~----~~~~vv~p~~~~~L~k~~~ 177 (274) ++ ..+.+ ...++|.++++.... .+.+ .. ...+.|++++...+=+... T Consensus 212 ~~~~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~~ 291 (418) T protein:vir:10 212 GTYNGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG 291 (418) T ss_pred CCcCCcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhhh Confidence 00 11111 246899999987664 1222 21 2668889998877644322 Q ss_pred cccccc--ccccccceeccccceecc-c-----eEEEcCCCCcceEEEEeCCeEEEEe--ecCceeeee----------- Q lcl|NC_010147. 178 TNFTRA--TELGDDIIVKGAFGEALG-A-----IIVRTNKLEAGTAILAKKGAVKLIL--KRDFFLEVA----------- 236 (274) Q Consensus 178 ~~~~~~--s~~~~~~~~~g~ig~~~G-~-----~Vv~s~~v~~~~~~~~~~~a~~~~~--~~~~~ve~~----------- 236 (274) .+.. ...+-+...... -...| + ||+..=+||++...++++.++.+.. .+....|.. T Consensus 292 --~I~~~~~e~~~G~vv~~~-~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~ 368 (418) T protein:vir:10 292 --EVTVTQRETSYGMVFTEW-KFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGA 368 (418) T ss_pred --heeecccceeeeEEEEEE-EcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccc Confidence 1211 111112221111 11112 2 4444447999999999999887753 233333322 Q ss_pred --cchhhcceEEEEEE--EEEEEEEcCccEEEEEe-cCCCCCC Q lcl|NC_010147. 237 --RDASTKTTALYSDK--HYVAYLYDESKAVKITK-GSGSLEM 274 (274) Q Consensus 237 --rd~~~~~~~v~~~~--~yg~~~~~~~~~v~~~~-~~a~~~~ 274 (274) ..-..+.|...+.. -|..++.||.+.++|+. ..|-++. T Consensus 369 ~~~~~~~~~D~~kG~iv~E~tLe~~N~~a~avitgl~~~~~~~ 411 (418) T protein:vir:10 369 TDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKAKERV 411 (418) T ss_pred cccccccccccccceEEEEeeeeeecccceEEeeccceecccc Confidence 11122335555553 38999999999999995 3333333 No 241 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=65.86 E-value=0.28 Score=23.64 Aligned_cols=272 Identities=8% Similarity=-0.017 Sum_probs=121.5 Q ss_pred CCCccceeeeeechHHHHHH-HHHHH--HHHhhhhcccccc-----ccccc-----------CCCceEEEEeeccCCccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPM-MQAQL--EKKLRFASFAEVD-----STLQG-----------QPGDTLTFPAFVYSGDAQ 61 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~-v~~~~--~~~~v~~~~~~~~-----~~~~~-----------~~g~tv~ip~~~~~~~~~ 61 (274) .-...| ....=.+..++.- ..+.+ +....|.+.+.-. ....+ ..+...++..--....++ T Consensus 165 ~~~~~~-~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aE 243 (529) T protein:vir:10 165 TTDGTP-FAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAE 243 (529) T ss_pred ccCccc-cccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchhhhh Confidence 111000 0000000000000 00000 0111121111000 00000 011122221111111122 Q ss_pred ccc-----CCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010147. 62 VVA-----EGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK 132 (274) Q Consensus 62 ~~~-----eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~ 132 (274) -.. .+..+++-..+..+++++.+-|+..-+.|=|..- .-+-|..+++.+-++..|...|+++++..+.+.+ T Consensus 244 aL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a 323 (529) T protein:vir:10 244 LRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTA 323 (529) T ss_pred ccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhh Confidence 111 1223566666777777777777655556644322 2368899999999999999999999998886533 Q ss_pred ccc----------cccccC-------------HHHHHHHHHHHhhc-C--------CCceEEEEcHHHHHHHHhhccccc Q lcl|NC_010147. 133 LTV----------NADITK-------------LNGLQSAIDKFNDE-D--------LEPMVLFINPLDAGKLRGDASTNF 180 (274) Q Consensus 133 ~~~----------~~~~~~-------------~d~i~~A~~~l~~~-~--------~~~~~~vv~p~~~~~L~k~~~~~~ 180 (274) ... ....++ .+.+-.+..++.+. + ....+++++|.+.+.|-....+.+ T Consensus 324 ~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~ 403 (529) T protein:vir:10 324 QVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNIS 403 (529) T ss_pred hhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhcc Confidence 211 011111 12233333333321 1 246789999999998864322222 Q ss_pred cccccccc----cceecccccee-ccceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEE Q lcl|NC_010147. 181 TRATELGD----DIIVKGAFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSD 249 (274) Q Consensus 181 ~~~s~~~~----~~~~~g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~ 249 (274) ........ +...+...|.+ .|++|+++++.|..-..+.-+|. +-|+-=.+...-.--|+.+++-.+-.. T Consensus 404 ~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~ 483 (529) T protein:vir:10 404 PAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFK 483 (529) T ss_pred ccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeeee Confidence 22111111 11112233454 34799999998854433332222 112211222222345889999999999 Q ss_pred EEEEEEEEcCccEEE-------E-EecCC-----CCCC Q lcl|NC_010147. 250 KHYVAYLYDESKAVK-------I-TKGSG-----SLEM 274 (274) Q Consensus 250 ~~yg~~~~~~~~~v~-------~-~~~~a-----~~~~ 274 (274) .|||..+ ||-..-. + ....+ .-+| T Consensus 484 tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 484 TRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAY 520 (529) T ss_pred eeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 9999874 6643211 1 11111 0111 No 242 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=64.83 E-value=0.29 Score=23.50 Aligned_cols=242 Identities=10% Similarity=0.051 Sum_probs=116.9 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccccccCC-CceEEEEeeccCCc-cccc---------cCCCcC Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGD-AQVV---------AEGEKI 69 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~-g~tv~ip~~~~~~~-~~~~---------~eg~~i 69 (274) |+-..-...-.+.+| |...+..-+..+.+|.+.---.-.+.|.. .++.-.-+-++++- ...| +.|+.. T Consensus 1 mp~N~n~avr~Y~Kq-f~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFGtGTg~ 79 (295) T protein:vir:47 1 MPSNQNNAVRRYEKQ-YAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGDNSGA 79 (295) T ss_pred CCCCCCccchhhhHH-HHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccccCCcc Confidence 776544444455565 77777777788877754210011111211 12221111111110 0112 222221 Q ss_pred CccccccceeEEEee-----eecceeeeeH-HHHhhcCccHHHHHHHH---HHHHHHHHHHHHHHHHhhccccc-ccccc Q lcl|NC_010147. 70 PTDILETKKREAKIR-----KIAKGTSITD-EALLSGYGDPQGEQVRQ---HGLAHANKVDNDVLEALMGAKLT-VNADI 139 (274) Q Consensus 70 ~~~~~t~~~~~~~~~-----~~~~~~~vtd-~~~~~~~~d~~~~~~~~---~a~~~a~~~d~~~~~~~~~a~~~-~~~~~ 139 (274) .--++...-.+. .+...|.+-. .+...-+.|+.+.+++| .+.+|++.+|..+-..+...... ..... T Consensus 80 ---SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te~~td 156 (295) T protein:vir:47 80 ---QSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTEALAD 156 (295) T ss_pred ---ccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc Confidence 111222111110 1111232221 22333455565555554 68899999998877776654332 22334 Q ss_pred cCHHHHHHHHHHHhhcC----C-CceEEEEcHHHHHHHHhhccccccccccccccceeccccceeccceEEEcC--CCCc Q lcl|NC_010147. 140 TKLNGLQSAIDKFNDED----L-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEA 212 (274) Q Consensus 140 ~~~d~i~~A~~~l~~~~----~-~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~~g~ig~~~G~~Vv~s~--~v~~ 212 (274) ++.|.+.....++.+.. + .+-...|||++|..|...+.-+....|. .++ =+.-+-++.|+.+...+ .+.. T Consensus 157 ~t~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~Ss--aNi-Dengi~~FkGf~i~e~P~~~~q~ 233 (295) T protein:vir:47 157 FTDDKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAKGAT--ISL-DENGLPKYKGFTLEETPAQYFET 233 (295) T ss_pred ccchhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccce--eee-ccCCcceecceEEEeccHhhccC Confidence 55566666555554432 2 4456899999999998776543333322 122 23346789999988865 5778 Q ss_pred ceEEEEeCCeEEEEeecCceee-e------------------------ecchhhcceEEEEEE Q lcl|NC_010147. 213 GTAILAKKGAVKLILKRDFFLE-V------------------------ARDASTKTTALYSDK 250 (274) Q Consensus 213 ~~~~~~~~~a~~~~~~~~~~ve-~------------------------~rd~~~~~~~v~~~~ 250 (274) |...+|.+..++.+- .++++. + -|........+-+|+ T Consensus 234 G~~aifs~dnig~af-tGIn~aR~IesEdF~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 234 GVIAIFSPNGIIIPF-VGISTARVIEAENFDGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred CcEEEEccccceeec-ccceeeeeeecccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 999999888876642 122211 0 011111111111122 No 243 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=63.43 E-value=0.32 Score=23.32 Aligned_cols=255 Identities=9% Similarity=-0.043 Sum_probs=105.4 Q ss_pred eeeechHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeecc-CCccccccCCCc---CCccccccceeEEEee Q lcl|NC_010147. 9 SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEK---IPTDILETKKREAKIR 84 (274) Q Consensus 9 ~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~-~~~~~~~~eg~~---i~~~~~t~~~~~~~~~ 84 (274) -|+|.+..+..++.+.-.... +.+.-.... ...+..+|.|=..+. ..-+..+.++.. +.... .......+- T Consensus 1 ~d~f~~~~l~~~i~~~p~~~~-l~~~~fp~~--~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~g--~~~~~~~~p 75 (346) T protein:vir:63 1 MEIFDTLTLAGVIQSGPALSM-YWQGFYPNE--ITFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAARG--YTTKTFRPA 75 (346) T ss_pred CCccCHHHHHHHHHhcCCccc-hhhhcCccc--cccccceEEEEEecCceeeeeeecCCCCcceecccc--eeeeEeecC Confidence 678888888887765432111 111111100 011122332221111 001122222221 21111 111222222 Q ss_pred eec--ceeeeeHHHHhh-------cCccHHHHHHH-------HHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_010147. 85 KIA--KGTSITDEALLS-------GYGDPQGEQVR-------QHGLAHANKVDNDVLEALMGAKLT-------------- 134 (274) Q Consensus 85 ~~~--~~~~vtd~~~~~-------~~~d~~~~~~~-------~~a~~~a~~~d~~~~~~~~~a~~~-------------- 134 (274) .+. ..+...|....+ +..++.+.+.+ .+.+.+.+.++..+...+.+.... T Consensus 76 ~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg 155 (346) T protein:vir:63 76 YVKPKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFG 155 (346) T ss_pred ccCccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeC Confidence 222 233333332221 22334433333 334445555566666666542111 Q ss_pred ----------ccccc-----cCHHHHHHHHHHHhhc-CCCceEEEEcHHHHHHHHhhcccccc----ccccccc---cce Q lcl|NC_010147. 135 ----------VNADI-----TKLNGLQSAIDKFNDE-DLEPMVLFINPLDAGKLRGDASTNFT----RATELGD---DII 191 (274) Q Consensus 135 ----------~~~~~-----~~~d~i~~A~~~l~~~-~~~~~~~vv~p~~~~~L~k~~~~~~~----~~s~~~~---~~~ 191 (274) ....+ ..+++|.++...+.++ +..+..++|+++++..|++++.+.-. .....+. ..+ T Consensus 156 ~~~~~~~~lt~~~~W~~~~adp~~di~~~~~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~l 235 (346) T protein:vir:63 156 RDPALTVQLTGGAAWDQATSDPLGNIQTMRTTAWKKSNSTITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNRSRL 235 (346) T ss_pred CCccceeeecccccCCCCCCCHHHHHHHHHHHHHHccCCceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccchhhc Confidence 00111 1245666666666554 46788999999999999876432211 1101010 011 Q ss_pred ec-------cccc---eeccceEEEc------------CCCCcceEEEEeCCeEEEE-eecCceeee-----------ec Q lcl|NC_010147. 192 VK-------GAFG---EALGAIIVRT------------NKLEAGTAILAKKGAVKLI-LKRDFFLEV-----------AR 237 (274) Q Consensus 192 ~~-------g~ig---~~~G~~Vv~s------------~~v~~~~~~~~~~~a~~~~-~~~~~~ve~-----------~r 237 (274) .. |.+. .+.|++|+.= ..+|++.++++..+..+.. .+.....+. +. T Consensus 236 ~~~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~~G~~~~~ip~~~v~~~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~ 315 (346) T protein:vir:63 236 DDGSPVQYQGTIGGYNGMGTLELYTYHDTYTGDDNTEQEILGSYDVVGTGPGLQGTQCFGAIMDFKNGLVPTRMFPKMWE 315 (346) T ss_pred ccchhhhhhhhHhhhhccCCeEEEEeccEEEcCCCceeccccCCeEEEEecCCcceEEEeeccccccCcccceeeeEEEE Confidence 11 1111 2346676541 1256777777766543221 111111110 01 Q ss_pred chhhcceEEEEEEEEEEEEEcCccEEEEEec Q lcl|NC_010147. 238 DASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) Q Consensus 238 d~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~ 268 (274) ...-..-.+.+-.+--..+.+|++++++++. T Consensus 316 ~~dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 316 EEDPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred ecCCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 1111112233333333445688888888877 No 244 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=63.35 E-value=0.32 Score=23.31 Aligned_cols=266 Identities=11% Similarity=0.006 Sum_probs=123.6 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhccccc------cc--cccc------CCCceEEEEeeccCCcccc--- Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV------DS--TLQG------QPGDTLTFPAFVYSGDAQV--- 62 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~------~~--~~~~------~~g~tv~ip~~~~~~~~~~--- 62 (274) .+... |...+.+.-. ..+ ....++...+.. +. .+.+ ..|...++..=-....++- T Consensus 167 ~~~~~~t~~G~~~~~~-----~~~--~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~ 239 (522) T protein:vir:69 167 LAASTQTKVGDIYTHF-----FQE--TGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEG 239 (522) T ss_pred cccccccccccccccc-----ccc--ccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhccc Confidence 22111 1111222210 000 000000000000 00 0000 0112222221101111221 Q ss_pred cc--CCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc- Q lcl|NC_010147. 63 VA--EGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-LT- 134 (274) Q Consensus 63 ~~--eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~-~~- 134 (274) +. -+..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-|+..|...|+++++..+.-.. .. T Consensus 240 lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 319 (522) T protein:vir:69 240 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGK 319 (522) T ss_pred CCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeec Confidence 11 1234666677778888888777655556644322 2368899999999999999999999997763111 00 Q ss_pred --------cccc------ccC-------HHHHHHHHHHHhhc--------C-CCceEEEEcHHHHHHHHhhccccccccc Q lcl|NC_010147. 135 --------VNAD------ITK-------LNGLQSAIDKFNDE--------D-LEPMVLFINPLDAGKLRGDASTNFTRAT 184 (274) Q Consensus 135 --------~~~~------~~~-------~d~i~~A~~~l~~~--------~-~~~~~~vv~p~~~~~L~k~~~~~~~~~s 184 (274) ..+. +.+ .+.+-.+..++... . ...++++++|++.+.|.......+..+. T Consensus 320 ~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~ 399 (522) T protein:vir:69 320 SGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQ 399 (522) T ss_pred cccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccc Confidence 0011 111 22233333333221 2 2578999999999998754433333332 Q ss_pred cccccceecc----cccee-ccceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEEEEEE Q lcl|NC_010147. 185 ELGDDIIVKG----AFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYV 253 (274) Q Consensus 185 ~~~~~~~~~g----~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg 253 (274) +...+..... ..|.+ .|++|+++++.|..-..+.-+|. +-|+-=.+...-.--|+.+++-.+-...||| T Consensus 400 ~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 479 (522) T protein:vir:69 400 GLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 479 (522) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 2222222111 23554 34799999998854443332232 1122112222223458899999999999999 Q ss_pred EEEEcCccE-------EEEEecC------CCCCC Q lcl|NC_010147. 254 AYLYDESKA-------VKITKGS------GSLEM 274 (274) Q Consensus 254 ~~~~~~~~~-------v~~~~~~------a~~~~ 274 (274) ..+ ||-.. ++|.-+. +-+-. T Consensus 480 l~v-NP~~~~~~~~~~~ri~~g~p~~~~~~~~n~ 512 (522) T protein:vir:69 480 IGV-NPFAESSLQAPGARIQSGMPSILNSLGKNA 512 (522) T ss_pred eee-cCcccccCCcccceeecccchhhcccCCcc Confidence 874 55322 1222111 11111 No 245 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=62.68 E-value=0.33 Score=23.22 Aligned_cols=272 Identities=8% Similarity=-0.032 Sum_probs=121.3 Q ss_pred CCCccceeeeeechHHHHHH-HHHHH-HH-Hhhhhccccc----------cccccc------CCCceEEEEeeccCCccc Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPM-MQAQL-EK-KLRFASFAEV----------DSTLQG------QPGDTLTFPAFVYSGDAQ 61 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~-v~~~~-~~-~~v~~~~~~~----------~~~~~~------~~g~tv~ip~~~~~~~~~ 61 (274) ....+ ...+.=.+..++.- ..+.+ .+ ..+|.+.+.- +..+.+ ..+...++..--....++ T Consensus 165 ~~~~t-~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aE 243 (529) T protein:vir:10 165 TTDGT-PFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAE 243 (529) T ss_pred ccccc-ccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhhhhhh Confidence 11111 01000000000000 00000 00 1111111100 000000 011222222111111222 Q ss_pred cc-----cCCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010147. 62 VV-----AEGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK 132 (274) Q Consensus 62 ~~-----~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~ 132 (274) -. ..+..+++-..+..+++++.+-|+..-+.|=|..- .-+-|..+++.+-|+..|...|+++++..+.+.+ T Consensus 244 aL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a 323 (529) T protein:vir:10 244 LRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTA 323 (529) T ss_pred ccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhh Confidence 11 11233556666777777777777655556644322 2368899999999999999999999998886533 Q ss_pred ccc----------cccccC-------------HHHHHHHHHHHhhc-C--------CCceEEEEcHHHHHHHHhhccccc Q lcl|NC_010147. 133 LTV----------NADITK-------------LNGLQSAIDKFNDE-D--------LEPMVLFINPLDAGKLRGDASTNF 180 (274) Q Consensus 133 ~~~----------~~~~~~-------------~d~i~~A~~~l~~~-~--------~~~~~~vv~p~~~~~L~k~~~~~~ 180 (274) ... ....++ .+.+-.+..++.+. + ....+++++|.+.+.|-....+.+ T Consensus 324 ~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~ 403 (529) T protein:vir:10 324 QVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNIS 403 (529) T ss_pred hhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhccccc Confidence 211 011111 12233333333321 1 246789999999998864222211 Q ss_pred ccc----ccccccceecccccee-ccceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEE Q lcl|NC_010147. 181 TRA----TELGDDIIVKGAFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSD 249 (274) Q Consensus 181 ~~~----s~~~~~~~~~g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~ 249 (274) ... +....+...+...|.+ .|++|+++++.|..-..+.-+|. +-|+-=.+...-.-.|+.+++-.+-.. T Consensus 404 ~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~ 483 (529) T protein:vir:10 404 PAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVMGFK 483 (529) T ss_pred ccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeeee Confidence 111 1110111112233454 34799999998854433332222 112211222222345899999999999 Q ss_pred EEEEEEEEcCccEEEEEe--------cCC-----CCCC Q lcl|NC_010147. 250 KHYVAYLYDESKAVKITK--------GSG-----SLEM 274 (274) Q Consensus 250 ~~yg~~~~~~~~~v~~~~--------~~a-----~~~~ 274 (274) .|||..+ ||-..-.--+ ..+ .-.| T Consensus 484 tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 484 TRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAY 520 (529) T ss_pred eeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 9999874 6643211111 111 0111 No 246 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=60.80 E-value=0.37 Score=22.98 Aligned_cols=250 Identities=11% Similarity=0.035 Sum_probs=94.2 Q ss_pred eeechHHHHHHHHHHH-HHHhhhhcccccccccccCCCceEEEEeeccCCccccccCCCcCCccccccceeEEEeee--e Q lcl|NC_010147. 10 NQIIPEVLAPMMQAQL-EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRK--I 86 (274) Q Consensus 10 ~~~~Pev~~~~v~~~~-~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~eg~~i~~~~~t~~~~~~~~~~--~ 86 (274) =.++|..|.... .-+ .+. +..+..=.++-..|..--+|.... +......+++ +-+. ...++-+ . T Consensus 1 i~~~P~~~g~~~-glff~~~----~v~T~~V~ie~~~~~l~lip~v~r-g~~g~~~~~~-----~~~~--~~f~~p~~~~ 67 (320) T protein:vir:10 1 MNLLPVNYGDSR-ALFAREK----KVRTRTILVEEKNGVLTLIQSREP-GSTENVAKRG-----KRKV--RSFVIPHLPL 67 (320) T ss_pred CCcCCchhhhhh-hhccCCC----CcccceEEEEEecCceeeeeccCC-CCCceeecCC-----cceE--EEEecceecc Confidence 114576665321 111 111 111111112222333333443222 1111111111 1111 1111111 1 Q ss_pred cceeeeeHHHHhh--------cCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------------- Q lcl|NC_010147. 87 AKGTSITDEALLS--------GYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---------------------- 136 (274) Q Consensus 87 ~~~~~vtd~~~~~--------~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~---------------------- 136 (274) ...+..++..... ...+.+......+.+.+....+-..+..+++.-...+ T Consensus 68 ~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~l 147 (320) T protein:vir:10 68 EDVILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFGL 147 (320) T ss_pred CCccCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEec Confidence 1112222221111 1112233333344444555555555666654211100 Q ss_pred -ccccCHH-HHHHHHHHH----hhcCCCceEEEEcHHHHHHHHhhcccccc-ccccccccceeccccc--eeccceEEEc Q lcl|NC_010147. 137 -ADITKLN-GLQSAIDKF----NDEDLEPMVLFINPLDAGKLRGDASTNFT-RATELGDDIIVKGAFG--EALGAIIVRT 207 (274) Q Consensus 137 -~~~~~~d-~i~~A~~~l----~~~~~~~~~~vv~p~~~~~L~k~~~~~~~-~~s~~~~~~~~~g~ig--~~~G~~Vv~s 207 (274) ++..+.. .+.+.+..+ +......-.++++|+++.+|.+++...-. .....+...++....+ ++.|+.+..= T Consensus 148 ~~a~~dv~~~~~~~~~~i~~~l~g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~l~~~~~~~f~~gGi~~~~Y 227 (320) T protein:vir:10 148 DNKDANVAESCRQVLRHVEDNLRGDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNRLGGDTRKGFKFGGLIFNEN 227 (320) T ss_pred CCCCccHHHHHHHHHHHHHHHhccCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhhccccccceEEecCEEEEEc Confidence 1112221 222333333 33334455789999999999765432111 0011111112211111 4667766551 Q ss_pred --------C----CCCcceEEEEeCCeEEE-----Eeec--------Ccee--eeecch-hhcceEEEEEEEEEEEEEcC Q lcl|NC_010147. 208 --------N----KLEAGTAILAKKGAVKL-----ILKR--------DFFL--EVARDA-STKTTALYSDKHYVAYLYDE 259 (274) Q Consensus 208 --------~----~v~~~~~~~~~~~a~~~-----~~~~--------~~~v--e~~rd~-~~~~~~v~~~~~yg~~~~~~ 259 (274) . .+|.++++++..|+-+. +-.. +.+. ..+.++ .++-+ +..-..--.-..+| T Consensus 228 ~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~g~~-l~~qS~PLpi~~rP 306 (320) T protein:vir:10 228 RARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGRGFD-LHSQSNVLPMCCRP 306 (320) T ss_pred ccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCcccccccccccccCCCeEE-EEeeecccccccCc Confidence 1 27888888776554221 1111 1110 001111 11122 22222222345689 Q ss_pred ccEEEEEecCCCCC Q lcl|NC_010147. 260 SKAVKITKGSGSLE 273 (274) Q Consensus 260 ~~~v~~~~~~a~~~ 273 (274) +.++++|.++++.- T Consensus 307 ~~lv~~~~~a~~~~ 320 (320) T protein:vir:10 307 GVLVELDAAAQPAG 320 (320) T ss_pred ceEEEEEecCCCCC Confidence 99999999888888 No 247 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=60.62 E-value=0.37 Score=22.96 Aligned_cols=271 Identities=12% Similarity=0.084 Sum_probs=114.5 Q ss_pred CCCccc--eeeeeechHHH----------------HHHHH-HHHHHHhhhhccccccc----ccccCC-CceE----EEE Q lcl|NC_010147. 1 MPQGIT--KTSNQIIPEVL----------------APMMQ-AQLEKKLRFASFAEVDS----TLQGQP-GDTL----TFP 52 (274) Q Consensus 1 Ma~~~T--~~~~~~~Pev~----------------~~~v~-~~~~~~~v~~~~~~~~~----~~~~~~-g~tv----~ip 52 (274) |...-- ...+-..|+.. +..++ |-|.+.+..-.....+. ++..++ ..|+ ..- T Consensus 1 ~~~~~n~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~ 80 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYL 80 (464) T ss_pred CCcchhhHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheee Confidence 221100 00000111111 11111 11221111100010000 011111 0111 112 Q ss_pred eeccCCccccccCCCcCCccccccceeEEEeeeecceeeeeHHH--HhhcCccHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010147. 53 AFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEA--LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG 130 (274) Q Consensus 53 ~~~~~~~~~~~~eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~--~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~ 130 (274) .++..|...+..|+..++.++.+......+++.+...- ..+.+ ..++..|++....+...-.+++.++-.+|=.-+. T Consensus 81 ~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r-~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~a~FyGds~ 159 (464) T protein:vir:80 81 AHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTK-NMSIATGLVNNIEDPMRILTDDAISVVAKTIEWASFYGDSD 159 (464) T ss_pred ccCccccccccccccccccCCCceEEEEEEeeeeecce-eeeeehhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 33444566677888889999999999988888653322 22232 3566789999888888888999998777633221 Q ss_pred ---cc--------------------ccccccccCHHHHHHHHHHHhhcCCCceEEEEcHHHHHHHHhhcc---ccccccc Q lcl|NC_010147. 131 ---AK--------------------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAS---TNFTRAT 184 (274) Q Consensus 131 ---a~--------------------~~~~~~~~~~d~i~~A~~~l~~~~~~~~~~vv~p~~~~~L~k~~~---~~~~~~s 184 (274) .+ ....+...+-+.|..|....+.+....+-+.||+.+.+.++.+.. ..+..+ T Consensus 160 l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~~~- 238 (464) T protein:vir:80 160 LSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVISD- 238 (464) T ss_pred cCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEEcC- Confidence 10 112234456677788888888777788999999999998854321 122221 Q ss_pred ccccc----ceeccccceeccceEEEcCCCCcceEEEEeCCeEEEEeecC---ceeeeecchhhcceEEEEEEEEEEEEE Q lcl|NC_010147. 185 ELGDD----IIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRD---FFLEVARDASTKTTALYSDKHYVAYLY 257 (274) Q Consensus 185 ~~~~~----~~~~g~ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~---~~ve~~rd~~~~~~~v~~~~~yg~~~~ 257 (274) .+.+ .-.+|.++...-++.--|.-|......-...++.--+-..| ..+++.-..+++...+.+...|.+.+. T Consensus 239 -n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~v 317 (464) T protein:vir:80 239 -NGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVVV 317 (464) T ss_pred -CCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeEEEEEEE Confidence 1211 11222222211112222222211111100100000000000 112222222233333333334444444 Q ss_pred cCcc---E-EEEEecCCCCCC Q lcl|NC_010147. 258 DESK---A-VKITKGSGSLEM 274 (274) Q Consensus 258 ~~~~---~-v~~~~~~a~~~~ 274 (274) +..+ . -.++.+.++.+- T Consensus 318 n~~GeS~ps~~~~~ti~~~~~ 338 (464) T protein:vir:80 318 SDDAESAPSDVASVVIDDKKK 338 (464) T ss_pred CCCCccccceeeeeeecCccc Confidence 3221 0 112222333332 No 248 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=58.71 E-value=0.41 Score=22.72 Aligned_cols=271 Identities=12% Similarity=0.016 Sum_probs=126.8 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccc------------------------------------ccc- Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDST------------------------------------LQG- 43 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~------------------------------------~~~- 43 (274) +.-.+|...+.+-|.+++ ++ ++...++++..+.-+.+. +.+ T Consensus 79 ~es~~t~~v~~~~P~Li~-lv-RRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea 156 (528) T protein:vir:66 79 AAGQTTGAITNVGPAVIG-MV-RRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSL 156 (528) T ss_pred cccccccccccCchhHHH-HH-HHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccc Confidence 222334444557776432 22 223344444333211100 000 Q ss_pred ------CCCce---------------EEEEeecc---------------------------------C--Ccccccc--- Q lcl|NC_010147. 44 ------QPGDT---------------LTFPAFVY---------------------------------S--GDAQVVA--- 64 (274) Q Consensus 44 ------~~g~t---------------v~ip~~~~---------------------------------~--~~~~~~~--- 64 (274) .+|.+ -+..+++. . +....++ T Consensus 157 ~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm 236 (528) T protein:vir:66 157 AAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGM 236 (528) T ss_pred ccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceeccccc Confidence 00000 00000100 0 0000010 Q ss_pred -----C---------CCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010147. 65 -----E---------GEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLE 126 (274) Q Consensus 65 -----e---------g~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~ 126 (274) | +..+++-..+..+++++.+-|+..-++|=|..- .-+-|..+++.+-++..|...|+++++. T Consensus 237 ~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~ 316 (528) T protein:vir:66 237 ATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) T ss_pred chhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHh Confidence 0 111334444555666666666555555544322 2368899999999999999999999987 Q ss_pred Hhhcccc--------cc--cccc------c-------CHHHHHHHHHHHhhc--------C-CCceEEEEcHHHHHHHHh Q lcl|NC_010147. 127 ALMGAKL--------TV--NADI------T-------KLNGLQSAIDKFNDE--------D-LEPMVLFINPLDAGKLRG 174 (274) Q Consensus 127 ~~~~a~~--------~~--~~~~------~-------~~d~i~~A~~~l~~~--------~-~~~~~~vv~p~~~~~L~k 174 (274) .+..... .+ .+.. + ..+.+-.+..++.+. . ....+++++|++.+.|.- T Consensus 317 ~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~ 396 (528) T protein:vir:66 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) T ss_pred hhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhh Confidence 7632111 00 0111 1 123333333333321 1 245899999999999965 Q ss_pred hccccccccccccc----cceeccccceec-cceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcc Q lcl|NC_010147. 175 DASTNFTRATELGD----DIIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKT 243 (274) Q Consensus 175 ~~~~~~~~~s~~~~----~~~~~g~ig~~~-G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~ 243 (274) .....+.+...... +.--.-..|.+. |++|+++++.|..-..+.-+|. +-|+-=.+.....-.|+.+++ T Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfq 476 (528) T protein:vir:66 397 ADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFH 476 (528) T ss_pred ccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCcccc Confidence 43222222111110 110111245554 5799999998854433332232 112211233334567999999 Q ss_pred eEEEEEEEEEEEEEcCccEE-------EEEecCCCCCC Q lcl|NC_010147. 244 TALYSDKHYVAYLYDESKAV-------KITKGSGSLEM 274 (274) Q Consensus 244 ~~v~~~~~yg~~~~~~~~~v-------~~~~~~a~~~~ 274 (274) =.+-...|||..+ ||-..- ++....--..| T Consensus 477 P~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~~~~~~ 513 (528) T protein:vir:66 477 PVLGFKTRYGIGI-NPFADSKSQEPSARITSGMLSKDS 513 (528) T ss_pred ceeeeeeeeceee-cCcccccCccccccccccchhhhh Confidence 9999999999874 663321 11111111111 No 249 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=58.13 E-value=0.42 Score=22.65 Aligned_cols=271 Identities=12% Similarity=0.027 Sum_probs=126.3 Q ss_pred CCCccceeeeeechHHHHHHHHHHHHHHhhhhcccccccc------------------------------------cccC Q lcl|NC_010147. 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDST------------------------------------LQGQ 44 (274) Q Consensus 1 Ma~~~T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~~------------------------------------~~~~ 44 (274) +.-.+|...+.+-|.++ .++ ++...++++..+.-+.+. +.+. T Consensus 79 ~es~~t~~v~~~~P~Li-~lv-Rra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~ 156 (528) T protein:vir:80 79 AAGQTTGAITNVGPAVI-GMV-RRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSL 156 (528) T ss_pred cccccccccccCCchhh-hHH-HHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccc Confidence 22233444455778643 232 223444444433211100 0000 Q ss_pred -------------------------CCceEEEEe-------e--------c--c---------------CCcccccc--- Q lcl|NC_010147. 45 -------------------------PGDTLTFPA-------F--------V--Y---------------SGDAQVVA--- 64 (274) Q Consensus 45 -------------------------~g~tv~ip~-------~--------~--~---------------~~~~~~~~--- 64 (274) .|+.++... . . . .+....+. T Consensus 157 ~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm 236 (528) T protein:vir:80 157 AAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGM 236 (528) T ss_pred cccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCccccccccccccccccccccccc Confidence 000000000 0 0 0 00000011 Q ss_pred -----C---------CCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010147. 65 -----E---------GEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLE 126 (274) Q Consensus 65 -----e---------g~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~ 126 (274) | +..+++-..+..+++++.+-|+-.-+.|=|..- .-+-|..+++.+-|+..|...|+++++. T Consensus 237 ~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~ 316 (528) T protein:vir:80 237 ATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) T ss_pred chhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHh Confidence 1 112333444555566666655544455544322 2368899999999999999999999987 Q ss_pred Hhhccccc--------c--cccc------c-------CHHHHHHHHHHHhhc--------C-CCceEEEEcHHHHHHHHh Q lcl|NC_010147. 127 ALMGAKLT--------V--NADI------T-------KLNGLQSAIDKFNDE--------D-LEPMVLFINPLDAGKLRG 174 (274) Q Consensus 127 ~~~~a~~~--------~--~~~~------~-------~~d~i~~A~~~l~~~--------~-~~~~~~vv~p~~~~~L~k 174 (274) .+...... + .+.. . ..+.+-.+..++... . ....+++++|++.+.|.- T Consensus 317 ~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~ 396 (528) T protein:vir:80 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) T ss_pred hhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhh Confidence 76321110 0 0111 1 123333333333321 1 245899999999999965 Q ss_pred hcccccccccccc----ccceeccccceec-cceEEEcCCCCcceEEEEeCCe------EEEEeecCceeeeecchhhcc Q lcl|NC_010147. 175 DASTNFTRATELG----DDIIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKT 243 (274) Q Consensus 175 ~~~~~~~~~s~~~----~~~~~~g~ig~~~-G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~ 243 (274) .....+.+..... .+.--.-..|.+. |++|+++++.|..-..+.-+|. +-|+-=.+.....-.|+.+++ T Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfq 476 (528) T protein:vir:80 397 ADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFH 476 (528) T ss_pred ccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCcccc Confidence 4322222111100 0100111245554 5799999998854433332232 112211233334567999999 Q ss_pred eEEEEEEEEEEEEEcCccEE-------EEEecCCCCCC Q lcl|NC_010147. 244 TALYSDKHYVAYLYDESKAV-------KITKGSGSLEM 274 (274) Q Consensus 244 ~~v~~~~~yg~~~~~~~~~v-------~~~~~~a~~~~ 274 (274) =.+-...|||..+ ||-... ++....-...| T Consensus 477 P~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ 513 (528) T protein:vir:80 477 PVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDS 513 (528) T ss_pred ceeeeeeeeceee-cCcccccCCcccccccccchhhhh Confidence 9999999999874 663321 11111111111 No 250 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=42.98 E-value=0.86 Score=20.93 Aligned_cols=272 Identities=11% Similarity=0.009 Sum_probs=118.7 Q ss_pred CCC-ccceeee-eechHH---HHHHHHHH------HHHHhhhhccccccc-ccccCCCceEEEEeeccCCccccccC-CC Q lcl|NC_010147. 1 MPQ-GITKTSN-QIIPEV---LAPMMQAQ------LEKKLRFASFAEVDS-TLQGQPGDTLTFPAFVYSGDAQVVAE-GE 67 (274) Q Consensus 1 Ma~-~~T~~~~-~~~Pev---~~~~v~~~------~~~~~v~~~~~~~~~-~~~~~~g~tv~ip~~~~~~~~~~~~e-g~ 67 (274) |=- ..++..+ .|.-|. |+..-... ...........-.+. ......+.+.++...-....++...+ +. T Consensus 117 mRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~ 196 (468) T protein:vir:10 117 MRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANR 196 (468) T ss_pred EEEEecCCCCccceeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCc Confidence 110 0000000 010000 00000000 000000000000000 00000111122211111112232332 23 Q ss_pred cCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------cc Q lcl|NC_010147. 68 KIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------NA 137 (274) Q Consensus 68 ~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~------~~ 137 (274) .+++-..+..+++++.+-++..-++|=|..- .-+-|..+++.+-++..|...|+++++..+.+.+... .. T Consensus 197 ~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~ 276 (468) T protein:vir:10 197 LFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANA 276 (468) T ss_pred ccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccccc Confidence 4556666677777777777655556644322 2468899999999999999999999999886543221 11 Q ss_pred cc------cC----HHHHHHHHHHHh--------hc-CCCceEEEEcHHHHHHHHhhcccccccccccccc-----ceec Q lcl|NC_010147. 138 DI------TK----LNGLQSAIDKFN--------DE-DLEPMVLFINPLDAGKLRGDASTNFTRATELGDD-----IIVK 193 (274) Q Consensus 138 ~~------~~----~d~i~~A~~~l~--------~~-~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~~~~-----~~~~ 193 (274) .. .+ .+.+-.+..++. +. -...++++++|.+.+.|-...-.++....+.... .--+ T Consensus 277 Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~t 356 (468) T protein:vir:10 277 GIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDT 356 (468) T ss_pred ccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccC Confidence 11 11 122222222221 11 2467899999999999975333333332221100 0001 Q ss_pred c--cccee-ccceEEEcCCCC-----cceEEEEeCCe------EEEEeecCceeeeecchhhcceEEEEEEEEEEEEEcC Q lcl|NC_010147. 194 G--AFGEA-LGAIIVRTNKLE-----AGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDE 259 (274) Q Consensus 194 g--~ig~~-~G~~Vv~s~~v~-----~~~~~~~~~~a------~~~~~~~~~~ve~~rd~~~~~~~v~~~~~yg~~~~~~ 259 (274) | ..|.+ .|++|+++.... +|-.+.++ |. +-|+-=.+.....--|+.+++-.+-...|||..+ || T Consensus 357 g~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP 434 (468) T protein:vir:10 357 GNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP 434 (468) T ss_pred cceEEEEecCceEEEEccccccCCccceEEEEEe-cCcceeceeeeccccccccccccCCCcccceeeeeeeeceee-cc Confidence 1 24454 357999996542 22222222 21 1122111222223338889999999999999874 77 Q ss_pred ccEEE-EEecC-CCCCC Q lcl|NC_010147. 260 SKAVK-ITKGS-GSLEM 274 (274) Q Consensus 260 ~~~v~-~~~~~-a~~~~ 274 (274) -.... ++-.. -...| T Consensus 435 ~~~~~~~~~g~~~~~~~ 451 (468) T protein:vir:10 435 FVTTNGLYNGTPDGEAL 451 (468) T ss_pred cceeccccCCCcccccc Confidence 54321 11110 00112 No 251 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=39.23 E-value=1 Score=20.52 Aligned_cols=255 Identities=11% Similarity=0.080 Sum_probs=113.5 Q ss_pred CC--Cccceeeee--echHHHHHHHHHHHHHHhhhhcccccccccccCCCceEEEEeeccCCccccccC-CCcCCccccc Q lcl|NC_010147. 1 MP--QGITKTSNQ--IIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILE 75 (274) Q Consensus 1 Ma--~~~T~~~~~--~~Pev~~~~v~~~~~~~~v~~~~~~~~~~~~~~~g~tv~ip~~~~~~~~~~~~e-g~~i~~~~~t 75 (274) +| ++....+.- +.|. .+..+..++.+++-|-+.-++-.. ....|..|-+-.-+.+..-.+.+. ++.-|-+-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~Invv~V-~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~ 93 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPS-VQQKLEQRIQESSEFLKQINVYGV-DELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSA 93 (338) T ss_pred HHHHhCCCcccceeeeCHH-HHHHHHHHHHHHHHhhccCceecc-cceeeeEeeeccCccccccccCCCCCccccccccc Confidence 22 333333332 3554 666777888888877665444211 112233333321111111111111 1111221112 Q ss_pred cceeEEEeeeec--ceeeeeHHHHhhcCccHHHHHHHHHHHHHHHHHHHHHHHH-------------------------h Q lcl|NC_010147. 76 TKKREAKIRKIA--KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEA-------------------------L 128 (274) Q Consensus 76 ~~~~~~~~~~~~--~~~~vtd~~~~~~~~d~~~~~~~~~a~~~a~~~d~~~~~~-------------------------~ 128 (274) .+.......+.. ..+....++.=...+|+...+.+.+.+.+|...=.-=++. + T Consensus 94 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 173 (338) T protein:vir:11 94 LDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQY 173 (338) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 333444444443 3444444555556678877777777777654321111111 1 Q ss_pred hc---------ccc----cc----cccccCHHHHH-HHHHH-HhhcC--CCceEEEEcHHHHHH----HHhhcccccccc Q lcl|NC_010147. 129 MG---------AKL----TV----NADITKLNGLQ-SAIDK-FNDED--LEPMVLFINPLDAGK----LRGDASTNFTRA 183 (274) Q Consensus 129 ~~---------a~~----~~----~~~~~~~d~i~-~A~~~-l~~~~--~~~~~~vv~p~~~~~----L~k~~~~~~~~~ 183 (274) +. .+. .. .++-.++|+++ ||... +.+.. +..-+++|+.+..+. |.+.. -.. T Consensus 174 Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~----~~p 249 (338) T protein:vir:11 174 RNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKD----QPA 249 (338) T ss_pred HhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcC----CCh Confidence 00 000 00 12234667655 57754 45433 334578888765542 22211 011 Q ss_pred ccccccceeccc---cceeccceEEEcCCCCcceEEEEeCCeEEEEeecCce---ee----eecchhhc-ceEEEEEEEE Q lcl|NC_010147. 184 TELGDDIIVKGA---FGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFF---LE----VARDASTK-TTALYSDKHY 252 (274) Q Consensus 184 s~~~~~~~~~g~---ig~~~G~~Vv~s~~v~~~~~~~~~~~a~~~~~~~~~~---ve----~~rd~~~~-~~~v~~~~~y 252 (274) + +.+.... ..++.|+|.+.-+++|.+..++-.-..+.+..+.+.. ++ .+|-+... .+.=++...| T Consensus 250 t----E~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~ 325 (338) T protein:vir:11 250 T----EKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIENYESSNDAYVVEDY 325 (338) T ss_pred H----HHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhccceeeecc Confidence 1 1121111 3479999999999999999988888888777665532 11 11111111 1111111222 Q ss_pred EEEEEcCccEEEEEecCCCCC Q lcl|NC_010147. 253 VAYLYDESKAVKITKGSGSLE 273 (274) Q Consensus 253 g~~~~~~~~~v~~~~~~a~~~ 273 (274) | +.+.+. ...-.| T Consensus 326 ~-------~~a~ie-ni~~~~ 338 (338) T protein:vir:11 326 G-------LGCLVE-NIEVAE 338 (338) T ss_pred c-------cEEEee-cceecC Confidence 2 222221 111111 No 252 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=36.57 E-value=1.2 Score=20.22 Aligned_cols=273 Identities=11% Similarity=0.022 Sum_probs=118.3 Q ss_pred CCCccceeee-eechH-HHHHHH---------------HHHH------HHHhhhhcccccccccccC------------- Q lcl|NC_010147. 1 MPQGITKTSN-QIIPE-VLAPMM---------------QAQL------EKKLRFASFAEVDSTLQGQ------------- 44 (274) Q Consensus 1 Ma~~~T~~~~-~~~Pe-v~~~~v---------------~~~~------~~~~v~~~~~~~~~~~~~~------------- 44 (274) ++...+..-. .+-|. -|+... ...+ ....++.+.........+. T Consensus 134 ~~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~ 213 (519) T protein:vir:10 134 IAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTAL 213 (519) T ss_pred cccccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccc Confidence 1110000000 00110 011000 0000 0000000000000000000 Q ss_pred --CCceEEEEeeccCCcccc---cc--CCCcCCccccccceeEEEeeeecceeeeeHHHHh----hcCccHHHHHHHHHH Q lcl|NC_010147. 45 --PGDTLTFPAFVYSGDAQV---VA--EGEKIPTDILETKKREAKIRKIAKGTSITDEALL----SGYGDPQGEQVRQHG 113 (274) Q Consensus 45 --~g~tv~ip~~~~~~~~~~---~~--eg~~i~~~~~t~~~~~~~~~~~~~~~~vtd~~~~----~~~~d~~~~~~~~~a 113 (274) .|...++..--.+..++. ++ .+..+++-..+..+++++.+-|+..-+.|=|..- .-+-|..+++.+-++ T Consensus 214 ~~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILS 293 (519) T protein:vir:10 214 VEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILA 293 (519) T ss_pred cccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHH Confidence 011111111101111111 10 1223555666777777777776655555544322 236889999999999 Q ss_pred HHHHHHHHHHHHHHhhcc--------ccc--ccccccC-------------HHHHHHHHHHHhhc--------C-CCceE Q lcl|NC_010147. 114 LAHANKVDNDVLEALMGA--------KLT--VNADITK-------------LNGLQSAIDKFNDE--------D-LEPMV 161 (274) Q Consensus 114 ~~~a~~~d~~~~~~~~~a--------~~~--~~~~~~~-------------~d~i~~A~~~l~~~--------~-~~~~~ 161 (274) ..|...|+++++..+.-. +.+ ..+..++ .+.+-.+..++.+. . ...++ T Consensus 294 TEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ 373 (519) T protein:vir:10 294 TEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNF 373 (519) T ss_pred HHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccE Confidence 999999999999765211 111 0011122 12222233333221 1 24589 Q ss_pred EEEcHHHHHHHHhhcccccccccccccccee--c--ccccee-ccceEEEcCCCCcceEEEEeCCe------EEEEeecC Q lcl|NC_010147. 162 LFINPLDAGKLRGDASTNFTRATELGDDIIV--K--GAFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRD 230 (274) Q Consensus 162 ~vv~p~~~~~L~k~~~~~~~~~s~~~~~~~~--~--g~ig~~-~G~~Vv~s~~v~~~~~~~~~~~a------~~~~~~~~ 230 (274) ++++|++.+.|-......+..........-. . -..|.+ .|++|+++++.|..-..+.-+|. +-|+-=.+ T Consensus 374 ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~ 453 (519) T protein:vir:10 374 IIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA 453 (519) T ss_pred EEEchHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccc Confidence 9999999999976553323222211111100 1 123554 45799999998854333322232 11211112 Q ss_pred ceeeeecchhhcceEEEEEEEEEEEEEcCccE-------EEEEec----CCCCCC Q lcl|NC_010147. 231 FFLEVARDASTKTTALYSDKHYVAYLYDESKA-------VKITKG----SGSLEM 274 (274) Q Consensus 231 ~~ve~~rd~~~~~~~v~~~~~yg~~~~~~~~~-------v~~~~~----~a~~~~ 274 (274) ...-.--|+.+++-.+-...|||..+ ||-.- .++.-. +.+.+| T Consensus 454 l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g~~~~a~~~~~ 507 (519) T protein:vir:10 454 LTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) T ss_pred cccccccCCccccceeeeeeeeceee-cCcccccccCccceeccCchhhhccccC Confidence 22223458899999999999999874 66331 112111 112223 No 253 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=22.93 E-value=2.4 Score=18.52 Aligned_cols=269 Identities=12% Similarity=0.033 Sum_probs=125.4 Q ss_pred CCCcc-ceeeeeechHHHHHHHHHHHHHHhhhhccccccc------------------------------ccccCCCceE Q lcl|NC_010147. 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS------------------------------TLQGQPGDTL 49 (274) Q Consensus 1 Ma~~~-T~~~~~~~Pev~~~~v~~~~~~~~v~~~~~~~~~------------------------------~~~~~~g~tv 49 (274) -+..+ |.....+-|.+++ + .++...++++..+.-+.+ .+.|..+..- T Consensus 69 i~~st~t~~v~~~~P~Li~-l-vRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~ 146 (470) T protein:vir:10 69 SADATAAGPVAGFDPVLIS-L-IRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLD 146 (470) T ss_pred cccccccccccccCchhhh-h-HHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCccccccc Confidence 22222 3333457887655 3 344555566554432211 1222111000 Q ss_pred E----------------------EEeec----cCCccccccC----------------CCcCCccccccceeEEEeeeec Q lcl|NC_010147. 50 T----------------------FPAFV----YSGDAQVVAE----------------GEKIPTDILETKKREAKIRKIA 87 (274) Q Consensus 50 ~----------------------ip~~~----~~~~~~~~~e----------------g~~i~~~~~t~~~~~~~~~~~~ 87 (274) . -|... .......++- +.++++-..+..+++++.+-|+ T Consensus 147 ~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRa 226 (470) T protein:vir:10 147 DTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRA 226 (470) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccc Confidence 0 00000 0000000110 1223344445555666666555 Q ss_pred ceeeeeHHHHh----hcCccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cc------cc----CHHHHHH Q lcl|NC_010147. 88 KGTSITDEALL----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN------AD------IT----KLNGLQS 147 (274) Q Consensus 88 ~~~~vtd~~~~----~~~~d~~~~~~~~~a~~~a~~~d~~~~~~~~~a~~~~~------~~------~~----~~d~i~~ 147 (274) ..-++|=|..- .-+-|..+++.+-++..|...|+++++..+.+.+.... .. .. ..+.+-. T Consensus 227 LKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~ 306 (470) T protein:vir:10 227 LKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKG 306 (470) T ss_pred eeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHH Confidence 44455544322 23688999999999999999999999998866443211 11 11 1222333 Q ss_pred HHHHHh--------h-cCCCceEEEEcHHHHHHHHhhccccccccccc--cccceecccccee-ccceEEEcCCCC---- Q lcl|NC_010147. 148 AIDKFN--------D-EDLEPMVLFINPLDAGKLRGDASTNFTRATEL--GDDIIVKGAFGEA-LGAIIVRTNKLE---- 211 (274) Q Consensus 148 A~~~l~--------~-~~~~~~~~vv~p~~~~~L~k~~~~~~~~~s~~--~~~~~~~g~ig~~-~G~~Vv~s~~v~---- 211 (274) +..++. . .-...++++++|.+.+.|--..-.++.+..+. ..+..-+-..|.+ .|++|++++.+. T Consensus 307 l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~ 386 (470) T protein:vir:10 307 LIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGA 386 (470) T ss_pred HHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccccccccCCCCceEEEEecCceEEEeeccccccCc Confidence 333332 1 12467899999999998844332233222111 0011111124555 347999997432 Q ss_pred ---cceEEEEeCCeEE----EEeecCceeee--ecchhhcceEEEEEEEEEEEEEcCccEEEEEec----CCCCCC Q lcl|NC_010147. 212 ---AGTAILAKKGAVK----LILKRDFFLEV--ARDASTKTTALYSDKHYVAYLYDESKAVKITKG----SGSLEM 274 (274) Q Consensus 212 ---~~~~~~~~~~a~~----~~~~~~~~ve~--~rd~~~~~~~v~~~~~yg~~~~~~~~~v~~~~~----~a~~~~ 274 (274) +|-.+.++ |.-. ++...=++.+. --|+.+++=.+-...|||..+ ||-.... +-. +....| T Consensus 387 a~~dy~~vG~K-G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~-~~~~~~i~~~~n~ 459 (470) T protein:vir:10 387 AATQYYVVGYK-GSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYGLVE-NPFSQGT-TQGLGTLTRNSNR 459 (470) T ss_pred ccccEEEEEEe-cCcceecceeeccccccccCCCCCCccccceeeeeeeeceee-cCcccCC-CcccccccCCCCc Confidence 12222222 2111 11110022222 237888999999999999874 6653221 111 112233 Done!