Query lcl|NC_019917.1_cdsid_YP_007236797.1 [gene=G167_gp51] [protein=major capsid protein] [protein_id=YP_007236797.1] [location=30331..31425] Match_columns 364 No_of_seqs 75 out of 87 Neff 6.0 Searched_HMMs 1612 Date Thu Nov 7 17:09:28 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_51 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_51_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:93696 Length: 364 100.0 1E-168 7E-172 941.4 32.4 364 1-364 1-364 (364) 2 protein:vir:105610 Length: 430 100.0 2E-152 1E-155 852.1 28.3 358 1-364 1-427 (430) 3 protein:vir:104439 Length: 404 100.0 4E-145 3E-148 812.2 29.6 352 5-362 1-404 (404) 4 protein:vir:10123 Length: 404 100.0 4E-145 3E-148 812.2 29.6 352 5-362 1-404 (404) 5 protein:vir:819 Length: 404 # 100.0 4E-145 3E-148 812.2 29.6 352 5-362 1-404 (404) 6 protein:vir:3298 Length: 404 # 100.0 4E-145 3E-148 812.2 29.6 352 5-362 1-404 (404) 7 protein:vir:2770 Length: 318 # 100.0 4E-112 2E-115 631.3 24.6 288 1-299 1-318 (318) 8 protein:vir:95875 Length: 401 99.5 4.8E-15 3E-18 99.1 16.6 325 1-362 11-401 (401) 9 protein:vir:105334 Length: 276 99.4 8.9E-15 5.5E-18 97.6 16.5 263 1-364 1-272 (276) 10 protein:vir:3613 Length: 272 # 99.4 1.4E-14 8.9E-18 96.5 16.3 265 1-364 1-268 (272) 11 protein:vir:96123 Length: 274 99.4 7.1E-14 4.4E-17 92.7 16.6 267 1-364 1-272 (274) 12 protein:vir:80180 Length: 381 99.3 6.8E-14 4.2E-17 92.8 15.4 299 1-364 15-331 (381) 13 protein:vir:9820 Length: 272 # 99.3 1.8E-13 1.1E-16 90.4 16.5 268 1-362 1-272 (272) 14 protein:vir:3033 Length: 272 # 99.3 1.8E-13 1.1E-16 90.4 16.5 268 1-362 1-272 (272) 15 protein:vir:93742 Length: 274 99.3 2E-13 1.3E-16 90.2 15.9 271 1-364 1-274 (274) 16 protein:vir:94494 Length: 274 99.3 5.3E-13 3.3E-16 87.9 16.8 263 1-364 1-274 (274) 17 protein:vir:97433 Length: 274 99.3 5.3E-13 3.3E-16 87.9 16.8 263 1-364 1-274 (274) 18 protein:vir:95898 Length: 274 99.3 6.8E-13 4.2E-16 87.3 17.0 264 1-364 1-269 (274) 19 protein:vir:96262 Length: 274 99.3 6.8E-13 4.2E-16 87.3 17.0 264 1-364 1-269 (274) 20 protein:vir:1239 Length: 274 # 99.3 6.8E-13 4.2E-16 87.3 16.5 264 1-364 1-272 (274) 21 protein:vir:96833 Length: 275 99.3 2.9E-13 1.8E-16 89.3 14.2 264 1-364 1-273 (275) 22 protein:vir:95107 Length: 270 99.2 4.9E-13 3E-16 88.1 15.1 257 1-364 1-267 (270) 23 protein:vir:80930 Length: 278 99.1 5.6E-12 3.5E-15 82.3 16.2 274 1-363 1-278 (278) 24 protein:vir:80213 Length: 334 99.1 2.4E-11 1.5E-14 78.8 19.6 309 1-361 1-334 (334) 25 protein:vir:78739 Length: 332 99.1 1E-11 6.5E-15 80.8 16.2 293 1-359 7-332 (332) 26 protein:vir:1541 Length: 347 # 99.1 7E-12 4.3E-15 81.7 14.6 307 1-361 1-347 (347) 27 protein:vir:7990 Length: 273 # 99.1 1.7E-11 1.1E-14 79.6 16.3 271 1-361 1-273 (273) 28 protein:vir:10450 Length: 344 99.1 1.5E-10 9.5E-14 74.4 20.9 301 1-359 1-344 (344) 29 protein:vir:8885 Length: 347 # 99.0 7.8E-11 4.9E-14 76.0 18.6 303 1-360 1-347 (347) 30 protein:vir:3364 Length: 347 # 99.0 5E-11 3.1E-14 77.1 16.8 309 1-361 1-347 (347) 31 protein:vir:2201 Length: 345 # 99.0 2.3E-10 1.4E-13 73.5 20.3 302 1-361 1-345 (345) 32 protein:vir:94622 Length: 341 99.0 9.3E-11 5.8E-14 75.6 17.9 298 1-364 1-336 (341) 33 protein:vir:94576 Length: 347 99.0 1.5E-10 9.4E-14 74.4 18.3 302 1-360 1-347 (347) 34 protein:vir:739 Length: 231 # 99.0 2.7E-11 1.7E-14 78.6 13.1 227 49-360 1-231 (231) 35 protein:vir:102944 Length: 330 98.9 3E-10 1.9E-13 72.8 16.6 284 1-364 1-298 (330) 36 protein:vir:1583 Length: 351 # 98.9 2.4E-10 1.5E-13 73.4 15.0 288 1-364 1-296 (351) 37 protein:vir:5974 Length: 324 # 98.8 1.6E-10 9.8E-14 74.3 13.4 282 1-364 1-292 (324) 38 protein:vir:78935 Length: 335 98.8 2.2E-09 1.4E-12 68.0 19.5 308 1-364 1-332 (335) 39 protein:vir:6324 Length: 335 # 98.8 4.9E-09 3.1E-12 66.1 19.4 309 1-364 1-332 (335) 40 protein:vir:94711 Length: 347 98.8 3.3E-09 2.1E-12 67.1 18.1 304 1-361 1-347 (347) 41 protein:vir:105822 Length: 273 98.7 4.1E-09 2.5E-12 66.6 16.5 271 1-361 1-273 (273) 42 protein:vir:102605 Length: 273 98.7 4.1E-09 2.5E-12 66.6 16.5 271 1-361 1-273 (273) 43 protein:vir:97031 Length: 402 98.7 9.3E-09 5.8E-12 64.6 18.4 305 1-364 1-370 (402) 44 protein:vir:100057 Length: 375 98.7 8.7E-09 5.4E-12 64.8 18.1 312 1-364 1-373 (375) 45 protein:vir:103323 Length: 364 98.6 1.5E-08 9.5E-12 63.4 16.0 304 1-364 1-342 (364) 46 protein:vir:99675 Length: 324 98.5 9.9E-08 6.1E-11 59.0 18.1 274 43-364 1-312 (324) 47 protein:vir:102655 Length: 322 98.4 2.5E-07 1.6E-10 56.7 18.0 297 1-362 13-322 (322) 48 protein:vir:80446 Length: 367 98.3 5.8E-08 3.6E-11 60.3 12.4 314 1-364 1-337 (367) 49 protein:vir:105645 Length: 400 98.1 2E-06 1.3E-09 51.8 18.7 308 1-364 1-368 (400) 50 protein:vir:79008 Length: 299 98.1 3.4E-06 2.1E-09 50.5 19.2 278 1-362 1-299 (299) 51 protein:vir:78387 Length: 349 98.0 1.6E-06 9.7E-10 52.4 15.3 301 1-364 1-317 (349) 52 protein:vir:94989 Length: 349 97.8 5E-06 3.1E-09 49.6 15.8 302 1-364 1-317 (349) 53 protein:vir:7019 Length: 401 # 97.7 2.5E-05 1.5E-08 45.8 18.6 308 1-364 1-356 (401) 54 protein:vir:99075 Length: 392 97.7 1.2E-05 7.2E-09 47.6 15.7 294 1-364 1-310 (392) 55 protein:vir:3525 Length: 423 # 97.6 1.4E-05 8.4E-09 47.3 15.0 291 1-364 1-307 (423) 56 protein:vir:108303 Length: 418 97.4 7.6E-05 4.7E-08 43.2 18.1 280 1-364 1-288 (418) 57 protein:vir:105374 Length: 423 96.8 0.00035 2.1E-07 39.6 16.0 289 1-364 1-307 (423) 58 protein:vir:8102 Length: 543 # 96.7 0.00044 2.7E-07 39.0 14.7 286 1-360 249-543 (543) 59 protein:vir:97331 Length: 319 96.5 0.00056 3.5E-07 38.4 16.9 270 1-364 19-299 (319) 60 protein:vir:94800 Length: 319 96.5 0.00056 3.5E-07 38.4 16.9 270 1-364 19-299 (319) 61 protein:vir:81160 Length: 371 96.2 0.00088 5.5E-07 37.3 14.7 271 1-359 91-371 (371) 62 protein:vir:4511 Length: 409 # 96.1 0.00099 6.2E-07 37.0 15.2 280 1-362 117-409 (409) 63 protein:vir:1383 Length: 421 # 96.1 0.00082 5.1E-07 37.5 13.2 271 1-364 114-396 (421) 64 protein:vir:3991 Length: 404 # 96.1 0.0011 6.6E-07 36.9 15.1 273 1-364 116-398 (404) 65 protein:vir:4997 Length: 397 # 96.0 0.0012 7.4E-07 36.6 14.8 271 1-364 109-388 (397) 66 protein:vir:174 Length: 423 # 95.8 0.0015 9.2E-07 36.1 14.8 287 1-364 1-310 (423) 67 protein:vir:95763 Length: 297 95.5 0.0019 1.2E-06 35.5 15.8 280 1-364 9-293 (297) 68 protein:vir:102119 Length: 404 95.5 0.002 1.2E-06 35.4 14.6 285 1-364 110-403 (404) 69 protein:vir:107120 Length: 329 95.2 0.0025 1.5E-06 34.9 17.8 271 1-364 30-310 (329) 70 protein:vir:105004 Length: 392 95.1 0.0027 1.7E-06 34.6 15.0 276 1-364 106-389 (392) 71 protein:vir:102082 Length: 392 95.1 0.0027 1.7E-06 34.6 15.0 276 1-364 106-389 (392) 72 protein:vir:102873 Length: 392 95.1 0.0027 1.7E-06 34.6 15.0 276 1-364 106-389 (392) 73 protein:vir:107593 Length: 392 95.1 0.0027 1.7E-06 34.6 15.0 276 1-364 106-389 (392) 74 protein:vir:4856 Length: 293 # 94.5 0.0041 2.6E-06 33.6 15.5 273 1-364 5-286 (293) 75 protein:vir:41 Length: 299 # N 94.2 0.005 3.1E-06 33.2 15.8 278 1-363 6-299 (299) 76 protein:vir:4953 Length: 397 # 94.2 0.0052 3.2E-06 33.1 14.4 270 1-364 109-388 (397) 77 protein:vir:4830 Length: 397 # 93.7 0.0065 4E-06 32.6 13.8 273 1-364 109-390 (397) 78 protein:vir:9759 Length: 303 # 93.7 0.0066 4.1E-06 32.5 18.3 287 1-362 1-303 (303) 79 protein:vir:1781 Length: 221 # 93.6 0.0055 3.4E-06 33.0 11.0 196 93-330 1-221 (221) 80 protein:vir:99920 Length: 311 92.8 0.0098 6.1E-06 31.6 16.9 294 1-364 1-308 (311) 81 protein:vir:7409 Length: 408 # 92.7 0.01 6.5E-06 31.5 14.8 270 1-364 116-398 (408) 82 protein:vir:1886 Length: 385 # 91.9 0.014 8.4E-06 30.8 14.0 271 1-362 105-385 (385) 83 protein:vir:191 Length: 385 # 91.9 0.014 8.4E-06 30.8 14.0 271 1-362 105-385 (385) 84 protein:vir:98339 Length: 415 91.7 0.015 9.1E-06 30.6 15.5 277 1-364 121-409 (415) 85 protein:vir:79987 Length: 415 91.7 0.015 9.1E-06 30.6 15.5 277 1-364 121-409 (415) 86 protein:vir:81100 Length: 415 91.7 0.015 9.1E-06 30.6 15.5 277 1-364 121-409 (415) 87 protein:vir:105522 Length: 423 91.7 0.015 9.1E-06 30.6 16.2 282 1-364 1-307 (423) 88 protein:vir:100135 Length: 418 91.3 0.017 1E-05 30.3 15.3 274 1-364 135-418 (418) 89 protein:vir:78920 Length: 290 90.2 0.022 1.4E-05 29.7 15.1 277 1-360 1-290 (290) 90 protein:vir:485 Length: 407 # 89.8 0.024 1.5E-05 29.4 16.3 289 1-364 106-403 (407) 91 protein:vir:95131 Length: 325 89.5 0.026 1.6E-05 29.3 10.6 288 1-364 1-303 (325) 92 protein:vir:3845 Length: 395 # 89.4 0.026 1.6E-05 29.2 13.3 272 1-364 107-391 (395) 93 protein:vir:9574 Length: 300 # 87.6 0.038 2.3E-05 28.4 17.2 287 1-364 1-297 (300) 94 protein:vir:4600 Length: 415 # 87.4 0.039 2.4E-05 28.3 14.9 271 1-364 121-409 (415) 95 protein:vir:4700 Length: 415 # 87.4 0.039 2.4E-05 28.3 14.9 271 1-364 121-409 (415) 96 protein:vir:4339 Length: 395 # 87.3 0.04 2.5E-05 28.3 15.3 275 1-359 113-395 (395) 97 protein:vir:105464 Length: 346 86.1 0.048 3E-05 27.8 16.8 285 1-364 1-302 (346) 98 protein:vir:105905 Length: 304 85.4 0.053 3.3E-05 27.6 16.4 283 1-361 9-304 (304) 99 protein:vir:94142 Length: 304 85.4 0.053 3.3E-05 27.6 16.4 283 1-361 9-304 (304) 100 protein:vir:80684 Length: 315 84.1 0.063 3.9E-05 27.2 16.1 292 1-364 1-311 (315) 101 protein:vir:4456 Length: 401 # 84.1 0.063 3.9E-05 27.2 14.5 284 1-359 107-401 (401) 102 protein:vir:1638 Length: 298 # 83.6 0.067 4.2E-05 27.0 14.8 287 1-364 1-296 (298) 103 protein:vir:104085 Length: 320 83.3 0.07 4.3E-05 26.9 13.7 281 1-362 14-320 (320) 104 protein:vir:9410 Length: 415 # 83.0 0.072 4.5E-05 26.8 14.6 271 1-364 121-409 (415) 105 protein:vir:3136 Length: 322 # 82.1 0.08 4.9E-05 26.6 11.1 290 1-364 1-322 (322) 106 protein:vir:4226 Length: 326 # 79.7 0.1 6.3E-05 26.0 13.7 282 1-362 20-326 (326) 107 protein:vir:9309 Length: 324 # 78.9 0.11 6.8E-05 25.8 16.9 280 1-364 27-312 (324) 108 protein:vir:2430 Length: 318 # 78.5 0.11 7.1E-05 25.8 13.6 284 1-364 14-318 (318) 109 protein:vir:94771 Length: 298 77.8 0.12 7.5E-05 25.6 17.5 286 1-364 1-296 (298) 110 protein:vir:81070 Length: 390 76.4 0.14 8.4E-05 25.3 13.2 265 1-359 113-390 (390) 111 protein:vir:100247 Length: 425 75.0 0.15 9.4E-05 25.1 15.0 290 1-362 130-425 (425) 112 protein:vir:1268 Length: 397 # 74.0 0.16 0.0001 24.9 12.6 267 1-361 123-397 (397) 113 protein:vir:6242 Length: 390 # 73.7 0.17 0.0001 24.8 12.9 268 1-360 111-390 (390) 114 protein:vir:9927 Length: 295 # 73.3 0.17 0.00011 24.8 12.6 271 1-364 1-291 (295) 115 protein:vir:1025 Length: 408 # 71.2 0.2 0.00012 24.4 14.6 272 1-364 116-398 (408) 116 protein:vir:100172 Length: 394 67.0 0.26 0.00016 23.8 14.9 261 1-364 111-389 (394) 117 protein:vir:2344 Length: 397 # 65.1 0.29 0.00018 23.5 15.2 277 1-364 10-337 (397) 118 protein:vir:104256 Length: 458 64.4 0.3 0.00019 23.4 12.5 289 1-361 162-458 (458) 119 protein:vir:97053 Length: 390 63.0 0.33 0.0002 23.3 17.1 265 1-359 113-390 (390) 120 protein:vir:100884 Length: 389 58.8 0.41 0.00025 22.7 14.6 263 1-364 109-387 (389) 121 protein:vir:78223 Length: 333 56.5 0.46 0.00028 22.5 16.2 296 1-360 10-333 (333) 122 protein:vir:1328 Length: 392 # 54.6 0.5 0.00031 22.2 14.2 276 1-362 111-392 (392) 123 protein:vir:1433 Length: 435 # 53.3 0.53 0.00033 22.1 18.3 293 1-364 132-430 (435) 124 protein:vir:7771 Length: 330 # 46.6 0.73 0.00045 21.3 18.1 291 1-364 10-325 (330) 125 protein:vir:8187 Length: 311 # 44.0 0.82 0.00051 21.0 17.4 292 1-364 1-307 (311) 126 protein:vir:97148 Length: 324 41.1 0.94 0.00058 20.7 14.6 277 1-364 27-319 (324) 127 protein:vir:96223 Length: 324 41.1 0.94 0.00059 20.7 16.6 280 1-364 27-312 (324) 128 protein:vir:79712 Length: 285 40.3 0.98 0.00061 20.6 16.0 270 1-361 1-285 (285) 129 protein:vir:78830 Length: 324 39.9 1 0.00062 20.6 17.1 278 1-364 27-317 (324) 130 protein:vir:96392 Length: 324 39.9 1 0.00062 20.6 17.1 278 1-364 27-317 (324) 131 protein:vir:7855 Length: 497 # 39.6 1 0.00063 20.6 15.2 306 1-363 151-497 (497) 132 protein:vir:101650 Length: 497 39.6 1 0.00063 20.6 15.2 306 1-363 151-497 (497) 133 protein:vir:103955 Length: 324 39.3 1 0.00064 20.5 16.8 280 1-364 27-320 (324) 134 protein:vir:96762 Length: 632 36.7 1.2 0.00072 20.2 14.4 269 1-364 357-630 (632) 135 protein:vir:6212 Length: 434 # 34.2 1.3 0.00081 19.9 12.3 277 1-364 141-434 (434) 136 protein:vir:79928 Length: 393 31.9 1.5 0.00091 19.7 9.4 284 1-364 59-381 (393) 137 protein:vir:78523 Length: 338 30.7 1.6 0.00097 19.5 17.8 299 1-362 10-338 (338) 138 protein:vir:80376 Length: 435 29.4 1.7 0.001 19.4 19.0 293 1-364 132-430 (435) 139 protein:vir:102335 Length: 312 28.9 1.7 0.0011 19.3 17.7 280 1-364 1-312 (312) 140 protein:vir:2504 Length: 305 # 28.7 1.7 0.0011 19.3 14.8 288 1-364 1-300 (305) 141 protein:vir:105038 Length: 428 27.4 1.8 0.0011 19.1 18.3 296 1-364 125-425 (428) 142 protein:vir:10364 Length: 390 24.7 2.1 0.0013 18.8 16.7 264 1-359 114-390 (390) 143 protein:vir:99749 Length: 324 24.4 2.2 0.0013 18.7 16.6 280 1-364 27-320 (324) 144 protein:vir:3870 Length: 400 # 24.2 2.2 0.0014 18.7 13.2 255 1-360 133-400 (400) 145 protein:vir:962 Length: 397 # 23.9 2.2 0.0014 18.7 12.0 255 1-359 132-397 (397) 146 protein:vir:108211 Length: 318 20.9 2.7 0.0017 18.2 8.2 280 1-364 1-314 (318) No 1 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=100.00 E-value=1.1e-168 Score=941.37 Aligned_cols=364 Identities=94% Similarity=1.408 Sum_probs=361.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||+|++++|||+++|+|+++||++++++|+|.++|||+|+++|||+++||+|++||+|+|+|++||+|+||+||++|||| T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~leGn 80 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDARVEGK 80 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccCceeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) ||+|+|++|+|+|||.||||+++|+|+||||+||||++||++|++||++++||++|+||||+||+++++..+++|+++++ T Consensus 81 ee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~~ 160 (364) T protein:vir:93 81 EESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYAG 160 (364) T ss_pred ccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) |||+|||++||||++++|++++|+++|+||+++||+|+++|++++..+|++++|+||+++|+++|||||||+|++|||++ T Consensus 161 N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) T protein:vir:93 161 NPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMRTA 240 (364) T ss_pred cccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCccc Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFDW 320 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~w 320 (364) +||+|+|+||+|++++|++||||+|++||||||+||||+++|+|++++++++|+++|+|||||||+++|||+++|+||+| T Consensus 241 t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~~w 320 (364) T protein:vir:93 241 AGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFDW 320 (364) T ss_pred CCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCCCcee Confidence 99999999999988999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 321 EETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 321 ~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) +||.+||||+.||++++|+|+||+|||++|||||+|||||+||| T Consensus 321 ~Ee~~D~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 321 EETVKDYGNEPAIAAGFIAGMKKARFNNKDFGVISIDTAAKKHS 364 (364) T ss_pred eecccCCCCchhhhhhhHhhhhhcccCCccceEEEecccccccC Confidence 99999999999999999999999999999999999999999999 No 2 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=100.00 E-value=2.1e-152 Score=852.11 Aligned_cols=358 Identities=27% Similarity=0.427 Sum_probs=337.4 Q ss_pred Cc--eeecccCCchHHHHHHHHHHHHHHhhcccccceee----------------cCCCccEEEEeecCCCCCceEEEEE Q lcl|NC_019917. 1 MT--TTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIG----------------TSENAVIQRKTELESDAGDTISFDL 62 (364) Q Consensus 1 Ma--~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G----------------~g~~~~I~~~~dL~k~~Gd~v~f~L 62 (364) |+ +|+|++|||+++++||+.||+++.|+++|.+||+| ++.++|||+++||+|++||+|+|+| T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L 80 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFHF 80 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEeE Confidence 65 99999999999999999999999999999999999 6778899999999999999999999 Q ss_pred eeccccCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019917. 63 SVHLRGKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGA 142 (364) Q Consensus 63 ~~~L~G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga 142 (364) ++||+|+||+||++||||||+|+|++|+|+|||.||||++||+|+||||+||||++||++|++||++++|||+|+||||+ T Consensus 81 ~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGa 160 (430) T protein:vir:10 81 VQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAGA 160 (430) T ss_pred eeccccCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccc------ccccccccccccccCcccCCCCCcEEeeccc-cc-------hhhhhhcccccHHHHHHHHHHHHhcccCC Q lcl|NC_019917. 143 RGIN------LDFVETPDFTGFAGNPLEAPDVDHLLYGGVA-TS-------KASLAATDIMAPIVIERAVEKAAMMQAEN 208 (364) Q Consensus 143 ~g~~------~~~~~~~~~~~~~~N~~~apt~~r~~~~~~a-t~-------~~~i~~~D~~s~~~i~~a~~~a~~~~~~~ 208 (364) ||.+ +|+..+|+|+.+++|+|+|||+||||++++. ++ +.+|+++|+||+++||+|+++|+++ T Consensus 161 rg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~---- 236 (430) T protein:vir:10 161 RGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQI---- 236 (430) T ss_pred hcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhh---- Confidence 9955 5678899999999999999999999997764 33 5679999999999999999999997 Q ss_pred CCCcceeeeEecCce------eEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccc Q lcl|NC_019917. 209 PETANMVPVSIDGDD------HYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVI 282 (364) Q Consensus 209 ~~~~~i~Pv~~~g~~------~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~ 282 (364) .+||+||+++|++ +|||||||+|+++||++++..|+|+|+.|++++|++||||+|++||||||+||||+++| T Consensus 237 --~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~vi 314 (430) T protein:vir:10 237 --ELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPI 314 (430) T ss_pred --CCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCcee Confidence 4789999999988 59999999999999999988766999999999999999999999999999999999999 Q ss_pred ccc-----cccC----------------CcccccchheeeccchheEeeecC--CCCCccceechhhccchhHHHHHHHH Q lcl|NC_019917. 283 RFN-----DYGA----------------GANVEAARALFMGRQAGVIAYGTA--NGLRFDWEETVKDYGNEPAICAGFIA 339 (364) Q Consensus 283 ~~~-----~~~~----------------~~~v~v~ralllGaqA~~~A~g~~--~g~~~~w~Ee~~D~g~~~~i~i~~i~ 339 (364) ||+ ++++ +++++|+|+|||||||+++|||++ +|+||+|+||.+||||++||++++|+ T Consensus 315 rf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~ 394 (430) T protein:vir:10 315 RFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAIL 394 (430) T ss_pred eecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHh Confidence 997 4444 345679999999999999999985 88999999999999999999999999 Q ss_pred hhhhcccCC--------cccEEEEEeeeeeccC Q lcl|NC_019917. 340 GMKKARFNS--------KDFGVISIDTAAKKHS 364 (364) Q Consensus 340 G~~K~rf~~--------~DfGvi~idta~~~~~ 364 (364) |+||+|||. +|||||+|||||++|. T Consensus 395 G~kK~rF~~~~~~~~~~~DfGvi~idtaa~~~~ 427 (430) T protein:vir:10 395 GCSKIRFAVEATNGLEYTDHGVMAIDTAVKIIG 427 (430) T ss_pred ccceeeecCCCCCCceeeeeEEEEhhhhhhhhc Confidence 999999985 6999999999999999 No 3 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=100.00 E-value=4e-145 Score=812.20 Aligned_cols=352 Identities=34% Similarity=0.502 Sum_probs=323.7 Q ss_pred ecccCCchHHHHHHHHHHHHHHhhcccccce-----------------eecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 5 VIPFGDPKAVKRWSADLAVDVRKKSYFEQRF-----------------IGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 5 ~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~-----------------~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) -+++++|+|+++|+..||++..+.+|+.+++ +|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 3456777777777777777777766665444 57899999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARG--- 144 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g--- 144 (364) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+|| T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ---ccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecC Q lcl|NC_019917. 145 ---INLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDG 221 (364) Q Consensus 145 ---~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g 221 (364) +.+|...+|.|++++.|+|+|||++|||+++++|++++|+++|+||+++||+++++++++. +||+||+++| T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~------~pi~Pv~~~g 234 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA------HPLQPVRLSG 234 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC------CCCcceEecc Confidence 3367889999999999999999999999999999999999999999999999999998884 5699999999 Q ss_pred ce------eEEEEEechhHHHHhhcCC-HHHHHHHHHh-hhhhccCCCeeecCeEEEcCEEEEecCcc-cccccc----- Q lcl|NC_019917. 222 DD------HYVVVMSEYQATDMRTAAG-GTWIDFQKAA-AAAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDY----- 287 (364) Q Consensus 222 ~~------~yV~~l~p~q~~~Lr~~~d-~~w~~~qk~A-~~~~g~~nPlF~G~~g~~ngvii~e~~~~-~~~~~~----- 287 (364) ++ +|||||||+|+++||++++ ++|+|+||+| ++++|++||||+|++||||||+||||+++ |||+.+ T Consensus 235 ~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred ccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeee Confidence 88 7999999999999999886 6799999976 55789999999999999999999999976 888532 Q ss_pred -----cC-----CcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccE Q lcl|NC_019917. 288 -----GA-----GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFG 352 (364) Q Consensus 288 -----~~-----~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfG 352 (364) ++ ++..+|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++ +||| T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEE Confidence 11 23457899999999999999999999999999999999999999999999999999996 5999 Q ss_pred EEEEeeeeec Q lcl|NC_019917. 353 VISIDTAAKK 362 (364) Q Consensus 353 vi~idta~~~ 362 (364) ||+|||||++ T Consensus 395 vi~idta~~~ 404 (404) T protein:vir:10 395 VIAVDTAVKL 404 (404) T ss_pred EEEecccccC Confidence 9999999999 No 4 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=100.00 E-value=4e-145 Score=812.20 Aligned_cols=352 Identities=34% Similarity=0.502 Sum_probs=323.7 Q ss_pred ecccCCchHHHHHHHHHHHHHHhhcccccce-----------------eecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 5 VIPFGDPKAVKRWSADLAVDVRKKSYFEQRF-----------------IGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 5 ~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~-----------------~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) -+++++|+|+++|+..||++..+.+|+.+++ +|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 3456777777777777777777766665444 57899999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARG--- 144 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g--- 144 (364) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+|| T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ---ccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecC Q lcl|NC_019917. 145 ---INLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDG 221 (364) Q Consensus 145 ---~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g 221 (364) +.+|...+|.|++++.|+|+|||++|||+++++|++++|+++|+||+++||+++++++++. +||+||+++| T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~------~pi~Pv~~~g 234 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA------HPLQPVRLSG 234 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC------CCCcceEecc Confidence 3367889999999999999999999999999999999999999999999999999998884 5699999999 Q ss_pred ce------eEEEEEechhHHHHhhcCC-HHHHHHHHHh-hhhhccCCCeeecCeEEEcCEEEEecCcc-cccccc----- Q lcl|NC_019917. 222 DD------HYVVVMSEYQATDMRTAAG-GTWIDFQKAA-AAAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDY----- 287 (364) Q Consensus 222 ~~------~yV~~l~p~q~~~Lr~~~d-~~w~~~qk~A-~~~~g~~nPlF~G~~g~~ngvii~e~~~~-~~~~~~----- 287 (364) ++ +|||||||+|+++||++++ ++|+|+||+| ++++|++||||+|++||||||+||||+++ |||+.+ T Consensus 235 ~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred ccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeee Confidence 88 7999999999999999886 6799999976 55789999999999999999999999976 888532 Q ss_pred -----cC-----CcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccE Q lcl|NC_019917. 288 -----GA-----GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFG 352 (364) Q Consensus 288 -----~~-----~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfG 352 (364) ++ ++..+|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++ +||| T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEE Confidence 11 23457899999999999999999999999999999999999999999999999999996 5999 Q ss_pred EEEEeeeeec Q lcl|NC_019917. 353 VISIDTAAKK 362 (364) Q Consensus 353 vi~idta~~~ 362 (364) ||+|||||++ T Consensus 395 vi~idta~~~ 404 (404) T protein:vir:10 395 VIAVDTAVKL 404 (404) T ss_pred EEEecccccC Confidence 9999999999 No 5 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=100.00 E-value=4e-145 Score=812.20 Aligned_cols=352 Identities=34% Similarity=0.502 Sum_probs=323.7 Q ss_pred ecccCCchHHHHHHHHHHHHHHhhcccccce-----------------eecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 5 VIPFGDPKAVKRWSADLAVDVRKKSYFEQRF-----------------IGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 5 ~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~-----------------~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) -+++++|+|+++|+..||++..+.+|+.+++ +|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 3456777777777777777777766665444 57899999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARG--- 144 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g--- 144 (364) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+|| T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:81 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ---ccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecC Q lcl|NC_019917. 145 ---INLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDG 221 (364) Q Consensus 145 ---~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g 221 (364) +.+|...+|.|++++.|+|+|||++|||+++++|++++|+++|+||+++||+++++++++. +||+||+++| T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~------~pi~Pv~~~g 234 (404) T protein:vir:81 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA------HPLQPVRLSG 234 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC------CCCcceEecc Confidence 3367889999999999999999999999999999999999999999999999999998884 5699999999 Q ss_pred ce------eEEEEEechhHHHHhhcCC-HHHHHHHHHh-hhhhccCCCeeecCeEEEcCEEEEecCcc-cccccc----- Q lcl|NC_019917. 222 DD------HYVVVMSEYQATDMRTAAG-GTWIDFQKAA-AAAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDY----- 287 (364) Q Consensus 222 ~~------~yV~~l~p~q~~~Lr~~~d-~~w~~~qk~A-~~~~g~~nPlF~G~~g~~ngvii~e~~~~-~~~~~~----- 287 (364) ++ +|||||||+|+++||++++ ++|+|+||+| ++++|++||||+|++||||||+||||+++ |||+.+ T Consensus 235 ~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:81 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred ccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeee Confidence 88 7999999999999999886 6799999976 55789999999999999999999999976 888532 Q ss_pred -----cC-----CcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccE Q lcl|NC_019917. 288 -----GA-----GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFG 352 (364) Q Consensus 288 -----~~-----~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfG 352 (364) ++ ++..+|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++ +||| T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:81 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEE Confidence 11 23457899999999999999999999999999999999999999999999999999996 5999 Q ss_pred EEEEeeeeec Q lcl|NC_019917. 353 VISIDTAAKK 362 (364) Q Consensus 353 vi~idta~~~ 362 (364) ||+|||||++ T Consensus 395 vi~idta~~~ 404 (404) T protein:vir:81 395 VIAVDTAVKL 404 (404) T ss_pred EEEecccccC Confidence 9999999999 No 6 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=100.00 E-value=4e-145 Score=812.20 Aligned_cols=352 Identities=34% Similarity=0.502 Sum_probs=323.7 Q ss_pred ecccCCchHHHHHHHHHHHHHHhhcccccce-----------------eecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 5 VIPFGDPKAVKRWSADLAVDVRKKSYFEQRF-----------------IGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 5 ~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~-----------------~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) -+++++|+|+++|+..||++..+.+|+.+++ +|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 3456777777777777777777766665444 57899999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARG--- 144 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g--- 144 (364) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+|| T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:32 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ---ccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecC Q lcl|NC_019917. 145 ---INLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDG 221 (364) Q Consensus 145 ---~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g 221 (364) +.+|...+|.|++++.|+|+|||++|||+++++|++++|+++|+||+++||+++++++++. +||+||+++| T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~------~pi~Pv~~~g 234 (404) T protein:vir:32 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA------HPLQPVRLSG 234 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC------CCCcceEecc Confidence 3367889999999999999999999999999999999999999999999999999998884 5699999999 Q ss_pred ce------eEEEEEechhHHHHhhcCC-HHHHHHHHHh-hhhhccCCCeeecCeEEEcCEEEEecCcc-cccccc----- Q lcl|NC_019917. 222 DD------HYVVVMSEYQATDMRTAAG-GTWIDFQKAA-AAAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDY----- 287 (364) Q Consensus 222 ~~------~yV~~l~p~q~~~Lr~~~d-~~w~~~qk~A-~~~~g~~nPlF~G~~g~~ngvii~e~~~~-~~~~~~----- 287 (364) ++ +|||||||+|+++||++++ ++|+|+||+| ++++|++||||+|++||||||+||||+++ |||+.+ T Consensus 235 ~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:32 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred ccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeee Confidence 88 7999999999999999886 6799999976 55789999999999999999999999976 888532 Q ss_pred -----cC-----CcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccE Q lcl|NC_019917. 288 -----GA-----GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFG 352 (364) Q Consensus 288 -----~~-----~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfG 352 (364) ++ ++..+|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++ +||| T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:32 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEE Confidence 11 23457899999999999999999999999999999999999999999999999999996 5999 Q ss_pred EEEEeeeeec Q lcl|NC_019917. 353 VISIDTAAKK 362 (364) Q Consensus 353 vi~idta~~~ 362 (364) ||+|||||++ T Consensus 395 vi~idta~~~ 404 (404) T protein:vir:32 395 VIAVDTAVKL 404 (404) T ss_pred EEEecccccC Confidence 9999999999 No 7 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=100.00 E-value=4e-112 Score=631.31 Aligned_cols=288 Identities=32% Similarity=0.444 Sum_probs=263.6 Q ss_pred Cc---------------eeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec Q lcl|NC_019917. 1 MT---------------TTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH 65 (364) Q Consensus 1 Ma---------------~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~ 65 (364) |+ .|.+..++| ++++|+++|++++++.++|. +|+|+|+++|||+++||+|++||+|+|+|++| T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~-~vk~ws~~l~~~~~~~~~~~-~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~ 78 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKK-STKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCCh-HHHHHHHhhhhHHHhhhhhh-cccCCCCCceEEEeccCCCCCccEEEEeEeec Confidence 32 344444444 78899999999999977765 69999999999999999999999999999999 Q ss_pred cccCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_019917. 66 LRGKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGI 145 (364) Q Consensus 66 L~G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~ 145 (364) |+|+||+||+++|||||+|+|++|+|+|||.||||++||+|+||||+||||++||++|++||++++||++|+||||+||. T Consensus 79 L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~ 158 (318) T protein:vir:27 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) T ss_pred cccCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999993 Q ss_pred ------cccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEe Q lcl|NC_019917. 146 ------NLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSI 219 (364) Q Consensus 146 ------~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~ 219 (364) .+|+.++|.|+++++|+++|||++|||++|++|++++|+++|+||+++||+++.+++++. +||+||++ T Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a------~pi~PV~v 232 (318) T protein:vir:27 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA------HPLQPVRL 232 (318) T ss_pred cccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC------CCCcceee Confidence 367889999999999999999999999999999999999999999999999999998773 66999999 Q ss_pred cCce------eEEEEEechhHHHHhhcCC-HHHHHHHHHh-hhhhccCCCeeecCeEEEcCEEEEecCcc-cccccccCC Q lcl|NC_019917. 220 DGDD------HYVVVMSEYQATDMRTAAG-GTWIDFQKAA-AAAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDYGAG 290 (364) Q Consensus 220 ~g~~------~yV~~l~p~q~~~Lr~~~d-~~w~~~qk~A-~~~~g~~nPlF~G~~g~~ngvii~e~~~~-~~~~~~~~~ 290 (364) +|++ +|||||||+|+++||+++. ++|+++||+| ++++|++||||+|++||||||+||||+++ |||+ ++ T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~---~G 309 (318) T protein:vir:27 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY---QG 309 (318) T ss_pred ccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEc---CC Confidence 9988 7999999999999999875 7899999976 35678999999999999999999999986 9998 45 Q ss_pred cccccchhe Q lcl|NC_019917. 291 ANVEAARAL 299 (364) Q Consensus 291 ~~v~v~ral 299 (364) .+|.++|.- T Consensus 310 ~~v~~~~~~ 318 (318) T protein:vir:27 310 QRFWYQRIT 318 (318) T ss_pred CeeeeeecC Confidence 566666654 No 8 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.46 E-value=4.8e-15 Score=99.06 Aligned_cols=325 Identities=16% Similarity=0.199 Sum_probs=183.4 Q ss_pred CceeecccCCchHHH-HHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC--ceecCcee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVK-RWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK--PTYGDART 77 (364) Q Consensus 1 Ma~T~~~~~dp~a~~-~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~l 77 (364) --.|..++++|+... .|.++....+.+.-+|.+ |. ...++-|+.|-+|.|.-..+|.-. |....-.. T Consensus 11 ~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~-fA---------~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 11 QKSSIDGANSDQMQTFFWLKKAIITARKEQYFMP-LA---------SVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred ccccccccccceeeehhhHHHHHhhhhhhhhhhh-cc---------cccccccccCCeEEEEecccccccccchhcCCCc Confidence 335667888899885 788888777777766653 43 234556778888988866666431 21111111 Q ss_pred ec------chh------------hhhhcccEEEEecccce-eeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019917. 78 EG------TEE------------NLRFYTDQVKIDQVRHP-VSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIY 138 (364) Q Consensus 78 eG------nee------------~L~~~~~~v~Idq~R~~-V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 138 (364) +| |-= =+......=+||+..+. ++..+++.|- =++++..-..+ + --.|..++-| T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qy---G~~~e~Td~~~-d---t~~D~~l~~h 153 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKF---GFFYEFTQESI-D---FDSDDGLMEH 153 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeec---cCccchhhhhh-h---hhcchHHHHH Confidence 21 100 01111222234433222 2222222220 00111000000 0 0012222222 Q ss_pred -----hcccccccccccccccccccccCcccCCCCCcEEeeccccchhh----hhhcccccHHHHHHHHHHHHhcccCCC Q lcl|NC_019917. 139 -----LSGARGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKAS----LAATDIMAPIVIERAVEKAAMMQAENP 209 (364) Q Consensus 139 -----l~ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~----i~~~D~~s~~~i~~a~~~a~~~~~~~~ 209 (364) |.|+.++.++.....-.+ ....++|++.+++.++ ..++..++++.|.++...+.-.+.+ . T Consensus 154 ~s~ell~g~~~~t~d~i~~dll~----------ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRap-k 222 (401) T protein:vir:95 154 LSRELMNGATQITEAVLQKDLLA----------AAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTP-T 222 (401) T ss_pred HHHHHhhhhhhhHHHHHHHHHHh----------hcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccc-c Confidence 344444433332111111 1246788887777764 4578889999999998877543321 1 Q ss_pred CCcceee-eEecC---ceeEEEEEechhHHHHhhc----CCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcc Q lcl|NC_019917. 210 ETANMVP-VSIDG---DDHYVVVMSEYQATDMRTA----AGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNV 281 (364) Q Consensus 210 ~~~~i~P-v~~~g---~~~yV~~l~p~q~~~Lr~~----~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~ 281 (364) ++..|.- .+++. ..+||.||||.-..+|+.. .+|.|.+.+|||.+ -++|.||+|.++|+.++..|.+ T Consensus 223 ~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~-----~~i~~gEiG~i~~vR~i~~p~~ 297 (401) T protein:vir:95 223 QTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADA-----GTIMNGEVGSIDKFRIIQVPEM 297 (401) T ss_pred chhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCc-----cccccccccccCceeEEecccc Confidence 1111111 11222 3689999999444444321 35789999999854 4699999999999999999999 Q ss_pred cccccccC----------------CcccccchheeeccchheEeeecCCCC--Cccce---------echhhccchhHHH Q lcl|NC_019917. 282 IRFNDYGA----------------GANVEAARALFMGRQAGVIAYGTANGL--RFDWE---------ETVKDYGNEPAIC 334 (364) Q Consensus 282 ~~~~~~~~----------------~~~v~v~ralllGaqA~~~A~g~~~g~--~~~w~---------Ee~~D~g~~~~i~ 334 (364) -.|.+-|+ +++.+|...|+||.+|.+..=-+.+|+ +|... +...-||..--++ T Consensus 298 ~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vg 377 (401) T protein:vir:95 298 LHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSS 377 (401) T ss_pred eeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhh Confidence 88865543 244568999999999876552222332 22221 2234467777899 Q ss_pred HHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 335 AGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 335 i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) +++.+|...++ +.|.+ .|-+++++ T Consensus 378 wK~~~a~~vL~---~e~m~-~ies~a~~ 401 (401) T protein:vir:95 378 IKWYYGILVKR---PERLA-LIKTVAPL 401 (401) T ss_pred hhhhhhhheec---cceeE-EEEeecCC Confidence 99999976654 44555 78899999 No 9 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.44 E-value=8.9e-15 Score=97.62 Aligned_cols=263 Identities=17% Similarity=0.164 Sum_probs=177.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC--ceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK--PTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~le 78 (364) ||.+.+-..|=..+..|+.-+-....++..|.+ + ....++|+-++|++|+|+....+ |+ -+..++.+. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~-~--------~~~~~~l~g~~G~ti~iP~~~~i-gda~~~~eg~~i~ 70 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQ-F--------ADIDSTLVGQPGDTLTFPAFVYS-GDATVVPEGQKIP 70 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcc-c--------ceecccccCCCCCEEEeeeecCC-CccccccCCCccC Confidence 996666777778889999999888877776644 1 12346788889999999988777 44 233444443 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.|+..++++.|.+.-.++.+.. .+...+.-|+..++-+.+..+|++..|+.++..|.++... T Consensus 71 --~~~lt~~~~~a~i~~~~k~~~~tD-~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~------------- 134 (276) T protein:vir:10 71 --VDKIETNRREAKIHKIGKGTDITD-EALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT------------- 134 (276) T ss_pred --ccccccceeeEEeehccccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------- Confidence 788999999999999888888764 4556667899999999999999999999998877653210 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .+++.++.+.|.+|....... ..+..+++|||..+..|| T Consensus 135 -------------------------~~~~~~t~d~i~~A~~~lgd~----------------~~~~~~ivv~p~~~~~L~ 173 (276) T protein:vir:10 135 -------------------------VSADIGTLAGLEAAIDTFDDE----------------DLEPMVLFINPKDAGKLR 173 (276) T ss_pred -------------------------ccccccCHHHHHHHHHHhccc----------------cCcccEEEEcHHHHHHHH Confidence 023457788888887654211 124578999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++.+.+|.+. .+...+.+.+|.+|.+.|+.|+..++++. ..++|+|..|+.+. ...+. T Consensus 174 k~~~~~f~~~------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gAi~~~--~~~~~-- 231 (276) T protein:vir:10 174 SSASDNFTRA------TELGDNIIVKGAFGEALGAVIVRSKKLDE------------GEAILAKRGAVKLI--TKRDF-- 231 (276) T ss_pred Hhcccccccc------ccccccceeccccceecceeEEEcCCCCc------------ceEEEEeccceeee--ecCCc-- Confidence 8776777432 33446789999999999999999887642 34578888776544 32222 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeee-------eeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTA-------AKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta-------~~~~~ 364 (364) . .|...|-..+ .+.|.+. +=|||.+++-. +.-.+ T Consensus 232 ~-vE~dRd~~~~----~d~i~~~-------~~y~~~~~~~~~vv~~t~~~~~~ 272 (276) T protein:vir:10 232 F-LETDRDPSTK----TTALYSD-------KHYVAYLYDESKAVKVTKGAGTT 272 (276) T ss_pred e-eecccchhhc----ccEEEEe-------eEEEEEEEcCcceEEEecCCcCC Confidence 2 4543333322 2333321 12333333321 11111 No 10 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.42 E-value=1.4e-14 Score=96.49 Aligned_cols=265 Identities=13% Similarity=0.135 Sum_probs=170.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc--eecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP--TYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g--v~Gd~~le 78 (364) ||.|.+...|-..+..|+.-+-....++.-|.+ + .....+|+-++|++|+|+....+ |+. +..++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~-~--------~~~~~~l~g~~G~ti~iP~~~~~-gda~~~~eg~~i- 69 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAP-L--------AQVDTTLQGQPGNTLKFPAFTYI-GDAADVAEGGEI- 69 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhcc-c--------cccccccccCCCCEEEEeeeccC-ccccccCCCCcc- Confidence 998888888877789999988666655554543 1 12345677789999999987655 432 3333344 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) ..+.|+..++++.|.+...++.+.. ++...+.-|+..++...++.+|++..|..++..|.|+. T Consensus 70 -~~~~lt~~~~~~~i~~~~k~~~vtD-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~--------------- 132 (272) T protein:vir:36 70 -SLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS--------------- 132 (272) T ss_pred -ChhhcCCcceeEeeehhhccccccH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 4788999999999999989888864 45666788999999999999999999999998887631 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) +. .+-..+.+.|.+|....... . .+-.+++|||..+..|| T Consensus 133 --~~----------------------~~~~~~~d~i~~A~~~lgd~--~--------------~~~~~ivv~p~~~~~L~ 172 (272) T protein:vir:36 133 --QT----------------------VSTKANVDGVQAALDIFNDE--D--------------AQAYVLIVNPKDAAKIR 172 (272) T ss_pred --cc----------------------ccccccHHHHHHHHHHhhhc--C--------------CCceEEEEcHHHHHHHh Confidence 00 01134666777776543211 1 12357999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++. .|.... .....+++++|.+|.|.|+.|+...+++... .+.-.++.|..|++. +...+. T Consensus 173 k~~--~~~~~~-----~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~--------~~~~~~~~~~gA~~~--~~~~~~-- 233 (272) T protein:vir:36 173 KDA--NAKNIG-----SEVGANALINGTYADVLGAQIVRSKKLAEGS--------ALMFKIVSNSPALKL--VLKRGV-- 233 (272) T ss_pred ccc--cccccc-----ccccccceeeeccceecCeeEEEeCCCCCCc--------eeEEEEEecccceee--eecCCc-- Confidence 754 332222 2234578999999999999999998876321 123456666665543 332222 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEe-eeeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISID-TAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~id-ta~~~~~ 364 (364) . .|...|-.. ..+.|.+ .+=||+-+++ +++..=+ T Consensus 234 ~-vE~~R~~~~----~~d~i~~-------~~~y~~~v~~~~~vv~~t 268 (272) T protein:vir:36 234 Q-VETDRDIVT----KTTVITA-------DEHYAAYLYDLTKVVNIT 268 (272) T ss_pred c-cccccchhh----cCcEEEE-------EEEEEEEEEcCccEEEEe Confidence 2 343222221 1223333 1234554443 2211111 No 11 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.36 E-value=7.1e-14 Score=92.66 Aligned_cols=267 Identities=16% Similarity=0.175 Sum_probs=171.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC--ceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK--PTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~le 78 (364) ||.+++...|=..+..|+.-+-....++.-|.+ + .....+|+-++|++|+|+... +.|+ -+..++.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~-~--------~~~~~~l~g~~G~tv~ip~~~-~~g~~~~~~~g~~i~ 70 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQ-F--------ADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIP 70 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcc-c--------ccccccccCCCCCEEEEEeec-cCCCccccCCCCcCc Confidence 998888888888899999888666544433322 1 223456777899999999876 4333 233333443 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.+...++++.|++...++.+... +...+..|+..++...+..+|++..|..++..|.++.- T Consensus 71 --~~~it~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~-------------- 133 (274) T protein:vir:96 71 --VDQIGTSKREAKVRKIGKGTELTDE-AVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------- 133 (274) T ss_pred --hhhcccceeEEEEEeeeceeeecHH-HHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-------------- Confidence 7789999999999998888888654 45567889999999999999999999999988765310 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) -..++.++++.|.+|..++.... .+..+++|||.++..|+ T Consensus 134 ------------------------~~~~~~~~~d~i~dA~~~l~d~~----------------~~~~~ivv~p~~~~~L~ 173 (274) T protein:vir:96 134 ------------------------TVEADITKLDGLQTAIDKFNDED----------------LEPMVLFVNPLDAGGLR 173 (274) T ss_pred ------------------------CcCcccccHHHHHHHHHHhcccC----------------CCceEEEeCHHHHHHHH Confidence 01124567888888766542110 13457999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++...+|.. ....-++.+.+|.+|.|.|+.|+...+++. ..++|+|..|++.+.+. + . T Consensus 174 k~~~~~f~~------~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gA~~~~~~~--~--~ 231 (274) T protein:vir:96 174 TSASDNFTR------PTQLGDNIIVKGAFGEALGAVIVRSNKLNK------------GEALLAKKGAVKLITKR--D--F 231 (274) T ss_pred hcccccccc------cccccccceeecccceecCeeEEEcCCCCc------------ceEEEEeCcceeeeecC--C--c Confidence 865444432 122334678899999999999999888752 23688888876554322 2 2 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCC---cccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNS---KDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~---~DfGvi~idta~~~~~ 364 (364) . .|...|-.. ..+.|.|.. +|.- ..=+++++ |-+++.- T Consensus 232 ~-vE~~Rd~~~----~~d~i~~~~--~yg~~~~~~~~vv~~-t~~~~~~ 272 (274) T protein:vir:96 232 F-LEKDRDASR----KSTALYSDK--HYVAYLYDESKVVKI-TKGAGDE 272 (274) T ss_pred c-cccccchhh----cccEEEEee--EEEEEEEcCccEEEE-EcCcccc Confidence 2 343222221 123333321 1100 11133222 2222222 No 12 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.34 E-value=6.8e-14 Score=92.78 Aligned_cols=299 Identities=13% Similarity=0.070 Sum_probs=168.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.|.+.. ...++|+..+.....++..|.+ ++. ..+++-..||+|+|+-.....-.....+..+. T Consensus 15 ~~~t~~~~---fiPev~s~~v~~~l~~~lv~~~-l~~---------~~~~~~~~GdTV~ip~~g~~~a~d~~~g~~i~-- 79 (381) T protein:vir:80 15 VDLSNVQV---FIPEVWSSEVRMFRDQKFAALE-ATK---------KIPFEGKKGDLIHIPNISRAAVYDKQPQTPVN-- 79 (381) T ss_pred cchhhHHh---hhhHHHHHHHHHHHHHhhhhhh-ccc---------cccceeecCceEEeeccCcceeeeecCCCccc-- Confidence 66666644 3457999999888767666643 432 23455567999999865555443444444443 Q ss_pred hhhhhhcccEEEEecccc-eeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRH-PVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~-~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+++...+.+|.||+.+. ++.+ ..+++..+..|+|.+....+...+++..|+.++..+....... + T Consensus 80 ~~~~~~~~~~itID~~~~~~~~I-dd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~---------~--- 146 (381) T protein:vir:80 80 LQARTDSEFTFTVTKYKESSFMI-EDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFP---------S--- 146 (381) T ss_pred ccccCCceEEEEEeeeeecceee-chHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c--- Confidence 456777888999999874 4555 4778889999999999999999999999999987776422100 0 Q ss_pred cCcccCCCCCcEEeeccccchhhhh-hcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLA-ATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~-~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ..+...++. . .+.+.....+ .+..++++.|..|.++..... .|. ++ .+++++|.++.+|+ T Consensus 147 ~~~~t~~~~----i-~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~--VP~-----------eg-R~lvv~P~~~~~Ll 207 (381) T protein:vir:80 147 QRIYSYDTT----L-GDGTVNAHLTGTPAPLTYAALLLAKQKLDEAD--VPQ-----------EG-RIVMVSPAQYIDLL 207 (381) T ss_pred ccccccccc----c-cccccccccccchhhHHHHHHHHHHHHHhhcC--CCc-----------CC-cEEEeCHHHHHHHh Confidence 011111111 0 0111112222 234567888888877665442 221 22 46788999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc----CCccccc----chheeeccchheEee Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG----AGANVEA----ARALFMGRQAGVIAY 310 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~----~~~~v~v----~ralllGaqA~~~A~ 310 (364) . +++|.+... +..+.|..|.+|+|.|+-|++.++++.....+ +++.... .-.-..|.+.. T Consensus 208 ~--~~~~~~ad~------~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~---- 275 (381) T protein:vir:80 208 S--INQFISVDF------SQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAG---- 275 (381) T ss_pred h--chhhhhhhh------ccchhhhceeeeEEcceEEEeecccccccccceeeecccccccccccccccccccccc---- Confidence 6 457654332 23457899999999999999999886532211 1010000 00111111100 Q ss_pred ecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcc-cEEEEEe-------eeeeccC Q lcl|NC_019917. 311 GTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKD-FGVISID-------TAAKKHS 364 (364) Q Consensus 311 g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~D-fGvi~id-------ta~~~~~ 364 (364) .+...+|. .+|+-+++....-+.-..++.+-..+ .++..+. +++.-|+ T Consensus 276 ---~a~av~~~---k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (381) T protein:vir:80 276 ---TANVVNTG---SASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWATAVVCHP 331 (381) T ss_pred ---ceeeeeee---eeeceeeeeeeccceeeecceeeecCCCceeeeehhhhhhhhhccccc Confidence 00111222 35555554444444333333332111 1111111 4445555 No 13 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.32 E-value=1.8e-13 Score=90.41 Aligned_cols=268 Identities=17% Similarity=0.146 Sum_probs=168.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-ceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-PTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-gv~Gd~~leG 79 (364) ||.|++...+-..++.|+..+-....+++.+.+ + ..+..+++..+|++|+++....+... .+..++.+. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~-~--------~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~- 70 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAP-L--------AEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP- 70 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhc-c--------ccccccccCCCCCEEEEEEecCCCCcccccCCCccc- Confidence 998888888889999999987666555544432 1 11223566779999999876544322 343334443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.+.+.+.++.|.+..+.+.+... ...++..|+.....+.|...|++..|..++-.|.|+. T Consensus 71 -~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~---------------- 132 (272) T protein:vir:98 71 -MTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST---------------- 132 (272) T ss_pred -ccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------- Confidence 6789999999999998888888655 4556788999999999999999999999998776631 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +.+ +...+++.|.+|...+... . .+-.+++|||..+..||. T Consensus 133 -~~~----------------------~~~~t~d~i~da~~~l~~~--~--------------~~~~~~vv~p~~~~~L~k 173 (272) T protein:vir:98 133 -QTV----------------------EATATVDGVSKALDIFNDE--D--------------DAETVIVMNPADASTLRL 173 (272) T ss_pred -ccc----------------------ccccCHHHHHHHHHHHhcc--C--------------CCccEEEEcHHHHHHHHH Confidence 110 1123466677765543211 1 123479999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +....|... .+...+.+.+|.+|.|.|+.++..+.++.. -++++|..|++++ ...+. . T Consensus 174 ~~~~~~~~~------~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~------------t~~~~~~~a~~~~--~~~~~--~ 231 (272) T protein:vir:98 174 DAAKEWLGA------TEVGANRVVSGVYGEVLGVQIVRSRKCPKG------------TAYMVRKGALRIM--LKRNT--M 231 (272) T ss_pred hcccccccc------ccccccccccccchhhcCeeEEEcCCCCcc------------eEEEEcCCeEEEE--ecCCc--e Confidence 654444221 223346788999999999999999887521 2567777665554 32222 2 Q ss_pred ceechhhccchhHHHHHHHHhhhhcc--c-CCcccEEEEEeeeeec Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKAR--F-NSKDFGVISIDTAAKK 362 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~r--f-~~~DfGvi~idta~~~ 362 (364) .|...|-.. ....|.+..--- + +.+-+=++.+..|++. T Consensus 232 -ve~~r~~~~----~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 232 -VETDRDITK----AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -eeecccccc----ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 343333322 123333311100 1 2233344444444444 No 14 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.32 E-value=1.8e-13 Score=90.41 Aligned_cols=268 Identities=17% Similarity=0.146 Sum_probs=168.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-ceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-PTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-gv~Gd~~leG 79 (364) ||.|++...+-..++.|+..+-....+++.+.+ + ..+..+++..+|++|+++....+... .+..++.+. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~-~--------~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~- 70 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAP-L--------AEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP- 70 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhc-c--------ccccccccCCCCCEEEEEEecCCCCcccccCCCccc- Confidence 998888888889999999987666555544432 1 11223566779999999876544322 343334443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.+.+.+.++.|.+..+.+.+... ...++..|+.....+.|...|++..|..++-.|.|+. T Consensus 71 -~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~---------------- 132 (272) T protein:vir:30 71 -MTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST---------------- 132 (272) T ss_pred -ccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------- Confidence 6789999999999998888888655 4556788999999999999999999999998776631 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +.+ +...+++.|.+|...+... . .+-.+++|||..+..||. T Consensus 133 -~~~----------------------~~~~t~d~i~da~~~l~~~--~--------------~~~~~~vv~p~~~~~L~k 173 (272) T protein:vir:30 133 -QTV----------------------EATATVDGVSKALDIFNDE--D--------------DAETVIVMNPADASTLRL 173 (272) T ss_pred -ccc----------------------ccccCHHHHHHHHHHHhcc--C--------------CCccEEEEcHHHHHHHHH Confidence 110 1123466677765543211 1 123479999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +....|... .+...+.+.+|.+|.|.|+.++..+.++.. -++++|..|++++ ...+. . T Consensus 174 ~~~~~~~~~------~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~------------t~~~~~~~a~~~~--~~~~~--~ 231 (272) T protein:vir:30 174 DAAKEWLGA------TEVGANRVVSGVYGEVLGVQIVRSRKCPKG------------TAYMVRKGALRIM--LKRNT--M 231 (272) T ss_pred hcccccccc------ccccccccccccchhhcCeeEEEcCCCCcc------------eEEEEcCCeEEEE--ecCCc--e Confidence 654444221 223346788999999999999999887521 2567777665554 32222 2 Q ss_pred ceechhhccchhHHHHHHHHhhhhcc--c-CCcccEEEEEeeeeec Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKAR--F-NSKDFGVISIDTAAKK 362 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~r--f-~~~DfGvi~idta~~~ 362 (364) .|...|-.. ....|.+..--- + +.+-+=++.+..|++. T Consensus 232 -ve~~r~~~~----~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 232 -VETDRDITK----AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -eeecccccc----ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 343333322 123333311100 1 2233344444444444 No 15 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.30 E-value=2e-13 Score=90.18 Aligned_cols=271 Identities=15% Similarity=0.146 Sum_probs=172.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-ceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-PTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-gv~Gd~~leG 79 (364) ||.+.+...|=..+..|+..+-....++.-|.+ + ..+..+|+-++|++|+|+....+... -+..++.+. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~-~--------~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~- 70 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFAS-F--------AEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP- 70 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcc-c--------ccccccccCCCCCEEEEEeeccCCCcccccCCCccc- Confidence 999999999888889999999777766655543 2 12335677789999999987655322 233333443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.+...++++.|++...++.+... +...+..|+...+...+...|++..|..++..|.++.- T Consensus 71 -~~~it~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~--------------- 133 (274) T protein:vir:93 71 -TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------- 133 (274) T ss_pred -ccccccceeEEEeeeecccccccHH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 7889999999999998888887654 45556789999999999999999999999988765310 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + ..++.++.+.|.+|..++... ..+-.+++|||..+..|++ T Consensus 134 -------~----------------~~~~~~~~d~i~dA~~~l~d~----------------~~~~~~ivv~p~~~~~L~k 174 (274) T protein:vir:93 134 -------T----------------VNADITKLNGLQSAIDKFNDE----------------DLEPMVLFINPLDAGKLRG 174 (274) T ss_pred -------c----------------ccccccCHHHHHHHHHHhhhc----------------cCCccEEEeCHHHHHHHHh Confidence 0 013456788888886654211 1234579999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +...+|. + ....-.+.+.+|.+|.|.|+.|+..++++. .-++|+|..|++.+... + .. T Consensus 175 ~~~~~f~---~---~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gai~~~~~~--~--~~ 232 (274) T protein:vir:93 175 DASTNFT---R---ATELGDDIIVKGAFGEALGAIIVRTNKLEA------------GTAILAKKGAVKLILKR--D--FF 232 (274) T ss_pred hhhhccc---c---cccccccceeecccceecCeeEEEcCCCCc------------ceEEEEeCCeEEEEecC--C--cc Confidence 5433442 1 123334689999999999999999987752 23577887776544222 2 22 Q ss_pred ceechhhccchhHHHHHHHHhhhhcccC-CcccEEEEE-eeeeeccC Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKARFN-SKDFGVISI-DTAAKKHS 364 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-~~DfGvi~i-dta~~~~~ 364 (364) .|...|-..+ .+.|.|..---+. -.+=+++++ ..++++.- T Consensus 233 -vE~~Rd~~~~----~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 233 -LEVARDASTK----TTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred -cccccchhhc----ccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 4444332221 2223221100000 011122221 11122222 No 16 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.27 E-value=5.3e-13 Score=87.87 Aligned_cols=263 Identities=16% Similarity=0.160 Sum_probs=173.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc--eecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP--TYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g--v~Gd~~le 78 (364) ||-+.+...|=..++.|+..+-....++..|.+ +..+..+|+-++|++|+|+....+ |+. +..++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~---------~~~~d~~l~g~~G~tv~iP~~~~~-g~a~~~~~g~~i~ 70 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFAS---------FAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIP 70 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcc---------cceecccccCCCCCEEEEeeecCC-CccccccCCCccc Confidence 999888888888899999999766655444432 222345677779999999987654 332 33333443 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.|...++++.|++.-+++.+... +...+.-|+..++.+.+..+|++..|+.++..|.++.. T Consensus 71 --~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-------------- 133 (274) T protein:vir:94 71 --TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------- 133 (274) T ss_pred --ccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-------------- Confidence 7789999999999998888888654 55557789999999999999999999999988765310 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) . ..++.++++.|.+|...+... ..+-.+++|||..+..|+ T Consensus 134 ---~---------------------~~~~~~~~d~i~dA~~~l~d~----------------~~~~~~ivv~p~~~~~L~ 173 (274) T protein:vir:94 134 ---T---------------------VNADITKLNGLQSAIDKFNDE----------------DLEPMVLFVNPLDAGKLR 173 (274) T ss_pred ---c---------------------ccccccCHHHHHHHHHHhhcc----------------CCCceEEEeCHHHHHHHH Confidence 0 012456788888887654211 123468999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++...+|.. .+....+.+.+|.+|.|.|+.|+..++++. ..++|+|..|+... ...+ . T Consensus 174 k~~~~~f~~------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gA~~~~--~~~~--~ 231 (274) T protein:vir:94 174 GDASTNFTR------ATELGDDIIVKGAFGEALGAIIVRTNKLEA------------GTAILAKKGAVKLI--LKRD--F 231 (274) T ss_pred hhhhhhccc------cCcccccceeccccceecCeeEEEcCCCCc------------ceEEEEeCcceEee--ecCC--c Confidence 754334432 233345789999999999999999987752 33678888776543 3222 2 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee---------eeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT---------AAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt---------a~~~~~ 364 (364) . .|...|-..+ .+.|.+. +=|||-+++- .+++.- T Consensus 232 ~-vE~~Rd~~~~----~d~i~~~-------~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 232 F-LEVARDASTK----TTALYSD-------KHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred e-eccccchhhc----ccEEEEE-------EEEEEEEEcCCceEEEecCcccccC Confidence 2 4544332221 2222221 2234433332 222222 No 17 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.27 E-value=5.3e-13 Score=87.87 Aligned_cols=263 Identities=16% Similarity=0.160 Sum_probs=173.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc--eecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP--TYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g--v~Gd~~le 78 (364) ||-+.+...|=..++.|+..+-....++..|.+ +..+..+|+-++|++|+|+....+ |+. +..++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~---------~~~~d~~l~g~~G~tv~iP~~~~~-g~a~~~~~g~~i~ 70 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFAS---------FAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIP 70 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcc---------cceecccccCCCCCEEEEeeecCC-CccccccCCCccc Confidence 999888888888899999999766655444432 222345677779999999987654 332 33333443 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.|...++++.|++.-+++.+... +...+.-|+..++.+.+..+|++..|+.++..|.++.. T Consensus 71 --~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-------------- 133 (274) T protein:vir:97 71 --TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------- 133 (274) T ss_pred --ccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-------------- Confidence 7789999999999998888888654 55557789999999999999999999999988765310 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) . ..++.++++.|.+|...+... ..+-.+++|||..+..|+ T Consensus 134 ---~---------------------~~~~~~~~d~i~dA~~~l~d~----------------~~~~~~ivv~p~~~~~L~ 173 (274) T protein:vir:97 134 ---T---------------------VNADITKLNGLQSAIDKFNDE----------------DLEPMVLFVNPLDAGKLR 173 (274) T ss_pred ---c---------------------ccccccCHHHHHHHHHHhhcc----------------CCCceEEEeCHHHHHHHH Confidence 0 012456788888887654211 123468999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++...+|.. .+....+.+.+|.+|.|.|+.|+..++++. ..++|+|..|+... ...+ . T Consensus 174 k~~~~~f~~------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gA~~~~--~~~~--~ 231 (274) T protein:vir:97 174 GDASTNFTR------ATELGDDIIVKGAFGEALGAIIVRTNKLEA------------GTAILAKKGAVKLI--LKRD--F 231 (274) T ss_pred hhhhhhccc------cCcccccceeccccceecCeeEEEcCCCCc------------ceEEEEeCcceEee--ecCC--c Confidence 754334432 233345789999999999999999987752 33678888776543 3222 2 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee---------eeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT---------AAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt---------a~~~~~ 364 (364) . .|...|-..+ .+.|.+. +=|||-+++- .+++.- T Consensus 232 ~-vE~~Rd~~~~----~d~i~~~-------~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 232 F-LEVARDASTK----TTALYSD-------KHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred e-eccccchhhc----ccEEEEE-------EEEEEEEEcCCceEEEecCcccccC Confidence 2 4544332221 2222221 2234433332 222222 No 18 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.27 E-value=6.8e-13 Score=87.28 Aligned_cols=264 Identities=18% Similarity=0.159 Sum_probs=173.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc-eecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP-TYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g-v~Gd~~leG 79 (364) ||...+-..|=..+..|+.-+-.+..++..|.+ + .....+|+-++||+|+|+....+...- +..++.++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~-~--------~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~- 70 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFAS-F--------AEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP- 70 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccc-c--------ceecccccCCCCCEEEeeeecCCCccccccCCCccc- Confidence 998778888888889999999877766655543 1 223456877899999999877553222 22233443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.|+..++++.|++.-+++.+.+ .+...+.-|+..++.+.++.+|++..|..++..|.++.. T Consensus 71 -~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~--------------- 133 (274) T protein:vir:95 71 -TDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL--------------- 133 (274) T ss_pred -hhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 678999999999999888888864 355556779999999999999999999999887765321 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) . .+++.++.+.|.+|..+.... ..+-.+++|||..+..|++ T Consensus 134 --~---------------------~~~~~~~~d~i~~A~~~lgd~----------------~~~~~~ivv~p~~~~~L~k 174 (274) T protein:vir:95 134 --T---------------------VEADITKLTGLQTAIDKFNDE----------------DLEPMVLFISPLDAGKLRG 174 (274) T ss_pred --c---------------------ccccccCHHHHHHHHHHhccc----------------cccccEEEeCHHHHHHHHh Confidence 0 012456788888886654211 0234689999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +..-+|. ....+..+.+..|.+|.|.|+.|+....++ ...++|+|-.|++. +...+ .. T Consensus 175 ~~~~~f~------~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~------------~~t~~l~~~gA~~~--~~~~~--~~ 232 (274) T protein:vir:95 175 DATTNFT------RATELGDDVIVKGAFGEALGAVIVRSNKLE------------AGTAILAKKGAVKL--ITKRD--FF 232 (274) T ss_pred hcccccc------ccccccccceeccccceecCeEEEEeCCCC------------CceEEEEeccceee--eecCC--cc Confidence 5422342 234445688999999999999999987653 23467888777654 33222 22 Q ss_pred ceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee--eeec--cC Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT--AAKK--HS 364 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt--a~~~--~~ 364 (364) +|-..|-.. ..+.|.+- +=||+-+++- .|++ -| T Consensus 233 -vE~~Rd~~~----~~d~i~~~-------~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 233 -LETDRDPST----KTTALYSD-------KHYVAYLYDESKAVKITKGS 269 (274) T ss_pred -ccccccccc----ccCEEEEe-------EEEEEEEEcCCcEEEEEcCC Confidence 454433332 22222221 2244444332 1111 11 No 19 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.27 E-value=6.8e-13 Score=87.28 Aligned_cols=264 Identities=18% Similarity=0.159 Sum_probs=173.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc-eecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP-TYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g-v~Gd~~leG 79 (364) ||...+-..|=..+..|+.-+-.+..++..|.+ + .....+|+-++||+|+|+....+...- +..++.++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~-~--------~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~- 70 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFAS-F--------AEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP- 70 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccc-c--------ceecccccCCCCCEEEeeeecCCCccccccCCCccc- Confidence 998778888888889999999877766655543 1 223456877899999999877553222 22233443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.|+..++++.|++.-+++.+.+ .+...+.-|+..++.+.++.+|++..|..++..|.++.. T Consensus 71 -~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~--------------- 133 (274) T protein:vir:96 71 -TDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL--------------- 133 (274) T ss_pred -hhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 678999999999999888888864 355556779999999999999999999999887765321 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) . .+++.++.+.|.+|..+.... ..+-.+++|||..+..|++ T Consensus 134 --~---------------------~~~~~~~~d~i~~A~~~lgd~----------------~~~~~~ivv~p~~~~~L~k 174 (274) T protein:vir:96 134 --T---------------------VEADITKLTGLQTAIDKFNDE----------------DLEPMVLFISPLDAGKLRG 174 (274) T ss_pred --c---------------------ccccccCHHHHHHHHHHhccc----------------cccccEEEeCHHHHHHHHh Confidence 0 012456788888886654211 0234689999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +..-+|. ....+..+.+..|.+|.|.|+.|+....++ ...++|+|-.|++. +...+ .. T Consensus 175 ~~~~~f~------~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~------------~~t~~l~~~gA~~~--~~~~~--~~ 232 (274) T protein:vir:96 175 DATTNFT------RATELGDDVIVKGAFGEALGAVIVRSNKLE------------AGTAILAKKGAVKL--ITKRD--FF 232 (274) T ss_pred hcccccc------ccccccccceeccccceecCeEEEEeCCCC------------CceEEEEeccceee--eecCC--cc Confidence 5422342 234445688999999999999999987653 23467888777654 33222 22 Q ss_pred ceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee--eeec--cC Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT--AAKK--HS 364 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt--a~~~--~~ 364 (364) +|-..|-.. ..+.|.+- +=||+-+++- .|++ -| T Consensus 233 -vE~~Rd~~~----~~d~i~~~-------~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 233 -LETDRDPST----KTTALYSD-------KHYVAYLYDESKAVKITKGS 269 (274) T ss_pred -ccccccccc----ccCEEEEe-------EEEEEEEEcCCcEEEEEcCC Confidence 454433332 22222221 2244444332 1111 11 No 20 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.26 E-value=6.8e-13 Score=87.27 Aligned_cols=264 Identities=15% Similarity=0.137 Sum_probs=171.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecccc-CceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRG-KPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G-~gv~Gd~~leG 79 (364) ||.+.+-..|-..+..|+.-+-....++..|.+ +..+..+|+-++||+|+|+....+.. .-+..++.++ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~---------~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~- 70 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFAS---------FAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP- 70 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcc---------cceecccccCCCCCEEEEeeecCCCccccccCCCccc- Confidence 998888888888889999999777655555543 12234567778999999998765421 1233334443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.|+..++++.|++.-.++.+.. .+..-+.-|+..++...+..+|++..|+.++..|.++.. T Consensus 71 -~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~--------------- 133 (274) T protein:vir:12 71 -TDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------- 133 (274) T ss_pred -hhhcccceeeEEeeeecceeeecH-HHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 778999999999999888888765 355556679999999999999999999998877765310 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) . .+++.++.+.|.+|..+.... ..+-.+++|||..+..|++ T Consensus 134 --~---------------------~~~~a~~~d~i~dA~~~lgd~----------------~~~~~~ivv~p~~~~~L~k 174 (274) T protein:vir:12 134 --T---------------------VNADITKLNGLQSAIDKFNDE----------------DLEPMVLFINPLDAGKLRG 174 (274) T ss_pred --c---------------------ccccccCHHHHHHHHHHhccc----------------cccccEEEeCHHHHHHHHh Confidence 0 012357788898887654211 0234679999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +...+|.. ++.+..+.+.+|.+|.|.|+.|+....++. .-++|+|..|++.. ...+ .. T Consensus 175 ~~~~~fv~------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gA~~~~--~~~~--~~ 232 (274) T protein:vir:12 175 DASTNFTR------ATELGDDIIVKGAFGEALGAIIVRSNKLEA------------GTAILAKKGAVKLI--LKRD--FF 232 (274) T ss_pred hhhhhccc------cccccccceecccceeecCeeEEEeCCCCc------------ceEEEEeccceeee--ecCC--ce Confidence 54334432 244445788999999999999999877652 23578887776543 3222 22 Q ss_pred ceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee-------eeeccC Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT-------AAKKHS 364 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt-------a~~~~~ 364 (364) +|-..|-.. ..+.|.+- +=|||-+++- -+.+-- T Consensus 233 -vE~~Rd~~~----~~d~i~~~-------~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:12 233 -LEVARDAST----KTTALYSD-------KHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred -eccccchhh----cccEEEee-------eEEEEEEEcCCceEEEEcCCccc Confidence 454333332 12223221 2234333322 111111 No 21 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.25 E-value=2.9e-13 Score=89.32 Aligned_cols=264 Identities=16% Similarity=0.138 Sum_probs=169.4 Q ss_pred Cce-eecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-ceecCceee Q lcl|NC_019917. 1 MTT-TVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-PTYGDARTE 78 (364) Q Consensus 1 Ma~-T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-gv~Gd~~le 78 (364) ||- +.+...|=..+..|+.-+-....+...|.+ ......+|+-++|++|+|+....+... -+..++.+. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~---------~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~ 71 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQ---------FADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIP 71 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcc---------cceecccccCCCCCEEEeeeeccCCccccccCCCCcc Confidence 552 336666666789999999888776665543 122346788889999999987765322 222233343 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.|+..++++.|.+.-+++.+... +...+.-|+..++.+.++..|++..|+.++..|.++.. T Consensus 72 --~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~-------------- 134 (275) T protein:vir:96 72 --IDLIETKKRQATIRKIGKGTVLTDE-ALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL-------------- 134 (275) T ss_pred --hhhcccceeeEEeehhcccccccHH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------- Confidence 7789999999999999999888653 44455679999999999999999999998877765310 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) . .+++.++.+.|.+|..++... ..+-.+++|||..+..|| T Consensus 135 ---~---------------------~~~~~~~~d~i~dA~~~lgd~----------------~~~~~~ivv~p~~~~~L~ 174 (275) T protein:vir:96 135 ---K---------------------VEADITKLAGLQTAIDKFNDE----------------DLEPMVLFVNPLDAGKLR 174 (275) T ss_pred ---c---------------------ccccccCHHHHHHHHHHhccc----------------cCCccEEEeCHHHHHHHH Confidence 0 012457788888887654211 123468999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ++...+|.. +.....|.+..|.+|.|.|+.|+...+++. ..++|+|-.|+... ...+ . T Consensus 175 k~~~~~f~~------~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~i~~~gA~~~~--~~~~--~ 232 (275) T protein:vir:96 175 ASATDNFTR------ATLLGDNVIVKGAFGEALGAIIVRSNKIKE------------GEAILAKRGAVKLI--TKRD--F 232 (275) T ss_pred hcccccccc------cccccccceeccccceecCeeEEEeCCCCc------------ceEEEEeccceeee--ecCC--c Confidence 876555532 122334678899999999999999887642 23678887776543 3222 1 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEee-------eeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDT-------AAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idt-------a~~~~~ 364 (364) . .|-..|-..+ .+.|.+. +=||+-+++- .-++.- T Consensus 233 ~-vE~~Rd~~~~----~d~i~~~-------~~y~~~~~~~~~vv~~t~~~~~~ 273 (275) T protein:vir:96 233 F-LETERHASHK----STALFSD-------KHYVAYLYDESKVVKITKSASGL 273 (275) T ss_pred c-cccccchhhc----CcEEEEe-------EEEEEEEEcCccEEEEEeccccc Confidence 2 4443333221 2333221 1233333222 111111 No 22 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.24 E-value=4.9e-13 Score=88.08 Aligned_cols=257 Identities=16% Similarity=0.111 Sum_probs=166.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC--ceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK--PTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~le 78 (364) ||.|...-- ..+..|+.-+-.+..+++.|.+ + + ....+|.-++|++|+|+... +.|+ .+..++.++ T Consensus 1 Ma~T~~~d~--I~Pev~~~~V~e~~~~~~~~~~-~-~-------~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~ 68 (270) T protein:vir:95 1 MTQTKKANL--INPEVLANVVSAQMQNAIRFTP-Y-A-------VTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMD 68 (270) T ss_pred CCceehhhh--cchHHHHHHHHHHHHhHHhhcc-c-c-------ccccccCCCCCCEEEeeeec-CCCccccccCCCccc Confidence 999987642 4567888888777777766654 2 1 12356888899999999876 6554 344444554 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+.|+..++..+|-+...++.+... +..-+.-|...++...++.+|++..|..++-.|.|+..- T Consensus 69 --~~~lt~~~~~a~i~~~gk~~~itD~-a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~------------- 132 (270) T protein:vir:95 69 --TTQMSMTTTKVTVKETGKAVEVTQT-AIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQT------------- 132 (270) T ss_pred --hhhcccchheeeeehhhCcceecHH-HHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 7799999999999998888888654 444444599999999999999999999999888875310 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .+..++.+.+.+|.... +. +.+...+++|||..+..|| T Consensus 133 --------------------------~~~~~t~~~~~dA~~~l--gd--------------~~~~~~~i~vhs~~~~~Lr 170 (270) T protein:vir:95 133 --------------------------ATVSADATGILDAIEVF--NS--------------ENDEDYVLYVNPKDYNKLV 170 (270) T ss_pred --------------------------cccccCHHHHHHHHHHh--cc--------------ccCCCcEEEEcHHHHHHHH Confidence 11235666677765532 21 1134568999999999999 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEE-EecCcccccccccCCcccccchheeeccchheEeeecCCCCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVL-HKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLR 317 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii-~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~ 317 (364) ++. |.+ ..+.-.|.+.+|.+|.|.|+.+ .....+ ....++|.+..|+.+. ...+ T Consensus 171 k~~---~~~------~~~~~~~~~~~G~ig~~~G~~Viv~s~~~------------~~~~~~l~~~gAi~~~--~~~~-- 225 (270) T protein:vir:95 171 KSL---FKV------GGNVQDRAISKGDLVEIVGVSDIVKSKRV------------SENTAFLQRYGAMEIV--NKKK-- 225 (270) T ss_pred hhh---ccc------ccccccchhcccccceecceeEEEeCCCC------------CceeEEEEeccceeee--ecCC-- Confidence 754 322 2344557789999999999964 433221 1134678887776544 4222 Q ss_pred ccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEe-------eeeeccC Q lcl|NC_019917. 318 FDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISID-------TAAKKHS 364 (364) Q Consensus 318 ~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~id-------ta~~~~~ 364 (364) .. .|...|-..+ ...|.+ .+-|||-.++ |..++-| T Consensus 226 ~~-vEtdRd~~~~----~d~i~~-------~~~y~v~~~~~skvv~~t~~~a~~ 267 (270) T protein:vir:95 226 PE-AYTDFDILKR----THLLST-------NYHYSVNLKDETGVVKVTFKPSGS 267 (270) T ss_pred ce-eeeccchhhc----ccEEEe-------eeEEEEEEEccceEEEEEecCCCC Confidence 22 5544333222 122222 2334554444 3334444 No 23 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.14 E-value=5.6e-12 Score=82.29 Aligned_cols=274 Identities=12% Similarity=0.063 Sum_probs=162.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-ceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-PTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-gv~Gd~~leG 79 (364) ||...+...|-..+..|+..+-....++.-|.+ + .....+|+-++||+|+|+....++.. -+..++.+. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~-~--------~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~- 70 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGK-I--------APIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAID- 70 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcc-c--------ceecccccCCCCCEEEEeeeccCCcceeecCCCcCc- Confidence 996555555667789999998666544433332 1 12345677778999999988766322 233334443 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.|...++++.|++.-.++.+.. .+..-+..|+..++...++.+|++..|..++-.|.|+... T Consensus 71 -~~~lt~~~~~~~i~~~~~a~~v~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~-------------- 134 (278) T protein:vir:80 71 -YSALETESVKHGIKKAGKGVKLTD-ESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE-------------- 134 (278) T ss_pred -ccccccceeeEeeehhhccccccH-HHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 678999999999999888887754 4566678899999999999999999999999998874210 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .-.+++. +..| -.++.+..+..+...... | ...++++||.++..|++ T Consensus 135 --~~~~~t~---------------~~~~-~~~~~~~da~~~l~~~~~--~-------------~~~~ivv~p~~~~~L~k 181 (278) T protein:vir:80 135 --VKGAINI---------------GLID-KIENTFTDAPDAIEDESI--T-------------TTGVLFLNYKDTAKLRE 181 (278) T ss_pred --ccccccc---------------chhh-hHHHHHHHHHHhhcccCC--C-------------cccEEEECHHHHHHHHh Confidence 0001110 0001 123444444332211110 1 12368999999999997 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) +...+|.. ....-.+.+.+|.+|.|.|+.|+...+++. ..+++++..|++ |+...+. . T Consensus 182 ~~~~~~~~------~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~------------~t~~l~~~gAi~--~~~~~~~--~ 239 (278) T protein:vir:80 182 EAAGSWTK------ASQLGDDLLVKGAFGELLGWEIVRTKKLAD------------GNALAVKAGALK--TFLKRNL--L 239 (278) T ss_pred hhhhhccc------cccccccceeeccceeecceeEEEcCCCCc------------ceEEEEecccee--eeecCCc--c Confidence 65434421 122223567789999999999999988852 235777777654 3332222 2 Q ss_pred ceechhhccchhHHHHHHHHhhhhcccCC---cccEEEEEeeeeecc Q lcl|NC_019917. 320 WEETVKDYGNEPAICAGFIAGMKKARFNS---KDFGVISIDTAAKKH 363 (364) Q Consensus 320 w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~---~DfGvi~idta~~~~ 363 (364) .|...|-.. ..+.|.+.. .|.- .+-+++ +-|.++.. T Consensus 240 -vE~~Rd~~~----~~d~i~~~~--~yg~~v~~~~~~v-~it~~a~~ 278 (278) T protein:vir:80 240 -AESGRDMDH----KLTKFNADQ--HYAVALVDETKAV-KVVPVAGN 278 (278) T ss_pred -cccccchhh----ccceeeeee--EEEEEEEcCcceE-EEeeccCC Confidence 343222221 223333321 1100 122332 23444444 No 24 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.14 E-value=2.4e-11 Score=78.82 Aligned_cols=309 Identities=13% Similarity=0.040 Sum_probs=177.4 Q ss_pred Ccee------eccc---CC--chHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC Q lcl|NC_019917. 1 MTTT------VIPF---GD--PKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK 69 (364) Q Consensus 1 Ma~T------~~~~---~d--p~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~ 69 (364) |+.- +.+. ++ .+-.++|+.+++....+++-|.++ ..++ ++ ..|+++.|+-+...+-. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~-------~~~r---~i--~~G~s~~~~~iG~~~~~ 68 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASW-------MNVR---SL--RGTNQLRVDRVGASTIA 68 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcc-------ceee---ec--cccceEEEeeecceeee Confidence 7643 2222 22 223499999999999999888763 1222 33 35999999988877766 Q ss_pred ceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_019917. 70 PTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGIN 146 (364) Q Consensus 70 gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~ 146 (364) ..+=++.+.++ .+.-...+|.||+. |+.| ..+++-..++|+|.+.-.....=+++..||.++..|.-+.... T Consensus 69 ~~~~g~~l~~~--~~~~~~~~l~ID~~l~~~~~V---ddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~ 143 (334) T protein:vir:80 69 GRKAGEELVVQ--KNVSDKLNLTVDTVLYARHFF---DKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFL 143 (334) T ss_pred eecCCCCCCCC--CcccCceEEEEeeeeehhhhH---hhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 66667777654 57778889999995 4566 5688889999999999999999999999999988876322211 Q ss_pred ccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhccc--CCCCCcceeeeEecCcee Q lcl|NC_019917. 147 LDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQA--ENPETANMVPVSIDGDDH 224 (364) Q Consensus 147 ~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~--~~~~~~~i~Pv~~~g~~~ 224 (364) -|....+.|.. -+....-.++. +++..-+.+.|.+|+..|+.... .-|.. +-+- T Consensus 144 ~~~~~~~~~~~------------G~~~~~~~~g~---~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~---------~~~~ 199 (334) T protein:vir:80 144 APAHLKPAFHD------------GILLPSTISGL---AADAAADADVLVAAHRQGVEAMVFRDLGDQ---------LMSE 199 (334) T ss_pred ccccccccccC------------Ccceeeccccc---ccchhhhHHHHHHHHHHHHHHHHhcCCCCC---------cCCc Confidence 11111111111 00000000010 11122335555555555543321 11210 1133 Q ss_pred EEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccC--Ccccccc------ Q lcl|NC_019917. 225 YVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGA--GANVEAA------ 296 (364) Q Consensus 225 yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~--~~~v~v~------ 296 (364) -++++.|.|+..|..+ +++.+ +.. ...+..+++=.|.++.|+|+.|++.++++..+..+. ++..++. T Consensus 200 R~~vv~P~~y~~Ll~~--~r~~n--~d~-~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~ 274 (334) T protein:vir:80 200 GVTLLDPVIFSFLLEH--DRLMN--VEF-GAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRR 274 (334) T ss_pred eEEEeChHHHHHHhcc--ccccc--cee-ccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccc Confidence 6889999999999964 34321 111 123345778889999999999999999987643222 1111111 Q ss_pred -hheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 297 -RALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 297 -ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) -+++....|++.+-...--+..++.|+.+- .-|-....+|.+=+|= +--+||-++.--+ T Consensus 275 ~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~----d~i~~~~a~G~g~lRP--eaa~vv~~~~~~~ 334 (334) T protein:vir:80 275 KMITFIPSMALISAQVHPVSAQFWEEKKDFG----HYLDTFQSYNIGQRRP--DAVAVHDITVTNP 334 (334) T ss_pred eEEEEEeCceEEEEEEeecceeeeechhhHH----HHHHHHHHcCCceecc--ceEEEEEEeeecC Confidence 145666666665544321223333333211 1233334455444443 4445544444444 No 25 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.11 E-value=1e-11 Score=80.79 Aligned_cols=293 Identities=16% Similarity=0.140 Sum_probs=173.3 Q ss_pred Cceeecc-------cCC---chHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCc Q lcl|NC_019917. 1 MTTTVIP-------FGD---PKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKP 70 (364) Q Consensus 1 Ma~T~~~-------~~d---p~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~g 70 (364) |+..+.+ .+| .+-.++|+.+|.....+.|.|.+.. +..++. .|++|.|+-+...+-.. T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~----------~~r~i~--~G~tv~i~~ig~~~~~~ 74 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLV----------RSYDLR--GGKSKQFMFTGKLSAGY 74 (332) T ss_pred ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcc----------cccccc--ccceEEEEeccceeEee Confidence 7777776 444 2555899999999999998887632 112232 59999999888888777 Q ss_pred eecCceeecchhhhhhcccEEEEeccc---ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019917. 71 TYGDARTEGTEENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINL 147 (364) Q Consensus 71 v~Gd~~leGnee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~ 147 (364) .+.++.+.++ +.+.-...+|.||+.. +.| ..+++..+++|||.+.-.....=+++..|+.++..|..+...+. T Consensus 75 ~~~g~~l~~~-~~~~~~~~~l~ID~~ky~~~~V---ddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~ 150 (332) T protein:vir:78 75 HTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFV---YSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) T ss_pred ecCCCCCCCC-CCCCCceEEEEEehhhhhHHHH---HhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccC Confidence 7777788776 4578888899999954 555 46889999999999999999999999999999998875332221 Q ss_pred cccccccccccccCcccCCCCCcEEeeccccchhhhhhcccc----cHHHHHHHHHHHHhcccCCCCCcceeeeEecCce Q lcl|NC_019917. 148 DFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIM----APIVIERAVEKAAMMQAENPETANMVPVSIDGDD 223 (364) Q Consensus 148 ~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~----s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~ 223 (364) +....|... .+ .++++... -++.|.+|.+.+.... .|. ++ T Consensus 151 ~~~~~~g~~-------------~~----------~~~~~~~~~~~~~~~~i~~a~~~Lde~~--VP~-----------~g 194 (332) T protein:vir:78 151 PVTGEPGGF-------------HV----------NIGAGNTNDAQAIVDGFFEAAAVLDERS--APQ-----------EG 194 (332) T ss_pred ccccccccc-------------cc----------ccCCccccCHHHHHHHHHHHHHHHhhcC--CCc-----------cC Confidence 111111110 10 11111122 2344555544443222 121 23 Q ss_pred eEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecC-eEEEcCEEEEecCccccccc--ccCCc--------- Q lcl|NC_019917. 224 HYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGG-LGMINNVVLHKHRNVIRFND--YGAGA--------- 291 (364) Q Consensus 224 ~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~-~g~~ngvii~e~~~~~~~~~--~~~~~--------- 291 (364) .++++.|.++..|....|+++.+ ... .+..-.+..|. ++.|+|+.|++.++++.... +..++ T Consensus 195 -R~~vv~P~~y~~Ll~~~d~~~~n--~~~---~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~ 268 (332) T protein:vir:78 195 -RVAVLSPRQYYSLISSVDTNILN--REI---GNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQ 268 (332) T ss_pred -CEEEeCHHHHHHHHhhcCceeee--eec---cccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccc Confidence 56679999999998655655311 111 12334566665 89999999999999975421 11110 Q ss_pred -ccccchheeeccchheEeeecCC---CCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeee Q lcl|NC_019917. 292 -NVEAARALFMGRQAGVIAYGTAN---GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTA 359 (364) Q Consensus 292 -~v~v~ralllGaqA~~~A~g~~~---g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta 359 (364) ...-.-+|++...|++.+-...- -++-+|.|+.+ ...|-....+|.+=+|= | ++++|=+| T Consensus 269 ~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~----~d~i~~~~~~G~~v~rP---e-~~v~l~~a 332 (332) T protein:vir:78 269 VDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ----GDLIVGKLAMGCGSLRT---S-VAGSFQAA 332 (332) T ss_pred cccccceEEeecccceeeeeeeccchhhhhcccchhhh----HhhhhhhhhhcCceecc---c-ceEEEeeC Confidence 11122367777777655533311 11224444332 12233333344211111 2 23333333 No 26 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.09 E-value=7e-12 Score=81.75 Aligned_cols=307 Identities=11% Similarity=0.042 Sum_probs=168.2 Q ss_pred Cceeeccc------------CCc--hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc Q lcl|NC_019917. 1 MTTTVIPF------------GDP--KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL 66 (364) Q Consensus 1 Ma~T~~~~------------~dp--~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) ||.|..+- +|+ +-.++|+..++....+.|.|.+. + +..++. .|++|.|+-+... T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~-~---------~~~~~~--~G~sv~i~~ig~~ 68 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPR-H---------MLRSIA--SGKSAQFPVIGRT 68 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhc-c---------cccccc--ccceeEeeeccce Confidence 88665533 222 34588999999988888877652 1 222232 5999999999999 Q ss_pred ccCceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019917. 67 RGKPTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) Q Consensus 67 ~G~gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~ 143 (364) +....+..+.+.++.++......+|.||+. ++.| ..+++..+++|+|.+.-.....=+++..|+.++.+|.+++ T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~V---ddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:15 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLI---YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeeeccCCCCCCCCCCCccceEEEEechhhhhhHHh---hhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 988888888899998889999999999987 4556 5788889999999999999999999999999999998643 Q ss_pred cccccccccccccccccCcccCCCCCcEEee-ccccc--hhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEec Q lcl|NC_019917. 144 GINLDFVETPDFTGFAGNPLEAPDVDHLLYG-GVATS--KASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSID 220 (364) Q Consensus 144 g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~-~~at~--~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~ 220 (364) ...-.......+ |-..-+.-. ...++ .....+.+.+ ++.|..|.+++... ..|. T Consensus 146 ~~~~~~~~~~~~----------~g~~~~~~~~~~~~~~~~~~~~~~~~i-~d~~~~a~~~Lde~--~VP~---------- 202 (347) T protein:vir:15 146 NLPDASNENIEG----------LGKPTVLTLVKPTTGDLTDPVELGKAI-IAQLTIARASLTKN--YVPA---------- 202 (347) T ss_pred hccccccccccc----------cCccccccccccccccchhhhhHHHHH-HHHHHHHHHHHhhc--CCCc---------- Confidence 211000000000 000000000 00000 0001111111 44555554444322 2221 Q ss_pred CceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc--------CC-- Q lcl|NC_019917. 221 GDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG--------AG-- 290 (364) Q Consensus 221 g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~--------~~-- 290 (364) ++ .++++.|.++..|..+. ++. . + .-. ....+-+|.+|.++|+.|++.++++.+.... .. T Consensus 203 -~g-R~~vv~P~~y~~LL~~~--~~~---~-~-d~~-~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~ 272 (347) T protein:vir:15 203 -AD-RTFYTTPDNYSAILAAL--MPN---A-A-NYQ-ALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHA 272 (347) T ss_pred -cC-CEEEeCHHHHHHHhccc--ccc---c-c-ccc-ccccccceEEEEEeceEEEeccccccccccccccccccccccc Confidence 23 56899999999998643 321 1 1 111 2234678999999999999999987543210 00 Q ss_pred ----ccc--c----cchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeee Q lcl|NC_019917. 291 ----ANV--E----AARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAA 360 (364) Q Consensus 291 ----~~v--~----v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~ 360 (364) .+. . ..-+|++-..|++.+-...--+.-.|.++. ++ ..|-....+|.+=+| .+-=++|.+.--+ T Consensus 273 ~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~--~~--d~i~~~~~~G~~vlr--P~~av~~~~~~~~ 346 (347) T protein:vir:15 273 FPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANY--QA--DQIIAKYAMGHGGLR--PEAAGAIVLPKVS 346 (347) T ss_pred ccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchh--hh--hhhehhhhcCCceec--cccEEEEecCCCC Confidence 000 0 011345555544433221101111233321 11 111122222322222 1111222221111 Q ss_pred e Q lcl|NC_019917. 361 K 361 (364) Q Consensus 361 ~ 361 (364) . T Consensus 347 ~ 347 (347) T protein:vir:15 347 E 347 (347) T ss_pred C Confidence 1 No 27 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.08 E-value=1.7e-11 Score=79.57 Aligned_cols=271 Identities=17% Similarity=0.135 Sum_probs=153.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||.+++ ..++|+..+.....+++.|.+ |+-+ .-++....||+|+|+.....+-.-..+... ... T Consensus 1 MA~~~~------~pei~~~~v~~~~~~~lv~~~-l~~~--------~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~-~~~ 64 (273) T protein:vir:79 1 MAFNNF------IPELWSDMLLEEWTAQTVFAN-LVNR--------EYEGIASKGNVVHIAGVVAPTVKDYKAAGR-QTS 64 (273) T ss_pred Ccchhh------hHHHHHHHHHHHHHhhccchh-hhhc--------cccccccCCcEEEEeecCcccccccccCCC-ccC Confidence 998664 468999999988877776654 5422 123334579999999765544222221111 245 Q ss_pred hhhhhhcccEEEEecc-cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQV-RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~-R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+++...+.++.||+. ..++.+. .+++..+..||++..+. +..=+++..|+.++..++++.. T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~vD~~i~~~~~~a~~--------------- 127 (273) T protein:vir:79 65 ADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGT--------------- 127 (273) T ss_pred ccccccceEEEEEeeecccceeec-cHHHHhhcccHHHHHHH-HHHHHHHHHHHHHHHHHhhccc--------------- Confidence 7889999999999996 4678775 45667788899986655 5556889999988888775321 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .|+..+ .++.+ -.++.|..|..++.... .|. ++ .+++++|.++..|++ T Consensus 128 ~~~~~~----------------~~~~~--~~~~~i~~a~~~ld~~~--vP~-----------~~-R~lvv~p~~~~~Ll~ 175 (273) T protein:vir:79 128 ALTGSA----------------PSDAD--DAFDLIASALKELTKAN--VPN-----------VG-RVVVVNAEMAFWLRS 175 (273) T ss_pred cccccc----------------ccchh--hHHHHHHHHHHHhhhcc--CCc-----------cC-cEEEECHHHHHHHhh Confidence 011111 11111 12456766655443222 232 22 457899999999996 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecC-CCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA-NGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~-~g~~~ 318 (364) +. ..+.+.. ..+..++|-+|.+|.|.|+-|+++++++....+...+.. ..| +++++- .-... T Consensus 176 ~~-~~~~~~~-----~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~---------~~A--~~~a~~~~~~e~ 238 (273) T protein:vir:79 176 SG-SKLTSAD-----TSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH---------PSA--AAYVSQIDTVEA 238 (273) T ss_pred ch-hhhhhhh-----hcccccceeeeEeeEEeceEEEecccccccCceEEEEEe---------ccc--eeeeeehhhhhc Confidence 42 2233222 234567889999999999999999998755432221111 111 122220 00111 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) .+.|+. ++.. |-....+|.+=+| .=||+++-...+ T Consensus 239 ~r~~~~--~~~~--v~~~~~yg~~v~~----p~~vv~~~~~g~ 273 (273) T protein:vir:79 239 LRDQDS--FSDR--IRALHVYGGKVVR----PTGVVVFNKTGS 273 (273) T ss_pred ccCccc--ceee--eeeeeeeeeEEec----CceEEEEeccCC Confidence 222221 1111 1112233332222 116655554444 No 28 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.06 E-value=1.5e-10 Score=74.40 Aligned_cols=301 Identities=13% Similarity=0.116 Sum_probs=167.3 Q ss_pred Cceeecc-------------cCCchH--HHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec Q lcl|NC_019917. 1 MTTTVIP-------------FGDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH 65 (364) Q Consensus 1 Ma~T~~~-------------~~dp~a--~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~ 65 (364) ||.|.++ ++|+.+ .++|+.+++....+.+-|.++. .+ .++. .|.++.|+-+.. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~-------~~---r~i~--~g~s~~~~~iG~ 68 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRH-------MV---RSIS--SGKSAQFPVLGR 68 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccc-------ee---eeec--ccceEEEEeece Confidence 8866433 344444 4999999999999998887631 12 2343 489999998888 Q ss_pred cccCceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019917. 66 LRGKPTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGA 142 (364) Q Consensus 66 L~G~gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga 142 (364) .+-...+-.+.+.|.-+++.-...+|.||+. |+.| ..+++..+.+|+|.+....+..=+++..|+.++.+|..+ T Consensus 69 ~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~V---dDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~ 145 (344) T protein:vir:10 69 TQAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGL 145 (344) T ss_pred eEEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhh---hhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 8777777777788887788888899999995 4666 478999999999999999999999999999999998753 Q ss_pred ccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhccccc----HHHHHHHHHHHHhcccCCCCCcceeeeE Q lcl|NC_019917. 143 RGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMA----PIVIERAVEKAAMMQAENPETANMVPVS 218 (364) Q Consensus 143 ~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s----~~~i~~a~~~a~~~~~~~~~~~~i~Pv~ 218 (364) ...--|-...|... ++. .+... +..+...++...+ ++.|..|.+.+.... .|. T Consensus 146 a~~~~~~~~~~~g~---------~~~--~~~~~--~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~--VP~-------- 202 (344) T protein:vir:10 146 CNVESQYNENITGL---------GTA--TVIET--TQDKTTLTDQVALGKEIIAALTKARAALTKNY--VPS-------- 202 (344) T ss_pred hccccccccccccc---------ccc--ceeec--ccccccccchhhhHHHHHHHHHHHHHHHhhcC--CCc-------- Confidence 22111111111100 000 01100 0011111111122 344555544443222 121 Q ss_pred ecCceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccC--------- Q lcl|NC_019917. 219 IDGDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGA--------- 289 (364) Q Consensus 219 ~~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~--------- 289 (364) ++ .++++.|.++..|+.+. . +.+.. -+..+.+-+|.+|.++|+.|++.++++.....+. T Consensus 203 ---~g-R~~vv~P~~y~~Ll~~~--~---~~~~~---~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~ 270 (344) T protein:vir:10 203 ---SD-RVFYCDPDSYSAILAAL--M---PNAAN---YAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKH 270 (344) T ss_pred ---cC-CEEEeChHHHHHHhhcc--c---ccccc---cccccceeeeEEEEEeceEEEeccccccccCCcccccccCccc Confidence 23 45779999999998543 2 22211 2334567789999999999999999875422110 Q ss_pred ------Ccccccc----hheeeccchheEeeecCCC--CCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEe Q lcl|NC_019917. 290 ------GANVEAA----RALFMGRQAGVIAYGTANG--LRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISID 357 (364) Q Consensus 290 ------~~~v~v~----ralllGaqA~~~A~g~~~g--~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~id 357 (364) +++..++ .+|+|=--|++.+ +... +..+|.|+. .+ ..|-....+|.+=+| .+-=|+|-+= T Consensus 271 ~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v--~~~~~~~e~~r~~~~--~~--d~i~g~~~~G~~vlR--Pe~a~~v~~~ 342 (344) T protein:vir:10 271 AFPATKSGNDKVAKDNVIGLFMHRSAVGTV--KLRDLALERARRANF--QA--DQIIAKYAMGHGGLR--PEAAGAVVFK 342 (344) T ss_pred cccCCcccceeeecceeEEEeechhhhhhh--hhccceeecccchhH--HH--HHHHHHhhcccceec--ccceEEEEee Confidence 1111111 1222222222111 1111 111222211 11 122233334433322 2223333333 Q ss_pred ee Q lcl|NC_019917. 358 TA 359 (364) Q Consensus 358 ta 359 (364) |. T Consensus 343 ~~ 344 (344) T protein:vir:10 343 TK 344 (344) T ss_pred cC Confidence 33 No 29 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.05 E-value=7.8e-11 Score=75.99 Aligned_cols=303 Identities=12% Similarity=0.053 Sum_probs=168.0 Q ss_pred Ccee--------ecccC----Cc--hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc Q lcl|NC_019917. 1 MTTT--------VIPFG----DP--KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL 66 (364) Q Consensus 1 Ma~T--------~~~~~----dp--~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) ||-+ +.+.+ |+ +-.++|+.++.....+.|-|.++ + +..++ ..|+++.|+-+... T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~-~---------~~r~i--~~G~sv~~~~iG~~ 68 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDK-H---------MVRTI--QNGKSASFPVMGRT 68 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhc-c---------ccccc--cCcceEEEeeecce Confidence 7743 33332 43 44599999999988888877652 1 11223 25999999999998 Q ss_pred ccCceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019917. 67 RGKPTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) Q Consensus 67 ~G~gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~ 143 (364) +....+-.+.+.+..+++.....+|.||+. ++.| ..+++-...+|+|++.......=|++..|+.++.+|..+. T Consensus 69 ~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~V---dd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a 145 (347) T protein:vir:88 69 KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLI---YDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) T ss_pred eeeeeccccCCCCCCCCCccceEEEEEechhhhhhhh---hhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 888877778888888889999999999997 6677 4688889999999999999999999999999999987532 Q ss_pred cccccccccccccccccCcccCCCCCcEEeeccccchhhhhhccc---ccHHHHHHHHHHHHhcccCCCCCcceeeeEec Q lcl|NC_019917. 144 GINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDI---MAPIVIERAVEKAAMMQAENPETANMVPVSID 220 (364) Q Consensus 144 g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~---~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~ 220 (364) .... . .++..++-..-...+.+..++ +..... --++.|.+|.+..... ..| T Consensus 146 ~~~~--~---------~~~~~~g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~~~Lde~--~VP----------- 199 (347) T protein:vir:88 146 NLPA--A---------SNENIAGLGQAVVLNIGAAAD--LVDVEARGKAILKGLTLARARLTKN--YVP----------- 199 (347) T ss_pred cccc--c---------cccccCCcccccccccccccc--ccchhhhHHHHHHHHHHHHHHHhhc--CCC----------- Confidence 2110 0 011111111111111111111 111111 1145555555443322 122 Q ss_pred CceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc----cCCccc-c- Q lcl|NC_019917. 221 GDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY----GAGANV-E- 294 (364) Q Consensus 221 g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~----~~~~~v-~- 294 (364) ++-.++++.|.++..|.+. .+.. +.-+ .....+-.|.+|.++|+-|++.++++..... +.+.+. . T Consensus 200 -~~gR~~vv~P~~y~~Ll~~--~~~~---~~~~---~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~ 270 (347) T protein:vir:88 200 -AGDRRFYCAPEDYSAILSA--LMPN---AANY---AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQ 270 (347) T ss_pred -CCCCEEEeCHHHHHHHhcc--hhhh---hhhh---ccccchhcceeeeeccceEEEeeccccccccccccccccccccc Confidence 2336677999999999853 2321 1111 1223467899999999999999998642111 000000 0 Q ss_pred ---------------cch--heeeccchheEeeecCCCCCccceechhhccchh-HHHHHHHHhhhhcccCCcccEEEEE Q lcl|NC_019917. 295 ---------------AAR--ALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEP-AICAGFIAGMKKARFNSKDFGVISI 356 (364) Q Consensus 295 ---------------v~r--alllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~i~~i~G~~K~rf~~~DfGvi~i 356 (364) ..+ +|++-.-|++.+ + ...-+.|-..|-.++. .|-....+|.+=+| .+-=++|.+ T Consensus 271 ~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v--~---~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~r--Pe~a~~~~~ 343 (347) T protein:vir:88 271 KHIFPATATGDDRVAQNNVVGLFNHRSAVGTV--K---LKDMALERARRPEFQADQIIGKYAMGHGGLR--PEAAGALVF 343 (347) T ss_pred ccccccccccccccccCcEEEEEechhhhhhe--e---cccceeeeeechhhHHHHhhhhhhhcCceec--cceEEEEEe Confidence 000 122222222111 1 1111233332322221 23333444433332 222344444 Q ss_pred eeee Q lcl|NC_019917. 357 DTAA 360 (364) Q Consensus 357 dta~ 360 (364) ..+| T Consensus 344 ~~a~ 347 (347) T protein:vir:88 344 TPAA 347 (347) T ss_pred CCCC Confidence 4444 No 30 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.03 E-value=5e-11 Score=77.07 Aligned_cols=309 Identities=12% Similarity=0.051 Sum_probs=173.5 Q ss_pred Cce--------eecccC----Cc--hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc Q lcl|NC_019917. 1 MTT--------TVIPFG----DP--KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL 66 (364) Q Consensus 1 Ma~--------T~~~~~----dp--~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) ||- |+.+.+ |+ +-.++|+.+|.....+.|.|.+. + +..++ ..|++|.|+-+... T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~-v---------~~r~~--~~G~sv~i~~iG~~ 68 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPR-H---------MLRSI--ASGKSAQFPVIGRT 68 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhh-h---------ccccc--cccceeEeeeccce Confidence 773 444443 33 45699999999988888877652 2 11122 25999999999999 Q ss_pred ccCceecCceeecchhhhhhcccEEEEeccc---ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019917. 67 RGKPTYGDARTEGTEENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) Q Consensus 67 ~G~gv~Gd~~leGnee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~ 143 (364) +....+..+.+.|+.++......+|.||+.. +.| ..+++-.+++|+|.+.......=+++..|+.++.+|+.++ T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~V---ddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:33 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLI---YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHH---hhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9888888889999988999999999999886 566 4678888999999999999999999999999999987543 Q ss_pred cccc-cccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCc Q lcl|NC_019917. 144 GINL-DFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGD 222 (364) Q Consensus 144 g~~~-~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~ 222 (364) .... +-...+.|.+.-..+...+ ..+...+.. ...+ .-++.|..|.+++.... .|. + T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~-------~tg~~~d~~-~~a~-~i~~~i~~a~~~Lde~~--VP~-----------~ 203 (347) T protein:vir:33 146 NLPDGSNENIEGLGKPTVLTLVKP-------TTGSLTDPV-ELGK-AIIAQLTIARASLTKNY--VPA-----------A 203 (347) T ss_pred hhhccccccccccccccccccccc-------ccccccchh-hhHH-HHHHHHHHHHHHHhhcC--CCc-----------c Confidence 2110 0011111111000000000 001111110 1111 22455555555443222 221 2 Q ss_pred eeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccC-------C----- Q lcl|NC_019917. 223 DHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGA-------G----- 290 (364) Q Consensus 223 ~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~-------~----- 290 (364) + .++++.|.++..|..+. ++.+ +. ..+. -.+-+|.+|.|+|+-|++.++++.....+. . T Consensus 204 g-R~~vv~P~~y~~Ll~~~--~~~~--~d---~~~~-~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~ 274 (347) T protein:vir:33 204 D-RTFYTTPDNYSAILAAL--MPNA--AN---YQAL-LDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFP 274 (347) T ss_pred C-cEEEeCHHHHHHHhccc--cccc--cc---cccc-cccccceeEEEeceeEEEecccccCcccccccccccccccccc Confidence 3 55789999999999643 3321 11 1122 246789999999999999999876532110 0 Q ss_pred ----ccccc----chheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 291 ----ANVEA----ARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 291 ----~~v~v----~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) .+..+ .-+|++-..|++.+-...--+.-.|.|+. ++ ..|-....+|.+=+| .+-=++|.+.--+. T Consensus 275 ~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~--~~--d~i~~~~~~G~~vlr--P~~av~i~~~~~~~ 347 (347) T protein:vir:33 275 ATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANY--QA--DQIIAKYAMGHGGLR--PEAAGAIVLPKVSE 347 (347) T ss_pred CCcccceeccccceeeeeecchhheeeeeeceeeeeccchhh--hh--HhhhhhhhcCCceec--ccceEEEecCCCCC Confidence 00111 12466666666544322101222333321 11 122222233432222 11122222222222 No 31 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.02 E-value=2.3e-10 Score=73.46 Aligned_cols=302 Identities=14% Similarity=0.100 Sum_probs=171.0 Q ss_pred Cceeec-------------ccCCchHH--HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec Q lcl|NC_019917. 1 MTTTVI-------------PFGDPKAV--KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH 65 (364) Q Consensus 1 Ma~T~~-------------~~~dp~a~--~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~ 65 (364) ||.... +++|+.+. ++|+.+++....+.|-|.++. +..+++ .|.++.|+-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~----------~~r~i~--~gks~~~~~iG~ 68 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRH----------MVRSIS--SGKSAQFPVLGR 68 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccc----------eeeecc--ccceEEEeeecc Confidence 553222 25775554 999999999999998887631 222343 588999998888 Q ss_pred cccCceecCceeecchhhhhhcccEEEEeccc---ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019917. 66 LRGKPTYGDARTEGTEENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGA 142 (364) Q Consensus 66 L~G~gv~Gd~~leGnee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga 142 (364) .+-...+-.+.+.+..++......+|.||+.. +.| ..+++...++|+|.+.-..+..=+++..||.++.+|..+ T Consensus 69 ~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~V---ddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~ 145 (345) T protein:vir:22 69 TQAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGL 145 (345) T ss_pred eEEEeeecCCCCCCCCCCcccceEEEEecchhhhhhhH---hhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 88777777788998888888888999999976 445 478999999999999999999999999999999998753 Q ss_pred ccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccc---cHHHHHHHHHHHHhcccCCCCCcceeeeEe Q lcl|NC_019917. 143 RGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIM---APIVIERAVEKAAMMQAENPETANMVPVSI 219 (364) Q Consensus 143 ~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~---s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~ 219 (364) ....-+....|. +-.+.......+ +...++..-+. -++.|..|.+.+... ..|. T Consensus 146 a~~~~~~~~~~~-----------~~~~~~~~~~~~-~g~~~t~~~~~~~~~~~ai~~a~~~Lde~--~VP~--------- 202 (345) T protein:vir:22 146 CNVESKYNENIE-----------GLGTATVIETTQ-NKAALTDQVALGKEIIAALTKARAALTKN--YVPA--------- 202 (345) T ss_pred hccccccccccc-----------cccccccccccc-ccccccccccCHHHHHHHHHHHHHHhhhc--CCCc--------- Confidence 221111111110 000000000000 01111111111 134444444433222 2221 Q ss_pred cCceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc----------- Q lcl|NC_019917. 220 DGDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG----------- 288 (364) Q Consensus 220 ~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~----------- 288 (364) .+ .++++.|.++..|+.+. .. .. ..-+..+.+=+|.++.++|+.|++.++++...... T Consensus 203 --~~-R~~vv~P~~y~~Ll~~~--~~---~~---~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~ 271 (345) T protein:vir:22 203 --AD-RVFYCDPDSYSAILAAL--MP---NA---ANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHV 271 (345) T ss_pred --cC-CEEEeChHHHHHHhccc--cc---cc---cccccccccccceEEEEeceEEEecccccccccCccccCccccccc Confidence 23 56999999999998533 32 11 11234455568999999999999998886432110 Q ss_pred -----CCccccc----chheeeccchheEeeecCCC--CCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEe Q lcl|NC_019917. 289 -----AGANVEA----ARALFMGRQAGVIAYGTANG--LRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISID 357 (364) Q Consensus 289 -----~~~~v~v----~ralllGaqA~~~A~g~~~g--~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~id 357 (364) ..+++.+ ..++++-.-|++. ++... +..+|.|+ ..+ ..|-....+|.+=+| .+--++|. T Consensus 272 ~~~~~g~~~~~~~~~~~~~l~~h~~A~~~--v~~~~~~~e~~r~~~--~~~--d~I~~~~a~G~~vlR--Peaa~~i~-- 341 (345) T protein:vir:22 272 FPANKGEGNVKVAKDNVIGLFMHRSAVGT--VKLRDLALERARRAN--FQA--DQIIAKYAMGHGGLR--PEAAGAVV-- 341 (345) T ss_pred ccccccceeeeeccCceEEEEEehhheee--eeeecceeeeeechh--HHH--HHHHHHHhcCCcccc--cceeEEEE-- Confidence 0011111 1245554444432 23211 12222222 111 123333444433333 23333333 Q ss_pred eeee Q lcl|NC_019917. 358 TAAK 361 (364) Q Consensus 358 ta~~ 361 (364) .-++ T Consensus 342 ~~~~ 345 (345) T protein:vir:22 342 FKVE 345 (345) T ss_pred EeeC Confidence 3333 No 32 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.02 E-value=9.3e-11 Score=75.57 Aligned_cols=298 Identities=14% Similarity=0.124 Sum_probs=159.0 Q ss_pred CceeecccC----Cc----hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCcee Q lcl|NC_019917. 1 MTTTVIPFG----DP----KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTY 72 (364) Q Consensus 1 Ma~T~~~~~----dp----~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~ 72 (364) ||.-|+--+ .+ ...++|+..+...-.++..|.+ ++ +. .+.+-..||+|+|+......-.-.. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~-~~--------~d-~~~~~~~Gdtv~ip~~g~~~~~d~~ 70 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTS-VV--------KT-WGAQVKKGDTFHVPRISELGVEDKA 70 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhh-cc--------cc-ccccccCCceEEEeccCcceeeeec Confidence 554333332 11 2458999999877777666654 32 11 1222235999999966554433333 Q ss_pred cCceeecchhhhhhcccEEEEeccc-ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Q lcl|NC_019917. 73 GDARTEGTEENLRFYTDQVKIDQVR-HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVE 151 (364) Q Consensus 73 Gd~~leGnee~L~~~~~~v~Idq~R-~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~ 151 (364) .+..+. .+++.-.+.+|.||+.+ .++.+. .+++..+..|+|++........+++..|+.++..++.+.+... T Consensus 71 ~~~~i~--~~~~~~~~~~itiD~~~~~~~~i~-d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~---- 143 (341) T protein:vir:94 71 TDVPVG--VQPVNDTDFVITVDTDRTTAVALD-DLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTAS---- 143 (341) T ss_pred CCCccc--cccccCceEEEEEeeeeecceeec-hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---- Confidence 333343 56777889999999986 666664 6788889999999999999999999999998887765432110 Q ss_pred cccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEec Q lcl|NC_019917. 152 TPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSE 231 (364) Q Consensus 152 ~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p 231 (364) .+++..++. ......+.++.+.|..|...+.... .|. ++ .+++++| T Consensus 144 --------~~~~~~~~~------------~~t~~~~~~~~~~i~~a~~~Lde~~--VP~-----------~g-R~lvv~P 189 (341) T protein:vir:94 144 --------QNVFSSSNG------------AITGNGQAFSFAVFLAARRLLLEAD--VPE-----------EK-IVLLISP 189 (341) T ss_pred --------CccccCccc------------cccCchhhhhHHHHHHHHHHHhhcC--CCc-----------cC-CEEEeCH Confidence 011111111 0111234467788877766554332 221 33 4568899 Q ss_pred hhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCc----------ccccchheee Q lcl|NC_019917. 232 YQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGA----------NVEAARALFM 301 (364) Q Consensus 232 ~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~----------~v~v~ralll 301 (364) .++..|+. +++|...... .++.|-+|.+|.|.|+-|++.++++....++... ...+....-+ T Consensus 190 ~~~~~Ll~--~~~~~~~~~~------g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 261 (341) T protein:vir:94 190 GQESALFT--IPQFISKDFI------NNAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYL 261 (341) T ss_pred HHHHHHhh--chhhhhhhcc------ccchhheeeeeeEeceEEEEeccccccccccccccccceecccccccccccccc Confidence 99999995 5566544332 2356889999999999999999987654432211 0111111112 Q ss_pred ccchh------eEee-----ecCCCCCcccee--------chhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 302 GRQAG------VIAY-----GTANGLRFDWEE--------TVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 302 GaqA~------~~A~-----g~~~g~~~~w~E--------e~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) |.+.. +++| +.....++.|-- -..+|.-+. -.+.|.| |.-|. +||+==+.++-+ T Consensus 262 ~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~i~~--~~~~G---~~~lrp~~~v~~ 334 (341) T protein:vir:94 262 PKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENRE--QVWLMVG--RQAYG---ARLYRPLHAVNI 334 (341) T ss_pred cccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhh--hhhhhhh--hhhhc---ccccCcceeEEE Confidence 22111 0000 000001111100 001122111 2233333 11222 122211222333 Q ss_pred cC Q lcl|NC_019917. 363 HS 364 (364) Q Consensus 363 ~~ 364 (364) |. T Consensus 335 ~~ 336 (341) T protein:vir:94 335 HT 336 (341) T ss_pred ec Confidence 33 No 33 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.00 E-value=1.5e-10 Score=74.42 Aligned_cols=302 Identities=12% Similarity=0.056 Sum_probs=170.9 Q ss_pred Cce--------eecccC----Cc--hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc Q lcl|NC_019917. 1 MTT--------TVIPFG----DP--KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL 66 (364) Q Consensus 1 Ma~--------T~~~~~----dp--~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) ||- |+.+.+ |+ +-.++|+.++.....+.+-|.++ + .++ ++ ..|+++.|+-+... T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~-~------~~r---ti--~~G~sv~~~~iG~~ 68 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNK-H------LVR---SI--QSGKSAQFPVLGRT 68 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhh-h------hhe---ec--cccceEEeeeccce Confidence 773 444333 33 35599999999999999888763 1 112 23 25999999999888 Q ss_pred ccCceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019917. 67 RGKPTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) Q Consensus 67 ~G~gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~ 143 (364) +-...+-.+.+.+.-+++.....+|.||+. ++.| ..+++....+|+|.+.-.....=|++..||.++.+|.-+. T Consensus 69 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~V---ddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a 145 (347) T protein:vir:94 69 KAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLI---YDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLC 145 (347) T ss_pred eEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhh---hhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 888777778888777889999999999997 4556 4678889999999999999999999999999998875321 Q ss_pred cccccccccccccccccCcccCCCCCcEEeeccccchhhhhhccccc----HHHHHHHHHHHHhcccCCCCCcceeeeEe Q lcl|NC_019917. 144 GINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMA----PIVIERAVEKAAMMQAENPETANMVPVSI 219 (364) Q Consensus 144 g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s----~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~ 219 (364) ....+-...+... |....+-.+.+ +.++.+..-+ ++.|.+|...+... ..| T Consensus 146 ~~~~~~~~~~~g~---------~~~~~v~i~~~----~~~~~~~~~~~~~~~d~i~~a~~~Lde~--dVP---------- 200 (347) T protein:vir:94 146 NLPTANNENIAGL---------GKAHVLEVGDQ----ATLQGDQVKLGQAIIAQLTLARAKLTGN--YVP---------- 200 (347) T ss_pred ccccccccccccC---------CcceeEeeecc----ccccccccccHHHHHHHHHHHHHHhhhc--CCC---------- Confidence 1110000000000 00000000011 1111111122 44455555444322 122 Q ss_pred cCceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCC--ccc---- Q lcl|NC_019917. 220 DGDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAG--ANV---- 293 (364) Q Consensus 220 ~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~--~~v---- 293 (364) ++-.++++.|.++..|-+..++. . ......+.+=+|.++.++|+.|++.++++.+...... +.+ T Consensus 201 --~~~R~~vv~P~~y~~LLk~~~~~---~-----~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~ 270 (347) T protein:vir:94 201 --SSDRVFYTTPDNYSAILAALMPN---A-----ANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTN 270 (347) T ss_pred --CCCCEEEeChHHHHHHHHhhccc---c-----cccccccccccceeEEeeceEEEEcCccccccCccccccccccccc Confidence 23478899999999997533222 1 1122234556799999999999999998765321111 000 Q ss_pred -----------------ccchheeeccchheEeeecCCCCCccceechhhccchh-HHHHHHHHhhhhcccCCcccEEEE Q lcl|NC_019917. 294 -----------------EAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEP-AICAGFIAGMKKARFNSKDFGVIS 355 (364) Q Consensus 294 -----------------~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~i~~i~G~~K~rf~~~DfGvi~ 355 (364) .-..+|++-.-|++.+-.. .+. .|-.+|..++. .|-....+|..=.| =+..+.. T Consensus 271 ~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~----~~~-~e~~~~~~~~~~~i~~~~a~G~g~~r---Pe~a~~i 342 (347) T protein:vir:94 271 QKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLK----DMA-LERARRANFQADQIIAKYAMGHGGLR---PEACGAL 342 (347) T ss_pred ccccccccccccccccccceEEEEechhhhhhhhhc----ccc-eeeeechhhhhhhhhhhhhhcCcccc---cceeEEE Confidence 0012455555555433111 111 11112222222 22223334432222 2445444 Q ss_pred Eeeee Q lcl|NC_019917. 356 IDTAA 360 (364) Q Consensus 356 idta~ 360 (364) ..++| T Consensus 343 ~~~~a 347 (347) T protein:vir:94 343 VFKKA 347 (347) T ss_pred EecCC Confidence 44444 No 34 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.97 E-value=2.7e-11 Score=78.56 Aligned_cols=227 Identities=11% Similarity=0.061 Sum_probs=140.2 Q ss_pred ecCCCCCceEEEEEeeccccC--ceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHH Q lcl|NC_019917. 49 ELESDAGDTISFDLSVHLRGK--PTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDY 126 (364) Q Consensus 49 dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w 126 (364) |=.-+.||+|+|+ ...|+ -+..++.+. .|.|++.+++..|-+...++.+... ++....-|+..++...|+.- T Consensus 1 ~~~~~~Gdtit~P---~~iGda~~v~eG~~i~--~~~l~~t~~~atIk~~gk~~~itD~-a~l~~~gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINLANLCEYP---NDIGDAADVAEGGEIS--LDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) T ss_pred CccccCCceEEec---ccccchhhhcCCCcCC--hhhccccceeeeEeeeccceeeeHH-HHhhccCchHHHHHHHHHHH Confidence 5556899999998 23454 333333333 7889999999999999999998653 55556779999999999999 Q ss_pred HHHHHHHHHHHHhcccccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhccc Q lcl|NC_019917. 127 FYKFTDELLFIYLSGARGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQA 206 (364) Q Consensus 127 ~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~ 206 (364) +++..|..++..|.++. ++.+-.++++.|.+|....... T Consensus 75 iA~kvD~di~~~~~~a~---------------------------------------l~~~~~~t~d~i~~A~~~fgde-- 113 (231) T protein:vir:73 75 LANKVDDDLLKAAKTTS---------------------------------------QTVSTKANVDGVQAALDIFNDE-- 113 (231) T ss_pred HHHhhhHHHHHhhcccc---------------------------------------ccccccccHHHHHHHHHHhccc-- Confidence 99999999887776532 0111136899999987654211 Q ss_pred CCCCCcceeeeEecCceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccc Q lcl|NC_019917. 207 ENPETANMVPVSIDGDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFND 286 (364) Q Consensus 207 ~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~ 286 (364) .++-++++|||.++.+||.+.+..|. ...+-+|.+++|.+|++.||.|...++++.... T Consensus 114 --------------~~~~~vivv~p~~~~~Lrk~~~~~~~-------~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~ 172 (231) T protein:vir:73 114 --------------DAQAYVLIVNPKDAAKIRKDANAKNI-------GSEVGANALINGTYADVLGAQIVRSKKLAEGSA 172 (231) T ss_pred --------------cccceEEEEcchHHHhhhhccchhhh-------hhhhccceeeecccceEcceEEEEcCCCCCCce Confidence 13457899999999999986653332 234556789999999999999999988864222 Q ss_pred ccCCcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHH-HHHHhhhhcccCCcccEEEEE-eeee Q lcl|NC_019917. 287 YGAGANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICA-GFIAGMKKARFNSKDFGVISI-DTAA 360 (364) Q Consensus 287 ~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i-~~i~G~~K~rf~~~DfGvi~i-dta~ 360 (364) + ...++....|+.+. ... .+. .|...|-..+.-.-. ..+++++ + ..+=+|+.+ +..| T Consensus 173 ~--------~~~~i~~~gAl~~~--~k~--~~~-vEtdRd~~~k~~~i~~~~~y~v~-l---~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 173 L--------MFKIVSNSPALKLV--LKR--GVQ-VETDRDIVTKTTVITADEHYAAY-L---YDLTKVVNITFTGV 231 (231) T ss_pred e--------eeeEEeeccceeee--ecc--cce-eeccccccccccEEEEeEEEEEE-E---EcCccEEEEEeecC Confidence 1 11233334444333 211 112 554444433322111 1111100 0 011123222 1222 No 35 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.90 E-value=3e-10 Score=72.78 Aligned_cols=284 Identities=12% Similarity=0.092 Sum_probs=169.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecC---CCCCceEEEEEeeccccCc-ee--cC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELE---SDAGDTISFDLSVHLRGKP-TY--GD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~---k~~Gd~v~f~L~~~L~G~g-v~--Gd 74 (364) ||.|.+-.-|-.....+..-+-....+++.|.. +.+|+...+|. .++|+.|++++-..|.|+. +. |+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~q-------SG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~ 73 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQ-------SGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGD 73 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhh-------cccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc Confidence 996555554444455666666666666665653 24555544443 3699999999999998874 33 23 Q ss_pred ceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 75 ~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) +.++ -+.|...++..+|=....++.... ++..-+--|...++...+++||.+..+..++..|.|.++-... .. T Consensus 74 ~~i~--~~ki~t~~~~a~i~~~~k~~~~tD-~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~--~~-- 146 (330) T protein:vir:10 74 KALE--TGKITAGADIACVLYRGRGWAANE-LTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTA--GE-- 146 (330) T ss_pred cccc--hhhcccceeEEEEEeecceeeehh-hhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhc--cc-- Confidence 3454 588999999999998888887753 3556677899999999999999999999999999986542100 00 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) + .+-+. .+.. . .-.++-.++.+.+-+|..+ ++-. .++..+++|||..+ T Consensus 147 ------~--~~~~~-~~~~-~------~~~~~a~~s~~~l~~A~~~--~GD~--------------~~~~~~ivmhS~v~ 194 (330) T protein:vir:10 147 ------K--GALEE-THVS-D------QSKASTGIDAGMVLDAKQL--LGDS--------------ADQVTAIAMHSAVY 194 (330) T ss_pred ------c--hhhhh-hhee-c------ccccccccCHHHHHHHHHH--hccc--------------cccceEEEEcHHHH Confidence 0 00000 0000 0 0113346888888887543 3310 13578999999999 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) .+||+. +..+..++. . ..+.+|.|+|+.|...-.++.. + .+..++++|..|+.+.-+.. T Consensus 195 ~~L~~~---~li~~~~~s---~------~~~~i~~~~G~~VivdD~~p~~------~--~~yt~yl~~~GAi~~~~~~~- 253 (330) T protein:vir:10 195 TKLQKD---NLIQYIQPT---T------ATINIPTYLGYRVIIDDGIAPT------G--DIYTSYLFRTGSIGLNTGNP- 253 (330) T ss_pred HHHHHh---hhhhhhccc---c------cCcccccccceEEEEeCCCCCC------C--CceeEEEEecCceeeecccC- Confidence 999963 233333432 1 1356899999999877666421 1 23668999988876554432 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEe-----eeee---ccC Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISID-----TAAK---KHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~id-----ta~~---~~~ 364 (364) .++-..|-..|-.... ..+...-+.++++- .++. -.| T Consensus 254 -~~~v~~EtdRd~~~g~------------~~l~~r~~~~~hp~G~s~~~~~~~~~~~s 298 (330) T protein:vir:10 254 -SGLTTFETSREAAKGN------------DMIYTRRALVMHPYGVKWTGAEVDAGNIT 298 (330) T ss_pred -CccccccccCCccccc------------eEEEEeeEEEeeeeeeeecccccccCcCC Confidence 2334445433321111 11111222333321 1111 011 No 36 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.87 E-value=2.4e-10 Score=73.37 Aligned_cols=288 Identities=15% Similarity=0.127 Sum_probs=172.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeec---CCCCCceEEEEEeeccccCc--eecCc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTEL---ESDAGDTISFDLSVHLRGKP--TYGDA 75 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL---~k~~Gd~v~f~L~~~L~G~g--v~Gd~ 75 (364) ||.|... |......|+.-+-....+++.|.. +.+|+...+| -.++|+.|+|++-..|+|++ +.|+. T Consensus 1 MA~T~ls--d~i~PEvf~~yv~~~~~~~~~l~q-------SG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~ 71 (351) T protein:vir:15 1 MAETHLS--DLIVPEVFGNYVVNQIIKTNRFVQ-------SGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSD 71 (351) T ss_pred CCceeee--eeechhHHHHHHhhhhHHhhhHhh-------cccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc Confidence 9988753 334445566655555556665543 2455554444 34799999999999998874 44555 Q ss_pred eeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc Q lcl|NC_019917. 76 RTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDF 155 (364) Q Consensus 76 ~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~ 155 (364) .++ .+.|...++.-+|=....++... .+++..+.=|+..++...|++||++..+..+|..|.|.++... T Consensus 72 ~i~--~~kitt~~~~a~i~~~~kg~~~t-D~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~-------- 140 (351) T protein:vir:15 72 DID--VNNLTSGKQQGIKFYQTKAYGYT-DLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTK-------- 140 (351) T ss_pred ccc--hheecccceeEEEEeeccceehh-hhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchh-------- Confidence 554 68899999999998888888774 4567777889999999999999999999999999998765321 Q ss_pred cccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHH Q lcl|NC_019917. 156 TGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQAT 235 (364) Q Consensus 156 ~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~ 235 (364) . .+.|.+. .+ ..-.++-+||.+.+-+|..+. +.. .++...+++|||..+. T Consensus 141 --~---------~~~~~~d--~t--~~~~~~~~is~~~l~~A~~~~--GD~-------------~~~~~~~ivmhS~v~~ 190 (351) T protein:vir:15 141 --I---------ANSKVYD--QT--KVSPSEPMFGAKGFTGAIGLM--GDL-------------QDTAFGAIAVNSATYS 190 (351) T ss_pred --h---------cccceec--cc--cccccccccCHHHHHHHHHHh--ccc-------------cccceEEEEEChHHHH Confidence 0 0111221 11 011234569999999987653 210 0123689999999999 Q ss_pred HHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 236 ~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) .||+.. .++..++. .+ .+.+|.|+|+.|.....++.-.. ++.-.+..++|+|..|+. |+.. T Consensus 191 ~L~~~~---li~~~~~s---~~------~~~i~t~~G~~VivdD~~p~~~~---~~~~~~ytsyl~~~GAi~--~~~~-- 251 (351) T protein:vir:15 191 LMKVQG---LIETIQPQ---NG------ATPFEAYNGLRIVLDDDIEIDLT---DKTKPVSTSYIFAPGAVR--YSTN-- 251 (351) T ss_pred HHHhhh---hhhhcccc---cc------CcccceecceEEEEcCCCccccC---CCCCceeEEEEEecceee--eecC-- Confidence 999643 33444443 11 23579999999988776653222 122234778999998875 4442 Q ss_pred CCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeee---eeccC Q lcl|NC_019917. 316 LRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTA---AKKHS 364 (364) Q Consensus 316 ~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta---~~~~~ 364 (364) .++ +|-..|....- +.+.+.- .-+|.-.=+|+ ..+-+ ..-.| T Consensus 252 ~~~--ve~~rd~~~~~--g~d~l~~--r~~~~~hp~G~-s~~~~~~~~~~~s 296 (351) T protein:vir:15 252 MRS--TETKYDPLING--GQDVIVQ--KRVGTIHVAGT-SIKASFSPSKASF 296 (351) T ss_pred CcC--cceeecccCCC--CceEEEE--eeeeeeeeeee-eecccccccCcCC Confidence 222 34333322211 1111111 11121111111 11100 01111 No 37 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.85 E-value=1.6e-10 Score=74.33 Aligned_cols=282 Identities=15% Similarity=0.068 Sum_probs=169.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecC--CCCCceEEEEEeeccccCc--eecCce Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELE--SDAGDTISFDLSVHLRGKP--TYGDAR 76 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~--k~~Gd~v~f~L~~~L~G~g--v~Gd~~ 76 (364) ||+|... |......+..-+-....+++.|. ..|--.+...+.++- ..+|+.|+++.-..|.|++ +.+++. T Consensus 1 MA~T~ls--d~i~peVf~~yv~~~~~~~~~l~----qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~ 74 (324) T protein:vir:59 1 MAYTKIS--DVIVPELFNPYVINTTTQLSAFF----QSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDD 74 (324) T ss_pred CCceeee--ceechhHHHHHHHhhhHHHHHHh----hcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcc Confidence 9987653 33444556665655555555443 333222222333332 2589999999999998774 444444 Q ss_pred eecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 77 TEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 77 leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) +. -+.|...++.-+|=..-.++... ..++..+.-|...++...|++||++..+..++..|.|.++.+. T Consensus 75 i~--~~~l~t~~~~a~i~~~~k~~~~t-D~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~--------- 142 (324) T protein:vir:59 75 LV--PQKINAGQDKAVLILRGNAWSSH-DLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDD--------- 142 (324) T ss_pred cc--hhhcccceeeEEEEeecCceeeh-hhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc--------- Confidence 44 68899999999998888888765 4466778889999999999999999999999999998765321 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) ..++.+-.. -+++-++|.+.+.+|..+ ++- +.++..+++|||..+.+ T Consensus 143 ----------~~~~~~dvs-------a~~~~~~s~~~l~~A~~~--~GD--------------~~~~~~~ivmhS~v~~~ 189 (324) T protein:vir:59 143 ----------MKDNKLDIS-------GTADGIYSAETFVDASYK--LGD--------------HESLLTAIGMHSATMAS 189 (324) T ss_pred ----------cccceeeee-------ccccceecHHHHHHHHHH--hCC--------------cccCcEEEEEchHHHHH Confidence 001111100 112235899999998664 331 11467899999999999 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) ||+.. -.+..++. .+ .+.+|.|+|+.|...-.++.-. .... -.+..++++|..|+....++ T Consensus 190 L~~~~---li~~~~~s---~~------~~~i~~~~G~~VivdD~~p~~~-~~~~--~~~y~s~l~~~GAi~~~~~~---- 250 (324) T protein:vir:59 190 AVKQD---LIEFVKDS---QS------GIRFPTYMNKRVIVDDSMPVET-LEDG--TKVFTSYLFGAGALGYAEGQ---- 250 (324) T ss_pred HHHhh---hhhhcccc---cc------CceeeeecccEEEEeCCCCccc-cCCC--CceEEEEEEecCeEEEeecC---- Confidence 99643 22222322 11 2457999999998776654211 1111 22477899998886555433 Q ss_pred CccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEE------eeeeeccC Q lcl|NC_019917. 317 RFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISI------DTAAKKHS 364 (364) Q Consensus 317 ~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~i------dta~~~~~ 364 (364) +.--+|...|-..+... +..+-+-++++ ++.+...| T Consensus 251 ~~v~vE~dRd~~~g~~~------------l~~r~~~~~~p~G~s~~~~~~~~~s 292 (324) T protein:vir:59 251 PEVPTETARNALGSQDI------------LINRKHFVLHPRGVKFTENAMAGTT 292 (324) T ss_pred CCcceecccCccccceE------------EEEeeEEEeEeeeEEecccccCCCC Confidence 33335655453221111 11111222222 23222223 No 38 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.84 E-value=2.2e-09 Score=68.02 Aligned_cols=308 Identities=14% Similarity=0.076 Sum_probs=173.8 Q ss_pred Cce----eeccc---CCchH--HHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTT----TVIPF---GDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~----T~~~~---~dp~a--~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+. |+.++ +++.+ .++|+.+++....+++-|.++ ..++. + ..|.++.|+-+.+..-... T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~-------~~~rt---i--~~g~s~~~~~iG~~~~~~~ 68 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPL-------MNIRD---L--RGSNVVRLDRLGNVEAKGR 68 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccc-------cceee---e--ccceeEEEeeeeeeeeccc Confidence 763 44443 22223 499999999999999988763 22332 3 3589999998888876666 Q ss_pred ecCceeecchhhhhhcccEEEEec---ccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQ---VRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLD 148 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq---~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~ 148 (364) +=.+.+.|+ ........|.||+ .|+.| ..+++-.+++|+|++-...+..-+++..||.++.+|.-+....-+ T Consensus 69 ~pG~~l~~~--~~~~~k~~itID~ll~a~~~V---ddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~ 143 (335) T protein:vir:78 69 RAGEELERS--RVVNDKWNLTVDTLLYLRHQF---DHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAP 143 (335) T ss_pred ccCcccCCC--CcccCCeEEEecceeechhhH---hhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 666677665 4566777899999 67777 467888899999999999999999999999999988764432222 Q ss_pred ccccccccccccCcccCCCCCcEEeeccccchhhhhhc-ccccHHHHHHHHHHHHhcc--cCCCCCcceeeeEecCceeE Q lcl|NC_019917. 149 FVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAAT-DIMAPIVIERAVEKAAMMQ--AENPETANMVPVSIDGDDHY 225 (364) Q Consensus 149 ~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~-D~~s~~~i~~a~~~a~~~~--~~~~~~~~i~Pv~~~g~~~y 225 (364) -...+.|. +|+ +....++.. -.-....+..|++.|.... ...|+. +.+-- T Consensus 144 ~~~~~~~~-----------------~G~-~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~---------~~~~r 196 (335) T protein:vir:78 144 VDLEDAFS-----------------PGV-LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDA---------VYSEG 196 (335) T ss_pred cccCCCcC-----------------CCc-ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCC---------CCCcc Confidence 11111111 111 111111100 0112445555655554332 222321 12347 Q ss_pred EEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc----cCCccc---cc-c- Q lcl|NC_019917. 226 VVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY----GAGANV---EA-A- 296 (364) Q Consensus 226 V~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~----~~~~~v---~v-~- 296 (364) |+++.|.|++.|..+. +.. .+. ....+..+++=.|.++.++||.|++.++++..... +.+.|+ +. . T Consensus 197 v~vv~P~~y~~Ll~~~--~l~--n~~-~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~ 271 (335) T protein:vir:78 197 LTPMSPRVFSLLLEHD--KLM--SVE-YQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQ 271 (335) T ss_pred EEEeChHHHHHHhccc--ccc--ccc-ccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccce Confidence 8999999999999643 321 111 11223346677899999999999999999866422 122222 11 1 Q ss_pred hheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 297 RALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 297 ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) -++++=..|++.+-...-..+..|.+.. .. .-|-....+|..=.|- |..+..--|-..+-+ T Consensus 272 ~~~~~~~~Al~t~~~~~~~~e~~~~~~~--~~--~~i~~~~a~G~g~lRP---e~a~~i~~tg~~~~~ 332 (335) T protein:vir:78 272 IALFLPSKTLITAQVAPVQAKLWEDHDQ--FS--WVLDTFQMYNIGARRP---DTAGAIELKGIEAFD 332 (335) T ss_pred EEEEEecceEEEEEEEecccceeeccch--hh--HhhhHHHHcCCcccCc---ceEEEEEecCCCccc Confidence 2344333443333222212333333322 11 1222233334333222 333333334444444 No 39 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.78 E-value=4.9e-09 Score=66.12 Aligned_cols=309 Identities=13% Similarity=0.057 Sum_probs=175.3 Q ss_pred Cce----eeccc---CCchH--HHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTT----TVIPF---GDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~----T~~~~---~dp~a--~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+. |+.++ +++.+ .++|+.+++....+++-|.++ +.++.+ ..|+++.|+-+.+.+-... T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~-------~~~rti-----~~g~s~~~~~iG~~~~~~~ 68 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPL-------MNIRDL-----RGSNVVRLDRLGNVEAKGR 68 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccc-------cceeee-----ccceeEEEeeeeeeeeecc Confidence 763 33333 22332 399999999999999888763 233333 3599999998888877777 Q ss_pred ecCceeecchhhhhhcccEEEEec---ccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQ---VRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLD 148 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq---~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~ 148 (364) +=.+.+.|+- -......|.||. .||.| ..+++-..++|+|++-...+..-+++..||.+|.+|.-+....-+ T Consensus 69 ~pG~~l~~~~--~~~~k~~itVD~ll~a~~~I---~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~ 143 (335) T protein:vir:63 69 RAGEELERSR--VVNDKWNLTVDTLLYLRHQF---DHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAP 143 (335) T ss_pred cCCcCcCCCC--ccccceEEEecceeechhhh---hhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCc Confidence 6777777764 344677899999 67777 467888889999999999999999999999999888764432211 Q ss_pred ccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhc--ccCCCCCcceeeeEecCceeEE Q lcl|NC_019917. 149 FVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMM--QAENPETANMVPVSIDGDDHYV 226 (364) Q Consensus 149 ~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~--~~~~~~~~~i~Pv~~~g~~~yV 226 (364) -...+.|.... +.+-+ .++..+.+ ..+.|..|++.|... ....|+ +|-+--+ T Consensus 144 ~~~~~~~~~G~-------~~~~~-----~tg~~~~~-----~~~~l~~a~~~a~~~L~e~dVP~---------~~~~dr~ 197 (335) T protein:vir:63 144 VDLEDAFSPGV-------LEKLD-----LTGLTAKQ-----AADKIVRMHRRVVETFIDRDLGD---------AVYSEGL 197 (335) T ss_pred cccCCCcCCCc-------ceeee-----eccCcccc-----cHHHHHHHHHHHHHHHHhccCCC---------cccCceE Confidence 11111111100 00000 11111111 244455454444322 222222 1113368 Q ss_pred EEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc----CCccc---c--cch Q lcl|NC_019917. 227 VVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG----AGANV---E--AAR 297 (364) Q Consensus 227 ~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~----~~~~v---~--v~r 297 (364) +++.|.|++.|..+. +.. .+.. ...+..++.=.|.++.++||.|++.++++.....+ .+.|+ + -.- T Consensus 198 ~vv~P~~y~~Ll~~~--~l~--n~~~-~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~ 272 (335) T protein:vir:63 198 TPMSPRVFSLLLEHD--KLM--NVEY-QATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQI 272 (335) T ss_pred EEeChHHHHHHhccc--ccc--cccc-ccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeE Confidence 899999999998643 332 1111 11223466778999999999999999997654221 22121 1 123 Q ss_pred heeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 298 ALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 298 alllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++++-.-|++.+-...-.....+.++. .. .-|-....+|..=.| .+-.++|.+ |-.-+-+ T Consensus 273 ~~~~~~~Al~t~~~~~vt~e~~~~~~~--~~--~~i~~~~a~G~g~lR--Pe~a~~i~~-tg~~~~~ 332 (335) T protein:vir:63 273 ALFLPSKTLITAQVAPVQAKLWEDNEK--FS--WVLDTFQMYNIGARR--PDTAGAIEL-KGIGAFD 332 (335) T ss_pred EEEEecceEEEEEEeecccceeeccch--hh--HHhHHHHHcCCcccc--cceEEEEEE-cCCCcee Confidence 566666665554333222233333322 11 122223333433222 233333332 4444545 No 40 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.77 E-value=3.3e-09 Score=67.06 Aligned_cols=304 Identities=12% Similarity=0.084 Sum_probs=164.0 Q ss_pred Cceeec-------cc----CC--chHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 1 MTTTVI-------PF----GD--PKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 1 Ma~T~~-------~~----~d--p~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) ||-|+- +. +| .+..++|..+++.+..+.|-|.+ ++ +..++ ..|++|.|+-+...+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~-~~---------~~r~i--~~G~sv~i~~iG~~t 68 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTAD-KH---------IVRTI--QNGKSAQFPVMGRTS 68 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhc-cc---------ccccc--cccceEEEeccccee Confidence 764432 22 23 34458999999999888776665 22 22233 259999999999998 Q ss_pred cCceecCceeecchhhhhhcccEEEEecc---cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLRFYTDQVKIDQV---RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARG 144 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~~~~~~v~Idq~---R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g 144 (364) -...+-++.+.|+-+++.-.+..|.||+. |+.| ..+++...++|+|.+.......=+++..|+.++.+|..... T Consensus 69 v~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~V---ddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa 145 (347) T protein:vir:94 69 GVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMI---FDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCN 145 (347) T ss_pred eeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHh---hhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 88888888998888888888888999998 5667 36788889999999999999999999999999987754221 Q ss_pred ccccccccccccccccCcccCCCCCcEEeeccccc--hhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCc Q lcl|NC_019917. 145 INLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATS--KASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGD 222 (364) Q Consensus 145 ~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~--~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~ 222 (364) ...+ ......++ .+++ .+-.+..+. ....+ .+.+ ++.|..|.+.+... ..|. + T Consensus 146 ~~~~--~~~~~~g~-----~~~s---~~~~~~~~~~~~~~~~-~~~~-~~~i~~a~~~Lde~--~VP~-----------~ 200 (347) T protein:vir:94 146 LPAA--SNENIAGL-----GTAS---VLEVGKKADLDTPAKL-GEAI-IGQLTIARAKLTSN--YVPA-----------G 200 (347) T ss_pred cccc--cccccCCC-----cccc---eeeccccccccchhhh-HHHH-HHHHHHHHHHHhhc--CCCC-----------C Confidence 1100 00000000 0011 011000000 00000 0111 34455554444322 1221 2 Q ss_pred eeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc----CCccc----- Q lcl|NC_019917. 223 DHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG----AGANV----- 293 (364) Q Consensus 223 ~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~----~~~~v----- 293 (364) + .++++.|.++..|..+ ........ ... ..+=+|.+|.++|+.|++.++++...... .+.++ T Consensus 201 ~-R~~vv~P~~~~~Ll~~--~~~~~~~~---~~~---~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~ 271 (347) T protein:vir:94 201 D-RYFYTTPDNYSAILAA--LMPNAANY---AAL---IDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQK 271 (347) T ss_pred C-cEEEeCHHHHHHHhcc--chhhhhhc---ccc---ccccccceEEEeceEEEecCcccccccccccccCcceecCccc Confidence 3 5667999999999853 33322111 111 22456999999999999999997542211 00000 Q ss_pred ---------------ccchheeeccchheEeeecCCCCCccceechhhccch-hHHHHHHHHhhhhcccCCcccEEEEEe Q lcl|NC_019917. 294 ---------------EAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNE-PAICAGFIAGMKKARFNSKDFGVISID 357 (364) Q Consensus 294 ---------------~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~i~~i~G~~K~rf~~~DfGvi~id 357 (364) .-..+|++=.-|+ +.++... +. .|-..|-.+. ..|-....+|.+=.|- +-=|+|... T Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~h~~A~--~~v~~~~--~~-~e~~r~~~~~~d~i~~~~~~G~~~~rP--~~a~~~~~~ 344 (347) T protein:vir:94 272 HAFPATASSDVKVTMDNVVGLFSHRSAV--GTVKLRD--LA-LERDRDVDAQGDLIVGKYAMGHGGLRP--EAAGALVFS 344 (347) T ss_pred ccccccchhhhcccccceeEEEeehhhh--hhhhccc--cc-ccchhchhhHHHHhhhhhhhcCccccc--ceeEEEEec Confidence 0011233322222 2222111 11 2211121111 1233334444333332 223444333 Q ss_pred eeee Q lcl|NC_019917. 358 TAAK 361 (364) Q Consensus 358 ta~~ 361 (364) +|. T Consensus 345 -~A~ 347 (347) T protein:vir:94 345 -PAE 347 (347) T ss_pred -CCC Confidence 333 No 41 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.69 E-value=4.1e-09 Score=66.56 Aligned_cols=271 Identities=17% Similarity=0.176 Sum_probs=145.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||.+++ ..++|+..+.....+.+.|.+ ++-+.- +.+-..||+|+|+-...++-..-++... ... T Consensus 1 MA~~~~------~pe~~~~~v~~~~~~~lv~~~-l~~~~~--------~~~~~~Gdtv~ip~~~~~~~~d~~~~~~-~~~ 64 (273) T protein:vir:10 1 MAFNNF------IPELWSDMLLEEWTAQTVFAN-LVNREY--------EGTASKGNVVHIAGVVAPTVKDYKAAGR-QTS 64 (273) T ss_pred Ccchhh------hHHHHHHHHHHHHHhhhccch-hhcccc--------ccccccCceEEEeecccccccccccCCC-ccC Confidence 988654 358999999888877766654 554321 1222469999999765544221111111 234 Q ss_pred hhhhhhcccEEEEeccc-ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVR-HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R-~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+++...+.++.||+.+ .++.+. .+++.-...|+++..+. +..=+++..|+.++..++++-.. T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~-------------- 128 (273) T protein:vir:10 65 ADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------- 128 (273) T ss_pred ccccccceEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc-------------- Confidence 67888999999999875 566665 44666677898875554 56678888999998887763110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |. ....++.+. .++.|..|...+.... .|. ++ .+++++|.++..|++ T Consensus 129 -~~----------------~~~~~~~~~--~~~~i~~a~~~ld~~~--vP~-----------~~-R~lvv~p~~~~~L~~ 175 (273) T protein:vir:10 129 -LT----------------GSAPTDADD--AFDLIAKALKELTKAN--VPN-----------VG-RVVVVNAEMAFWLRS 175 (273) T ss_pred -cc----------------cccccchhH--HHHHHHHHHHHhhhcC--CCc-----------CC-CEEEECHHHHHHHhc Confidence 10 011122111 2556767655543332 221 22 457899999999996 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecC-CCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA-NGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~-~g~~~ 318 (364) + +..+.+... .+..+.|=+|.+|.+.|+-|+++++++....+.+ +..=..|+ ++++- ..+.. T Consensus 176 ~-~~~~~~~~~-----~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~---------~~~~~~A~--~~a~q~~~~e~ 238 (273) T protein:vir:10 176 S-GSKLTSADT-----SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQF---------VAFHPSAA--AYVSQIDTVEA 238 (273) T ss_pred c-hhhhhhhhc-----cccccceeeeeeeEEeceEEEEecccccCCccEE---------EEEeccce--eeeeeeehhhc Confidence 4 222322222 2344567789999999999999998875443221 11111222 22220 00111 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) .+.|+. ++.. |-...++|.+=+| .+ |++++=...+ T Consensus 239 ~r~~~~--~~~~--v~~~~~yg~~v~~---~~-~~~~l~~~g~ 273 (273) T protein:vir:10 239 LRDQDS--FSDR--IRALHVYGGKVVR---PT-GVVVFNKTGS 273 (273) T ss_pred ccCCCc--ceee--eeeeeeeeeeEec---cc-eEEEEeccCC Confidence 112211 1111 1111222222121 11 4444332222 No 42 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.69 E-value=4.1e-09 Score=66.56 Aligned_cols=271 Identities=17% Similarity=0.176 Sum_probs=145.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||.+++ ..++|+..+.....+.+.|.+ ++-+.- +.+-..||+|+|+-...++-..-++... ... T Consensus 1 MA~~~~------~pe~~~~~v~~~~~~~lv~~~-l~~~~~--------~~~~~~Gdtv~ip~~~~~~~~d~~~~~~-~~~ 64 (273) T protein:vir:10 1 MAFNNF------IPELWSDMLLEEWTAQTVFAN-LVNREY--------EGTASKGNVVHIAGVVAPTVKDYKAAGR-QTS 64 (273) T ss_pred Ccchhh------hHHHHHHHHHHHHHhhhccch-hhcccc--------ccccccCceEEEeecccccccccccCCC-ccC Confidence 988654 358999999888877766654 554321 1222469999999765544221111111 234 Q ss_pred hhhhhhcccEEEEeccc-ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVR-HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R-~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+++...+.++.||+.+ .++.+. .+++.-...|+++..+. +..=+++..|+.++..++++-.. T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~-------------- 128 (273) T protein:vir:10 65 ADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------- 128 (273) T ss_pred ccccccceEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc-------------- Confidence 67888999999999875 566665 44666677898875554 56678888999998887763110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |. ....++.+. .++.|..|...+.... .|. ++ .+++++|.++..|++ T Consensus 129 -~~----------------~~~~~~~~~--~~~~i~~a~~~ld~~~--vP~-----------~~-R~lvv~p~~~~~L~~ 175 (273) T protein:vir:10 129 -LT----------------GSAPTDADD--AFDLIAKALKELTKAN--VPN-----------VG-RVVVVNAEMAFWLRS 175 (273) T ss_pred -cc----------------cccccchhH--HHHHHHHHHHHhhhcC--CCc-----------CC-CEEEECHHHHHHHhc Confidence 10 011122111 2556767655543332 221 22 457899999999996 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecC-CCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA-NGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~-~g~~~ 318 (364) + +..+.+... .+..+.|=+|.+|.+.|+-|+++++++....+.+ +..=..|+ ++++- ..+.. T Consensus 176 ~-~~~~~~~~~-----~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~---------~~~~~~A~--~~a~q~~~~e~ 238 (273) T protein:vir:10 176 S-GSKLTSADT-----SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQF---------VAFHPSAA--AYVSQIDTVEA 238 (273) T ss_pred c-hhhhhhhhc-----cccccceeeeeeeEEeceEEEEecccccCCccEE---------EEEeccce--eeeeeeehhhc Confidence 4 222322222 2344567789999999999999998875443221 11111222 22220 00111 Q ss_pred cceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) .+.|+. ++.. |-...++|.+=+| .+ |++++=...+ T Consensus 239 ~r~~~~--~~~~--v~~~~~yg~~v~~---~~-~~~~l~~~g~ 273 (273) T protein:vir:10 239 LRDQDS--FSDR--IRALHVYGGKVVR---PT-GVVVFNKTGS 273 (273) T ss_pred ccCCCc--ceee--eeeeeeeeeeEec---cc-eEEEEeccCC Confidence 112211 1111 1111222222121 11 4444332222 No 43 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.69 E-value=9.3e-09 Score=64.62 Aligned_cols=305 Identities=15% Similarity=0.132 Sum_probs=163.6 Q ss_pred Cceee----cc---cCCchHH--HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTTTV----IP---FGDPKAV--KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~T~----~~---~~dp~a~--~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+.-+ .+ ++|+.+. |+|+.+++....+.+-|.++ +.++. + ..|.++.|.-+...+-... T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~-------~~vrt---i--~~GkS~qf~~iG~~~a~y~ 68 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSY-------FDVQT---V--TGTNTVSNKYLGETELQVL 68 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCc-------ceeee---e--cccceEEEEEEeeeEEeee Confidence 87433 22 2455554 89999999999998888763 23333 3 2788999998877776666 Q ss_pred ecCceeecchhhhhhcccEEEEec---ccceeeccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHH--Hhcccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQ---VRHPVSAGGRMSRKRSVHN-IRRIARDRLGDYFYKFTDELLFI--YLSGARGI 145 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq---~R~~V~~~~~m~~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~--~l~ga~g~ 145 (364) +-.+.+.| +.+......|.||. .||.| ..+++-...+| +|.+-...+..-+++..||.++- .+++..- T Consensus 69 ~~G~~ldg--~~~~~~k~~ItID~lL~a~~~V---~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~- 142 (402) T protein:vir:97 69 APGQSPNA--TPTQADKNQLVIDTTVIARNTV---AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN- 142 (402) T ss_pred ccccccCC--CCcccccEEEEeCceeechhhh---hhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc- Confidence 65566765 46777777899998 45666 46788888999 89999899999999999997753 3343211 Q ss_pred cccccccccccccccCcccCCCCCcEEeeccccchhhhhhcc-cccHHHHHHHHHHHHhc--ccCCCCCcceeeeEecCc Q lcl|NC_019917. 146 NLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATD-IMAPIVIERAVEKAAMM--QAENPETANMVPVSIDGD 222 (364) Q Consensus 146 ~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D-~~s~~~i~~a~~~a~~~--~~~~~~~~~i~Pv~~~g~ 222 (364) . .+|+.+--..+.+....-..+..+ .-+...|-++++.|... .+..|. + T Consensus 143 t-----------------~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~-----------~ 194 (402) T protein:vir:97 143 T-----------------KAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI-----------S 194 (402) T ss_pred c-----------------ccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc-----------c Confidence 0 011111101111100000000011 12444444444444322 222231 2 Q ss_pred eeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc-----------cCCc Q lcl|NC_019917. 223 DHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY-----------GAGA 291 (364) Q Consensus 223 ~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~-----------~~~~ 291 (364) + -++++.|.|+.-|..+. +..+ +.. .. ...+.+=.|.+++++||.|++.++++..-.. |... T Consensus 195 d-Rv~vv~P~~y~~Ll~~~--rl~n--~d~-~~-~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y 267 (402) T protein:vir:97 195 D-VAIMMPWKFFNALRDAD--RIVD--KTY-TI-SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRY 267 (402) T ss_pred c-cEEEeChHHHHHHhhcc--cccc--hhh-cc-ccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccC Confidence 3 68999999999999643 3221 111 11 1123345799999999999999999754211 1222 Q ss_pred cc----ccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEE------------ Q lcl|NC_019917. 292 NV----EAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVIS------------ 355 (364) Q Consensus 292 ~v----~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~------------ 355 (364) ++ .-.++++.=.-|++.+-...--+.++|.++.+ -.-|-....+|..-.|-. --|||. T Consensus 268 ~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~----~~~id~~~a~G~g~~RPe--aa~vv~~~~~~t~~~~~~ 341 (402) T protein:vir:97 268 DPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK----TYYIDTFMAEGAIPDRWE--AVSVVTTKRDATTGDAGG 341 (402) T ss_pred CcCcccceeEEEEEecceEEEEEeeccccchhhchhHH----HHHHHHHHHhCCcccCcc--ceEEEEEecccccccCCc Confidence 22 22345555445554442222122333333211 111444455555544431 112210 Q ss_pred --Eee------------------eeeccC Q lcl|NC_019917. 356 --IDT------------------AAKKHS 364 (364) Q Consensus 356 --idt------------------a~~~~~ 364 (364) =|+ ++.+.| T Consensus 342 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (402) T protein:vir:97 342 PGDDHATVLARAQRKAVYVKTEGAAAAFS 370 (402) T ss_pred cccchhhhhcccccceEEEeccccchhcc Confidence 000 111111 No 44 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.69 E-value=8.7e-09 Score=64.76 Aligned_cols=312 Identities=13% Similarity=0.081 Sum_probs=167.5 Q ss_pred Cceeecc---------------cCCchH--HHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe Q lcl|NC_019917. 1 MTTTVIP---------------FGDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS 63 (364) Q Consensus 1 Ma~T~~~---------------~~dp~a--~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~ 63 (364) |+.++.. ++|+.+ .++|+.+++....+.+-|.++ +++.++. .|.++.|+-+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~----------~~~rti~--~Gksv~f~~i 68 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDL----------VTKRTLK--NGKSLQFIYT 68 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhcc----------ccccccc--cCceEEEEee Confidence 5544432 236444 499999999999998887753 2223332 5889999988 Q ss_pred eccccCceecCceeecc-hhhhhhcccEEEEeccc---ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019917. 64 VHLRGKPTYGDARTEGT-EENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYL 139 (364) Q Consensus 64 ~~L~G~gv~Gd~~leGn-ee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l 139 (364) ...+-...+..+.+.|+ .++..-.+.+|.||+.. +.| ..+++...++|||++.......=|++..|+.++..| T Consensus 69 G~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~V---dDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l 145 (375) T protein:vir:10 69 GRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFV---YDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSI 145 (375) T ss_pred eeeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhH---hhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88887788888888887 45667778899999984 556 468889999999999999999999999999999998 Q ss_pred cccccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEe Q lcl|NC_019917. 140 SGARGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSI 219 (364) Q Consensus 140 ~ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~ 219 (364) ..+.....|-...+.+. |....+..+.+.++...++. .--++.|..+.+.+... ..|. T Consensus 146 ~kaa~~~~p~~~~~~~~---------~Gg~~i~~~sg~~~~~~~ta--~~~~~ai~~a~~~Lde~--~VP~--------- 203 (375) T protein:vir:10 146 TRGARSASPVSATNFVE---------PGGTQIRVGSGTNESDAFTA--SALVNAFYDAAAAMDEK--GVSS--------- 203 (375) T ss_pred HHhhhhccccccccccc---------cCcceeeeccccccccccCH--HHHHHHHHHHHHHHhhc--CCCC--------- Confidence 74332222211111111 11111111122222222211 11234444444333222 1221 Q ss_pred cCceeEEEEEechhHHHHhhcCCHH-HHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc----cCCccc- Q lcl|NC_019917. 220 DGDDHYVVVMSEYQATDMRTAAGGT-WIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY----GAGANV- 293 (364) Q Consensus 220 ~g~~~yV~~l~p~q~~~Lr~~~d~~-w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~----~~~~~v- 293 (364) .+ -++++.|.++.-|..+.|+. .. .+.. +.+...=.|.++.++|+.|++.++++....+ ++..++ T Consensus 204 --~~-R~~vv~P~~y~~Ll~~~d~~~~~--n~d~----~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~ 274 (375) T protein:vir:10 204 --QG-RCAVLNPRQYYALIQDIGSNGLV--NRDV----QGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGET 274 (375) T ss_pred --CC-CEEEeChHHHHHHHhcCCcccee--eecc----cccceeccceEEEEeceEEEEecccccccccccccccccccc Confidence 24 34779999999998654432 11 1111 1112223577899999999999999877542 111111 Q ss_pred ----ccchheeeccchheEeeecCCCCCccceechhhc--------cchhHHHHHHHHhhhhcccCC------c-c---- Q lcl|NC_019917. 294 ----EAARALFMGRQAGVIAYGTANGLRFDWEETVKDY--------GNEPAICAGFIAGMKKARFNS------K-D---- 350 (364) Q Consensus 294 ----~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~--------g~~~~i~i~~i~G~~K~rf~~------~-D---- 350 (364) ...+.+..+...++.+ |.+ +-+|. .+|+ ..+..+++-...|+.=-+|++ + | T Consensus 275 a~~~~~~~~~~~~~~~~~~~-g~~---~~y~~--d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~ 348 (375) T protein:vir:10 275 SPGNLGSHIGPTPENANATG-GVN---NDYGT--NAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILG 348 (375) T ss_pred chhhhhccccccCCcceeec-ccc---ccccc--cccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeee Confidence 1223333333332222 110 00111 1111 122222322223322222210 0 0 Q ss_pred ---c--------EEEEEeeeeeccC Q lcl|NC_019917. 351 ---F--------GVISIDTAAKKHS 364 (364) Q Consensus 351 ---f--------Gvi~idta~~~~~ 364 (364) + .++.|.+.+.+-+ T Consensus 349 ~~a~G~~~lrp~~av~l~~~~~~~~ 373 (375) T protein:vir:10 349 RMAMGADYLNPAAAVELYIGATAPS 373 (375) T ss_pred eeeeccCccCceeEEEEecCcCccc Confidence 1 1233444433333 No 45 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.56 E-value=1.5e-08 Score=63.42 Aligned_cols=304 Identities=13% Similarity=0.092 Sum_probs=159.7 Q ss_pred Cceeec----c---cCCchHH--HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTTTVI----P---FGDPKAV--KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~T~~----~---~~dp~a~--~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+.-+. + ++|+.+. |+|+.+++....+.+-|.++ +.++.+ ..|.++.|+-+...+-... T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~-------~~~rti-----~~gkS~q~~~iG~~~~~~~ 68 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQW-------FDVQEV-----VGTNSVSNKYIGETELQVL 68 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCc-------ceeeee-----cccceEEeeeeeeeEEeee Confidence 874332 2 2455554 89999999999998888763 233333 2788999998877776666 Q ss_pred ecCceeecchhhhhhcccEEEEec---ccceeeccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhc-cccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQ---VRHPVSAGGRMSRKRSVHN-IRRIARDRLGDYFYKFTDELLFIYLS-GARGIN 146 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq---~R~~V~~~~~m~~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~l~-ga~g~~ 146 (364) +-.+.+.| +.+.....+|.||+ .||.| ..+++-...+| +|++-...+..=+++..||.++..+. .+.... T Consensus 69 ~~G~~ld~--~~~~~~k~~itID~ll~a~~~V---~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~ 143 (364) T protein:vir:10 69 SPGKSPDA--SPTEFDKNRLVVDTTVIARNTV---AHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNT 143 (364) T ss_pred ccCcccCC--CCcccCcEEEEecceeeechhh---hhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 65556654 57777788999999 56666 46788888999 89998888888999999998765442 221110 Q ss_pred ccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHh--cccCCCCCcceeeeEecCcee Q lcl|NC_019917. 147 LDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAM--MQAENPETANMVPVSIDGDDH 224 (364) Q Consensus 147 ~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~--~~~~~~~~~~i~Pv~~~g~~~ 224 (364) . +.+. ++.-.|-...+ -.++.++.. .-+...+-++++.|.. ..+..|. +- T Consensus 144 ~-----~~~~----~~~~~~~g~~i-~~~~~a~~~------~~~~~~l~~ai~~a~~~LdEkdVP~------------~~ 195 (364) T protein:vir:10 144 E-----AIRK----NPRVAGHGFSI-HIVGLASSF------LTSPQYMMAAIEMAMEQQTEQEVDT------------SE 195 (364) T ss_pred c-----cccc----CCcccCCccee-eecccCcch------hhhHHHHHHHHHHHHHHHhhcCCCc------------cc Confidence 0 1000 01000111011 111111111 1222333333333322 2222231 33 Q ss_pred EEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccccccc---------------- Q lcl|NC_019917. 225 YVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG---------------- 288 (364) Q Consensus 225 yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~---------------- 288 (364) -++++.|.|+.-|..+. +.. .+.. ...+ .+..=+|.+++++||.|++.++++.....+ T Consensus 196 R~~vv~P~~y~~Ll~~~--~lv--n~d~-~~~~-~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~ 269 (364) T protein:vir:10 196 LCGLMPWTAFNCLRDAD--RIV--DKSY-TIAA-SDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGN 269 (364) T ss_pred cEEEeChHHHHHHhcCC--ccc--cccc-cccC-CCccccceeEEEeceEEEeccccccccccccccccccccccccccC Confidence 78999999999998643 322 1111 1111 234558999999999999999997653211 Q ss_pred -CC----cccccchheeeccchheEeeecCCCCCccceec-hhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 289 -AG----ANVEAARALFMGRQAGVIAYGTANGLRFDWEET-VKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 289 -~~----~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee-~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) .. +...-.+++++=.-|++.+-...--+..+|.+. ..|+.+ ....+|..=.|- | ++.+|=+++.. T Consensus 270 g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id-----a~~a~G~g~lRP---e-aa~~i~~~~~~ 340 (364) T protein:vir:10 270 GNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID-----TFLAEGAIPDRW---E-AVAVVTAADTA 340 (364) T ss_pred CcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee-----eehcccCcccCc---c-ceEEEEecCCC Confidence 11 111224455554444433322211112222221 122222 133344433332 1 11111111111 Q ss_pred cC Q lcl|NC_019917. 363 HS 364 (364) Q Consensus 363 ~~ 364 (364) .- T Consensus 341 ~~ 342 (364) T protein:vir:10 341 EL 342 (364) T ss_pred CC Confidence 11 No 46 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.46 E-value=9.9e-08 Score=58.99 Aligned_cols=274 Identities=11% Similarity=0.019 Sum_probs=149.2 Q ss_pred cEEEEeecCCCCCceEEEEEeeccccCceecCceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHH Q lcl|NC_019917. 43 VIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDR 122 (364) Q Consensus 43 ~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~ 122 (364) .||.++ .|.++.|+-+...+-...+=.+.+.|+-+++.-....|.||+..-.=-.=..+++...++|||.+.-.. T Consensus 1 ~vr~i~-----~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MTRTIT-----SGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Ceeeee-----cCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHH Confidence 666664 599999998888877777777788888778888888899999763321114678888999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcccccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhccccc----HHHHHHHH Q lcl|NC_019917. 123 LGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMA----PIVIERAV 198 (364) Q Consensus 123 L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s----~~~i~~a~ 198 (364) ...=|++..||.+|.+++......- +...+ |....... -++.. .....++.-+ ++.|..|. T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a-----~~~~~---~~~~~g~~-~~~~~------~~~~~~~~~~~~~~~dai~~a~ 140 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRK-----ETTNE---NIEGLGAA-SLVKI------TGKKEDPAKYGTQVIQALTYAR 140 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhccc-----ccccC---CcccCCcc-ceecc------cccccccccCHHHHHHHHHHHH Confidence 9999999999999999875221110 00010 11000000 00000 1111122233 34444444 Q ss_pred HHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEec Q lcl|NC_019917. 199 EKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKH 278 (364) Q Consensus 199 ~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~ 278 (364) +.+.... .|. .+ -++++.|.++..|+.+. . ..... .+..+.+=+|.+|.++|+.|++. T Consensus 141 ~~Lde~~--VP~-----------~g-R~~vv~P~~y~~Ll~~~--~---~~~~~---~~~~~~~~~G~V~~i~Gf~V~~S 198 (324) T protein:vir:99 141 AAFAKKY--IPA-----------GD-RTFYTDPDTYSAILAAL--M---PNAAN---YAALIDPETGNIRNVMGFEVVET 198 (324) T ss_pred HHHhhcC--CCC-----------CC-CEEEeChHHHHHHhhcc--c---ccccc---cccccceecceEEEEeceEEEec Confidence 4433221 221 23 45899999999998432 2 11111 12335677899999999999999 Q ss_pred Ccccccccc---cCC------------cc--------cccchheeeccchheEeeecCCCCCccceechhhccchhHHHH Q lcl|NC_019917. 279 RNVIRFNDY---GAG------------AN--------VEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICA 335 (364) Q Consensus 279 ~~~~~~~~~---~~~------------~~--------v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i 335 (364) ++++..... .+. .+ ..-..+|++=.+|++..-...--+.-+|.|+. .-.-|-. T Consensus 199 n~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~----~~d~i~~ 274 (324) T protein:vir:99 199 PHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEY----QADQIIA 274 (324) T ss_pred CCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceechhh----HHHhhhh Confidence 998753110 000 00 01123466655554333222111233444321 1122333 Q ss_pred HHHHhhhhcccCCcccEEEEEeeeee-----------ccC Q lcl|NC_019917. 336 GFIAGMKKARFNSKDFGVISIDTAAK-----------KHS 364 (364) Q Consensus 336 ~~i~G~~K~rf~~~DfGvi~idta~~-----------~~~ 364 (364) ...+|.+=+| .+--++|.+..-+. |-| T Consensus 275 ~~a~G~~~lR--Pe~a~~v~l~~~~~~~~~~~~~~~~~~~ 312 (324) T protein:vir:99 275 KYAMGHGGLR--PEAVGAIIFEDGETPAVAPDVITGVASF 312 (324) T ss_pred hhhhcCcccc--cceEEEEEEccCccccccchhhhhhccc Confidence 3444544332 33344444322210 000 No 47 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.35 E-value=2.5e-07 Score=56.74 Aligned_cols=297 Identities=12% Similarity=0.023 Sum_probs=165.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec--cccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH--LRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~--L~G~gv~Gd~~le 78 (364) |+++... .-+++|+..+...+.++..- |-++ ++..+.- ..++.+..-.... .-|..-.+..... T Consensus 13 Ms~~i~~----~fv~qy~~~v~~~~qq~~s~---L~~t-----V~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 13 IAGDIDQ----AFVQTYETTLRILSQQKSAK---LKQY-----CQHKNES--SESHNWETLASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred eechhhh----HHHHHHHHHHHHHHHHhhhh---hhcc-----ccccccc--ccccceeecccccccccccccccccccC Confidence 8876543 34699999998888776533 3332 2211111 1222222111111 1121111111111 Q ss_pred cc----hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 79 GT----EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 79 Gn----ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) +. ..+.....-.+.+++...++.+ ..+++-|..+|+|...-..+..=+++..|+.++-.+.|.-.. T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~V-Dd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~--------- 148 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVV-EQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASI--------- 148 (322) T ss_pred cccCCCccccccceEEEeecccccceec-chHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccc--------- Confidence 11 1122334444555566666655 577888999999999999999999999999888777663211 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) +....++..|+++-+.- .+=.++.+.|..|+...+.... |+ +..-++++.|.|+ T Consensus 149 --~~~gt~v~~~ss~~i~~-----------g~~g~t~~kl~~a~~~l~~~dv--p~-----------d~~R~~vv~p~~~ 202 (322) T protein:vir:10 149 --KGTGQPVEFLATQEIGD-----------GTKPISFDYVTEITERFLENEI--EP-----------EVSKVIVIGPTQA 202 (322) T ss_pred --cccccccccCCCccccc-----------CccchhHHHHHHHHHHHHhcCC--CC-----------CCCeEEEeCHHHH Confidence 11112333333322111 1125788888888877655532 21 2223589999999 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeee-cCeEEEcCEEEEecCccccccccc------CCcccccchheeeccchhe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYG------AGANVEAARALFMGRQAGV 307 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-G~~g~~ngvii~e~~~~~~~~~~~------~~~~v~v~ralllGaqA~~ 307 (364) .+|=. ++++-+....+ .++|+. |.+|.|-|+-+..+++++...... ...+..+.+++.-=.+|++ T Consensus 203 ~~LL~--d~~~ts~D~~~------~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~ 274 (322) T protein:vir:10 203 RKLLQ--ITEATSADYTS------AMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALG 274 (322) T ss_pred HHHhc--chhhhhhhccc------chhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCcee Confidence 99974 55664433322 356775 889999999999999987543211 1122234555544455554 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) .|-++.-.++..| .-|..+-.-|-....+|-.-+ ++=||+.|+---++ T Consensus 275 ~a~~~dv~~~i~~---~~~~~~a~~I~~~~~~Ga~ri----~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 275 YHSCKDIWTKVAE---DPSASFAWRIYSAFTADCVRV----EDEHIFKLRLKNSL 322 (322) T ss_pred EEEeeeeeEEeec---cCCcchhhhhhhhhhhCceEe----ccCcEEEEEEeccC Confidence 4433321222222 224444455555566676555 44699999986666 No 48 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.25 E-value=5.8e-08 Score=60.28 Aligned_cols=314 Identities=13% Similarity=0.048 Sum_probs=173.2 Q ss_pred Cceee--cccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecC---CCCCceEEEEEeeccccCc-ee-c Q lcl|NC_019917. 1 MTTTV--IPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELE---SDAGDTISFDLSVHLRGKP-TY-G 73 (364) Q Consensus 1 Ma~T~--~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~---k~~Gd~v~f~L~~~L~G~g-v~-G 73 (364) |+..+ +...|......|..=+.....+++.|.. +.+|+.-.+|. +..|+.|++++...|.|.. .. + T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~q-------SGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~ 73 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFL-------SGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhh-------cceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCC Confidence 98755 4444444555565555555555555553 44666666664 4899999999999998853 22 3 Q ss_pred Cce-eecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_019917. 74 DAR-TEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVET 152 (364) Q Consensus 74 d~~-leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~ 152 (364) ++. .+-.-..+.-.+|.-+|=..-.+.... -+++--+--|..+....++++||.+.....+|..|.|.++.+..-... T Consensus 74 d~~~~~~t~~kittg~~~a~v~~r~kaw~~~-Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~ 152 (367) T protein:vir:80 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFA 152 (367) T ss_pred CCCcccccccccccchheeeeehhcccchhh-hHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchh Confidence 222 222335677777777776666666543 345555667999999999999999999999999999987654322111 Q ss_pred ccccccccCc-ccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEec Q lcl|NC_019917. 153 PDFTGFAGNP-LEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSE 231 (364) Q Consensus 153 ~~~~~~~~N~-~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p 231 (364) ....+...++ ++....++++=-.+.++ +++-+||.+.+-+|.. .++-. .++.=+++||+ T Consensus 153 ~~~~~~~~~a~~~~~~~~~~~Dis~~t~----~~~~~~s~~~~~~A~~--~lGD~--------------~~~l~~i~mHS 212 (367) T protein:vir:80 153 TIKTRGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAF--TMGDH--------------VGSIAAIAVHS 212 (367) T ss_pred hhhhhhccccccccccCceeeeeeccCC----CccceecHHHHHHHHH--Hhccc--------------cccccEEEEch Confidence 1111111111 11112222222222221 1234699999988833 23311 13567899999 Q ss_pred hhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeee Q lcl|NC_019917. 232 YQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYG 311 (364) Q Consensus 232 ~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g 311 (364) ..+..|++.. -++.-++. .+ ...++.|+|.+|+.--.|+.....+. .+..+.|+|..|.. |+ T Consensus 213 ~V~~~L~~~~---li~~i~~s---d~------~~~i~ty~G~~VIvDD~~Pv~~~~a~----~~yttYlfg~GAi~--~~ 274 (367) T protein:vir:80 213 MVYKRMTNND---EIEFIPDS---KG------QLTIPTYMGKVVIVDDGMPVFGTGAD----KTYLSILFGGAAFG--YA 274 (367) T ss_pred HHHHHHHhcc---ccccccCC---CC------ccccceecceeEEEeCCCcccccCCC----ceEEEEEEecceee--ec Confidence 9999999642 23333332 22 25689999999999878875543221 24779999998764 55 Q ss_pred cCCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeee--------------eccC Q lcl|NC_019917. 312 TANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAA--------------KKHS 364 (364) Q Consensus 312 ~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~--------------~~~~ 364 (364) ..++..+ .|-..|--..-+-+.+.++- +-||--.=+|+=-..+.+ +..| T Consensus 275 ~~~~~~~--~E~~Rd~~~~~~gG~d~L~~--Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~s 337 (367) T protein:vir:80 275 DGAPQVP--VAVGRRELRGNGSGLEYILE--RKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) T ss_pred ccCCccc--eecccchhhhcCCceEEEEe--eeeEEeecceeeecccccccccccccccccccccCC Confidence 4333322 34333332111112222111 011111112221111111 0111 No 49 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.14 E-value=2e-06 Score=51.78 Aligned_cols=308 Identities=15% Similarity=0.114 Sum_probs=166.4 Q ss_pred Cce----eecc---cCCchHH--HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTT----TVIP---FGDPKAV--KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~----T~~~---~~dp~a~--~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+. |..+ ++++.+. ++|+.++++...+++-|.++ +.+|.++ .|.++.|+-+....-... T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~-------~~vRtI~-----~gkS~qf~~lG~s~a~y~ 68 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSY-------FDVQTVT-----GTNTVSNKYLGETELQVL 68 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhccc-------ceeeeec-----ccceEEEEEeeeeEEeee Confidence 874 3332 3556555 99999999999999988763 3455443 788999998888877777 Q ss_pred ecCceeecchhhhhhcccEEEEeccc---ceeeccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHH--Hhcccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHN-IRRIARDRLGDYFYKFTDELLFI--YLSGARGI 145 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~--~l~ga~g~ 145 (364) +-.+.+.|+ ........|.||... |.| ..+++-...+| +|.+-...+..-+++..||.++. .+++ .. T Consensus 69 ~pG~~ldg~--~~~~dk~~ItIDtLL~a~~~V---~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~-~a- 141 (400) T protein:vir:10 69 APGQSPAAT--STQADKNQLVIDATVIARNTV---AHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGG-IA- 141 (400) T ss_pred cCCCCcCCC--CcccCcEEEEeCceeeecchh---hhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc- Confidence 777778866 467777789999875 555 46788888999 89999999999999999997763 3343 10 Q ss_pred cccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhc--ccCCCCCcceeeeEecCce Q lcl|NC_019917. 146 NLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMM--QAENPETANMVPVSIDGDD 223 (364) Q Consensus 146 ~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~--~~~~~~~~~i~Pv~~~g~~ 223 (364) +....+.+.+.-.+.. .-.+. ++ +....-+..-+..|++.|... .+..| .+ T Consensus 142 --~t~~~~~~~~g~~~g~-----s~~v~--~~------~~~~~~~~~~l~~A~~~A~~~LdEkdVP------------~~ 194 (400) T protein:vir:10 142 --NTQAKRTNPRVKGHGF-----SVNVE--VN------EGEALVNPQYVMAAVEFALEQQLEQEVD------------IS 194 (400) T ss_pred --ccccccccCCcccccc-----ceeec--cc------ccccccCHHHHHHHHHHHHHHHHhcCCC------------cc Confidence 1111111111110100 00000 11 111122334444444444222 22212 12 Q ss_pred eEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc-----------cCCcc Q lcl|NC_019917. 224 HYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY-----------GAGAN 292 (364) Q Consensus 224 ~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~-----------~~~~~ 292 (364) -+++|+.|..+.-|+. .|.- +.+...... .+..=+|.+++++||.|+|.++++.+... |...+ T Consensus 195 d~vvl~pp~~Ys~Ll~-~dkL---vnrdf~~s~--~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~ 268 (400) T protein:vir:10 195 DVAILMPWRYFNVLRD-ADRI---VDKSYTISQ--SGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYD 268 (400) T ss_pred ceEEEcCHHHHHHHHh-CCcc---cchhccccC--CCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCC Confidence 3677777766666663 3311 223221111 24466789999999999999999754211 22222 Q ss_pred c----ccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC------cccEEEEE------ Q lcl|NC_019917. 293 V----EAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS------KDFGVISI------ 356 (364) Q Consensus 293 v----~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~------~DfGvi~i------ 356 (364) + .-.++++.=..|++..=.. .-+.-.|.|+. . .-.-|-....+|+.-.|-.- .|=.+-++ T Consensus 269 ~t~d~s~~~av~F~~sAv~tvk~~-~lt~~~~~d~r-~--~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~ 344 (400) T protein:vir:10 269 PIAEMNGAIAVLFTADALLVGRSI-DVIGDIFYEKK-E--KTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAA 344 (400) T ss_pred ccccccceeEEEEehhheEEEEee-ccccccccchh-h--HHHHHHHHHHhCCcccchhheEEEEecCCcccccccCcch Confidence 2 2344555555555543222 11222344331 1 11234445566666555521 11111111 Q ss_pred ----------------eeeeeccC Q lcl|NC_019917. 357 ----------------DTAAKKHS 364 (364) Q Consensus 357 ----------------dta~~~~~ 364 (364) -+++++.| T Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~ 368 (400) T protein:vir:10 345 QHTQVLNRAQRKAVYVKNAAPAGA 368 (400) T ss_pred hHHHHHhhcccceEEEeccccccc Confidence 12222222 No 50 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.10 E-value=3.4e-06 Score=50.55 Aligned_cols=278 Identities=12% Similarity=0.058 Sum_probs=153.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceee-- Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTE-- 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le-- 78 (364) ||..+++ ++|+..|-....+.+.+.. |.++..+.-|. . ..|++|.++-+ +..++ +|-... T Consensus 1 MA~~n~a-------~~~~~~Ld~~~~~~l~~~~-L~~~~~~~~v~-~-----~gg~tVkI~~i---~~~gl-~DY~R~~~ 62 (299) T protein:vir:79 1 MAALNYA-------KEYSNVLAQAYPYTLNFGD-LYATPNNGRYR-W-----TGSKTIEIPTI---STTGR-VDSNRDTI 62 (299) T ss_pred CccchhH-------HHHHHHHHHHHHhhceeee-eccCcccceee-e-----cCCCEEEEecc---ccccc-cccccCCC Confidence 9954443 7899999999888776653 55544433331 1 35899998833 33333 444332 Q ss_pred c-chhhhhhcccEEEEecccceeeccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhc-cccccccccccccc Q lcl|NC_019917. 79 G-TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNI--RRIARDRLGDYFYKFTDELLFIYLS-GARGINLDFVETPD 154 (364) Q Consensus 79 G-nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l~-ga~g~~~~~~~~~~ 154 (364) | +.++++....++.+||.|----.=..|+...+...+ -...+....+...-..|.-.|-.|+ ++.+. T Consensus 63 g~~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~--------- 133 (299) T protein:vir:79 63 AVAQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL--------- 133 (299) T ss_pred cccccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc--------- Confidence 2 344788889999999999322111455544443333 1122222333344455655555553 22110 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) +..++...++++.. ++.|+.+..+.+...-| .+-.+|+++|... T Consensus 134 --------------------g~~~~~~~~T~~n~--y~~i~~~~~~lde~~vP--------------~~~rvl~vtp~~~ 177 (299) T protein:vir:79 134 --------------------GNTADTTVLTTTNV--LEVFDKLMEKMTEARVP--------------ENGRILYVTPVVN 177 (299) T ss_pred --------------------CCcccccccCHHHH--HHHHHHHHHHHHhcCCC--------------CCCeEEEeCHHHH Confidence 11122233444443 68888887776655322 2347899999999 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcc-----cccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNV-----IRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~-----~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) .-|+.+. + ++|.-. -...+....|.+|.+|||.|+|.|.- +.|.++...+..+-...+||....+.++ T Consensus 178 ~~L~~~~--~---f~k~~~--~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~ 250 (299) T protein:vir:79 178 TLIKNAK--E---IQRTVN--IKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQIFMSLVHPSAIIT 250 (299) T ss_pred HHHhhch--h---hhcccc--cccccceeeeeeeeecceEEEEechhhcCccceeccCccccCcccccceEEEcCCeeee Confidence 9999643 4 444421 12345678999999999999996653 3344443333333345788888888888 Q ss_pred eecCCCCCcc----------ceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 310 YGTANGLRFD----------WEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 310 ~g~~~g~~~~----------w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) .-+....+.+ |.++ .-|.+-. -++++-=|+.+--..|++ T Consensus 251 ~~K~~~~~~~~P~~~~~~~~~~~~-r~y~d~~-------------v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 251 PVSYQFSKLDEPTAVTEGKYFYFE-ESFEDVF-------------ILNKKADAIQFVVEGAGA 299 (299) T ss_pred eEeeeeEEeecCCCCCccceeeee-eeeeeee-------------eeccccCeEEEEeeecCC Confidence 7664333332 2221 1111110 012333366554445555 No 51 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.98 E-value=1.6e-06 Score=52.41 Aligned_cols=301 Identities=13% Similarity=0.109 Sum_probs=157.2 Q ss_pred Cceeecc-cCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecC---CCCCceEEEEEeeccccC--c-eec Q lcl|NC_019917. 1 MTTTVIP-FGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELE---SDAGDTISFDLSVHLRGK--P-TYG 73 (364) Q Consensus 1 Ma~T~~~-~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~---k~~Gd~v~f~L~~~L~G~--g-v~G 73 (364) ||.|... .=-|+ ...|..-+.....+++.|.. +.+|+.-.+|. .+.|+.|++++..+|.|+ + |.+ T Consensus 1 Ma~T~l~D~iipe-~~vf~~Yv~~~~~e~~~l~q-------SGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~ 72 (349) T protein:vir:78 1 MAITTIGDIVTGN-IPVLASYMTEDPVEKTAFFD-------SGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSN 72 (349) T ss_pred CCceEEeeeeccC-HHHHHHHHHHhhHHhhhhhh-------ccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCC Confidence 9998854 22222 11344444444445554443 35666666665 478999999999999985 3 434 Q ss_pred Ccee-ecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_019917. 74 DART-EGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVET 152 (364) Q Consensus 74 d~~l-eGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~ 152 (364) |..- ..--+.+.-+++.-++=..-++.... -+++--|--|..+....++++||.+.....++..|.|.++.+..-... T Consensus 73 D~~~~~~t~~kitt~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~ 151 (349) T protein:vir:78 73 DVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDA 151 (349) T ss_pred CCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccch Confidence 3211 11234566677766665555555432 334444555889999999999999999999999999976533211100 Q ss_pred ccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEech Q lcl|NC_019917. 153 PDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEY 232 (364) Q Consensus 153 ~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~ 232 (364) ..+. +.|.+ ++.++..++.+.+-.|..+ ++.... --.+++.=+++||+. T Consensus 152 ~~~~------------~~~t~--------d~s~~a~~~~~~~~dA~~~--lgda~~---------Gd~~~~lt~i~mHS~ 200 (349) T protein:vir:78 152 YHEQ------------NDMVV--------DVSATLGFDAGAFIDATQT--MGDALM---------GNGGEVLGAIAMHSF 200 (349) T ss_pred hhhc------------cccee--------eeccccCCChhhhhhhHHH--HHHHhc---------cccccceeEEEEchH Confidence 0000 11111 1112223566665555432 221100 001245678999999 Q ss_pred hHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 233 QATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 233 q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) .+..|++.. -++..+. +++ ...++.|+|..+..--.++....+ ...+..++|+|..|. .|+. T Consensus 201 v~~~L~~~~---li~~i~~---s~~------~~~i~ty~G~~VivDD~~Pv~~~g----~~~~yttylfg~GAi--~~~~ 262 (349) T protein:vir:78 201 VYAQARKAQ---LIDFIRD---AEN------NTMFATYQGYRVIVDDSMTVVGQG----AQRKFISIIFGQGAI--GYGE 262 (349) T ss_pred HHHHHHhhh---hhhhccC---ccc------CcccceecCeEEEEeCCCccccCC----CCceEEEEEeecceE--EEcc Confidence 999999532 2222222 221 224788999999888777654322 233577899997765 4444 Q ss_pred CCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec--------cC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK--------HS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~--------~~ 364 (364) . .+..-.|-..|--.+.+-+.+.+. ..-+|--.=+|+ ..+.++.+ -| T Consensus 263 ~--~~~~~~et~rd~~~g~~~G~d~l~--~R~~~~~hp~G~-s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:78 263 G--NPVMPLEYEREASRANGGGVETLW--TRKTWLLHPFGY-RFTSAVITGNGTETIARS 317 (349) T ss_pred C--CCccceeeecccccCCcceeEEEE--EeeEEEeeeeee-eeccccccCCccccccCC Confidence 2 232223433343222111111111 011111111222 11111100 11 No 52 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.84 E-value=5e-06 Score=49.65 Aligned_cols=302 Identities=13% Similarity=0.114 Sum_probs=157.8 Q ss_pred Cceeecc-cCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecC---CCCCceEEEEEeeccccC--c-eec Q lcl|NC_019917. 1 MTTTVIP-FGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELE---SDAGDTISFDLSVHLRGK--P-TYG 73 (364) Q Consensus 1 Ma~T~~~-~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~---k~~Gd~v~f~L~~~L~G~--g-v~G 73 (364) ||.|... .=-|+ ...|..-+.....+++.|.. +.+|+.-.+|. ++.|+.|++++..+|.|+ + +.| T Consensus 1 Ma~T~l~D~iipe-~~vf~~Yv~~~~~e~~~l~q-------SGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~ 72 (349) T protein:vir:94 1 MAITTIGNIVTGN-IPVLASYMTEDPVEKTAFFN-------SGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSN 72 (349) T ss_pred CCceEEeeeeccC-hHHHHHHHHHhHHHhhhhhh-------ccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCC Confidence 9998854 21121 11344444444445554443 35676666665 478999999999999885 3 445 Q ss_pred Cceee-cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_019917. 74 DARTE-GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVET 152 (364) Q Consensus 74 d~~le-Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~ 152 (364) |...+ ..-..+...+|.-++=..-++.... -+++.-|--|..+....++++||.+.....++..|.|.++.+.-.... T Consensus 73 dt~~~~~t~~kit~~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~ 151 (349) T protein:vir:94 73 DVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDA 151 (349) T ss_pred CCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhccccccccc Confidence 54321 2234566666666555555554332 344544556889999999999999999999999999976532111110 Q ss_pred ccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEech Q lcl|NC_019917. 153 PDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEY 232 (364) Q Consensus 153 ~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~ 232 (364) ..+. +.|.+ .+.++..++.+.+-.|..+ ++.... --..+..=+++||+. T Consensus 152 ~~~~------------~~~~~--------d~~~~a~~~~~~~~~A~~~--~Gdaa~---------Gd~~~~lt~i~mHS~ 200 (349) T protein:vir:94 152 YHEQ------------NDMVV--------DVSATSGFDAGAFIDATQT--MGDALM---------GNGGEVLGAIAMHSF 200 (349) T ss_pred cccc------------CceeE--------EecccCCCChhhHHHHHHH--HHHHhc---------cccccceeEEEEchH Confidence 0000 11111 1223344666666666443 221100 001135678999999 Q ss_pred hHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 233 QATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 233 q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) .+..|++.. -++.-+ .+++ ...++-|+|..+..--.++....+ +..+..++|+|..|. .|+. T Consensus 201 v~~~L~~~~---li~~i~---~s~~------~~~i~ty~G~~VivDD~~Pv~~~g----~~~~yttylfg~GAi--~~~~ 262 (349) T protein:vir:94 201 VYAQARKAQ---LIDFIR---DAEN------NTMFATYQGYRVIVDDSMTVVGQD----TSRKFISIIFGQGAI--GYGE 262 (349) T ss_pred HHHHHHhcc---hhhhcc---Cccc------CcccceecCcEEEEeCCCccccCC----CCceEEEEEeecceE--Eeec Confidence 999999632 122222 1221 123688999988887777654222 223578999997765 4444 Q ss_pred CCCCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec-------cC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK-------HS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~-------~~ 364 (364) .+ +..-.|-..|--.+.+-+.+.+. ..-+|--.=+|+=-..+.+.. -| T Consensus 263 ~~--~~~~~E~~rd~~~g~~~G~d~L~--~R~~~~~hp~G~s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:94 263 GN--PEMPLEYEREASRANGGGVETLW--TRKTWLLHPFGYSFTSAVITGNGTETIARS 317 (349) T ss_pred CC--CCcceeeecccccCCcceeEEEE--EeeEEEeeeeeeeecccccCCCccccccCC Confidence 22 22223433333222111112111 011111112222111111111 01 No 53 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.72 E-value=2.5e-05 Score=45.84 Aligned_cols=308 Identities=14% Similarity=0.096 Sum_probs=159.6 Q ss_pred Ccee----ecc---cCCchHH--HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce Q lcl|NC_019917. 1 MTTT----VIP---FGDPKAV--KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT 71 (364) Q Consensus 1 Ma~T----~~~---~~dp~a~--~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv 71 (364) |+.- ..+ ++++.+. ++|+.+++....+++-|.++ +.+|.++ .|.++.|+-+...+-... T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~-------~~vRti~-----~gkS~qf~~~G~s~~~~~ 68 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSY-------FDVQTVT-----GTNTVSNKYLGETELQVL 68 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhccc-------ceeeeec-----ccceEEEEEeeeeEeeee Confidence 8743 222 3555554 99999999999999988763 3455443 788999998887776666 Q ss_pred ecCceeecchhhhhhcccEEEEeccc---ceeeccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHh--cccccc Q lcl|NC_019917. 72 YGDARTEGTEENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHN-IRRIARDRLGDYFYKFTDELLFIYL--SGARGI 145 (364) Q Consensus 72 ~Gd~~leGnee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~l--~ga~g~ 145 (364) +-.+.+.| +........|.||... |.| ..+++-.+.+| +|.+-...+..-+++..||.++..+ ++ +.. T Consensus 69 ~pG~~ld~--~~~~~dK~~ItID~lL~a~~~V---~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa-~an 142 (401) T protein:vir:70 69 APGQSPAA--TSTQADKNQLVIDATVIARNTV---AHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGG-IAN 142 (401) T ss_pred cCCCCcCC--CCcccccEEEEeCceeehhhhh---hhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc Confidence 66666775 4567777789999875 555 46788888999 8999999999999999999774333 43 110 Q ss_pred cccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhc--ccCCCCCcceeeeEecCce Q lcl|NC_019917. 146 NLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMM--QAENPETANMVPVSIDGDD 223 (364) Q Consensus 146 ~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~--~~~~~~~~~i~Pv~~~g~~ 223 (364) ..+- .-||..-+. ...+-.++++... .-+...|..|+..|... .+..| .+ T Consensus 143 a~~~---------~~~p~~~~~-G~~i~v~~~~~~~------~~~~~~l~~ai~dA~~~LdEkdVP------------~~ 194 (401) T protein:vir:70 143 TQAK---------RTNPRVKGH-GFSINVEVAEGEA------LVNPQYVMAAVEFALEQQLEQEVD------------IS 194 (401) T ss_pred cccc---------ccCCCcCCC-ceEEecccccccc------ccCHHHHHHHHHHHHHHHHhcCCC------------cc Confidence 0000 001100011 1111111221111 12333333444433222 22212 13 Q ss_pred eEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccc-----------cCCcc Q lcl|NC_019917. 224 HYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDY-----------GAGAN 292 (364) Q Consensus 224 ~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~-----------~~~~~ 292 (364) -|++|+.|..+.-|.. .|.- +.+..... ..+..=+|.+++++||.|+|.++++.+... |...+ T Consensus 195 r~vvl~pp~~Ys~Ll~-~d~L---~nrd~~~s--~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~ 268 (401) T protein:vir:70 195 DVAILMPWRYFNVLRD-ADRI---VDKTYTIS--QSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYD 268 (401) T ss_pred ceEEEcCHHHHHHHHh-cCcc---cchhhccc--cCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCC Confidence 4777777777766763 3311 12222111 124455788999999999999999764321 22222 Q ss_pred c----ccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC------cccEEEE----Ee- Q lcl|NC_019917. 293 V----EAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS------KDFGVIS----ID- 357 (364) Q Consensus 293 v----~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~------~DfGvi~----id- 357 (364) + .-.++++.=..|++..=.. .-+.-+|.|+. .. -.-|-....+|..-.|-.- ++-||+- .| T Consensus 269 ~~~d~s~~~~v~f~~~Av~tvk~~-~lt~~~~~d~r-~~--~~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~ 344 (401) T protein:vir:70 269 PLPAMNGAIAVLFTADALLVGRSI-DVTGDIFYEKK-EK--TYYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDG 344 (401) T ss_pred CCccccceeEEEEehhheEEEEee-ccccchhhhhh-hh--HHHHHHHHHhCCcccchhheEEEeecCcccccccccCCc Confidence 2 2344555555555443222 11122344431 11 1112234444544444310 1111100 00 Q ss_pred --e---eeeccC Q lcl|NC_019917. 358 --T---AAKKHS 364 (364) Q Consensus 358 --t---a~~~~~ 364 (364) + -+.+.+ T Consensus 345 ~~~~~~~~~~~~ 356 (401) T protein:vir:70 345 AQHTIVKNRAQR 356 (401) T ss_pred chhhhhhhhccc Confidence 0 000111 No 54 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.68 E-value=1.2e-05 Score=47.64 Aligned_cols=294 Identities=13% Similarity=0.055 Sum_probs=143.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceee-- Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTE-- 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le-- 78 (364) ||-+.+ ...+|++.+...-.+..-|.+ ++=+.-. .|+.-..||+|+++...........-....+ T Consensus 1 Ma~~~~------~p~~~a~~~l~~l~~~lv~~~-lv~~~~~------~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~ 67 (392) T protein:vir:99 1 MANAFS------KPTAVVDTAIQMLQNELILTN-LVWLNGI------GDFAHKFNDTITVRVPAPSRGHTRKLRGAGAER 67 (392) T ss_pred Cccccc------cHHHHHHHHHHHHHhhccchh-hhccccc------cccccCCCCeEEEeecccccceeeeccccccCC Confidence 996553 457999988766655555543 4322111 2443357999999876665544433222222 Q ss_pred -cchhhhhhcccEEEEeccc-ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 79 -GTEENLRFYTDQVKIDQVR-HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 79 -Gnee~L~~~~~~v~Idq~R-~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) ...+++.-...++.||+.+ +++.+.+ .+.-....|++++.-+....=+++..|+.++.-++++... T Consensus 68 ~~~~~~~~~~~~~~~id~~k~~~~~i~d-~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~----------- 135 (392) T protein:vir:99 68 NLTVSDFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE----------- 135 (392) T ss_pred cccccccccceEEEEEeeeeecceeech-HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------- Confidence 2345677788899998887 5566654 3555578899888878788888888898888777763110 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) + .+ ....++. ...++.|-.|..++.... . | ++ .+++++|..... T Consensus 136 ----~----~~-----------~~~~~~~--~~~~~~i~~a~~~L~~~~--v-------P-----~~-R~~vv~p~~~~~ 179 (392) T protein:vir:99 136 ----A----AG-----------AVHEVAP--DEFFKGVNGARRALNELY--I-------P-----QG-RVLVVGTAVTEQ 179 (392) T ss_pred ----c----cc-----------cccccCh--hhhHHHHHHHHHHHhhcC--C-------C-----CC-CEEEEcHHHHHH Confidence 0 00 0001111 123555666655443332 1 2 12 366788999999 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCC--cccccch--heeeccchheEeeec Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAG--ANVEAAR--ALFMGRQAGVIAYGT 312 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~--~~v~v~r--alllGaqA~~~A~g~ 312 (364) |+.+ +.+...+..- ......+-.|.+|.+.|+-+++.+++......... +-..+.+ ....|+.... ++.. T Consensus 180 l~~~--~~~~~~~~~g---~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~-s~s~ 253 (392) T protein:vir:99 180 ILND--DRFIKYESQG---QSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRST-AISG 253 (392) T ss_pred Hhcc--cceeeccccc---chhhhhhhcceeeeeeeeEEEeecccccccceeeecccccccccccccccccccee-EEec Confidence 9864 4544333321 01113366799999999999999887543221110 0000000 1112221111 1111 Q ss_pred CCCCCccceechhhccchhH---HHHHHHHhhhhcc---cCC--cccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPA---ICAGFIAGMKKAR---FNS--KDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~---i~i~~i~G~~K~r---f~~--~DfGvi~idta~~~~~ 364 (364) .+.....|. .||..... ..+....|...+. ... ....+-+....+.+.+ T Consensus 254 ~~~v~~~~~---~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) T protein:vir:99 254 DQRIAMRWL---VDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) T ss_pred ccceeccee---ecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeee Confidence 111122232 23332211 1122222221111 000 0011111111111111 No 55 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.60 E-value=1.4e-05 Score=47.28 Aligned_cols=291 Identities=10% Similarity=0.029 Sum_probs=137.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||-|. .. ...+.|++++...-.++.-|. +++=+.-..-|. ..+.||+|+++......-.-....+...-+ T Consensus 1 MAN~l-lT---~iP~iia~~al~~l~~~lV~~-~lV~r~y~ge~~-----~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~ 70 (423) T protein:vir:35 1 MANNL-ES---NISQIVLKKFLPGFMSDIVLC-KTVDRQLLSGEI-----NSNTGDSVSFKRPHQFKSERTETGDITGKD 70 (423) T ss_pred Cccch-hh---hhHHHHHHHHHHHHHhhcccc-hhcccCCCcccc-----cccCCCEEEEeeCCcceeecccCcCCCCcc Confidence 99332 11 235789988876665555544 454222111111 235799999997765532211111111223 Q ss_pred hhhhhhcccEEEEecccc-eeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRH-PVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~-~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+.+.-.+-.|.||+..+ ++.+.++ ++.-..-||.+..++... -+++..|+.++..|...-. T Consensus 71 ~~~~~e~~v~l~id~~k~~a~~v~d~-e~~l~i~~~~~~l~~a~~-ala~~vd~~l~~~l~~~a~--------------- 133 (423) T protein:vir:35 71 KNGLFSAKATGKVGKYITVAVEWTQI-EEALKLNQLDQILSPIHE-RMVTDLETELAHFMMNNGA--------------- 133 (423) T ss_pred ccccccceeeEEeccceeccceeCHH-HHHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhccc--------------- Confidence 466666677899999987 7877654 222245577666666654 4566667777655543100 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |-+-.|+ +..-.++.|..|..++.... .|. .+ -.+++.|.-...|.. T Consensus 134 -~~vgt~~------------------t~~~~~~~i~~a~~~Ld~~~--vP~-----------~~-R~~Vv~p~~~a~Ll~ 180 (423) T protein:vir:35 134 -LSLGSPN------------------TAIKKWADVAQTASFIKDIG--IKT-----------GE-NYAIMDPWSAQRLAD 180 (423) T ss_pred -ccccccc------------------CCcchHHHHHHHHHHHHHhc--CCc-----------CC-CEEEeCHHHHHHHhc Confidence 1000011 01123666777665554332 222 12 567889998888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCe-EEEcCEEEEecCcccccccccCCcccccchheeeccchheE-eeecCCCCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGL-GMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVI-AYGTANGLR 317 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~-g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~-A~g~~~g~~ 317 (364) ++..+...++. ...-|=.|.+ |.+.|+-+++.++++...++..++.+-+.++... .++..- +-+...+.. T Consensus 181 -~~~~~~~~~~~------~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v-~~~a~~~~~~~~~~~~ 252 (423) T protein:vir:35 181 -AQSGLHAADQL------VRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNV-DYLSVKDSYQFTVALT 252 (423) T ss_pred -cccceeccccc------hhHHHhhccceeeecceEEEEcCCCccccccccccceeecccccc-ccccccccccceeeee Confidence 23333222221 1223556776 9999999999999987766554443333322211 010000 000001111 Q ss_pred ccceech--hhccchhHHHHHHHHhhh------hcccCCcccE-----EEEEeeeeeccC Q lcl|NC_019917. 318 FDWEETV--KDYGNEPAICAGFIAGMK------KARFNSKDFG-----VISIDTAAKKHS 364 (364) Q Consensus 318 ~~w~Ee~--~D~g~~~~i~i~~i~G~~------K~rf~~~DfG-----vi~idta~~~~~ 364 (364) -.|.... .--|+ +-+|-|++ |-++..-|.+ +++=|+...+++ T Consensus 253 ~~~~~~~g~l~~GD-----~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g 307 (423) T protein:vir:35 253 GATPSKTGFLKAGD-----QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASG 307 (423) T ss_pred eeeeccCCcEEecc-----eEEeeeeeeccccccceeecccCCceeEEEEeccccccccC Confidence 2233211 11122 22334422 2222111211 111111111111 No 56 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.40 E-value=7.6e-05 Score=43.17 Aligned_cols=280 Identities=13% Similarity=0.036 Sum_probs=147.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc-cCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR-GKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~-G~gv~Gd~~leG 79 (364) ||.=. |.=+..+.|++++...-.+...|.+ ++-+.-.. |.. ++||+|+++....+. .++ ..+. T Consensus 1 m~~~~---N~~ltp~iia~~~l~~l~~~lV~~~-lv~r~y~~------e~~-~~GDTV~I~vp~~~~v~dg----~~~~- 64 (418) T protein:vir:10 1 MAVQD---NNLLTDDVIAKEALRLLKNNLVMAK-CVYRNYEK------TFG-KVGDTIRLKLPYRVKSASG----RTLV- 64 (418) T ss_pred CCccc---cccccHHHHHHHHHHHHHHhccchh-hhcCCCch------HHh-hCCCEEEEeeCCceeeccc----CCcc- Confidence 87522 2223458999999887777776654 66443222 222 479999998755553 111 1122 Q ss_pred chhhhhhcccEEEEeccc-ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVR-HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R-~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+++.-.+-.|.||+.. +++.+.++ ++..+.-||++..-+....=+++..|+.++..+.++- T Consensus 65 -~~~~te~~v~l~id~~k~~~~~itD~-e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~--------------- 127 (418) T protein:vir:10 65 -KQPMVDQTIPFKIAYQEHVGLEYTVK-DKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAF--------------- 127 (418) T ss_pred -ccccccceEEEEEecccccceeechH-HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 45666677789998877 56777653 3344567888777666667777777777765555421 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) |.+..+. + . .-.++.+-.+..++.... .|. +.-..+++.|.....|. T Consensus 128 --~~~gt~g----------t-----~---~~~~~~i~~a~~~Ld~~~--VP~-----------~G~R~lVv~P~~~~~L~ 174 (418) T protein:vir:10 128 --HSSGTPG----------V-----R---PGAFIDFANAGAKQTTYA--VPQ-----------DGMRHAVLDPFTCASLS 174 (418) T ss_pred --cccccCC----------c-----C---cchHHHHHHHHHHHHhcC--CCC-----------CCceEEEeCHHHHHHHh Confidence 1100000 0 0 012555555554443232 232 22345669999999997 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) .+. .+ .+. ..+....|-.|.+|.+.|+-|++.++++....+..+++. +..|+++-+-+.+..+ T Consensus 175 ~~~--~~-----~~~-~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~-----~v~ga~~~~~~~~~~~---- 237 (418) T protein:vir:10 175 DEV--TK-----LFK-ESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTP-----LVNGTVVNGDTVGFDG---- 237 (418) T ss_pred hhc--cc-----ccc-ccccchhhheeeeeeeeceEEEEecCCCcccccccccce-----eeecccccceeEEEee---- Confidence 532 22 111 122334577899999999999999999866554433322 2334433222221111 Q ss_pred cce--echhhccchhHH-HHHHHHhhhhcccCC-cccEEEEEe--eeeeccC Q lcl|NC_019917. 319 DWE--ETVKDYGNEPAI-CAGFIAGMKKARFNS-KDFGVISID--TAAKKHS 364 (364) Q Consensus 319 ~w~--Ee~~D~g~~~~i-~i~~i~G~~K~rf~~-~DfGvi~id--ta~~~~~ 364 (364) .|. +-..--|+...| ++..+.++.|-.-.. +-| |+..| +.+..+. T Consensus 238 ~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f-~V~~~~~~~~~~~~ 288 (418) T protein:vir:10 238 GTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEF-VVLEDVDTDAGGAG 288 (418) T ss_pred cceeeccceeeccEEEECceeecccccccccccceEE-EEEeeccccccCcc Confidence 111 111222333222 234444454444432 345 33333 3333333 No 57 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=96.78 E-value=0.00035 Score=39.56 Aligned_cols=289 Identities=9% Similarity=0.008 Sum_probs=132.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||-| +... ..+.|++.+...-.+...|. +++=+.-..-|.. ...||+|+++......-.-..+..--.-+ T Consensus 1 MaN~-llT~---~p~iia~~aL~~l~~~lV~~-~lVnr~y~~ef~~-----~k~GDTV~I~~p~~~~~~d~~~~~~~~~~ 70 (423) T protein:vir:10 1 MPNN-LDSN---VSQIVLKKFLPGFMSDLVLA-KTVDRQLLAGEIN-----SSTGDSVSFKRPHQFSSLRTPTGDISGQN 70 (423) T ss_pred Cccc-hhhh---hHHHHHHHHHHHHHhhcccc-hhhcccCCCcccc-----cccCCEEEEeeCCceeeeccCCccccccc Confidence 8833 2222 34679988876666665554 3543222111111 24899999987666543333322111124 Q ss_pred hhhhhhcccEEEEecccc-eeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRH-PVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~-~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) -++|.-.+-.|.||+..+ ++.+..+ ++....-||- ..-.....=+++..|+.++..+.+. +. T Consensus 71 ~~dl~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~-~~l~~A~~aLA~~vd~~ia~~~~~~-~~-------------- 133 (423) T protein:vir:10 71 KNNLISGKATGRVGNYITVAVEYQQL-EEAIKLNQLE-EILAPVRQRIVTDLETELAHFMMNN-GA-------------- 133 (423) T ss_pred cCccccceeEEEeeceeeeeeeechH-HHhcChhhHH-HHHHHHHHHHHHHHHHHHHHHHhhc-cc-------------- Confidence 678888888999999987 7777543 2222334452 2222223446666777666544431 00 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |.+..|. .. .-.++.+-.+..++... ..|. .+ -.+++.|.-...|.. T Consensus 134 -~~~gt~~--------t~----------~~a~~~i~~a~~~Ld~~--~vP~-----------~~-R~~Vv~p~~~a~Ll~ 180 (423) T protein:vir:10 134 -LSLGSPN--------TP----------ITKWSDVAQTASFLKDL--GVNE-----------GE-NYAVMDPWSAQRLAD 180 (423) T ss_pred -cccccCC--------cc----------cchHHHHHHHHHHHHhc--cCCc-----------CC-CEEEeChHHHHHHhc Confidence 0000000 00 01255555554444322 2232 23 345899999888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCe-EEEcCEEEEecCcccccccccCCcccccchhee--eccchheEeeecCCCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGL-GMINNVVLHKHRNVIRFNDYGAGANVEAARALF--MGRQAGVIAYGTANGL 316 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~-g~~ngvii~e~~~~~~~~~~~~~~~v~v~rall--lGaqA~~~A~g~~~g~ 316 (364) + +......+ .+...-|=.|.+ |.+.|+-+++.++++...++..+....+.++-. .+++..+.. ...+. T Consensus 181 ~-~~~~~~~~------~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~--~~~~~ 251 (423) T protein:vir:10 181 A-QTGLHASD------QLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQ--FTVTL 251 (423) T ss_pred c-ccceeccc------ccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccce--eeeee Confidence 3 32322112 122233556776 999999999999998776554433222222111 112111100 00001 Q ss_pred Ccccee--chhhccchhHHHHHHHHh------hhhcccCC------cccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWEE--TVKDYGNEPAICAGFIAG------MKKARFNS------KDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~E--e~~D~g~~~~i~i~~i~G------~~K~rf~~------~DfGvi~idta~~~~~ 364 (364) .-.|.. -..--|+ +-.|-| +.|-.+.. +-|-|++ |.-+.+++ T Consensus 252 ~~~~~~~~~~l~~GD-----~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a-~~~~~~~g 307 (423) T protein:vir:10 252 TGATASVTGFLKAGD-----QVKFTNTYWLQQQTKQALYNGATPISFTATVTA-DANSDSGG 307 (423) T ss_pred eeccccccCceeecc-----eEEecceeeecccccccccccccCcceEEEEEe-eeeeccCC Confidence 111211 0111122 112333 22222211 2333332 21111111 No 58 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=96.65 E-value=0.00044 Score=39.01 Aligned_cols=286 Identities=10% Similarity=0.043 Sum_probs=142.9 Q ss_pred Cc-eeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeec Q lcl|NC_019917. 1 MT-TTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma-~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG 79 (364) ++ .++...+-......++..++....+..+-+.++. + ++ ...|+...+.....-...+|...... T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~-~----~~-------~~~g~~~~~~~~~~~~a~~v~Eg~~~-- 314 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFA-R----QV-------VATGDVWHGVSSAAVQWSWDAEFEEV-- 314 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhc-c----cc-------cCCcceEEEEecCCcceeecccCccc-- Confidence 22 2222233345567777777766665543333321 0 00 01222211111111122333333333 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..++|..-++.+.....-|.+..++-+ -+ .||-..-...|..-+....|+.+| .|+ ++ .....|++ T Consensus 315 ~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~~~~~i~~~l~~~~~~~~d~ail---~G~------Gt-~~~p~Gi~ 382 (543) T protein:vir:81 315 SDDSPEFGQPEIPVKKAQGFVPISIEALQ-DE-ANVTETVALLFAEGKDELEAVTLT---TGT------GQ-GNQPTGIV 382 (543) T ss_pred cccccccceeeeeeeeeEeeehhhHHHHh-cc-HHHHHHHHHHHHHHHHHHHHHHHh---ccC------CC-Ccccccch Confidence 25567777777777777777777666554 24 699999999999999999999886 331 10 01233332 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .++ +. ......-.+++.++++.+.++.......- ...-+++|+|.-+..|++ T Consensus 383 ~~~----~~--------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~----------------~~~~~~v~n~~~~~~l~~ 434 (543) T protein:vir:81 383 TAL----AG--------TAAEIAPVTAETFALADVYAVYEQLAARH----------------RRQGAWLANNLIYNKIRQ 434 (543) T ss_pred hhc----cc--------ccccccccccccccHHHHHHHHHhhhccc----------------cCCcEEEEcHHHHHHHHH Confidence 211 00 00111111344567777766654332110 122357899998888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCee----ecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIF----KGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF----~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) -.| +..+||| .|.-+++.|.+++....++.......+.+. ..+++|-=.. +.++..++ T Consensus 435 lkd--------------~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~---~~i~~gd~~~-~~i~~~~~ 496 (543) T protein:vir:81 435 FDT--------------QGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADN---FVLLYGNFQN-YVIADRIG 496 (543) T ss_pred hhc--------------CCCceeccCcCCCCCccccceeeEEeccccccccccccCCc---ceEEEeeccc-eeEEeecc Confidence 222 1224555 466678999999999888766544333322 1356665332 33344445 Q ss_pred CCccceechh-hcc---chhHHHHHHHHhhhhcccCCcccEEEEEeeee Q lcl|NC_019917. 316 LRFDWEETVK-DYG---NEPAICAGFIAGMKKARFNSKDFGVISIDTAA 360 (364) Q Consensus 316 ~~~~w~Ee~~-D~g---~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~ 360 (364) +...+..+.+ ++. +.+.+-+...+|++.. +.+-|-++.+-|+| T Consensus 497 ~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~--~~~A~~~l~~~~~a 543 (543) T protein:vir:81 497 MTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVV--NPNAFRLLNVETAS 543 (543) T ss_pred cEEEEeccccccchhhcCceEEEEEEeeccEee--cccceEEEEecccC Confidence 5555554421 111 1122222222332221 24557666666666 No 59 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=96.51 E-value=0.00056 Score=38.41 Aligned_cols=270 Identities=10% Similarity=0.014 Sum_probs=146.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCcee--e Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDART--E 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l--e 78 (364) .|.-++-.|..--.++|+..|-....+..+-.+ +.. |.-++- ..|++|.++-+. ..++ +|-.. - T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~-~~~---N~~~e~------~gg~tVkIp~i~---~~gl-~DY~R~~g 84 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTP-ALI---SNDAIF------MEGRSFTVMKGD---TTEL-KDYKRNAT 84 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhh-ccc---CcceEe------ccCcEEEEeeec---cccc-ccccCCCC Confidence 455556666666667788777654444433322 221 221211 368999987443 2333 23221 1 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNI--RRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) .+.++++....++.+||.|----.=..|+..-+..++ -........+......|.-.|..|++.-+. T Consensus 85 ~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----------- 153 (319) T protein:vir:97 85 NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----------- 153 (319) T ss_pred cccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----------- Confidence 2356788999999999998332222567777776665 233344455555556776666666642110 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) .++..++++. .++.|+.+..+.+.... . + -.|||+.|....- T Consensus 154 ---------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~V-------------P-~-~Rvl~Vtp~~~~~ 195 (319) T protein:vir:97 154 ---------------------HLTVGTGSDA--QYDAVLDVSVELDEIKA-------------P-E-NRVLFVSPTFYKG 195 (319) T ss_pred ---------------------ccccccCHHH--HHHHHHHHHHHHHhcCC-------------C-C-CcEEEeCHHHHHH Confidence 0111222222 37888888776654431 1 2 2578999999999 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) |+.+. .+ ++.. ......+..|.+|.+|||.|++.|.. ++ . ...+++|......+.-+..-+ T Consensus 196 L~~~~--~f---~~~~---~~~~~~~~~g~Vg~idG~~Vi~vps~-~~------k----~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:97 196 IKKFV--IA---LPQG---DTRQQVLGKGVQGELDGFVIVKVPTK-LL------Q----GLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred HHhhh--hh---hccc---cccccceeeeeceeecCeEEEEeccc-cc------c----cceEEEEcCCeeeeeeeeeee Confidence 98643 33 3322 11245789999999999999997653 11 1 125677766554443332222 Q ss_pred Cccce-echhhccchhHHHHHHHHhhhhccc------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWE-ETVKDYGNEPAICAGFIAGMKKARF------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~-Ee~~D~g~~~~i~i~~i~G~~K~rf------~~~DfGvi~idta~~~~~ 364 (364) +.+-. +.. . .+.+.| ..| +++--||+++=.+.++.. T Consensus 257 ~~~~p~~~~------~---a~~v~g---r~y~d~~V~~~k~~~Iy~~~~~~~~~~ 299 (319) T protein:vir:97 257 KTNSNIPGM------F---GTLAEQ---LLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) T ss_pred eccCCCccc------c---ceeeee---eeeeeeEEeccccceEEEeecCCcccC Confidence 22211 111 1 123333 333 234446665544444433 No 60 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=96.51 E-value=0.00056 Score=38.41 Aligned_cols=270 Identities=10% Similarity=0.014 Sum_probs=146.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCcee--e Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDART--E 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l--e 78 (364) .|.-++-.|..--.++|+..|-....+..+-.+ +.. |.-++- ..|++|.++-+. ..++ +|-.. - T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~-~~~---N~~~e~------~gg~tVkIp~i~---~~gl-~DY~R~~g 84 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTP-ALI---SNDAIF------MEGRSFTVMKGD---TTEL-KDYKRNAT 84 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhh-ccc---CcceEe------ccCcEEEEeeec---cccc-ccccCCCC Confidence 455556666666667788777654444433322 221 221211 368999987443 2333 23221 1 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNI--RRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) .+.++++....++.+||.|----.=..|+..-+..++ -........+......|.-.|..|++.-+. T Consensus 85 ~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----------- 153 (319) T protein:vir:94 85 NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----------- 153 (319) T ss_pred cccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----------- Confidence 2356788999999999998332222567777776665 233344455555556776666666642110 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) .++..++++. .++.|+.+..+.+.... . + -.|||+.|....- T Consensus 154 ---------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~V-------------P-~-~Rvl~Vtp~~~~~ 195 (319) T protein:vir:94 154 ---------------------HLTVGTGSDA--QYDAVLDVSVELDEIKA-------------P-E-NRVLFVSPTFYKG 195 (319) T ss_pred ---------------------ccccccCHHH--HHHHHHHHHHHHHhcCC-------------C-C-CcEEEeCHHHHHH Confidence 0111222222 37888888776654431 1 2 2578999999999 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) |+.+. .+ ++.. ......+..|.+|.+|||.|++.|.. ++ . ...+++|......+.-+..-+ T Consensus 196 L~~~~--~f---~~~~---~~~~~~~~~g~Vg~idG~~Vi~vps~-~~------k----~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:94 196 IKKFV--IA---LPQG---DTRQQVLGKGVQGELDGFVIVKVPTK-LL------Q----GLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred HHhhh--hh---hccc---cccccceeeeeceeecCeEEEEeccc-cc------c----cceEEEEcCCeeeeeeeeeee Confidence 98643 33 3322 11245789999999999999997653 11 1 125677766554443332222 Q ss_pred Cccce-echhhccchhHHHHHHHHhhhhccc------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWE-ETVKDYGNEPAICAGFIAGMKKARF------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~-Ee~~D~g~~~~i~i~~i~G~~K~rf------~~~DfGvi~idta~~~~~ 364 (364) +.+-. +.. . .+.+.| ..| +++--||+++=.+.++.. T Consensus 257 ~~~~p~~~~------~---a~~v~g---r~y~d~~V~~~k~~~Iy~~~~~~~~~~ 299 (319) T protein:vir:94 257 KTNSNIPGM------F---GTLAEQ---LLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) T ss_pred eccCCCccc------c---ceeeee---eeeeeeEEeccccceEEEeecCCcccC Confidence 22211 111 1 123333 333 234446665544444433 No 61 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.20 E-value=0.00088 Score=37.32 Aligned_cols=271 Identities=10% Similarity=0.004 Sum_probs=132.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee---ccccCceecCcee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV---HLRGKPTYGDART 77 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~---~L~G~gv~Gd~~l 77 (364) |+.+....+.......++..+.......+++.. +++.-. +. +...++.... .-...+|-..+. T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~-~~~~~~---------~~---~~~~~~~~~~~~~~~~a~~v~Eg~~- 156 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQN-LITVEP---------VT---TLSGSRVFKKRSQQTGFVEVAEGAA- 156 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhh-hceeee---------cc---CCceeEEEEeecCCcceeeeccccc- Confidence 776666667677778888888777767666543 433100 00 1111111111 111112211111 Q ss_pred ecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 78 EGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 78 eGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) .......+|..-++.+....+-+.+..++- +-+.+||...-.+.|.+=+....|+.++.- .|+ T Consensus 157 ~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g-~g~--------------- 219 (371) T protein:vir:81 157 IGEKATPQFTLLQYQVKKYAGFFRVTNELL-NDSTEAIVNTLVRWIGDESRVTRNGLIINV-LNT--------------- 219 (371) T ss_pred cccccccceeeEEeeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHhh-ccc--------------- Confidence 111234566777777777777676654433 235678888888888887887777766532 111 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) .+|+ ...+.+.|..+.... +... .+ ..=+.+|||.-+..| T Consensus 220 ------~~~~-------------------~~~~~~~i~~~~~~~-l~~~-------~~-------~~a~~vmn~~~~~~L 259 (371) T protein:vir:81 220 ------KAKT-------------------AIADLDGLKQIINVQ-LDPV-------FR-------STSSVIVNQDAFNWL 259 (371) T ss_pred ------cccc-------------------ccccHHHHHHHHHhh-cchh-------hh-------cCCEEEEcHHHHHHH Confidence 0111 112222232222111 1100 00 112467899988888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++-.| +..+|||. |..+.+.|.+++....++....+..+.... .-.+++|-=.-.+-++- T Consensus 260 ~~lkd--------------~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~-~~~i~~Gd~~~~~~~~~ 324 (371) T protein:vir:81 260 DTLKD--------------QNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQ-FAPIIVGDLKEAVVMFD 324 (371) T ss_pred HHhhc--------------cCCCeeeecccCCCCCceecceeEEEecccccCccccccccCC-cceEEEEehhceEEEEe Confidence 74222 22367775 445799999999988887665443222221 33466773221122222 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeee Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTA 359 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta 359 (364) ..++...|.++..|+ .+.+.+-+...+|.+-. +.+-|-++.+-+| T Consensus 325 ~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~--~~~a~~~~~~~~A 371 (371) T protein:vir:81 325 RQRTEIMSSNVAMDAFETDATLWRAIERMDVKMR--DDEAFVFGEVQLA 371 (371) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEEecC Confidence 345666777665443 23333333333332211 1233444443333 No 62 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=96.11 E-value=0.00099 Score=37.05 Aligned_cols=280 Identities=12% Similarity=0.088 Sum_probs=132.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc--cCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR--GKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~--G~gv~Gd~~le 78 (364) |..|+-..|-......|+..+.......++..+ +.- ++. + ..|..+.+....... +..|...+ + T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~-~~~--------~~~-~--~~~~~~~~~~~~~~~~~~~~v~E~~--~ 182 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIAS-VAQ--------ILT-T--SDGRTMEWATADGTSEVGVLLGENE--E 182 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhh-hce--------eee-c--CCCceEEEEeeccCccccccccccc--c Confidence 666665666667788899888877766665532 211 000 0 011122222222111 22332222 2 Q ss_pred cchhhhhhcccEEEEecc-cceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQV-RHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~-R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) -.+....|..-++..-.. -+-|.+..++-+. +.+||...-+..|++=+....|+.++.- +|+.. .....+ T Consensus 183 ~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d-s~~~l~~~i~~~la~a~~~~~~~a~l~G-~G~~~-------~~~p~G 253 (409) T protein:vir:45 183 AGEEDTDFGMGSLGALKMTSKIIRVSNELLQD-SAIDMEAYLARRIAERIGRGEARYLIQG-TGAGT-------PKQPKG 253 (409) T ss_pred ccccccccceeeeeeeeeeeeehhhhHHHHhc-cHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCC-------ccccce Confidence 234455555545544333 2445454444333 5678888888888888888887776621 22110 001123 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) +..+. +......+++.++.+.|.++.......- .....+++++||..+..| T Consensus 254 il~~~---------------~~~~~~~~~~~~~~d~i~~l~~~l~~~~--------------~~~a~~~~~~n~~~~~~l 304 (409) T protein:vir:45 254 LAASV---------------TGTTQTAAANAVKWQEILALKHSIDPAY--------------RRGPKFRLAFNDNTLKLI 304 (409) T ss_pred eeecc---------------ccccccccccccchHHHHHHHHhhhhhh--------------ccCCeEEEEECHHHHHHH Confidence 22111 1112223455677776766544332110 112468899999888887 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++= |. +..+|||. |.-+.+.|.+++....++-. +++. ..+++|=-.-. ..+- T Consensus 305 ~~l---------kd-----~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~---~~~~-----~~i~~Gd~~~~-~i~~ 361 (409) T protein:vir:45 305 SEM---------ED-----GQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDI---GAGK-----KFMFCGDFDRF-IIRR 361 (409) T ss_pred HHh---------hc-----CCCceeeccCcCCCCCceecceeeEEecCcCCc---cCCc-----cEEEEeehhhh-heee Confidence 641 21 23467775 44468889999887776421 1111 13555542211 1122 Q ss_pred CCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeec Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKK 362 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~ 362 (364) .+++...+..+.+--.+.+.+-+. .||. .+-|-++.+-+++.+ T Consensus 362 ~~~~~~~~~~d~~~~~~~~~~~~~-------~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 362 VRYMILKRLVERYAEYDQTGFLAF-------HRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccceEEEEeecccccCCcEEEEEE-------EEeccEeechhheEEEEeccCCCC Confidence 234445544432211111111111 1332 234455554444444 No 63 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=96.08 E-value=0.00082 Score=37.51 Aligned_cols=271 Identities=10% Similarity=0.040 Sum_probs=128.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCce-ecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPT-YGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv-~Gd~~leG 79 (364) .+.+....+.......++..+.......++..+ ++.. .-. .+.++++.....-...+. ..++.-+= T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~-l~~~---------~~~---~~~~~~~~~~~~~~~~~~~~~~E~~~~ 180 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKE-HCHV---------IPV---NRNAGKMPVRAGASVDKLANLAKDTEL 180 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhh-hcee---------eec---cCCceEEEEeecCCccceeeccccccc Confidence 122222334445667777777666655555433 2210 000 111222222111111111 01111111 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..++|..-++.+....+-|.+..+|-+ -+.+||...-++.|.+-+....|..++..+.|.. T Consensus 181 ~~s~~~f~~i~~~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~---------------- 243 (421) T protein:vir:13 181 VKAMLKTQPMAYDIDDYGLLAPIDNSLLE-DSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVL---------------- 243 (421) T ss_pred cccccceeEEEeeeeeeEeehhhhHHHHh-hhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcc---------------- Confidence 24456777777777777777766555433 2567777776777766666666655543332210 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +.+...+.+-|.++...+.... ...-+++|||.-+..|++ T Consensus 244 ------------------------~~~~~~~~d~i~~~~~~l~~~~----------------~~~a~~v~n~~~~~~l~~ 283 (421) T protein:vir:13 244 ------------------------AEETINDYAGLVKTINSLVPNA----------------RKRAIIVTNSDGRAYLDG 283 (421) T ss_pred ------------------------ccccccchHHHHHHHHHhhhhh----------------cCCCEEEEcHHHHHHHHH Confidence 0011123445555554443221 112356889988888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) - |.+ ..+|||. |.-+.+.|.+++..+.+.- +.++ .-.+++|--.-++.++-.++ T Consensus 284 l---------kd~-----~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~----~~~~----~~~~~~gd~~~~~~~~~~~~ 341 (421) T protein:vir:13 284 L---------MDK-----QGRPLLKELSDGGDLVFKGRPVIELEESIF----DVGD----ETKFIVSDFKTLIKFMDRKQ 341 (421) T ss_pred h---------hcC-----CCceeecCcCCCCCceecceeeEEeccccc----cCCC----ceEEEEEeccccEEEEEecc Confidence 2 222 3357774 5557888999987766542 1111 12466665433233333455 Q ss_pred CCccceechhhccc-hhHHHHHHHHhhhhcc------cCCcccEEEEEeeeeeccC Q lcl|NC_019917. 316 LRFDWEETVKDYGN-EPAICAGFIAGMKKAR------FNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 316 ~~~~w~Ee~~D~g~-~~~i~i~~i~G~~K~r------f~~~DfGvi~idta~~~~~ 364 (364) ....|..+. +|.+ .+.+-+...++.+... |.-..+++++-++-+++-| T Consensus 342 ~~v~~~~~~-~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 342 YLIDQSKEA-GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred eEEEeeccc-ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCCC Confidence 666665442 2221 1222222222322221 2234667766665555554 No 64 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=273 Identities=10% Similarity=0.029 Sum_probs=136.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) |..++...+.......++..+.....+.++... +++. .-+....|.-..+.+... ....+|-.+... . T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~-~ 184 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-YVRV---------ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI-P 184 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHh-hcce---------eeccCCcceEEEEeecCCccceeeecCcccc-c Confidence 665655556667788899998888877777654 4321 111111111111111110 011222211111 0 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ......|.+-++.+....+-+.+..++-+ -+.+||...-++.|.+.+....|+.++.- .|. T Consensus 185 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~~~~~~d~~il~g-~g~----------------- 245 (404) T protein:vir:39 185 DLDNPRLTIIKYLIKRYAGIITATNTLLK-DTAENILAWLSSWIAKKVVVTRNQAIIAA-MGT----------------- 245 (404) T ss_pred cccccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc----------------- Confidence 12346677777777777777766554443 47889999999999999999999987632 111 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + .|. .. ..+++-|-.+...+ .... . ...=+.+|||..+..|++ T Consensus 246 -~---~~~-------------~~-----~~~~~~i~~~~~~~-~~~~-------~-------~~~a~~v~n~~~~~~L~~ 288 (404) T protein:vir:39 246 -V---PKK-------------PT-----IAKFDDVITMINTS-VDPA-------I-------IATSSLLTNQSGLNKLAL 288 (404) T ss_pred -c---ccc-------------cc-----cccHHHHHHHHHHh-hhhh-------h-------ccCCEEEEcHHHHHHHHH Confidence 0 000 00 11122222221111 1100 0 011267999999988874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeec Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGT 312 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~ 312 (364) - |. +..+|||. |..+.+.|.+++-........ .... ...+++|- +++.++ - T Consensus 289 l---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~--~~~~----~~~~~~gd~~~~~~~~--~ 346 (404) T protein:vir:39 289 V---------KT-----AEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPN--SGST----VYPLYYGDMSQAITLF--D 346 (404) T ss_pred h---------hc-----cCCceeeccCcCCCCcceecceeEEEecccccCc--cCCC----ccEEEEEeccccEEEE--e Confidence 2 21 12357775 444688898877554321111 1111 22467773 333333 2 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ..+....|..+..|+ .+.+.+-+...+|.+.. +.+-|-++.+.+++.+-+ T Consensus 347 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~--~~~a~~~~~~~~~a~~~~ 398 (404) T protein:vir:39 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTT--DSEALVAGSFTAIADQVG 398 (404) T ss_pred ecceEEEEeccchhhhhhceeeEEEEeeeccEEe--cccceEEEEeeccccCCC Confidence 345667776654332 22233333333333222 245565666656655554 No 65 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=95.97 E-value=0.0012 Score=36.63 Aligned_cols=271 Identities=9% Similarity=0.007 Sum_probs=126.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) |+.+....|.......++..+.......++... ++.. .-...+.|....+...... .+..|-..... T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-- 176 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQE-YVNV---------ENVTTLTGSRVYEKWADITGLAKLDDEGGQI-- 176 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhh-hcce---------eeccCCcceEEEEeeccCCcceeeecccccc-- Confidence 777666667677788888888777766666543 3321 0011111111111110000 11122111111 Q ss_pred chh-hhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 80 TEE-NLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 80 nee-~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+. ..+|..=++.+.....-+.+..++- +.+.+||...-+..|...+....|+.+|.- .|.. T Consensus 177 ~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~ail~G-~g~~--------------- 239 (397) T protein:vir:49 177 GQNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILEA-IGTL--------------- 239 (397) T ss_pred ccccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHhc-cccc--------------- Confidence 111 1234444555555555555544433 347899999999999999999999887622 1110 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) + | .+...+++-|.++........ ...-+.+|||..+..|+ T Consensus 240 --~----~------------------~~~~~~~d~i~~~~~~l~~~~----------------~~~a~~v~n~~~~~~l~ 279 (397) T protein:vir:49 240 --P----N------------------KPTLAKWDDIIDLQAKVDPAI----------------KQTSLFLTNTSGFTALK 279 (397) T ss_pred --c----c------------------cccccCHHHHHHHHHhhhhhh----------------cCCCEEEEcHHHHHHHH Confidence 0 0 011234455555544432211 11236789999998887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA 313 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~ 313 (364) +- |. ...+|||. |.-+.+.|.+++.....+... +.++ ...+++|--.-.+-++-. T Consensus 280 ~l---------kd-----~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~--~~~~----~~~~~~gd~~~~~~~~~~ 339 (397) T protein:vir:49 280 KV---------KN-----AMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPN--GTGG----AMPLYFGDLKQAVTLFDR 339 (397) T ss_pred Hh---------hc-----cCCceeecccccCCCCceecceeeEEeccccccc--ccCC----ceeEEEeeccceEEEEee Confidence 52 21 12367764 555789998877654432211 1112 224566632211222222 Q ss_pred CCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++....+.++..++ .+.+.+-+...+|.+-. ..-+++.+...+.+++ T Consensus 340 ~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~----~~~a~~~~~~~~~~~~ 388 (397) T protein:vir:49 340 QHLSLLSTNIGGGAFETDTTKVRVIDRFDVVST----DTEAFVPASFKAIADQ 388 (397) T ss_pred cccEEEEeccccchhhcCeeeEEEEEeeccEEe----cccceEEEEecccccc Confidence 34444444322111 11111112111221110 1234445555555554 No 66 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=95.77 E-value=0.0015 Score=36.08 Aligned_cols=287 Identities=10% Similarity=0.023 Sum_probs=129.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeec-CCCCCceEEEEEeeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTEL-ESDAGDTISFDLSVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG 79 (364) ||-|. .. ...+.|++.+...-.+...|.+ ++=+.-.. |. ....||+|+++......-.-..+.+ ..+ T Consensus 1 MaN~l-lT---~ip~iia~~al~~l~~~lV~~~-lVnr~y~~------e~~~~k~GDTV~I~~p~~~~~~~~~~~~-~~~ 68 (423) T protein:vir:17 1 MPNNL-DS---NVSQIVLKKFLPGFMSDLVLAK-TVDRQLLA------GEINSSTGDSVSFKRPHQFSSLRTPTGD-ISG 68 (423) T ss_pred Cccch-hh---hhHHHHHHHHHHHHHhhcccch-hhcccCCc------chhhcccCCEEEEeeCCcceeecccCcc-cCC Confidence 98332 22 2357799888766666555543 44221111 11 1248999999865554432222211 112 Q ss_pred -chhhhhhcccEEEEecccc-eeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 -TEENLRFYTDQVKIDQVRH-PVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 -nee~L~~~~~~v~Idq~R~-~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +-++|.-.+-.|.||+..+ ++.+.++ ++.-..-|| +..-+....=+++..|+.++..+.+. + T Consensus 69 ~~~~~l~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~-a------------- 132 (423) T protein:vir:17 69 QNKNNLISGKATGRVGNYITVAVEYQQL-EEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNN-G------------- 132 (423) T ss_pred cccCccccceeEEEeeceeeeeeeecHH-HHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhc-c------------- Confidence 3567777777899999987 6777653 222233445 22222223446666676665444331 0 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) .|.+..|.. .. -.++.+-.+..++... ..|. .+ -.+++.|.-...| T Consensus 133 --~~~~gt~~t----------------~~--~a~~~i~~a~~~Ld~~--~vP~-----------~~-R~~Vv~p~~~a~L 178 (423) T protein:vir:17 133 --ALSLGSPNT----------------PI--TKWSDVAQTASFLKDL--GVNE-----------GE-NYAVMDPWSAQRL 178 (423) T ss_pred --ccccccCCc----------------cc--ccHHHHHHHHHHHHhc--cCCc-----------CC-CEEEeChHHHHHH Confidence 011000110 00 1355565555444322 2232 23 3458999998888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeeecCe-EEEcCEEEEecCcccccccccCCcccccchhee--eccchheEeeecCC Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFKGGL-GMINNVVLHKHRNVIRFNDYGAGANVEAARALF--MGRQAGVIAYGTAN 314 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~G~~-g~~ngvii~e~~~~~~~~~~~~~~~v~v~rall--lGaqA~~~A~g~~~ 314 (364) ..+ +......+. +...-|=.|.+ |.+.|+-+++..+++...++..++...+.++-. .++++.... ... T Consensus 179 l~~-~~~~~~~~~------~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~--~~~ 249 (423) T protein:vir:17 179 ADA-QTGLHASDQ------LVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQ--FTV 249 (423) T ss_pred hcc-ccceecccc------cchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccc--eee Confidence 753 323221121 11222445666 899999999999998776655443222222211 111111000 000 Q ss_pred CCCcccee--chhhccchhHHHHHHHHhh------hhcccC------CcccEEEEEee---eeeccC Q lcl|NC_019917. 315 GLRFDWEE--TVKDYGNEPAICAGFIAGM------KKARFN------SKDFGVISIDT---AAKKHS 364 (364) Q Consensus 315 g~~~~w~E--e~~D~g~~~~i~i~~i~G~------~K~rf~------~~DfGvi~idt---a~~~~~ 364 (364) +.--.|.. ...--|+. -+|-|+ .|-.+. .+-|.|.+ |. +..++. T Consensus 250 ~~~~~~~~~~g~l~~GD~-----~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~-~~~~~a~~~~t 310 (423) T protein:vir:17 250 TLTGATTSVTGFLKAGDQ-----VKFTNTYWLQQQTKQALYNGATPISFTATVTA-DANSDSSGDVT 310 (423) T ss_pred eeeeeeeeccCceeecce-----EEecceeeecccccccccccccccceEEEEEe-cccccccCceE Confidence 11112221 01111221 133332 222221 12344322 11 111111 No 67 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=95.52 E-value=0.0019 Score=35.49 Aligned_cols=280 Identities=12% Similarity=0.057 Sum_probs=138.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEE-EeeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFD-LSVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~-L~~~L~G~gv~Gd~~leG 79 (364) |..|....+.......++.+++....+.++... +... + . .+ .+..+.+. .........|.+.+... T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~-~~~~-----~-~---~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~- 75 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQ-LGQY-----Q-E---ME--GEQEKTVYVQTDGISAYWVNETEKIK- 75 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhh-hcce-----e-e---cC--CCccEEEEEEcCCceeEEeecCcccc- Confidence 666666667778889999999988877777653 3211 0 0 00 01112222 11122223343333332 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +...+|..-++.+.....-+.+..++- +-+.+||...-++.|.+-+.+..|+.+|.- .|+.+ | .++ T Consensus 76 -~~~~~f~~v~l~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~d~a~l~G-~g~~~--------~--~gi- 141 (297) T protein:vir:95 76 -TDKPEVVPVTLKAHKLGIILVTSREAL-NYTWKKFFEDMKPQIVEAFYKKIDEAGLLG-HDTPF--------A--NSV- 141 (297) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhcc-cCCcc--------c--ccc- Confidence 445677777777766666665543322 236789999999999999999999998821 11110 0 111 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ... ....+..+++.++++.|.++....... +.+.-+.+|||.-+..|++ T Consensus 142 -------------~~~--~~~~~~~~~~~~t~~~i~~~~~~l~~~----------------~~~~~~~v~~~~~~~~L~~ 190 (297) T protein:vir:95 142 -------------AKA--AKDANKVIGGPINYDNILKLQDALYDA----------------DVEPNAFVSKIQNRSALRE 190 (297) T ss_pred -------------ccc--ccccceecccccCHHHHHHHHHHhhhc----------------cCCcCEEEEcHHHHHHHHH Confidence 110 111222345667888787776544321 1122357889999998874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCcc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~~ 319 (364) -.| +..+|||.+..+.+.|.+++..+... ...+ -+++|=-.- +.+|-.++..+. T Consensus 191 l~d--------------~~G~~i~~~~~~~l~G~Pv~~~~~~~-----~~~~------~~~~gd~s~-~~~~~~~~~~i~ 244 (297) T protein:vir:95 191 ARD--------------GNKVSIYDKAANTIDGITTVDLKSAR-----FEKG------DLLAGDFDN-LIYGVPYNITYK 244 (297) T ss_pred hhc--------------cCCceeecCCCCcccceeeEeecCCC-----CCCc------eEEEEeccc-EEEEEecCeEEE Confidence 221 23479999988999998876443211 1111 133333221 123333344443 Q ss_pred ceechhhc--cchhHHHHH-HHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 320 WEETVKDY--GNEPAICAG-FIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 320 w~Ee~~D~--g~~~~i~i~-~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) -.++..-. .+..+..+. +-.++-.+|... =||+|+-=+..++++. T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 293 (297) T protein:vir:95 245 ISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTP 293 (297) T ss_pred EeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEee Confidence 33321100 000000000 111222222211 1444433333333333 No 68 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=95.49 E-value=0.002 Score=35.42 Aligned_cols=285 Identities=8% Similarity=0.025 Sum_probs=130.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+....+.......|+..+.......+++.. +++. .-+....|......+...-...+|..++..... T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~-l~~~---------~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~ 179 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYN-MVDY---------EPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTN 179 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhh-hhce---------eeccCCccceEEEEecCCcceeecccccccccc Confidence 666666666566678888888877777776653 4332 001111111111111122223334333333222 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) ...++|..-++.+.....-+.+..++- +.+.++|....++.|.+.+....|+.+|.- +|. +. ...++ T Consensus 180 ~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~il~G-~g~-~~--------~~~gi-- 246 (404) T protein:vir:10 180 GDNGKLERFNFKLKDLADFMSIPNDLL-KFADKSLEDWIINWFVDKVRITRNAEILYG-AGG-DE--------HATGI-- 246 (404) T ss_pred ccccceeeeEeeheeeEeeehhhHHHH-hhcHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCC-CC--------cccce-- Confidence 234555544555555555555544432 357789999999999999999999977622 221 00 00111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) ... .+...++.+...+.+.+..++.... .. . . ...-+.+|||.-+..|++- T Consensus 247 ------------~~~--~~~~~~~~~~~~~~~~~~~~~~~~l-~~-----------~-~--~~~~~~v~n~~~~~~L~~l 297 (404) T protein:vir:10 247 ------------MTA--NKFKKITLPKSPALKDFKKCKNVEL-LN-----------V-F--KATSSWIVNQDGFNYLDSL 297 (404) T ss_pred ------------eec--cccceeeccccccHHHHHHHHHhhh-hc-----------c-c--cCCCEEEEcHHHHHHHHHh Confidence 100 0011222233345666666544321 10 0 0 1112468999888888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeecC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGTA 313 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~~ 313 (364) |. +..+|||. |.-+++.|.+++..+.. ...++++ .-.+++|- +++.+.. . T Consensus 298 ---------kd-----~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~---~~~~~~~----~~~~~~gd~s~~~~~~~--~ 354 (404) T protein:vir:10 298 ---------ED-----KTGRPYLQPDPKDPTQYRFLGLPVIELPND---LLLSTES----AIPVLLGDTKEAYKYVS--D 354 (404) T ss_pred ---------hc-----cCCceeeccCcCCCCCccccceeeEEeccc---ccCCCCC----ccEEEEEeccccEEEEE--e Confidence 21 23467775 34467888887653321 1111222 22567774 3433332 2 Q ss_pred CCCCccceech-hhcc-chhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETV-KDYG-NEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~-~D~g-~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) .+....+..+. .||. +.+.+-+...+|++-. +.+-|-++.+ ++++-+ T Consensus 355 ~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~--~~~a~~~~~~--~~aa~~ 403 (404) T protein:vir:10 355 GAYELATTNIGAGAFETNTTKARIIMRIDGNVK--DSEALLIAEI--PVESVQ 403 (404) T ss_pred cceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEe--ecccCC Confidence 34444433221 1111 2222222222222111 1123333332 222222 No 69 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=95.24 E-value=0.0025 Score=34.88 Aligned_cols=271 Identities=10% Similarity=0.035 Sum_probs=144.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCce-eec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDAR-TEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~-leG 79 (364) .|--.+-.|+-.--+++...|-....++++-.+.++ | ++.+...|++|.++-+.- .++ +|-. -.| T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~----N------~~~e~~~g~tVkIp~i~~---~gl-~DY~R~~g 95 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVI----S------NDAIFMQGRSFTVIKGDV---TEL-KDYKRNAT 95 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeec----c------cceeeccCcEEEEeeecc---ccc-ccccCCCC Confidence 444444445444457777777655555443332111 1 222345799999885432 233 2222 122 Q ss_pred -chhhhhhcccEEEEecccceeeccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 80 -TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNI--RRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 80 -nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) +.++++....++.+||.|----.=..|+..-+..++ -......+.+......|.-.|-.|++.-+. T Consensus 96 ~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~----------- 164 (329) T protein:vir:10 96 NEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK----------- 164 (329) T ss_pred ccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc----------- Confidence 345788899999999998322222567766666555 333444555556667776666666542110 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) .++..++++ -.++.|+.+..+.+... + | + -.+||+.|....- T Consensus 165 ---------------------~~~~~~t~~--nay~~i~~a~~~Lde~~---v------p-----~-~Rvl~VtP~~~~~ 206 (329) T protein:vir:10 165 ---------------------HLTVGSGAD--AQYDAVLDVSVELDEIG---A------G-----A-SRILFVTPKFYKG 206 (329) T ss_pred ---------------------ccccccCHH--HHHHHHHHHHHHHHhcC---C------C-----C-CcEEEeCHHHHHH Confidence 111122322 24778888877665432 1 1 2 2578999999999 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) |+. ++.+ .+. .......++.|.+|.+|||.|++.|+.. + . ...+|+|......+.-+..-+ T Consensus 207 Lk~--~~~f---~~~---~~~~~~~~~~g~Vg~idG~~Ii~vps~~-~------k----~in~ii~~~~A~~~~~K~~~~ 267 (329) T protein:vir:10 207 IKK--FVIE---LPQ---GDNRQQVLGKGVQGELDGFTIVKVPSKM-L------Q----GVEAMAVIGEVMASPIQANEA 267 (329) T ss_pred HHh--hhhh---hcc---ccccccceeeeeeeeecCeEEEEecCCc-c------c----ceeEEEEcCCceeeeeeeeee Confidence 985 3333 221 2234567899999999999999976531 1 1 225677766544443332222 Q ss_pred CccceechhhccchhHHHHHHHHhhhhccc------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWEETVKDYGNEPAICAGFIAGMKKARF------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf------~~~DfGvi~idta~~~~~ 364 (364) +.+-.++ +. ..+.+.| ..| +++--||++.-+.++.-| T Consensus 268 ~~~~p~~-----~~---~a~~v~g---r~yyd~~V~~~k~~~I~~~~~~a~~~~ 310 (329) T protein:vir:10 268 KLNSNVP-----GM---FGTLAEQ---MLYTGAFVPEHLQKYIFTIGGKEVETN 310 (329) T ss_pred eeeCCCC-----cc---chheeee---eeeeeeEEEccccCEEEEecccCcccC Confidence 2221111 11 1123333 222 234446666555444444 No 70 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=95.12 E-value=0.0027 Score=34.64 Aligned_cols=276 Identities=11% Similarity=0.028 Sum_probs=128.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+....|.......++..+.......++... +.+. .-+....|....+.........+|...... .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~~ 174 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ-YVTV---------EPVRTRSGSRVLEKNSDMIPFAEITEMGEI-PE 174 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhh-hcee---------eeccCCceeEEEEeecCCccceeecccccc-cc Confidence 666666666667778888888777766666543 3221 111111111111111111122233222211 11 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .+...|..-++.+....+-+.+..+|-+ -+.+||...-++.|++=+....|..++.- .|+ T Consensus 175 ~~~~~~~~v~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~~~g-~g~------------------ 234 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAGILPLSRSLLQ-DSDQNILKYVTKWLGKKSKVTRNVLILGV-IEK------------------ 234 (392) T ss_pred cccccceeEEeeeeeEEEeehhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc------------------ Confidence 2235566666666666666666555433 36788888888888888888888777621 111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) ..+++ ..+.+.|-.+.... +... .+ ..=+.+|||..+..|++- T Consensus 235 --~~~~~--------------------~~~~d~i~~~~~~~-l~~~-------~~-------~~a~~vm~~~~~~~L~~l 277 (392) T protein:vir:10 235 --LTKQA--------------------IKSLDDIKDVLNVK-LDPA-------IS-------PNAILLTNQDGFNYLDKL 277 (392) T ss_pred --ccccC--------------------ccCHHHHHHHHHHh-hhhh-------hc-------cCCEEEEcHHHHHHHHHh Confidence 00111 12233333332211 1110 00 112367899988888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEE-EEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVV-LHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvi-i~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) |. ...+|||. |.-+.+.|.+ ++....+.-.+....++. -.+++|-=.-++.++... T Consensus 278 ---------kd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~----~~~~~gdfs~~~~i~~~~ 339 (392) T protein:vir:10 278 ---------KD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK----APLIIGDLKEAIVLFKRE 339 (392) T ss_pred ---------hc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc----eEEEEEehhceEEEEeec Confidence 22 23478884 4456667753 332222222222222222 245666422222223334 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++.+.|.++..++ .+.+.+.+..-+|.+-. +.+-|-++.+-+++++-+ T Consensus 340 ~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW--DNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEecccccccC Confidence 5666666543211 12222333333332211 234566666667777776 No 71 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=95.12 E-value=0.0027 Score=34.64 Aligned_cols=276 Identities=11% Similarity=0.028 Sum_probs=128.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+....|.......++..+.......++... +.+. .-+....|....+.........+|...... .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~~ 174 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ-YVTV---------EPVRTRSGSRVLEKNSDMIPFAEITEMGEI-PE 174 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhh-hcee---------eeccCCceeEEEEeecCCccceeecccccc-cc Confidence 666666666667778888888777766666543 3221 111111111111111111122233222211 11 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .+...|..-++.+....+-+.+..+|-+ -+.+||...-++.|++=+....|..++.- .|+ T Consensus 175 ~~~~~~~~v~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~~~g-~g~------------------ 234 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAGILPLSRSLLQ-DSDQNILKYVTKWLGKKSKVTRNVLILGV-IEK------------------ 234 (392) T ss_pred cccccceeEEeeeeeEEEeehhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc------------------ Confidence 2235566666666666666666555433 36788888888888888888888777621 111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) ..+++ ..+.+.|-.+.... +... .+ ..=+.+|||..+..|++- T Consensus 235 --~~~~~--------------------~~~~d~i~~~~~~~-l~~~-------~~-------~~a~~vm~~~~~~~L~~l 277 (392) T protein:vir:10 235 --LTKQA--------------------IKSLDDIKDVLNVK-LDPA-------IS-------PNAILLTNQDGFNYLDKL 277 (392) T ss_pred --ccccC--------------------ccCHHHHHHHHHHh-hhhh-------hc-------cCCEEEEcHHHHHHHHHh Confidence 00111 12233333332211 1110 00 112367899988888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEE-EEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVV-LHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvi-i~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) |. ...+|||. |.-+.+.|.+ ++....+.-.+....++. -.+++|-=.-++.++... T Consensus 278 ---------kd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~----~~~~~gdfs~~~~i~~~~ 339 (392) T protein:vir:10 278 ---------KD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK----APLIIGDLKEAIVLFKRE 339 (392) T ss_pred ---------hc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc----eEEEEEehhceEEEEeec Confidence 22 23478884 4456667753 332222222222222222 245666422222223334 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++.+.|.++..++ .+.+.+.+..-+|.+-. +.+-|-++.+-+++++-+ T Consensus 340 ~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW--DNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEecccccccC Confidence 5666666543211 12222333333332211 234566666667777776 No 72 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=95.12 E-value=0.0027 Score=34.64 Aligned_cols=276 Identities=11% Similarity=0.028 Sum_probs=128.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+....|.......++..+.......++... +.+. .-+....|....+.........+|...... .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~~ 174 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ-YVTV---------EPVRTRSGSRVLEKNSDMIPFAEITEMGEI-PE 174 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhh-hcee---------eeccCCceeEEEEeecCCccceeecccccc-cc Confidence 666666666667778888888777766666543 3221 111111111111111111122233222211 11 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .+...|..-++.+....+-+.+..+|-+ -+.+||...-++.|++=+....|..++.- .|+ T Consensus 175 ~~~~~~~~v~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~~~g-~g~------------------ 234 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAGILPLSRSLLQ-DSDQNILKYVTKWLGKKSKVTRNVLILGV-IEK------------------ 234 (392) T ss_pred cccccceeEEeeeeeEEEeehhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc------------------ Confidence 2235566666666666666666555433 36788888888888888888888777621 111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) ..+++ ..+.+.|-.+.... +... .+ ..=+.+|||..+..|++- T Consensus 235 --~~~~~--------------------~~~~d~i~~~~~~~-l~~~-------~~-------~~a~~vm~~~~~~~L~~l 277 (392) T protein:vir:10 235 --LTKQA--------------------IKSLDDIKDVLNVK-LDPA-------IS-------PNAILLTNQDGFNYLDKL 277 (392) T ss_pred --ccccC--------------------ccCHHHHHHHHHHh-hhhh-------hc-------cCCEEEEcHHHHHHHHHh Confidence 00111 12233333332211 1110 00 112367899988888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEE-EEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVV-LHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvi-i~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) |. ...+|||. |.-+.+.|.+ ++....+.-.+....++. -.+++|-=.-++.++... T Consensus 278 ---------kd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~----~~~~~gdfs~~~~i~~~~ 339 (392) T protein:vir:10 278 ---------KD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK----APLIIGDLKEAIVLFKRE 339 (392) T ss_pred ---------hc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc----eEEEEEehhceEEEEeec Confidence 22 23478884 4456667753 332222222222222222 245666422222223334 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++.+.|.++..++ .+.+.+.+..-+|.+-. +.+-|-++.+-+++++-+ T Consensus 340 ~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW--DNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEecccccccC Confidence 5666666543211 12222333333332211 234566666667777776 No 73 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=95.12 E-value=0.0027 Score=34.64 Aligned_cols=276 Identities=11% Similarity=0.028 Sum_probs=128.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+....|.......++..+.......++... +.+. .-+....|....+.........+|...... .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~~ 174 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ-YVTV---------EPVRTRSGSRVLEKNSDMIPFAEITEMGEI-PE 174 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhh-hcee---------eeccCCceeEEEEeecCCccceeecccccc-cc Confidence 666666666667778888888777766666543 3221 111111111111111111122233222211 11 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .+...|..-++.+....+-+.+..+|-+ -+.+||...-++.|++=+....|..++.- .|+ T Consensus 175 ~~~~~~~~v~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~~~g-~g~------------------ 234 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAGILPLSRSLLQ-DSDQNILKYVTKWLGKKSKVTRNVLILGV-IEK------------------ 234 (392) T ss_pred cccccceeEEeeeeeEEEeehhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc------------------ Confidence 2235566666666666666666555433 36788888888888888888888777621 111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) ..+++ ..+.+.|-.+.... +... .+ ..=+.+|||..+..|++- T Consensus 235 --~~~~~--------------------~~~~d~i~~~~~~~-l~~~-------~~-------~~a~~vm~~~~~~~L~~l 277 (392) T protein:vir:10 235 --LTKQA--------------------IKSLDDIKDVLNVK-LDPA-------IS-------PNAILLTNQDGFNYLDKL 277 (392) T ss_pred --ccccC--------------------ccCHHHHHHHHHHh-hhhh-------hc-------cCCEEEEcHHHHHHHHHh Confidence 00111 12233333332211 1110 00 112367899988888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEE-EEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVV-LHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvi-i~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) |. ...+|||. |.-+.+.|.+ ++....+.-.+....++. -.+++|-=.-++.++... T Consensus 278 ---------kd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~----~~~~~gdfs~~~~i~~~~ 339 (392) T protein:vir:10 278 ---------KD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK----APLIIGDLKEAIVLFKRE 339 (392) T ss_pred ---------hc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc----eEEEEEehhceEEEEeec Confidence 22 23478884 4456667753 332222222222222222 245666422222223334 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++.+.|.++..++ .+.+.+.+..-+|.+-. +.+-|-++.+-+++++-+ T Consensus 340 ~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW--DNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEecccccccC Confidence 5666666543211 12222333333332211 234566666667777776 No 74 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=94.54 E-value=0.0041 Score=33.64 Aligned_cols=273 Identities=7% Similarity=-0.004 Sum_probs=125.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) |+.+....|.......++..+.......++... +... + -++...|........... ....|.+++.. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~-~~~~-----~----~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~-- 72 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQE-YVNV-----E----NVTTLTGSRVYEKWTDITGLANIDDEAGKI-- 72 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhh-hcee-----e----eccCCcceEEEEeecCCCcceeeecCCccc-- Confidence 888888887778889999999777766665543 3211 0 011112211111111111 11223222111 Q ss_pred ch-hhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 80 TE-ENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 80 ne-e~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+ +..+|..-++.+.....-+.+..++-+ =+.+||...-++.|.+-+....|+.++-.+... T Consensus 73 ~~~~~~~~~~i~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~---------------- 135 (293) T protein:vir:48 73 ADIDDPKLSLIKYTIKRYAGISTVTNSLLA-DSAENILAWLSGWIAKKVVVTRNKAILGVVDKL---------------- 135 (293) T ss_pred ccccccceeEEEEeeeEEEEeehhhHHHHh-hhhHHHHHHHHHHHHHHHHHHHHhHHhhccccc---------------- Confidence 12 235565556666665555555444332 356889888888899888888888777332210 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) . +.....+++-|.++...+..+. ...-+.+|||.-+..|+ T Consensus 136 ------------------~------~~~~~~~~d~i~~~~~~l~~~~----------------~~~a~~vmn~~~~~~L~ 175 (293) T protein:vir:48 136 ------------------P------TKPTLTKWDDIIDLEAKVDPAI----------------KQTSFFLTNTSGFTALK 175 (293) T ss_pred ------------------c------ccccccCHHHHHHHHHhhhhhh----------------cCCCEEEEcHHHHHHHH Confidence 0 0112345555655544332110 11235678999888887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA 313 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~ 313 (364) +- |. +..+|||. |..+.+.|.+++.......... .++ ...+++|.-.-++..+-. T Consensus 176 ~l---------kd-----~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~--~~~----~~~~~~gd~~~~~~~~~~ 235 (293) T protein:vir:48 176 KV---------KN-----ALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNA--SSG----VMPLYFGDLKQAVTLFDR 235 (293) T ss_pred Hh---------hc-----cCCceEeecCcCCCCCceecceeeEEecccccCCc--cCC----ceEEEEEeccceEEEEEe Confidence 42 21 23367775 4456888988876443321111 111 224566642221222222 Q ss_pred CCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) .+......++..++ .+...+-+..-.|.+.. +.+-|-++.+-+++..-. T Consensus 236 ~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~--~~~a~~~l~~~~~~~~~~ 286 (293) T protein:vir:48 236 QQMSLLSTNIGGGAFETDTTKVRVIDRFDVVAT--DTEAFVPASFKAIADQKG 286 (293) T ss_pred cceEEEEecccchhhhcCeEEEEEEEeeCcEEe--cccceEEEEeeccccCCc Confidence 34444444432111 11111111111111111 112222222222221111 No 75 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=94.22 E-value=0.005 Score=33.18 Aligned_cols=278 Identities=14% Similarity=0.128 Sum_probs=137.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.|....+.......++.++.....+.++... +.- .+ . + .|...++..........|...+. -. T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-~~~-----~~-~---~---~~~~~~~~~~~~~~a~~v~E~~~--~~ 70 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMK-LAK-----AV-P---M---TKPEEEFTFMSGVGAFWVDEAER--IQ 70 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhh-hce-----ee-e---c---CCCcEEEEEEcCCceeeeecCcc--cc Confidence 777777666667888999999888777766543 321 00 0 0 11122222222222333422222 23 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) +...+|..-++.+.....-+.+..++- +-+..||...-.+.|++-+.+..|+.+| .|. . .+.-.++.. T Consensus 71 ~~~~~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G~------g--~~~~~gil~ 138 (299) T protein:vir:41 71 TSKPTFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVF---TGV------E--SPYNWNILK 138 (299) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHh---hcc------c--Ccccccccc Confidence 556677666666666555555544433 3467899999999999999999998877 221 0 011111111 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) . .+...+..+.+..+++.|.++........ ...-..+|||..+..|++- T Consensus 139 ~---------------~~~~~~~~~~~~~~~~~l~~~~~~l~~~~----------------~~~~~~v~n~~~~~~L~~l 187 (299) T protein:vir:41 139 S---------------ATDASNLVEETANKYDDLNEAIGLIEAED----------------LEPNGIATIRKQRVKYRST 187 (299) T ss_pred c---------------ccccceeeccccccHHHHHHHHHhhhccc----------------CCcCEEEEcHHHHHHHHHh Confidence 0 11111222344567777777755432111 1223578999998888742 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) |. +..+|||. ++.+.+.|.+++..+.+.- ..+. ..+++|--+-++ ++-.++. T Consensus 188 ---------kd-----~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~-----~~~~----~~~~~gdfs~~~-i~~~~~~ 243 (299) T protein:vir:41 188 ---------KD-----GNGMPIFNTATSNGVDDVLGLPIAYTPKYTF-----GDKD----ISELVGDWNQAY-YGILRGV 243 (299) T ss_pred ---------hc-----cCCceeecCCcCCCCceecceeeEEecccCC-----CCCc----eEEEEEecccEE-EEEecCc Confidence 22 23467774 4557788999998877741 1111 135556544432 4443444 Q ss_pred Cccceechh--hccchhHHHH-HHHHhhhhccc---------CCcccEEEEEeeeeecc Q lcl|NC_019917. 317 RFDWEETVK--DYGNEPAICA-GFIAGMKKARF---------NSKDFGVISIDTAAKKH 363 (364) Q Consensus 317 ~~~w~Ee~~--D~g~~~~i~i-~~i~G~~K~rf---------~~~DfGvi~idta~~~~ 363 (364) .+.-.+|.. .+.+.-+..+ -+-.++-.+|. +.+-|-++..-+ +. T Consensus 244 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a---a~ 299 (299) T protein:vir:41 244 EYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA---GN 299 (299) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc---CC Confidence 444333321 0000000000 01122222221 112333332222 22 No 76 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=94.18 E-value=0.0052 Score=33.12 Aligned_cols=270 Identities=8% Similarity=0.011 Sum_probs=124.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) |+.++...+.......++..+.......+++.. +++. .-.....|....+.....- ...+|-...... T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~- 177 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQE-YVNV---------ENVTTLTGSRVYEKWTDITGLANIDDEAGKIA- 177 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHh-hhce---------eecccCccceEEEeeccCCcceeeecCccccc- Confidence 665555556566677888888777666666543 3321 0011111111111111100 112221111111 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ....++|..=++.+.....-+.+..+|- +.+.+||...-++.|.+.+....|+.++.-. |.. T Consensus 178 ~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~-g~~---------------- 239 (397) T protein:vir:49 178 DVDDPKLSLIKYTIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILEAI-AAL---------------- 239 (397) T ss_pred cccccceeeEEeeeeeEEeeehhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc---------------- Confidence 1124566666666666666666655544 3578999999999999999999998887331 100 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + |. .-..+.+.|.++...+.... + ..=+.+|||..+..|+. T Consensus 240 -~----~~------------------~~~~~~d~i~~~~~~l~~~~---------~-------~~a~~vmn~~~~~~l~~ 280 (397) T protein:vir:49 240 -P----TK------------------PTLTKWDDIIDLEAKVDPAI---------K-------QTSFFLTNTSGFTALKK 280 (397) T ss_pred -c----cc------------------cccccHHHHHHHHHhhhhhh---------c-------CCCEEEEcHHHHHHHHH Confidence 0 00 00123444555544332111 1 11256889999999874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchheEeeec Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGVIAYGT 312 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~~A~g~ 312 (364) - |.+ ..+|||. |..+.+.|.+++.....+.. .+..+ ...+++| .+++.+. . T Consensus 281 l---------kd~-----~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~--~~~~~----~~~i~~gd~~~~~~~~--~ 338 (397) T protein:vir:49 281 V---------KNA-----LGDYLMERDVKSPTGYSIDGFAVKEVADRWLA--NGTGG----AMPLYFGDLKQAVTLF--D 338 (397) T ss_pred h---------hcC-----CCceeeccCcCCCCCceecceeeEEecccccc--cccCC----ceeEEEeeccceEEEE--e Confidence 2 222 2356664 45678999888654332111 11111 2356667 3333232 2 Q ss_pred CCCCCccceechh-hc-cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVK-DY-GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~-D~-g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ..|.++...++.. +| .+.+.+-+...+|.+..+- + +++.+...+.+-+ T Consensus 339 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~--~--a~~~~~~~~~~~~ 388 (397) T protein:vir:49 339 RQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDT--E--AFVPASFKAIADQ 388 (397) T ss_pred ecceEEEEeccccchhhcCceeEEEEeeeCcEEecc--c--ceEEEEeecccCC Confidence 2344444333221 11 1112222222222221111 1 2222222222222 No 77 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=93.75 E-value=0.0065 Score=32.57 Aligned_cols=273 Identities=9% Similarity=0.030 Sum_probs=120.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) |+.++...+.......|+..+.......++... ++.. ..++...|....+.....- ....|.+.+.. . T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~ 177 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQE-YVNV---------ENVTTLTGSRVYEKWADITGLAKLDDEAGSI-G 177 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHh-hhce---------eeccCCcceEEEEeecCCCcceeeecccccc-c Confidence 555444445556678888888777777776643 3211 1111122222222211111 11122211111 1 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .....+|..=++.+....+-+.+..++- +-+.+||...-++.|+.=+....|+.+|.-. |.. T Consensus 178 ~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~-g~~---------------- 239 (397) T protein:vir:48 178 TNDDPKLYPIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILEAI-ATL---------------- 239 (397) T ss_pred cccccceeeEEeeheeeeeehhhHHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccc---------------- Confidence 1223455555555566555555544433 3478899999999999999999998877321 100 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ++ + +...+.+.|.++........ ...-+.+|||..+..|++ T Consensus 240 -~~---~-------------------~~~~~~d~i~~~~~~l~~~~----------------~~~a~~v~n~~~~~~L~~ 280 (397) T protein:vir:48 240 -PT---K-------------------PTLTKWDDIIDLQAKVDPAI----------------KQTSFFLTNTSGFTALKK 280 (397) T ss_pred -cc---c-------------------cccccHHHHHHHHHHhhhhh----------------cCCCEEEECHHHHHHHHH Confidence 00 0 01123344444433322110 112356789999988874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |. +..+|||. |.-+++.|.+++.....+-.. +..+ .-.+++|--.-++.++-.+ T Consensus 281 l---------kd-----~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~--~~~~----~~~~~~gd~~~~~~~~~~~ 340 (397) T protein:vir:48 281 V---------KN-----AFGDYLMERDVKSPTGYSIDGFAVKEVADRWLAN--ASSG----AMPLYFGDLKQAVTLFDRQ 340 (397) T ss_pred h---------hc-----CCCceeeccCcCCCCCceeccceeEEecccccCC--cCCC----ceEEEEEeccceEEEEeec Confidence 2 21 22367774 445789998887654322211 1111 2245666322112222223 Q ss_pred CCCccceech---hhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETV---KDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~---~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) +......++. ++++ .+.+-+..-++++.. +.+-|-++.+.+++..-. T Consensus 341 ~~~i~~~~~~~~~~~~~-~~~~r~~~r~d~~~~--~~~a~~~~~~~~~~~~~~ 390 (397) T protein:vir:48 341 QMSLLSTNIGGGAFETD-TTKIRVIDRFDVVAT--DTESFVPASFKAIADQKG 390 (397) T ss_pred ceEEEEeccchhhhhcC-ceeEEEEeeeccEEe--cccceEEEEecccccCCC Confidence 4444433322 1111 111111111111111 112222222222111111 No 78 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=93.71 E-value=0.0066 Score=32.52 Aligned_cols=287 Identities=11% Similarity=0.036 Sum_probs=137.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEE-eeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDL-SVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L-~~~L~G~gv~Gd~~leG 79 (364) ||++..+. -.....++.++.......|.+.. +.. .|. + .+.++++.. .......+|-+.+.. T Consensus 1 m~t~t~gg--~liP~~~~~~ii~~l~~~s~i~~-l~~-----~~~----~---~~~~~~ip~~~~~~~a~wv~E~~~~-- 63 (303) T protein:vir:97 1 MGTETSKA--SLFDKHLVSDLINKVKGHSSLAK-LSS-----QKP----I---PFNGSKEFTFTLDSDIDVVAENGKK-- 63 (303) T ss_pred CcccCCCC--eEcchhHHHHHHHHHHhhchhhh-hcc-----eee----c---CCCceEEEEEecCcceEEeecCccc-- Confidence 99655443 36778888888777766776654 421 111 1 111233321 122233344333332 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRK--RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) .+..++|..-++.+-....-+.+..++-+| -+.++|-+.-++.|++-+.+..|+.++.-.-...|...... T Consensus 64 ~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~------- 136 (303) T protein:vir:97 64 THGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVI------- 136 (303) T ss_pred cccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccc------- Confidence 356677766666666666666554433221 35678999999999999999999988843211111110000 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) |. -.+.+..+.....++++ .+.+.|.++........ .+.=..+|||..+..| T Consensus 137 --------~~---~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~L 188 (303) T protein:vir:97 137 --------GT---NHFDSKVTQVVKFTESE-DADANIEAAVNLIQGAE----------------GVVTGLAMDTEFSTAL 188 (303) T ss_pred --------cc---ccccccccccccccccc-chHHHHHHHHHHHhhcC----------------CCccEEEEcHHHHHHH Confidence 00 01111111111122222 34566666554332111 1111378899999988 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee------cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeee Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK------GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYG 311 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~------G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g 311 (364) ++-.| + ..+|||. +..+.+.|.+++....+.-....+... ..+++|-=.-++.|+ T Consensus 189 ~~lkd---------~-----~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~-----~~~~~Gdf~~~~~~~ 249 (303) T protein:vir:97 189 AKVTN---------G-----EMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESK-----DLVIIGDFESMFKWG 249 (303) T ss_pred HHhhc---------c-----CCCeEEecCccCCCCCceecceeeEEecccCCccccCCCc-----cEEEEeeccccEEEE Confidence 74222 1 2255653 334678899999887775332222222 136777655555666 Q ss_pred cCCCCCccceechhhccchhHHHHH-HHHhhhhcccCC-cccEEEE-----Eeeeeec Q lcl|NC_019917. 312 TANGLRFDWEETVKDYGNEPAICAG-FIAGMKKARFNS-KDFGVIS-----IDTAAKK 362 (364) Q Consensus 312 ~~~g~~~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi~-----idta~~~ 362 (364) -.++..+.+.++ ++..+..+. +..++--.|... =|+.|+. .-+-++- T Consensus 250 ~~~~~~~~~~~~----~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 250 YAKQIPMEIIKY----GDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EecCcEEEEeec----cCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 655666666543 322221111 111211122110 0222211 1111111 No 79 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=93.56 E-value=0.0055 Score=32.98 Aligned_cols=196 Identities=13% Similarity=0.069 Sum_probs=97.9 Q ss_pred EecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccCcccCCCCCcEE Q lcl|NC_019917. 93 IDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAGNPLEAPDVDHLL 172 (364) Q Consensus 93 Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~ 172 (364) ||.+..+--.=..+++..+.+|+|.+.-..+..=+++-.|+-++..+..+.....|....+ .+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~--~g--------------- 63 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQD--GG--------------- 63 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccc--cC--------------- Confidence 8877655333357899999999999999999999999999999998875322221211110 00 Q ss_pred eeccccchhhhhhccccc----HHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhcCCHHHHHH Q lcl|NC_019917. 173 YGGVATSKASLAATDIMA----PIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTAAGGTWIDF 248 (364) Q Consensus 173 ~~~~at~~~~i~~~D~~s----~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~~d~~w~~~ 248 (364) +...+.++..-+ ++.|..|...+. .+..| .+--++++.|.+++.|=...|+.. + T Consensus 64 ------~~~~~~a~~t~~~~~l~dai~~a~~~Ld--ekdVP------------~~gR~~vv~P~~y~~LL~~~d~~~--~ 121 (221) T protein:vir:17 64 ------FSVNIGAGNTNNAQAIVDGFFEAAAVLD--ERSAP------------MDGRVAVLSPRQYYSLISSVDTNI--L 121 (221) T ss_pred ------cceeccccccCCHHHHHHHHHHHHHHHh--hcCCC------------CCCCEEEeCcHHHHHHHHhcCcce--e Confidence 011111112222 344444433332 22222 234567889999988853223332 1 Q ss_pred HHHhhhhhccCCCeeec-CeEEEcCEEEEecCccccccc--ccC--Cc---------cccc----chheeeccchheEee Q lcl|NC_019917. 249 QKAAAAAEGRNNPIFKG-GLGMINNVVLHKHRNVIRFND--YGA--GA---------NVEA----ARALFMGRQAGVIAY 310 (364) Q Consensus 249 qk~A~~~~g~~nPlF~G-~~g~~ngvii~e~~~~~~~~~--~~~--~~---------~v~v----~ralllGaqA~~~A~ 310 (364) .+....+.| -+..| ++|+++|+.|++.++++.... +.. +. ...+ .-+|+.=..|++..= T Consensus 122 n~d~~~s~g---~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvk 198 (221) T protein:vir:17 122 NREIGNTQG---DMNTGKGLYVNAGIRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVE 198 (221) T ss_pred eeecccccc---cccccceeeeecCcEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchheeeee Confidence 111111222 24456 699999999999999975211 110 00 0000 114444444443332 Q ss_pred ecCCCCCccceechhh---ccch Q lcl|NC_019917. 311 GTANGLRFDWEETVKD---YGNE 330 (364) Q Consensus 311 g~~~g~~~~w~Ee~~D---~g~~ 330 (364) -..-..|+-.+-..|- -+++ T Consensus 199 l~~~~~~~~~~~~~~~~~~~~~~ 221 (221) T protein:vir:17 199 VLLPPSRPPLVISMFSIRRPDRR 221 (221) T ss_pred eecCCCCCceeeeeeeccCCCCC Confidence 2211222221111110 0111 No 80 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=92.83 E-value=0.0098 Score=31.60 Aligned_cols=294 Identities=12% Similarity=0.018 Sum_probs=126.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) ||++....+ ......++.++.......+... ++.-. | -+.. ..+++... ......+|-+.+... T Consensus 1 Mat~tt~~g-~~vP~~~~~~ii~~~~~~s~l~-~~~~~-----i----~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~- 65 (311) T protein:vir:99 1 MATFGTGNL-KNLPRNIADGMVKDVVQGSTVA-VLSAR-----K----PQRF---GNEDIITFNGRPKAEFVGEGQQKS- 65 (311) T ss_pred CceecCCCc-eeccHHHHHHHHHHHHhhchhh-hhcce-----e----eccC---CceEEEEEeCCceeEEeecCcccc- Confidence 997766544 4557788888887776666553 33210 0 0110 11233211 122223343222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhh--hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc-cccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSR--KRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGA-RGINLDFVETPDFT 156 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~--qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga-~g~~~~~~~~~~~~ 156 (364) +...+|.+-++.+-....-+.+..++-+ --+.+||-..-++.|++-+++..|+.+|.- .|+ .|...... T Consensus 66 -~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G-~g~~~g~~~~g~------ 137 (311) T protein:vir:99 66 -STTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHR-INPLTGTVIPGW------ 137 (311) T ss_pred -cccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcc-cCcccCcccccc------ Confidence 4456666555555444444444333321 135688999999999999999999888833 221 11110000 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhccccc-HHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMA-PIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQAT 235 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s-~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~ 235 (364) .|... ..+....+++.+... ...|+.+........... +.=..+|||..+. T Consensus 138 ---~~~~~-----------~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--------------~~~~~vmn~~~~~ 189 (311) T protein:vir:99 138 ---SNYLG-----------AASKRVELTADTIANPDLAIEAAVGLLVANGHPT--------------PVNGLALHPSIAW 189 (311) T ss_pred ---ccccc-----------cccceeeccccccchhHHHHHHHHHHHhhhccCC--------------CccEEEEcHHHHH Confidence 00000 011111122222222 233445443332221100 0112688999998 Q ss_pred HHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcc-c-ccch-heeeccchhe Q lcl|NC_019917. 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGAN-V-EAAR-ALFMGRQAGV 307 (364) Q Consensus 236 ~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~-v-~v~r-alllGaqA~~ 307 (364) .|++- |. ...+|||. +..+.+.|.+++....+........... + .... -+++|--.-+ T Consensus 190 ~L~~l---------kd-----~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~ 255 (311) T protein:vir:99 190 GLSTA---------RY-----TDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANG 255 (311) T ss_pred HHHhh---------hc-----cCCCeeecCcccCCCCceecceeeEeecccccccccccccchhhccCcceEEEeecccc Confidence 88742 22 12367774 4467899999988776643322111110 0 0011 2455543333 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) +.|+-..+..+.-.++ -|+++.... +-.++-..|+.. -||.|.- +-++++.. T Consensus 256 ~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~d~~~~r~~~r~d~~v~~-~~~v~~~~ 308 (311) T protein:vir:99 256 IHWGVQRDIPVELIKY-GDPDGQGDL---KRHNQIALRLEIVYGWYVFT-DRFVVIEN 308 (311) T ss_pred EEEEEecCceEEEeec-CCCCcchhh---hhcCcEEEEEEEeecceecC-hhHeeeec Confidence 4444322222222211 122211100 122222222211 1222211 11122111 No 81 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=92.68 E-value=0.01 Score=31.45 Aligned_cols=270 Identities=11% Similarity=0.052 Sum_probs=128.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeec-----CCCc-cEEEEeecCCCCCceEEEEEeeccccCceecC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGT-----SENA-VIQRKTELESDAGDTISFDLSVHLRGKPTYGD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~-----g~~~-~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd 74 (364) |..+....+.....+.|+..+.......++... +++. ++.. +|.+.. +. ...+..|.++ T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~--~~------------~~~~~~v~E~ 180 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-YVRVESVSTSSGSRVYEKWT--DV------------TPLKAMDEED 180 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhh-hcceeeccCCcceEEEEeec--CC------------cccccccccc Confidence 666666656666778888888888877777654 4331 1111 111111 00 0011122111 Q ss_pred ceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 75 ~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) ... ......+|..-++.+....+-+.+..++-+ -+.+||....+..|.+.+....|+.++.- .|. T Consensus 181 ~~~-~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~~~~~~d~~il~G-~G~------------ 245 (408) T protein:vir:74 181 GKI-PDLDNPRLTIIKYLIKRYAGIITATNTLLK-DTAENILAWLSSWIAKKVVVTRNQAIIAA-MGT------------ 245 (408) T ss_pred ccc-ccccccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc------------ Confidence 111 112446777777777777777766555443 37889999999999999999999887632 111 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) +. |. ++ ..+.+-|..+.. +.+. |.-. ..=+.+|||..+ T Consensus 246 ------~~---~~-------~~-----------~~~~~~i~~~~~-~~l~-----------~~~~---~~a~~v~n~~~~ 283 (408) T protein:vir:74 246 ------VP---KK-------PT-----------IANFDDVITMIN-TSVD-----------PAII---ATSSLLTNQSGL 283 (408) T ss_pred ------cc---cc-------cc-----------cccHHHHHHHHH-Hhhh-----------hhhc---CCCEEEEcHHHH Confidence 00 00 00 011222222211 1111 0000 112467899998 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) ..|++- |. +..+|||. |.-+.+.|.+++-.+...-.. .++. .-.+++|--.-++. T Consensus 284 ~~l~~l---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~--~~~~----~~~i~~gd~~~~~~ 343 (408) T protein:vir:74 284 NKLALV---------KT-----AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN--SGST----VYPLYYGDMSQAIT 343 (408) T ss_pred HHHHHh---------hc-----CCCceEeccCcCCCCCceecceeeEEecCccccc--ccCC----cceEEEEehhccEE Confidence 888742 22 23467775 334688898877654321111 1111 22467774322222 Q ss_pred eecCCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 310 YGTANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ++-..+..+.|..+..+. -+.+.+-+...+|.+.. +.+-|-++.+.+.+.+-. T Consensus 344 ~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~--~~~a~~~~~~~~~~~~~~ 398 (408) T protein:vir:74 344 LFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT--DSEALVAGSFTAIADQVG 398 (408) T ss_pred EEEecceEEEEeccccchhhcceeeEEEEEeeCcEEe--cccceEEEEeecccCCCC Confidence 233345666666543211 12222323333333211 123343333322222222 No 82 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=91.95 E-value=0.014 Score=30.83 Aligned_cols=271 Identities=10% Similarity=0.065 Sum_probs=127.4 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec--cccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH--LRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~--L~G~gv~Gd~~le 78 (364) |..+....|. .....++..+.......++... +... +. . .|..+++....- -...+|...+ + T Consensus 105 ~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~-~~~~------~~---~---~~~~~~~~~~~~~~~~a~~v~E~~--~ 168 (385) T protein:vir:18 105 LGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRD-LLAQ------GR---T---SSNALEYVREEVFTNNADVVAEKA--L 168 (385) T ss_pred hccccccCCc-eecchhhhHHHHHhhhccchhh-hcce------ec---c---cCcceEEEEEecCCcceeeeccCc--c Confidence 5554444443 2345677788877777776653 3211 00 0 111233322211 1122332222 1 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) -.+...+|..-++.+.....-+.+..++-+ ...+|-..-++.|.+-+....|+.+| .|+ +... ...++ T Consensus 169 ~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~~d~~~l---~G~------g~~~-~~~Gi 236 (385) T protein:vir:18 169 KPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALKEEGQLL---NGD------GTGD-NLEGL 236 (385) T ss_pred ccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHHHHHHHH---hcc------CCCC-ccccc Confidence 224455666666666666666655444432 23468888888888888888888777 221 1110 11222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ... ........+++...+++.|.++...+.... .+.=+++|||..+..|+ T Consensus 237 ~~~--------------~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~----------------~~~~~~~~~~~~~~~l~ 286 (385) T protein:vir:18 237 NKV--------------ATAYDTSLNATGDTRADIIAHAIYQVTESE----------------FSASGIVLNPRDWHNIA 286 (385) T ss_pred ccc--------------cccccccccccccchHHHHHHHHHhhcccc----------------CCCCEEEEcHHHHHHHH Confidence 111 001111223334456777777755432110 12226789999999887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCee----ecCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeec Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIF----KGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGT 312 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF----~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~ 312 (364) +-.| + ..+||| .|.-+.+.|++++..+.++.. .+++|- +++.++ - T Consensus 287 ~lkd---------~-----~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~-------------~~~~gd~~~~~~~~--~ 337 (385) T protein:vir:18 287 LLKD---------N-----EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAG-------------TFTVGGFDMASQVW--D 337 (385) T ss_pred Hhhc---------C-----CCceeccCcccCCCceecceeeEEcCcCCCC-------------cEEEeecccEEEEE--E Confidence 4222 1 124454 567788999999988776421 244443 333332 2 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) ..+....+..+..|+ -+.+.+-+..-+|.+-. +.+ +++.+.-++++ T Consensus 338 ~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~--~~~--a~~~~~~~aa~ 385 (385) T protein:vir:18 338 RMDATVEVSREDRDNFVKNMLTILCEERLALAHY--RPT--AIIKGTFSSGS 385 (385) T ss_pred ecceEEEEeccccchhhcCcEEEEEEEeeccEEe--ccc--ceEEEEeccCC Confidence 234444454443332 11111111111221111 123 33333333333 No 83 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=91.95 E-value=0.014 Score=30.83 Aligned_cols=271 Identities=10% Similarity=0.065 Sum_probs=127.4 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec--cccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH--LRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~--L~G~gv~Gd~~le 78 (364) |..+....|. .....++..+.......++... +... +. . .|..+++....- -...+|...+ + T Consensus 105 ~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~-~~~~------~~---~---~~~~~~~~~~~~~~~~a~~v~E~~--~ 168 (385) T protein:vir:19 105 LGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRD-LLAQ------GR---T---SSNALEYVREEVFTNNADVVAEKA--L 168 (385) T ss_pred hccccccCCc-eecchhhhHHHHHhhhccchhh-hcce------ec---c---cCcceEEEEEecCCcceeeeccCc--c Confidence 5554444443 2345677788877777776653 3211 00 0 111233322211 1122332222 1 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) -.+...+|..-++.+.....-+.+..++-+ ...+|-..-++.|.+-+....|+.+| .|+ +... ...++ T Consensus 169 ~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~~d~~~l---~G~------g~~~-~~~Gi 236 (385) T protein:vir:19 169 KPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALKEEGQLL---NGD------GTGD-NLEGL 236 (385) T ss_pred ccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHHHHHHHH---hcc------CCCC-ccccc Confidence 224455666666666666666655444432 23468888888888888888888777 221 1110 11222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ... ........+++...+++.|.++...+.... .+.=+++|||..+..|+ T Consensus 237 ~~~--------------~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~----------------~~~~~~~~~~~~~~~l~ 286 (385) T protein:vir:19 237 NKV--------------ATAYDTSLNATGDTRADIIAHAIYQVTESE----------------FSASGIVLNPRDWHNIA 286 (385) T ss_pred ccc--------------cccccccccccccchHHHHHHHHHhhcccc----------------CCCCEEEEcHHHHHHHH Confidence 111 001111223334456777777755432110 12226789999999887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCee----ecCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeec Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIF----KGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGT 312 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF----~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~ 312 (364) +-.| + ..+||| .|.-+.+.|++++..+.++.. .+++|- +++.++ - T Consensus 287 ~lkd---------~-----~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~-------------~~~~gd~~~~~~~~--~ 337 (385) T protein:vir:19 287 LLKD---------N-----EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAG-------------TFTVGGFDMASQVW--D 337 (385) T ss_pred Hhhc---------C-----CCceeccCcccCCCceecceeeEEcCcCCCC-------------cEEEeecccEEEEE--E Confidence 4222 1 124454 567788999999988776421 244443 333332 2 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) ..+....+..+..|+ -+.+.+-+..-+|.+-. +.+ +++.+.-++++ T Consensus 338 ~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~--~~~--a~~~~~~~aa~ 385 (385) T protein:vir:19 338 RMDATVEVSREDRDNFVKNMLTILCEERLALAHY--RPT--AIIKGTFSSGS 385 (385) T ss_pred ecceEEEEeccccchhhcCcEEEEEEEeeccEEe--ccc--ceEEEEeccCC Confidence 234444454443332 11111111111221111 123 33333333333 No 84 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=91.71 E-value=0.015 Score=30.65 Aligned_cols=277 Identities=11% Similarity=0.048 Sum_probs=128.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) .+.+....+.......|+..+.......++... ++.. .-+....|.-............+|....... . T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~-~ 189 (415) T protein:vir:98 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTV---------KRVTNGSGKYPVVRQSEVAALEKVEELEENP-E 189 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhh-heee---------eeccCCceeEEEEeecCCccceeeccccccC-c Confidence 233334445556778888888776666665543 3321 0011111110111111111112332111111 0 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .....|.+-++.+.....-+.+..++- +-+.+||...-++.|++-+....|+.++..+. + |.. ..... T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~-g~~-------~~~~~-- 257 (415) T protein:vir:98 190 LAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVIT-K-GST-------GSTSS-- 257 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccc-c-Ccc-------ccccc-- Confidence 122345555555555555555544433 34788999999999999999999988775442 1 100 00000 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) + ........+.+...+++.|.+++....... ...=.++|||.-+..|++- T Consensus 258 ~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~n~~~~~~l~~l 307 (415) T protein:vir:98 258 G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKPN----------------YEHNVAIVSQTMFAKLDKM 307 (415) T ss_pred c--------------ccccccccccccccchhHHHHHHHhhhhhc----------------cCCCEEEEcHHHHHHHHHh Confidence 0 001111223344566776766654432110 1112467899888888741 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchheEeeecC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGVIAYGTA 313 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~~A~g~~ 313 (364) |. +..+|||. |.-+.+.|.+++-.+.++- +.++.+ .+++| .+++.+ +.. T Consensus 308 ---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~----~~~~Gd~~~~~~~--~~~ 363 (415) T protein:vir:98 308 ---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKGNN----TLIIGNLKDAIVL--FDR 363 (415) T ss_pred ---------hc-----cCCceeeccCcCCCCCceecceeeEEeccccc----CCCCcc----EEEEEehhccEEE--Eee Confidence 22 23467774 3456899999887665532 222222 47788 444433 223 Q ss_pred CCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfGvi~idta~~~~~ 364 (364) .+....|.. +++. ...+ ++. .||.. +-|-++.+.++++--. T Consensus 364 ~~~~v~~~~--~~~~-~~~~-----~~~--~r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:98 364 SQYQASWTD--YMHF-GECL-----MIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred cceEEEEec--cccC-ceEE-----EEE--EEeccEEeccccEEEEEEeccCCCCC Confidence 455666543 2221 1111 111 24432 2233333322222221 No 85 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=91.71 E-value=0.015 Score=30.65 Aligned_cols=277 Identities=11% Similarity=0.048 Sum_probs=128.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) .+.+....+.......|+..+.......++... ++.. .-+....|.-............+|....... . T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~-~ 189 (415) T protein:vir:79 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTV---------KRVTNGSGKYPVVRQSEVAALEKVEELEENP-E 189 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhh-heee---------eeccCCceeEEEEeecCCccceeeccccccC-c Confidence 233334445556778888888776666665543 3321 0011111110111111111112332111111 0 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .....|.+-++.+.....-+.+..++- +-+.+||...-++.|++-+....|+.++..+. + |.. ..... T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~-g~~-------~~~~~-- 257 (415) T protein:vir:79 190 LAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVIT-K-GST-------GSTSS-- 257 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccc-c-Ccc-------ccccc-- Confidence 122345555555555555555544433 34788999999999999999999988775442 1 100 00000 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) + ........+.+...+++.|.+++....... ...=.++|||.-+..|++- T Consensus 258 ~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~n~~~~~~l~~l 307 (415) T protein:vir:79 258 G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKPN----------------YEHNVAIVSQTMFAKLDKM 307 (415) T ss_pred c--------------ccccccccccccccchhHHHHHHHhhhhhc----------------cCCCEEEEcHHHHHHHHHh Confidence 0 001111223344566776766654432110 1112467899888888741 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchheEeeecC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGVIAYGTA 313 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~~A~g~~ 313 (364) |. +..+|||. |.-+.+.|.+++-.+.++- +.++.+ .+++| .+++.+ +.. T Consensus 308 ---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~----~~~~Gd~~~~~~~--~~~ 363 (415) T protein:vir:79 308 ---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKGNN----TLIIGNLKDAIVL--FDR 363 (415) T ss_pred ---------hc-----cCCceeeccCcCCCCCceecceeeEEeccccc----CCCCcc----EEEEEehhccEEE--Eee Confidence 22 23467774 3456899999887665532 222222 47788 444433 223 Q ss_pred CCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfGvi~idta~~~~~ 364 (364) .+....|.. +++. ...+ ++. .||.. +-|-++.+.++++--. T Consensus 364 ~~~~v~~~~--~~~~-~~~~-----~~~--~r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:79 364 SQYQASWTD--YMHF-GECL-----MIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred cceEEEEec--cccC-ceEE-----EEE--EEeccEEeccccEEEEEEeccCCCCC Confidence 455666543 2221 1111 111 24432 2233333322222221 No 86 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=91.71 E-value=0.015 Score=30.65 Aligned_cols=277 Identities=11% Similarity=0.048 Sum_probs=128.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) .+.+....+.......|+..+.......++... ++.. .-+....|.-............+|....... . T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~-~ 189 (415) T protein:vir:81 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTV---------KRVTNGSGKYPVVRQSEVAALEKVEELEENP-E 189 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhh-heee---------eeccCCceeEEEEeecCCccceeeccccccC-c Confidence 233334445556778888888776666665543 3321 0011111110111111111112332111111 0 Q ss_pred hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc Q lcl|NC_019917. 81 EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFAG 160 (364) Q Consensus 81 ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~~ 160 (364) .....|.+-++.+.....-+.+..++- +-+.+||...-++.|++-+....|+.++..+. + |.. ..... T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~-g~~-------~~~~~-- 257 (415) T protein:vir:81 190 LAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVIT-K-GST-------GSTSS-- 257 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccc-c-Ccc-------ccccc-- Confidence 122345555555555555555544433 34788999999999999999999988775442 1 100 00000 Q ss_pred CcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhhc Q lcl|NC_019917. 161 NPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRTA 240 (364) Q Consensus 161 N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~~ 240 (364) + ........+.+...+++.|.+++....... ...=.++|||.-+..|++- T Consensus 258 ~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~n~~~~~~l~~l 307 (415) T protein:vir:81 258 G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKPN----------------YEHNVAIVSQTMFAKLDKM 307 (415) T ss_pred c--------------ccccccccccccccchhHHHHHHHhhhhhc----------------cCCCEEEEcHHHHHHHHHh Confidence 0 001111223344566776766654432110 1112467899888888741 Q ss_pred CCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchheEeeecC Q lcl|NC_019917. 241 AGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGVIAYGTA 313 (364) Q Consensus 241 ~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~~A~g~~ 313 (364) |. +..+|||. |.-+.+.|.+++-.+.++- +.++.+ .+++| .+++.+ +.. T Consensus 308 ---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~----~~~~Gd~~~~~~~--~~~ 363 (415) T protein:vir:81 308 ---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKGNN----TLIIGNLKDAIVL--FDR 363 (415) T ss_pred ---------hc-----cCCceeeccCcCCCCCceecceeeEEeccccc----CCCCcc----EEEEEehhccEEE--Eee Confidence 22 23467774 3456899999887665532 222222 47788 444433 223 Q ss_pred CCCCccceechhhccchhHHHHHHHHhhhhcccCC-----cccEEEEEeeeeeccC Q lcl|NC_019917. 314 NGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-----KDFGVISIDTAAKKHS 364 (364) Q Consensus 314 ~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-----~DfGvi~idta~~~~~ 364 (364) .+....|.. +++. ...+ ++. .||.. +-|-++.+.++++--. T Consensus 364 ~~~~v~~~~--~~~~-~~~~-----~~~--~r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:81 364 SQYQASWTD--YMHF-GECL-----MIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred cceEEEEec--cccC-ceEE-----EEE--EEeccEEeccccEEEEEEeccCCCCC Confidence 455666543 2221 1111 111 24432 2233333322222221 No 87 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=91.68 E-value=0.015 Score=30.63 Aligned_cols=282 Identities=11% Similarity=0.081 Sum_probs=126.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) ||-|... +..+.|++++...-.+...|. +++=+.-..-+. ....||+|+++.-....-.-..+. .+.++ T Consensus 1 MANsl~~----l~p~iia~~al~~l~~~lV~~-~lV~r~y~~ef~-----~ak~GDTV~I~~P~~~~~~d~~~~-~~t~~ 69 (423) T protein:vir:10 1 MANNLDA----NVSQIVLKKFLPGFMSDLVLC-KTVDRQLLAGEI-----NSSTGDSVSFKRPHQFKSERTMDG-DITGK 69 (423) T ss_pred Ccccccc----ccHHHHHHHHHHHHHhhcccc-hhhccCCCcccc-----ccccCCEEEEeeCCceeeecccCc-ccCcc Confidence 9944433 346799999876666665554 354332211111 124899999986655432211111 12232 Q ss_pred -hhhhhhcccEEEEecccc-eeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 81 -EENLRFYTDQVKIDQVRH-PVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 81 -ee~L~~~~~~v~Idq~R~-~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .++|.-.+-.|.|||..+ ++.+..+ ++.-...||-+ .-.....=+++..|+.+...|...-. T Consensus 70 ~~~~l~e~~v~l~id~~k~~a~~v~d~-E~~l~i~~~~~-~l~~A~~aLA~~vd~~ia~~~~~~~~-------------- 133 (423) T protein:vir:10 70 SKNSLISAKATGEVGNYITVAVEYRQI-EEALKLNQLDQ-ILVPINERMVTDLETELALFMMKHGA-------------- 133 (423) T ss_pred cccccccceEEEEecceeeeeeeeChH-HHhcChhHHHH-HHHHHHHHHHHHHHHHHHHHhhhccc-------------- Confidence 245555667899999887 7777653 22234556633 33333455677777766655543110 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) |.+..|.. . .+ .++.+-.+..++. ....|. .+ -.+++.|.-...|. T Consensus 134 --~~vgt~~t--------~--------~~--a~~~~a~a~~~L~--~~~vP~-----------~~-R~~Vv~p~~~a~Ll 179 (423) T protein:vir:10 134 --LSLGSPNT--------P--------IK--KWSDVAQTASFLK--DLGINS-----------GE-NYAVMDPWAAQRLA 179 (423) T ss_pred --cccccccc--------c--------cc--cHHHHHHHHHHHh--hccCCc-----------CC-CEEEeCHHHHHHHh Confidence 11111110 0 01 2445555544433 222232 23 34599999888886 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCe-EEEcCEEEEecCcccccccccCCcc------cc----------cchheee Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGL-GMINNVVLHKHRNVIRFNDYGAGAN------VE----------AARALFM 301 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~-g~~ngvii~e~~~~~~~~~~~~~~~------v~----------v~ralll 301 (364) . ++......++.. .-.+=.|.+ |.+.|+-+++..+++...++..++. .. +.++-.+ T Consensus 180 ~-~~~~~~~~~~~~------~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~ 252 (423) T protein:vir:10 180 D-AQSGLHVSEQLV------RTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLT 252 (423) T ss_pred h-hhhhhccccccc------hHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEeccccccccccccee Confidence 4 233322222221 122445655 9999999999999876533222111 10 1222233 Q ss_pred ccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC------cccEEEEEeeeeeccC Q lcl|NC_019917. 302 GRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS------KDFGVISIDTAAKKHS 364 (364) Q Consensus 302 GaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~------~DfGvi~idta~~~~~ 364 (364) ++-+-.-++-+. |--|..- ++.++.=+.|-++-. +-|-| +-|+-+-+++ T Consensus 253 ~~T~s~~g~l~~-GD~~t~a------------Gv~~v~~~tk~~l~~~~~~~~~~~~V-~~~~~~~a~~ 307 (423) T protein:vir:10 253 GATASKKGFLKV-GDQLQFD------------DTHWLNQQSKQTLYNGASALSFTATV-MEDANAHSSG 307 (423) T ss_pred eccceeceeEEe-cceEeec------------ceeeecccccceeecccCCcceEEEE-EecccccccC Confidence 322211111111 0000000 001111122222211 12222 1121111111 No 88 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=91.25 E-value=0.017 Score=30.32 Aligned_cols=274 Identities=10% Similarity=0.084 Sum_probs=125.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc--ccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL--RGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L--~G~gv~Gd~~le 78 (364) ...|....+.......|+..+.....+.+++.. ++..- . . .|.++++.....- ...+|...+.. T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~-~~~~~------~---~---~~~~~~~~~~~~~~~~a~~v~E~~~~- 200 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRD-LLMPG------Q---T---SSSSIEYTVETGFTNNAAAVAEGAQK- 200 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHh-hccee------e---c---cCCceeEEEEecCCCceeeeccCccc- Confidence 223333344456788899998888877777754 33210 0 0 1111222211110 11122222221 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+..++|..-++.+.....-+.+..++-+ -+ .||...-+..|.+-+....|+.+|.- +|. .. ...++ T Consensus 201 -~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds-~~l~~~i~~~l~~a~~~~~d~a~l~G-~g~--------~~-~p~Gi 267 (418) T protein:vir:10 201 -PTSDLKFNLKNQPVRTIAHLFKASRQILD-DA-PALQSYIDGRARYGLQLTEEGQILKG-DGT--------GA-NILGI 267 (418) T ss_pred -cccccceeeEEEeeeeEEEeehhhHHHHH-hH-HHHHHHHHHHHHHHHHHHHHHHHhcc-CCC--------Cc-ccccc Confidence 23445666666666666666655544433 23 47888888889999999999877621 111 00 01122 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ... ..........++..+++.|..++..+... +...-.++|+|..+..|+ T Consensus 268 ~~~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~----------------~~~~~~~v~n~~~~~~L~ 317 (418) T protein:vir:10 268 LPQ--------------ASAFMPSITLANATPIDKIRLALLQAVLA----------------EFPATGIVLNPIDWASIE 317 (418) T ss_pred ccc--------------cccccccccccccccHHHHHHHHHhhccc----------------cCCCCEEEEcHHHHHHHH Confidence 110 00011122233344566666654433211 011224778999998887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeec Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGT 312 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~ 312 (364) +-.| ...+|||. |..+.+.|.+++..+.++.. .+++|- |++.++ - T Consensus 318 ~lkd--------------~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~-------------~~~~gd~s~~~~~~--~ 368 (418) T protein:vir:10 318 LTKD--------------SQGRYIVGNPVNGTTPRLWNLPVVETQAMTAN-------------EFLVGAFSMAAQIF--D 368 (418) T ss_pred Hhhc--------------CCCceeccccccCCCceecceeeEEcCCCCCC-------------cEEEeeccceEEEE--E Confidence 4221 12356663 55688999999988766421 134453 222222 1 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) ..+....|.++..++ -+.+.+-+...++++ ++ ..-+++.+....++-. T Consensus 369 ~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~-~~---~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 369 RMEIEVLLSTENVDDFEKNMVSIRAEERLALA-VY---RPESFVTGALVEQAGG 418 (418) T ss_pred ecceEEEEecccchhhhcCceEEEEEEeeccE-Ee---cccceEEEEeccCCCC Confidence 234445555432211 011111111111111 00 1123333333322223 No 89 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=90.24 E-value=0.022 Score=29.68 Aligned_cols=277 Identities=13% Similarity=0.108 Sum_probs=141.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) || .+ -.++|+..|-....+.+.+.. | ... -+ .. .-|++|.++=+. -.|+.-=++-.|. T Consensus 1 Ma-----in---~a~~~~~~Ld~~~~~~~~t~~-l-~~~---~~-~~-----~ggktVkI~~i~---~~gl~DY~R~~g~ 58 (290) T protein:vir:78 1 MA-----IN---YVDKYGKELDQKLVFGTYTNE-L-ETP---NL-LW-----LDAKTFKIQTIT---TTGLKAHTRNKGY 58 (290) T ss_pred Cc-----hh---HHHHHHHHHHHHHHhhheeee-c-ccc---ce-ee-----ccCCEEEEeeec---cCcccccccCCCc Confidence 76 22 237899999999888877654 3 211 11 11 358999988333 2332111111222 Q ss_pred -hhhhhhcccEEEEecccceeeccchhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 81 -EENLRFYTDQVKIDQVRHPVSAGGRMSRKRS--VHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 81 -ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) ..+.+....++.+||-|-=--.=..|+...+ ...+-.....-..+......|.-.|-.|+...+. T Consensus 59 ~~g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~------------ 126 (290) T protein:vir:78 59 NEGSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKT------------ 126 (290) T ss_pred ccCccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhc------------ Confidence 2335667888999998832111134444333 2344444455555556667777777777642110 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) .| ......++++ =-++.|+.+..+.+ . .| .+-.+|+++|.-..-| T Consensus 127 --~~---------------~~~~~t~t~~--n~~~~i~~~~~~ld--e--vp------------~~~rvl~vtp~~~~lL 171 (290) T protein:vir:78 127 --NS---------------NSVAEEITKD--NVFTKLKAAIRKVK--K--YG------------TQNLVMYVSPDVMAAL 171 (290) T ss_pred --cC---------------cccccccCHH--HHHHHHHHHHHHHH--h--cC------------CCCeEEEECHHHHHHH Confidence 00 0011122222 23455556544432 1 11 2458999999999999 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccc------cccccCCcccccchheeeccchheEeee Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIR------FNDYGAGANVEAARALFMGRQAGVIAYG 311 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~------~~~~~~~~~v~v~ralllGaqA~~~A~g 311 (364) +.+. + +++... ......-...|.+|.+||+.|+|.|.--| |.++...+..+-...+||-...+.+|.- T Consensus 172 ~~~~--~---f~r~~~-~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~~~~a~i~~~ 245 (290) T protein:vir:78 172 ELSD--D---FVRAIN-VQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLVNKGSVVGGA 245 (290) T ss_pred hhCh--h---hhcccc-ccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCccceeEEEEcCCceeeee Confidence 8643 4 455321 11112234689999999999999886434 4444333434445689999999999987 Q ss_pred cCCCCCccceechhhccchhHHHHHHHHhhhh----cccCCcccEEEEEeeee Q lcl|NC_019917. 312 TANGLRFDWEETVKDYGNEPAICAGFIAGMKK----ARFNSKDFGVISIDTAA 360 (364) Q Consensus 312 ~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K----~rf~~~DfGvi~idta~ 360 (364) ++.-.+.+=.+...+ |+. +.+.-++ .-++++-=|+++ =++| T Consensus 246 K~~~~~~~~P~~~~~-~d~------~~~~~r~y~d~~v~~nk~~~i~~-~~~~ 290 (290) T protein:vir:78 246 KHASIYLHAPGSVGQ-GDG------WLYQYRVYHDIFVLDQQKDGVIA-STEV 290 (290) T ss_pred eeeEEEeeCCCCCcC-cce------eeeeeeeeeeeeeeccccCeeEE-EeeC Confidence 754443332222111 110 0000000 001222224433 1222 No 90 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=89.77 E-value=0.024 Score=29.42 Aligned_cols=289 Identities=11% Similarity=0.051 Sum_probs=134.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) |.+++...|-...+..|+..++......++... +.. +++ + .+..+++..... ....+|...+.. . T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~-~~~--------~~~-~---~~~~~~~~~~~~~~~a~~v~E~~~~-~ 171 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQ-EAT--------VIT-L---GGSDYKKLVNLGGTTSGWVGETDAR-P 171 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhh-hce--------eee-c---CCCceEEEEecCCcceeeecccccc-c Confidence 666665555566778899998877766655543 221 000 0 011122221111 111122111111 0 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ......|..-++.+-...+-+.+..++-+ -+.+||-..-+..|..=+....|+.++. =+|+ + .| .|+. T Consensus 172 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~~~a~l~-G~G~---~-----~p--~Gil 239 (407) T protein:vir:48 172 ETATSKLGLIEPFMGEIYGNPQATQKMLD-DAFFNVEDWINSELALEFAEQEEIAFTS-GDGS---K-----KP--KGFL 239 (407) T ss_pred ccccccceeEEeeeeeeEeehhhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHhhhhc-cCCC---C-----cc--ceee Confidence 01112455555555555555555444332 4678898888888888888888887652 1222 1 11 2222 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ..+-...+ ....+.+........+++.++.+.|-++....... . ...-+.+|||..+..|++ T Consensus 240 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~------------~----~~~a~~v~n~~~~~~L~~ 301 (407) T protein:vir:48 240 AYESTDED--DKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKA------------H----RSGAKFMMNNSSLFAIRL 301 (407) T ss_pred eccccccc--ccccccccccccccccccccChHHHHHHHHhhchh------------h----hcCCEEEEcHHHHHHHHH Confidence 11110000 00011111111122345667777776665433211 0 011245789998888864 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |. +..+|||. |..+++.|.+++..+.++.. +++. ..+++|-=.-++-++... T Consensus 302 l---------kD-----~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~---~~~~-----~~i~~Gd~~~~~~i~~~~ 359 (407) T protein:vir:48 302 L---------KD-----NDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI---AADA-----KAIAFGNFKRGYTIVDRI 359 (407) T ss_pred h---------hc-----cCCceeeccCcCCCCCceecceeeEEecCcCCc---cCCc-----cEEEEEeccccEEEEEee Confidence 2 21 23467774 44568889999888776531 1111 235566432222222223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccCC---cccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS---KDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~---~DfGvi~idta~~~~~ 364 (364) +.... .+ +|-.+-.+++. .-.||.. ..=+++.+.-++++|| T Consensus 360 ~~~i~-~d---~~~~~~~~~~~-----~~~r~d~~v~~~~a~~~l~~~aa~~~ 403 (407) T protein:vir:48 360 GTRIL-RD---PYTNKPFVGFY-----TTKRTGGMLVDSQAIKLMKIGAATRQ 403 (407) T ss_pred ceEEE-ee---ccccCCcEEEE-----EEEEeccEEecccceEEEEeeccCCC Confidence 33332 11 33222111111 1123422 1225667778888888 No 91 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=89.50 E-value=0.026 Score=29.27 Aligned_cols=288 Identities=15% Similarity=0.071 Sum_probs=121.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhccc-ccceeecCCCccEEEEeecCCCCCceEEEEEeeccccC-----ceecC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYF-EQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGK-----PTYGD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f-~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-----gv~Gd 74 (364) ||....-+=+|++ +..+.++.+- ..+|- ..++..|+.-+++. .||-+.+++-.+|.|. .+.++ T Consensus 1 m~lsD~~vfN~~~--------~~a~~e~~~q~~~~fn-~as~gai~l~~~~~--~Gd~~~~pf~~~l~g~~~~~~~~~~~ 69 (325) T protein:vir:95 1 MALSDLAVYSEYA--------YSAFSETLRQQVDLFN-TATGGAIMLQSAAH--QGDFSDVAFFAKVTGGLVRRRNAYGS 69 (325) T ss_pred Cchhhhhhhhhhh--------hhhhhhhhhhhHhhhh-hcccceeEeccccc--cCceeeccccccccccccccccCCCC Confidence 8876655433443 2222222110 01121 34556676655554 3999999999999773 34444 Q ss_pred ceeecchhhhhhccc-EEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh----cccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTD-QVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYL----SGARGINLDF 149 (364) Q Consensus 75 ~~leGnee~L~~~~~-~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l----~ga~g~~~~~ 149 (364) ..++. ..|....+ .+.+-..+-++.. ..++-..-.|-...+...+.++|+++..+.++..+ .++.+.+ T Consensus 70 ~~vt~--~kitt~~~~av~~~r~~g~~~~--d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~--- 142 (325) T protein:vir:95 70 GTVAE--KVLKHLVDTSVKVAAGTPPVRL--DPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV--- 142 (325) T ss_pred ceecc--ceeccccceeeEEecccCcccc--cHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--- Confidence 44442 34444444 2232222222221 12222222233333444555555555555544444 3321111 Q ss_pred cccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEE Q lcl|NC_019917. 150 VETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVM 229 (364) Q Consensus 150 ~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l 229 (364) .+.+.-....++ .++..+|.+++-+|..+ ++-. .++-=.++| T Consensus 143 ------------------~~~v~dis~~~~----~~~~~~s~~~l~~A~~k--lGD~--------------~~~l~~~~M 184 (325) T protein:vir:95 143 ------------------SDVVYDATANTD----AADKLPTWNNLNNGQAK--FGDQ--------------SSQIAAWIM 184 (325) T ss_pred ------------------ccceeeeecccC----cccccccHHHHHHHHHH--hccc--------------ccceeEEEE Confidence 011111111111 12345799999888664 3311 135668899 Q ss_pred echhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 230 SEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 230 ~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) |+.-+.+|++..-.++..+..+. .. + .+.-|.|-++.---.++.. ..+.....+.++||..|.++. T Consensus 185 HS~v~~~L~~~~L~~~~~~~~~~-g~----~-----~i~t~~G~~VIVdD~~p~~----~~g~~~~ytty~lg~GAi~~~ 250 (325) T protein:vir:95 185 HSTPMHKLYGSNLTNGERLFTYG-TV----N-----VVRDPFGKLLVMTDSPNLF----AAGTPNVYHILGLVPGGVLIG 250 (325) T ss_pred chHHHHHHHHhhccccccccccC-Cc----c-----cccccCCcEEEEeCCCCCC----CccCceeEEEEEEecCeEEec Confidence 99999999864322221111111 11 1 1223444333332223221 122233578999998885444 Q ss_pred eecCCCCCccceechhhccchhHHHHH----HHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 310 YGTANGLRFDWEETVKDYGNEPAICAG----FIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~~D~g~~~~i~i~----~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) .+. . +.-.....+=..+.+.... .+++.+=.+|....-|+ . +|.+.|-. T Consensus 251 ~~~--~--~~~~~~~~~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~-s-Pt~aeL~~ 303 (325) T protein:vir:95 251 QNN--D--FDANEETKNGDENIIRTYQAEWSYNIGVKGFAWDKANGGK-S-PTDAALFT 303 (325) T ss_pred CCC--C--ccccccccCcccceeeeeeeeeeEEeecceeeeecccccC-C-cChHhhcC Confidence 332 2 2212211121222222222 11222223331100000 0 11111111 No 92 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=89.45 E-value=0.026 Score=29.25 Aligned_cols=272 Identities=9% Similarity=0.028 Sum_probs=119.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) .+.|..+.+.......|+..+.......++... ++.. .-.+...|.-....+...- .+..|...... . T Consensus 107 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~---------~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-~ 175 (395) T protein:vir:38 107 SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLES-LANV---------ENVTTSHGSRVYEKLADITPLKDLDDESALI-G 175 (395) T ss_pred hccCccCCCceecchhHhhHHHHHHHhhcchhh-hcce---------eeccCCcceEEEEeeccCCcccccccccccc-c Confidence 334444455556678888888877777666543 4321 0011111111111111100 01222111111 1 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ..+..+|..-++.+.....-+.+..++- +.+.+||...-...|...+....|+.+|.- .|+- . T Consensus 176 ~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~il~g-~g~~--------~------- 238 (395) T protein:vir:38 176 DNDDPELTVVKYLIHRYAGITTVTNTLL-KDTVDNIIQWLVNWAAKKDVVTRNAKILEV-MGKA--------P------- 238 (395) T ss_pred cccccceeeEEeeeeeeEeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccc--------c------- Confidence 1223455555555555555555443333 357889999999999999999999887732 1110 0 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + +. +.+ +.+.|..+.... +... . ...-+.+|||..+..|++ T Consensus 239 -~----~~--------~~~-----------~~~~i~~~~~~~-l~~~-------~-------~~~a~~v~n~~~~~~L~~ 279 (395) T protein:vir:38 239 -K----KP--------TIS-----------QFDNIKDLENNT-LDPA-------I-------ESTSSFITNQSGYNILSK 279 (395) T ss_pred -c----cc--------ccc-----------cHHHHHHHHHHh-hhhh-------h-------cCCCEEEEcHHHHHHHHH Confidence 0 00 001 112222221110 1100 0 112356899988888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeecc--chheEeeec Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGVIAYGT 312 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~~A~g~ 312 (364) -. . ...+|||. |..+++.|.+++..+.+.-.. ..+ .-.+++|- +++.+. - T Consensus 280 lk---------d-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~---~~~----~~~i~~gd~~~~~~i~--~ 336 (395) T protein:vir:38 280 VK---------D-----ADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPD---VSG----SHPLYFGDLKQGITLF--D 336 (395) T ss_pred hh---------c-----cCCceeeccCcCCCCcceeccceeEEecccccCc---CCC----cceEEEEeccccEEEE--E Confidence 22 1 12357774 445688899887775532221 111 12466773 333332 2 Q ss_pred CCCCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEe---eeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISID---TAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~id---ta~~~~~ 364 (364) ..+....|.++..++ -+...+-+...+|.+.. +.+-|-++.+- |.+++.| T Consensus 337 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~--~~~a~~~~~~~~~~~~~~~~~ 391 (395) T protein:vir:38 337 RQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLI--DDGAFAAASFKTVANQAQGTA 391 (395) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEe--cccceEEEEeecccCCCCCcc Confidence 234555555543222 11122212111222111 12333333332 2222222 No 93 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=87.62 E-value=0.038 Score=28.40 Aligned_cols=287 Identities=9% Similarity=0.026 Sum_probs=139.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) ||.|....|.. ....++.++.......|... ++.. ++.. ++..+++... ......+|-+++... T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~-~l~~------~~~~------~~~~~~~p~~~~~~~a~wv~Eg~~~~- 65 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIA-KLSP------QKPI------PFNGQREFVFDFDSDIDIVAENGKKT- 65 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhh-hhcc------eeec------cCCceEEEEEecCcceEEeeCCcccc- Confidence 99888887764 56778888877666666554 3321 1110 1112333321 122233443333322 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRK--RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +...+|..-++.+-....-+.+..++-++ -+.+||-..-++.|..=+++..|+.+|.-.-...|..... .+ T Consensus 66 -~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~------~~ 138 (300) T protein:vir:95 66 -HGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTI------IG 138 (300) T ss_pred -cccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCccc------cc Confidence 55677777777777766666654444321 2458898999999999999999999983321111110000 00 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) .....+......+++.+.+.+.|.++........ .+.=+.+|||..+..| T Consensus 139 --------------~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~L 188 (300) T protein:vir:95 139 --------------DNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSE----------------RDITGAILDPIFTTAL 188 (300) T ss_pred --------------ccccccccceeecccccchHHHHHHHHHHhhhcC----------------CCccEEEECHHHHHHH Confidence 0000111112223345566677777765443221 1111478899999888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++-.| +..+|||. |..+.+.|.+++-.+.+.- +...+- --+|+|--.-++.||- T Consensus 189 ~~lkd--------------~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~----~~~~~~---~~~~~GDf~~~~~~~~ 247 (300) T protein:vir:95 189 SKMKN--------------AEGGKLYPELAWGGVPDAINGLAVDKNRTVSY----SQTDPK---NTAIVGDFETMFKWGY 247 (300) T ss_pred HHhhc--------------cCCCeeccCccccCCCceecceeeEEecCCCC----CCCCCc---cEEEEeeccceEEEEE Confidence 74222 12255663 4568899999987766521 111111 1245565444445555 Q ss_pred CCCCCccceechhhccchhHHHHHH-HHhhhhcccC-CcccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGF-IAGMKKARFN-SKDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~-i~G~~K~rf~-~~DfGvi~idta~~~~~ 364 (364) ..+..+.+.++ +..-+-++.. -..+--.|+. -=|++|.--...+++.- T Consensus 248 ~~~~~~~v~~~----~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~ 297 (300) T protein:vir:95 248 AKEVPMEIIKY----GDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVK 297 (300) T ss_pred ecccEEEEeec----cCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEec Confidence 45555555533 2211111110 0000000110 01333322222222222 No 94 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=87.36 E-value=0.039 Score=28.29 Aligned_cols=271 Identities=11% Similarity=0.054 Sum_probs=125.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceee-----cCCCc-cEEEEeecCCCCCceEEEEEeeccccCceecC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIG-----TSENA-VIQRKTELESDAGDTISFDLSVHLRGKPTYGD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G-----~g~~~-~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd 74 (364) .+.+....+.......++..+.......++... +.+ .+... +|.+.+ .+ -...+|... T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~-----~~----------~~~~~v~Eg 184 (415) T protein:vir:46 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTVKRVTNGSGKYPVVRQS-----EV----------AALEKVEEL 184 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhh-hcceeeccCCceeEEEEEec-----CC----------cceeecccc Confidence 233344445556778888888777666666543 432 11111 111110 00 001112111 Q ss_pred ceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 75 ~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) .... ......|..-++.+.....-+.+..++- +-+.+||...-++.|.+-+....|+.++.-+.. |.. . T Consensus 185 ~~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~--g~~-------~ 253 (415) T protein:vir:46 185 EENP-ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITK--GST-------G 253 (415) T ss_pred cccc-cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--CCc-------c Confidence 1110 0122344444444444444454433333 347789999999999999999999888755321 100 0 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) .... . .........++...+++.|.+++....... .+.=.++|||..+ T Consensus 254 ~~~~--~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~n~~~~ 301 (415) T protein:vir:46 254 STSS--G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKPN----------------YEHNVAIVSQTMF 301 (415) T ss_pred cccc--c--------------cccccceeccccccchHHHHHHHHhhhhhc----------------cCCCEEEEcHHHH Confidence 0000 0 000011122334456666666554332110 1112567999988 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchhe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGV 307 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~ 307 (364) ..|++- |. +..+|||. |.-+.+.|.+++..+.++- ++++ +..+++| .+++. T Consensus 302 ~~L~~l---------kd-----~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~----~~~~~~gd~~~~~~ 359 (415) T protein:vir:46 302 AKLDKM---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKG----NNTLIIGNLKDAIV 359 (415) T ss_pred HHHHHh---------hc-----cCCCeeeccCcCCCCCccccceeeEEeccccc----cCCC----ccEEEEEehhccEE Confidence 888641 21 22356763 4557889999887665532 2222 2246777 34333 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeeccC Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKKHS 364 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~~~ 364 (364) + +...+....|.. +++... . +++. .|+. .+-|-++.+.++++--. T Consensus 360 ~--~~~~~~~v~~~~--~~~~~~-~-----~~~~--~r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:46 360 L--FDRSQYQASWTD--YMHFGE-C-----LMIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred E--EeecceEEEeec--cccCce-E-----EEEE--EEeccEEeccccEEEEEeeccCCCCC Confidence 2 222455666653 233221 1 1121 2332 23343333333333332 No 95 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=87.36 E-value=0.039 Score=28.29 Aligned_cols=271 Identities=11% Similarity=0.054 Sum_probs=125.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceee-----cCCCc-cEEEEeecCCCCCceEEEEEeeccccCceecC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIG-----TSENA-VIQRKTELESDAGDTISFDLSVHLRGKPTYGD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G-----~g~~~-~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd 74 (364) .+.+....+.......++..+.......++... +.+ .+... +|.+.+ .+ -...+|... T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~-----~~----------~~~~~v~Eg 184 (415) T protein:vir:47 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTVKRVTNGSGKYPVVRQS-----EV----------AALEKVEEL 184 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhh-hcceeeccCCceeEEEEEec-----CC----------cceeecccc Confidence 233344445556778888888777666666543 432 11111 111110 00 001112111 Q ss_pred ceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 75 ~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) .... ......|..-++.+.....-+.+..++- +-+.+||...-++.|.+-+....|+.++.-+.. |.. . T Consensus 185 ~~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~--g~~-------~ 253 (415) T protein:vir:47 185 EENP-ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITK--GST-------G 253 (415) T ss_pred cccc-cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--CCc-------c Confidence 1110 0122344444444444444454433333 347789999999999999999999888755321 100 0 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) .... . .........++...+++.|.+++....... .+.=.++|||..+ T Consensus 254 ~~~~--~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~n~~~~ 301 (415) T protein:vir:47 254 STSS--G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKPN----------------YEHNVAIVSQTMF 301 (415) T ss_pred cccc--c--------------cccccceeccccccchHHHHHHHHhhhhhc----------------cCCCEEEEcHHHH Confidence 0000 0 000011122334456666666554332110 1112567999988 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchhe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGV 307 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~ 307 (364) ..|++- |. +..+|||. |.-+.+.|.+++..+.++- ++++ +..+++| .+++. T Consensus 302 ~~L~~l---------kd-----~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~----~~~~~~gd~~~~~~ 359 (415) T protein:vir:47 302 AKLDKM---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKG----NNTLIIGNLKDAIV 359 (415) T ss_pred HHHHHh---------hc-----cCCCeeeccCcCCCCCccccceeeEEeccccc----cCCC----ccEEEEEehhccEE Confidence 888641 21 22356763 4557889999887665532 2222 2246777 34333 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeeccC Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKKHS 364 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~~~ 364 (364) + +...+....|.. +++... . +++. .|+. .+-|-++.+.++++--. T Consensus 360 ~--~~~~~~~v~~~~--~~~~~~-~-----~~~~--~r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:47 360 L--FDRSQYQASWTD--YMHFGE-C-----LMIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred E--EeecceEEEeec--cccCce-E-----EEEE--EEeccEEeccccEEEEEeeccCCCCC Confidence 2 222455666653 233221 1 1121 2332 23343333333333332 No 96 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=87.29 E-value=0.04 Score=28.26 Aligned_cols=275 Identities=9% Similarity=0.069 Sum_probs=124.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc--ccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL--RGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L--~G~gv~Gd~~le 78 (364) +..+....+.......|+..+.......+++.+ ++..-. . .|..+++.....- ...+|...+. T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~-l~~~~~---------~---~~~~~~~~~~~~~~~~a~~v~E~~~-- 177 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRD-LVAPGT---------T---ESNSVEYVRETGFVNNAAPVSEGTQ-- 177 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHh-hcccee---------c---CCCceEEEEEecCCCceeeecCCcc-- Confidence 222333333345667888888888777777654 432110 0 1222333321111 1223322222 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) -.+..++|..-++.+.....-+.+..++-+ -+ .+|-..-+..|.+-+....|+.+|.- +|+- . ...++ T Consensus 178 ~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~l~~~v~~~la~a~~~~~d~~~l~G-~g~~--------~-~~~Gi 245 (395) T protein:vir:43 178 KPYSDLTFELENAPVRTIAHLFKASRQILD-DA-SALQSYIDARARYGLMLVEECQLLYG-NGTG--------A-NLHGI 245 (395) T ss_pred ccccccceeEEEEeeeeEEEeehhhHHHHH-hH-HHHHHHHHHHHHHHHHHHHHHHHHhc-cCCC--------C-ccccc Confidence 234556666666666666666666555433 23 36888888888888888889877632 2211 0 11222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .... +..+.......+....++.|.++...+.... ...-+++|||..+..|+ T Consensus 246 ~~~~------------~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~l~ 297 (395) T protein:vir:43 246 IPQA------------QAYAPPSGVVVTAEQRIDRIRLAILQAQLAE----------------FPASGIVLNPIDWALIE 297 (395) T ss_pred cccc------------cccccccccccccchhHHHHHHHHHhhcccc----------------CCCcEEEEcHHHHHHHH Confidence 1100 0111111222333456777777654432111 11225789999988887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCee----ecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIF----KGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF----~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) +-.| + ..+||| .|..+.+.|++|+..+.++.. .+++|--.-+.-++-.. T Consensus 298 ~lkd---------~-----~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~-------------~~~~gd~~~~~~~~~~~ 350 (395) T protein:vir:43 298 LNKD---------A-----ENRYIIGSPQNGTTPTLWRLPVVETQAITQD-------------EFLTGAFSLGAQIFDRM 350 (395) T ss_pred Hhhc---------c-----CCceeccccccCCCceecceeeEEcCCCCCC-------------cEEEEeccceEEEEEec Confidence 4221 1 224555 455677889998887665421 13444322111111112 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta 359 (364) +....+.++..++ -|.+.+-+..-+|++-. +.+-|-++.+-++ T Consensus 351 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~~~~taa 395 (395) T protein:vir:43 351 DIEVLVSTENDKDFENNMVTIRAEERLAFAVY--RPEAFVTGSLTAS 395 (395) T ss_pred ceEEEEeccccchhhcCcEEEEEEEeeccEEe--cccceEEEEeccC Confidence 3344444332111 01111111111111110 1222333332222 No 97 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=86.10 E-value=0.048 Score=27.81 Aligned_cols=285 Identities=13% Similarity=0.114 Sum_probs=141.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCc-eeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDA-RTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~-~leG 79 (364) ||- + -.++|+..|-....+.+.+...+.++-.+.-| .. ..|.+|.++-+.--+| . +|- +-.| T Consensus 1 Mai-----n---ya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v-~~-----~ggktVkIp~is~tsG--l-~DY~R~~g 63 (346) T protein:vir:10 1 MTI-----N---YAEKYQAAVQQAFYDGHLYSAELWNSPSNSII-KF-----DGAKHIKVPRLEITSG--R-KDRQRRTI 63 (346) T ss_pred Ccc-----h---hHHHHHHHHHHHHHhhhccchhhcccccccce-Ee-----cCCCEEEEEEeeeecc--c-ccccccCC Confidence 662 2 23688888877776665554444444444322 22 2578898774421112 1 122 2222 Q ss_pred c--hhhhhhcccEEEEecccceeeccchhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccccccccccccc Q lcl|NC_019917. 80 T--EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSV--HNIRRIARDRLGDYFYKFTDELLFIYLSG-ARGINLDFVETPD 154 (364) Q Consensus 80 n--ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~--~dlr~~ar~~L~~w~~~~~D~~~~~~l~g-a~g~~~~~~~~~~ 154 (364) - ..+++....++.++|-|----.=..|+...|. ..+-.....-..+...-..|.-.|..|+. +.+.+ T Consensus 64 ~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~-------- 135 (346) T protein:vir:10 64 TTPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH-------- 135 (346) T ss_pred cccccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc-------- Confidence 2 24678888888898888322111344432221 11111111112222223446555555542 11100 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) ++...+.+++++. -++.|+.+..+.+...- | .+-.|||++|.-. T Consensus 136 --------------------~~~~~~~a~T~~n--i~~~i~~~~~~lde~~v--p------------~~~rvl~vTp~~~ 179 (346) T protein:vir:10 136 --------------------DGGITTNTLDEKN--ILPAFDNMMLDFDEARI--P------------STNRILYVTPKTN 179 (346) T ss_pred --------------------cccccccccCHHH--HHHHHHHHHHHHHHccC--C------------CCCeEEEECHHHH Confidence 0111122233332 24566676665443321 1 2458999999999 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcc-----cccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNV-----IRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~-----~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) .-|+++. . ++|.-. . +..+ ...|.+|.+|||.|++.|.- +.|.++...+..+-...+||-...+.+| T Consensus 180 ~lLk~s~--~---f~k~~~-v-~~~~-~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t~ak~INfiiv~~~A~ia 251 (346) T protein:vir:10 180 AILKRAE--A---MNRALT-L-KDPN-NIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIIDTAKQIEMFLIYNGVQIA 251 (346) T ss_pred HHHhhch--h---heeccc-c-cccc-ccceeeeeecCeEEEEcchhhcccchhhccCccccCCccceeEEEECCceeee Confidence 9998744 3 344321 1 1122 35899999999999998764 2344444444444456888888888888 Q ss_pred eecCCCCCccceechhhccchhHHHHHHHHhhhhccc------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 310 YGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARF------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf------~~~DfGvi~idta~~~~~ 364 (364) .-+..-.+.+=... ...|. ..+.+ -+| +++-=|+.+.-.-+++.. T Consensus 252 ~~K~~~~~if~P~~-~~~g~------~l~~~---R~Y~D~fv~~nk~~~Iyv~~~~a~~~~ 302 (346) T protein:vir:10 252 PEKYSFVGFDQPSA-ATSGN------YLYYE---QSYDDVLLLNTKTKGIQFVVSDKPKKD 302 (346) T ss_pred eeeeeeeEeeCCCC-Ccccc------eeeee---eeeeeeeeeccccceEEEeeecccccC Confidence 77654443332211 11111 00111 111 233336655555555444 No 98 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=85.36 E-value=0.053 Score=27.56 Aligned_cols=283 Identities=11% Similarity=0.106 Sum_probs=132.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) +..|....+.......++.++.....+.+++.. +.- + .-+. +..+++.... .-....+..++ +- T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~-~~~--------~-~~~~---~~~~~ip~~~~~~~a~~v~E~~--~~ 73 (304) T protein:vir:10 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMK-LAK--------N-EPMT---AQKKKFTYLAKGVGAYWVSETE--RI 73 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhh-hcc--------e-eecc---CCceEEEEEeCCcceEEeecCc--cc Confidence 344444444556778899999888877777653 321 0 0010 1112222111 11122332222 33 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|..-++.+.....-+.+..++ .+-+.+||...-++.|.+-+++..|+.+|.- .|+. .|. +.. T Consensus 74 ~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G-~g~~--------~~~--~~~ 141 (304) T protein:vir:10 74 QTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG-TKSP--------YNT--STS 141 (304) T ss_pred ccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheec-cCCC--------ccc--ccc Confidence 356677777777777777666664443 2346789999999999999999999888621 1211 000 000 Q ss_pred cCcccCCCCCcEEeeccccchhhhh-hcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLA-ATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~-~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .+. +..+.. ....+ ++...+++.|.++........ .+.-.++|||.-+..|+ T Consensus 142 ~~~---------~~~~~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~----------------~~~~~~v~~~~~~~~L~ 194 (304) T protein:vir:10 142 GKP---------LVEGAE--EKGNVVTDTNNLYVDLSALMATIEDEE----------------LDPNGVLTTRSFRSKMR 194 (304) T ss_pred ccc---------cccccc--ccccccccccchHHHHHHHHHHhhhcc----------------CCcCEEEEcHHHHHHHH Confidence 000 011101 11111 223456777777755443221 11224688999999998 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) +- |. +..+|||.+..+.+.|.+++..+.+.- .+... -+++|--..+ .+|..++... T Consensus 195 ~l---------kd-----~~G~~l~~~~~~~l~G~PV~~~~~~~~-----~~~~~----~~~~gd~~~~-~~~~~~~~~i 250 (304) T protein:vir:10 195 NA---------LD-----ANDRPLFDANGNEIMGLPLSYTGADVY-----DKKKS----LALMGDWDYA-RYGILQGIEY 250 (304) T ss_pred Hh---------hc-----cCCcEeecCCCccccceeeEEeccccc-----CCCCc----EEEEEehhhE-EEEEecceEE Confidence 42 22 234799999999999999987766632 11111 2455554432 2443333333 Q ss_pred cceechh----hccchhHHHHH-HHHhhhhcccCC-cccEEE-----EEeeeee Q lcl|NC_019917. 319 DWEETVK----DYGNEPAICAG-FIAGMKKARFNS-KDFGVI-----SIDTAAK 361 (364) Q Consensus 319 ~w~Ee~~----D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi-----~idta~~ 361 (364) ...+|-. .+.+--+-.+. +-.++-+.|... -|+.|. ++-+.+. T Consensus 251 ~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 251 AISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 2222210 00000000000 001111111100 011111 1112222 No 99 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=85.36 E-value=0.053 Score=27.56 Aligned_cols=283 Identities=11% Similarity=0.106 Sum_probs=132.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) +..|....+.......++.++.....+.+++.. +.- + .-+. +..+++.... .-....+..++ +- T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~-~~~--------~-~~~~---~~~~~ip~~~~~~~a~~v~E~~--~~ 73 (304) T protein:vir:94 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMK-LAK--------N-EPMT---AQKKKFTYLAKGVGAYWVSETE--RI 73 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhh-hcc--------e-eecc---CCceEEEEEeCCcceEEeecCc--cc Confidence 344444444556778899999888877777653 321 0 0010 1112222111 11122332222 33 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|..-++.+.....-+.+..++ .+-+.+||...-++.|.+-+++..|+.+|.- .|+. .|. +.. T Consensus 74 ~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G-~g~~--------~~~--~~~ 141 (304) T protein:vir:94 74 QTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG-TKSP--------YNT--STS 141 (304) T ss_pred ccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheec-cCCC--------ccc--ccc Confidence 356677777777777777666664443 2346789999999999999999999888621 1211 000 000 Q ss_pred cCcccCCCCCcEEeeccccchhhhh-hcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLA-ATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~-~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .+. +..+.. ....+ ++...+++.|.++........ .+.-.++|||.-+..|+ T Consensus 142 ~~~---------~~~~~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~----------------~~~~~~v~~~~~~~~L~ 194 (304) T protein:vir:94 142 GKP---------LVEGAE--EKGNVVTDTNNLYVDLSALMATIEDEE----------------LDPNGVLTTRSFRSKMR 194 (304) T ss_pred ccc---------cccccc--ccccccccccchHHHHHHHHHHhhhcc----------------CCcCEEEEcHHHHHHHH Confidence 000 011101 11111 223456777777755443221 11224688999999998 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) +- |. +..+|||.+..+.+.|.+++..+.+.- .+... -+++|--..+ .+|..++... T Consensus 195 ~l---------kd-----~~G~~l~~~~~~~l~G~PV~~~~~~~~-----~~~~~----~~~~gd~~~~-~~~~~~~~~i 250 (304) T protein:vir:94 195 NA---------LD-----ANDRPLFDANGNEIMGLPLSYTGADVY-----DKKKS----LALMGDWDYA-RYGILQGIEY 250 (304) T ss_pred Hh---------hc-----cCCcEeecCCCccccceeeEEeccccc-----CCCCc----EEEEEehhhE-EEEEecceEE Confidence 42 22 234799999999999999987766632 11111 2455554432 2443333333 Q ss_pred cceechh----hccchhHHHHH-HHHhhhhcccCC-cccEEE-----EEeeeee Q lcl|NC_019917. 319 DWEETVK----DYGNEPAICAG-FIAGMKKARFNS-KDFGVI-----SIDTAAK 361 (364) Q Consensus 319 ~w~Ee~~----D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi-----~idta~~ 361 (364) ...+|-. .+.+--+-.+. +-.++-+.|... -|+.|. ++-+.+. T Consensus 251 ~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 251 AISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 2222210 00000000000 001111111100 011111 1112222 No 100 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=84.12 E-value=0.063 Score=27.17 Aligned_cols=292 Identities=12% Similarity=0.055 Sum_probs=133.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) ||.+....+.-.....++..+.......|+... +. + .|. . .+..+++... ......+|-+.+... T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~-l~-~----~i~----~---~~~~~~ip~~~~~~~a~wv~Eg~~~~- 66 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAK-LS-P----EQP----T---IFGPVKGAVFSGVPRAKIVGEGEVKP- 66 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhh-hc-c----eee----c---CCCceEEEEEeCCcceEEeeCCcccc- Confidence 999998888889999999999888877776543 31 1 110 0 1122343322 222233444333333 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHN----IRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDF 155 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~d----lr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~ 155 (364) +...+|..-++..-....-+.+..++- +.+..| |+..-.+.|.+-+++..|+.+|.-. |..+ T Consensus 67 -~s~~~f~~v~l~~~kl~~~~~iS~ell-~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~-~~~~----------- 132 (315) T protein:vir:80 67 -SASVDVSAFTAQPIKVVTQQRVSDEFM-WADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-DPAT----------- 132 (315) T ss_pred -ccccceeeeEeeeeeEEeeehhhHHHh-hcCchhHHHHHHHHHHHHHHHHHHHHHhhheeecc-CCCC----------- Confidence 445566665555555544444433221 123444 6676777777777777787776321 1000 Q ss_pred cccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHH Q lcl|NC_019917. 156 TGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQAT 235 (364) Q Consensus 156 ~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~ 235 (364) + + +|.. +.....+....++.++. ....+.++........ + ...-..+|||..+. T Consensus 133 -~---~---~~~~---~~~~~~~~~~~~~~~~~-~~~d~~~~~~~~~~~~-----------~----~~~~~~imn~~~~~ 186 (315) T protein:vir:80 133 -G---K---AASA---VHTSLNKTKNIVDATDS-ATADLVKAVGLIAGAG-----------L----QVPNGVALDPAFSF 186 (315) T ss_pred -C---c---cccc---cccccccccceeecccc-chHHHHHHHHHHhhcc-----------C----ccceEEEEcHHHHH Confidence 0 0 0110 00011111122222332 2233444433221110 0 11124678999999 Q ss_pred HHhhcCCHHHHHHHHHhhhhhccCCC----eeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeee Q lcl|NC_019917. 236 DMRTAAGGTWIDFQKAAAAAEGRNNP----IFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYG 311 (364) Q Consensus 236 ~Lr~~~d~~w~~~qk~A~~~~g~~nP----lF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g 311 (364) .||+-.+.. ......+| +..|+.+.+.|.+++..+.|......+.+..+ -+++|--.- +.|| T Consensus 187 ~L~~l~~~~---------g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~----~~~~GDfs~-~~~g 252 (315) T protein:vir:80 187 ALSTEVYPK---------GSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV----KAIVGDFSR-VHWG 252 (315) T ss_pred HHHHHhhcc---------CCcccccccccccccCCCceecceeeEecCcCCccccccccccc----EEEEeeccc-EEEE Confidence 998532211 11112233 44667789999999988877544333333322 234454332 2344 Q ss_pred cCCCCCccceechhhccchhHHHHH-HHHhhhhccc---------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 312 TANGLRFDWEETVKDYGNEPAICAG-FIAGMKKARF---------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 312 ~~~g~~~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rf---------~~~DfGvi~idta~~~~~ 364 (364) ..+++.+...++ +...+.... +-.++-..|. +.+-|-++..-++.+... T Consensus 253 ~~~~~~i~i~~~----~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~ 311 (315) T protein:vir:80 253 FQRNFPIELIEY----GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNP 311 (315) T ss_pred EecCeeEEEecc----ccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCC Confidence 444555554432 222111111 1112112221 112233332223323333 No 101 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=84.08 E-value=0.063 Score=27.16 Aligned_cols=284 Identities=12% Similarity=0.087 Sum_probs=127.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) |+.+.-..|-...+..|+..+.......+++.. +.-. + . . .|..+.+..... ....+|-+.+ ... T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~-~~~~-----~-~---~---~~~~~~~~~~~~~~~a~wv~E~~-~~~ 172 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQ-EATV-----I-T---V---GGSDYKKLVNLGGTASGWVGETD-TRS 172 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhh-hcee-----e-e---c---CCCceEEEEecCCccceeecccc-ccC Confidence 666655555556678888888876655555432 3210 0 0 0 111122221111 1111221111 111 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ..+..+|..-++.+-....-+.+..++-+ .+.+||-...+..|.+=+....|+.+|.- +|+ + .| .|+. T Consensus 173 ~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~ai~~~~~~~~l~G-~G~-~-------~p--~Gil 240 (401) T protein:vir:44 173 QTATSRLGLIEPFMGEIYGNPQATQKMLD-DAFFNVEAWINSELATEFAEQEEIAFTTG-DGT-K-------KP--KGFL 240 (401) T ss_pred ccccccceeeeeehhheeeehhhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHhhhhcc-CCC-C-------cc--ceee Confidence 11223444445555555555655444333 47889999999999999999999888832 222 1 11 1221 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ..+ ...+.+ ...+.+........+++.++++-|.++....... - ...=+.+|+|..+..|++ T Consensus 241 ~~~-~~~~~~-~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~-------------~---~~~a~~v~n~~~~~~L~~ 302 (401) T protein:vir:44 241 AYE-STEESD-KARAFGKLQHIVSGEATAVTADAIIKLIYTLRKA-------------H---RTGAKFMMNNNSLFAIRL 302 (401) T ss_pred ccc-cccccc-cccccccccccccccccccCHHHHHHHHHhcchh-------------h---hcCCEEEEcHHHHHHHHH Confidence 110 000000 0000011001111244557777776665432111 0 111256799998888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |. +..+|||. |.-+.+.|.+++....++-. +++. ..+++|--.-++.++... T Consensus 303 l---------kd-----~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~---~~~~-----~~i~~Gd~~~~~~i~~~~ 360 (401) T protein:vir:44 303 L---------KD-----TEGNYLWRPGLELGQPSSLAGYGIAENEQMPDI---AADA-----KAIAFGNFKRGYTIVDRI 360 (401) T ss_pred h---------hc-----cCCceeecCCcCCCCCceecceeeEEecCcCCc---cCCc-----cEEEEeehhccEEEEEec Confidence 2 22 23468885 44567889999887766421 1111 245566432222222223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta 359 (364) +.... .+ +|-.+-.+++..+ .||. .+-|-.+.+-+| T Consensus 361 ~~~~~-~~---~~~~~~~v~~~a~-----~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 361 GTRIL-RD---PYTNKPFVGFYTT-----KRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ceEEe-ee---ccccCCcEEEEEE-----EEeccEEecccceEEEEeecC Confidence 33322 11 2222111111110 1332 234433333333 No 102 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=83.58 E-value=0.067 Score=27.01 Aligned_cols=287 Identities=13% Similarity=0.058 Sum_probs=127.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEE-eeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDL-SVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L-~~~L~G~gv~Gd~~leG 79 (364) ||.+. ++ ......+.++.....+.+.+. ++.. .| . . .+-.+++.. ........|-+++... T Consensus 1 ma~~g-G~---lvp~~~~~~ii~~~~~~s~i~-~l~~-----~~-~---~---~~~~~~ip~~~~~~~a~~v~E~~~~~- 62 (298) T protein:vir:16 1 MVLNK-GT---LFDPTLVTDLISKVAGKSSIA-RLSA-----QK-P---I---PFNGEKVFTFTMDSEIDVVAESGKKT- 62 (298) T ss_pred CcccC-cc---eechhHHHHHHHHHHhhhhhh-hhcc-----ee-e---c---cCCceEEEEEecCcceEEecCCcccc- Confidence 99443 33 344566777776665555544 3421 11 0 0 111123322 1122233343322222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRK--RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +..++|.+-++.+-....-+.+..++-++ -+.++|-..-++.|++-+.+..|+.+|.-.....|...++ .+ T Consensus 63 -~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~------~~ 135 (298) T protein:vir:16 63 -HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAV------IG 135 (298) T ss_pred -ccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccc------cc Confidence 44566655555555555555444443221 3457898889999999999999988883211011111000 00 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) .+. ..+..+.....++...-....|.++........ .+.=..+|||..+..| T Consensus 136 --------~~~----~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~l 187 (298) T protein:vir:16 136 --------TNH----FDSKVTQKVEAPRGIADPNGAIENAVELLTGVD----------------ADVTGIAINPSFRSAL 187 (298) T ss_pred --------ccc----cccccccccccccccccHHHHHHHHHHHhhhcC----------------CCccEEEEcHHHHHHH Confidence 000 001111111111111222445555544332111 1112477899999998 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++- |.+ ..+|||. |.-+.+.|.+++-...+.-. ...+. -.+|+|-=.-++.|+- T Consensus 188 ~~l---------kd~-----~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~---~~~~~----~~~~~GDfs~~~~~~~ 246 (298) T protein:vir:16 188 AKQ---------KDL-----QDNALFPELKWGATPDTINGLPVDVNKTVSDM---SLTQR----DRAIIGDFANGFKWGY 246 (298) T ss_pred HHh---------hcc-----CCCeeecCcccCCCCceecceeeEEecccccc---cCCCc----cEEEEeeccceEEEEE Confidence 742 222 2367774 44578999998877665421 11111 1466776443345555 Q ss_pred CCCCCccceechhhccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) ..+..+.+.++..+++. .+.. +-.++-..|..- =|+.|+-=..-+++-. T Consensus 247 ~~~~~~~~~~~~~~~~~--~~~~-f~~~~v~~ra~~r~d~~v~~~~a~~~l~~ 296 (298) T protein:vir:16 247 AKEVPLEVIQYGDPDNS--GLDL-KGYNQVYIRAELFLGWGILDATKFARVTE 296 (298) T ss_pred ecCceEEEeeccCCcCc--chhh-hhcCcEEEEEEEEEccEeecccceEEEee Confidence 55667777655333321 1110 111222222110 1222211111112212 No 103 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=83.29 E-value=0.07 Score=26.92 Aligned_cols=281 Identities=11% Similarity=0.068 Sum_probs=126.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) |+.|....+.......|+..++....+.++... +.- .| . . .+.++++..... ....+|-..+.. T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~-~~~-----~~-~---~---~~~~~~~p~~~~~~~a~~v~E~~~~-- 78 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQ-FAQ-----KV-P---M---GTTGQKIPHWIGDVSAQWIGEGDMK-- 78 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhh-hcc-----ee-e---c---cCCceEEEEEeCCcceEEecCCccc-- Confidence 776665554446678899999988877776543 321 11 0 0 123344443321 122333333332 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|.+-++.+.....-+.+..++- +.+.+||....++.|.+-+++..|+.+|.- .|+- . + ....+. T Consensus 79 ~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G-~g~~-~--~----~~~~~~- 148 (320) T protein:vir:10 79 PITKGNMTSQNIAPHKIATIFVASAETV-RANPANYLGTMRTKVATAFAMAFDSAALNG-TDSP-F--P----TYLAQT- 148 (320) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHH-hcChHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCC-C--C----cccccc- Confidence 2556777776777766666666644433 357889999999999999999999998622 2210 0 0 000000 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHH-HHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPI-VIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~-~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .+ +........++.++.-.++ .+.++...... ...+.-+++|||..+..|+ T Consensus 149 ~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~v~n~~~~~~L~ 200 (320) T protein:vir:10 149 TK------------SVSLADPGGATASDLTAYDAVAVNGLSLLVN----------------AKKKWTHTLLDDIVEPILN 200 (320) T ss_pred cc------------cccceecccccccccccHHHHHHHHHhhhhc----------------ccCCCcEEEEcHHHHHHHH Confidence 00 0001111222233332222 22222221110 0123347789999999998 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee-----cCe-----EEEcCEEEEecCcccccccccCCcccccchheeeccchheE Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK-----GGL-----GMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVI 308 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~-----g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~ 308 (364) +-.| + ..+|||. |.- +.+.|++++..+.+. .+.. .+++|--+-++ T Consensus 201 ~lkd---------~-----~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~-------~~~~----~~~~gd~~~~~ 255 (320) T protein:vir:10 201 GAKD---------K-----NGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVA-------DGTT----VGYMGDFRNVI 255 (320) T ss_pred Hhhc---------c-----CCceeeccccccCccccccCceeeeeeeEecCCCC-------CCce----EEEEeecceEE Confidence 5222 1 2245553 222 234455555443331 1111 23344332221 Q ss_pred eeecCCCCCccceech-----hhccc---------hhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 309 AYGTANGLRFDWEETV-----KDYGN---------EPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 309 A~g~~~g~~~~w~Ee~-----~D~g~---------~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) +|..++......+|. .|++. .+.+-+.+.+|.+-. +.+-|-+|.-=++-+| T Consensus 256 -~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~--~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 256 -WGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNN--DKDAFVKLTNVVTPDA 320 (320) T ss_pred -EEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEe--cccceEEEEeccCCCC Confidence 333333333333221 01111 111111111111111 2234555443344444 No 104 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=82.97 E-value=0.072 Score=26.84 Aligned_cols=271 Identities=10% Similarity=0.042 Sum_probs=122.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeec-----CCCc-cEEEEeecCCCCCceEEEEEeeccccCceecC Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGT-----SENA-VIQRKTELESDAGDTISFDLSVHLRGKPTYGD 74 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~-----g~~~-~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd 74 (364) -+.|....+.......++..+.......+++.. +.+. ++.. +|.+.+ ..-...+|... T Consensus 121 ~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~v~Eg 184 (415) T protein:vir:94 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK-YVTVKRVTNGSGKYPVVRQS---------------EVAALEKVEEL 184 (415) T ss_pred hhccccccccccCcHHHHHHHHHHHHhhhhhhh-hcceeeccCCceeEEEEeec---------------CCccceecccc Confidence 223333445555667788888777666666643 4321 1111 111110 00111122111 Q ss_pred ceeecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Q lcl|NC_019917. 75 ARTEGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPD 154 (364) Q Consensus 75 ~~leGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~ 154 (364) .... .....+|.+-++.+....+-+.+..++- +-+.+||...-++.|.+-+....|+.++..+...- +. T Consensus 185 ~~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~---------~~ 253 (415) T protein:vir:94 185 EENP-ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS---------TG 253 (415) T ss_pred cccc-ccccccceeeEeeheeeeeechhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc---------cc Confidence 1111 0112234444444444444444433322 24678898888999999999988887775432110 00 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) .... + ........+.++..+++.|.+++...... . .+.=.++|||.-+ T Consensus 254 ~~~~--~--------------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~------------~----~~~~~~vmn~~~~ 301 (415) T protein:vir:94 254 STSS--G--------------FEKEGKKLEVKKAKSLDDIKDAINLNVKP------------N----YEHNVAIVSQTMF 301 (415) T ss_pred cccc--c--------------ccccccccccccccchHHHHHHHHhhhhh------------c----cCCCEEEEcHHHH Confidence 0000 0 00111122233445666666665433211 0 1112467899888 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec--cchhe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG--RQAGV 307 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG--aqA~~ 307 (364) ..|++- |. ...+|||. |.-+.+.|.+++-.+.++- +.++.+ .+++| .+++. T Consensus 302 ~~l~~l---------kd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~----~i~~gd~~~~~~ 359 (415) T protein:vir:94 302 AKLDKM---------KD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVL----GQKGNN----TLIIGNLKDAIV 359 (415) T ss_pred HHHHHh---------hc-----cCCCeeeccCcCCCCCceecceeeEEeccccc----CCCCcc----EEEEEehhccEE Confidence 888742 22 22367774 3447899999887766542 222222 46777 45443 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeeccC Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKKHS 364 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~~~ 364 (364) +. -..+....|..+ ++. ...+ ++. .||. .+-|-++.+.++++--. T Consensus 360 ~~--~~~~~~v~~~~~--~~~-~~~~-----r~~--~r~d~~~~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:94 360 LF--DRSQYQASWTDY--MHF-GECL-----MIA--VRQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred EE--eecceEEEEecc--ccC-ceEE-----EEE--EEeccEEeccccEEEEEEeccCCCCC Confidence 32 224556665532 221 1111 111 2332 22333333333332222 No 105 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=82.10 E-value=0.08 Score=26.60 Aligned_cols=290 Identities=14% Similarity=0.149 Sum_probs=134.4 Q ss_pred CceeecccCCch---HHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCcee Q lcl|NC_019917. 1 MTTTVIPFGDPK---AVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDART 77 (364) Q Consensus 1 Ma~T~~~~~dp~---a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l 77 (364) |++-+--.+ -| ...+||+.+..---++ +.+. -|.++.|. +.||+|.++=+ |.++.+|... T Consensus 1 ~~~~n~ts~-~qafi~~EiWsa~il~~l~~~--Lv~~--------~~~~~~d~--g~GDtV~InsI----g~~tV~dY~~ 63 (322) T protein:vir:31 1 MSTGNNTSN-TQALIVSEIWADEIEDILHEK--LLDV--------NIARVVDF--PDGDKLTIPSV----GTPVVRSRPE 63 (322) T ss_pred CCCCCCccc-ceEEeehhhhHHHHHHHhhhh--hhhh--------hhhccccc--CCCCeEEeccc----cccccccccC Confidence 887662222 22 1469988885433232 2211 12233333 46999999844 4444455444 Q ss_pred ecc--hhhhhhcccEEEEeccc---ceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccccccccc Q lcl|NC_019917. 78 EGT--EENLRFYTDQVKIDQVR---HPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLS-GARGINLDFVE 151 (364) Q Consensus 78 eGn--ee~L~~~~~~v~Idq~R---~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~-ga~g~~~~~~~ 151 (364) .+. -+.|.-.+.+|.|||.. +.|+- .+ -....||+..+-...++=+++-.|+-..-.|. |+...+ ... T Consensus 64 ~~~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~---~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~--~~~ 137 (322) T protein:vir:31 64 QGDFTFDNLDTGEISIILRDEVYAGNAISK-KL---RQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFA--GQN 137 (322) T ss_pred CCCcccccCCCceEEEEEehhhhhccccch-hH---HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh--ccC Confidence 433 57788889999999976 44543 23 23578999999999999999988887754443 221000 000 Q ss_pred cccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEec Q lcl|NC_019917. 152 TPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSE 231 (364) Q Consensus 152 ~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p 231 (364) .| . -.|.. | +.++. +.+.-++.++.|.++..++.... -| .+--++++.| T Consensus 138 ~p--~--vin~~--~---~~iv~--------~gt~~~~ay~~lv~l~~kLdkan--VP------------~~gR~vVV~P 186 (322) T protein:vir:31 138 DP--N--VINGV--P---HRFVG--------TGTDQTMDVTDFSRVNYVMTQSK--MP------------MGGMIGIIDP 186 (322) T ss_pred Cc--c--eecCC--c---cceec--------cCCCchhhHHHHHHHHHHhcccc--CC------------CCCeEEEeCc Confidence 00 0 00110 1 11221 11233577888888766553332 22 1235678899 Q ss_pred hhHHHHhh-------cCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccc----cccCCccc-ccchhe Q lcl|NC_019917. 232 YQATDMRT-------AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFN----DYGAGANV-EAARAL 299 (364) Q Consensus 232 ~q~~~Lr~-------~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~----~~~~~~~v-~v~ral 299 (364) .....|.. ..|++|-.+..+.- .+|- .| +|-+-|+-|+....+..-+ +++.+... +..+.+ T Consensus 187 ~~~~~L~~i~~~~~l~~D~rf~~i~~sG~-a~g~---~~---Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~ 259 (322) T protein:vir:31 187 SVAHHLETITNISNISNNPRWEGIVESGI-APDM---QF---VRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNM 259 (322) T ss_pred hhhhhhhhhhhhhhhhccccccccccccc-hhhH---HH---HHHHhceeeeeeccccccccccccCcccccccceeecc Confidence 99887743 23556654444321 1121 12 6777788888877663221 11111110 001111 Q ss_pred e-e----ccchheEeeecCCCCCcccee----chhhccc--hhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 300 F-M----GRQAGVIAYGTANGLRFDWEE----TVKDYGN--EPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 300 l-l----GaqA~~~A~g~~~g~~~~w~E----e~~D~g~--~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) | | |.-..+- .|.| |.|=+.+ .-+...-+.+|-.=.|- |--+|+. =++++-+- T Consensus 260 f~~~~~~~~~~~~~----------~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~--e~l~~~~-a~~~~~~~ 322 (322) T protein:vir:31 260 FMNVSDMGLLPFVV----------AWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRD--ENLVCVL-ANADKVTF 322 (322) T ss_pred cccccchhhhhhhh----------HhhhhhhhhcccCccccccceeeeeeecceeecc--cceEEEE-eccccccC Confidence 1 1 1111111 1211 1111111 11222222333221221 2222211 12222222 No 106 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=79.71 E-value=0.1 Score=26.02 Aligned_cols=282 Identities=13% Similarity=0.064 Sum_probs=125.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) |+++....+ ......++..+.....+.++... +.- +.. . .+..+++.... ......|.+.+ +- T Consensus 20 ~~~~~~~~g-~~ip~~~~~~ii~~~~~~s~i~~-~~~--------~~~-~---~~~~~~~p~~~~~~~a~~v~Eg~--~~ 83 (326) T protein:vir:42 20 AQTGDSMFE-GYLEPEQAQDYFAEAEKISIVQQ-FAQ--------KIP-M---GTTGQKIPHWTGDVSASWIGEGD--MK 83 (326) T ss_pred eeccccCCc-ceechhhHHHHHHHHHhcchhhh-hcc--------eee-c---cCCceEEEEEeCCcceEEecCCc--cc Confidence 444433333 24678888998888777776543 211 000 0 11123332111 11222332222 33 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|..-++.+.....-|.+..++-+ .|.+||-..-++.|.+-+.+..|+.+|.- +|+ + .| .++. T Consensus 84 ~~~~~~f~~i~~~~~k~~~~v~iS~ell~-~s~~~~~~~i~~~l~~a~~~~~d~a~l~G-~gs-~-------~p--~gi~ 151 (326) T protein:vir:42 84 PITKGNMTSQTIAPHKIATIFVASAETVR-ANPANYLGTMRTKVATAFAMAFDNAAING-TDS-P-------FP--TFLA 151 (326) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHHHHHHHHHhhcc-cCC-C-------cc--cccc Confidence 35667888878888877777777655443 47899999999999999999999988722 221 1 11 1111 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .+ ++........+.. .+..++...+........... . ....-+.+|||..+..|++ T Consensus 152 ~~----~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~-----------~---~~~~a~~v~n~~~~~~L~~ 207 (326) T protein:vir:42 152 QT----TKEVSLVDPDGTG------SNADLTVYDAVAVNALSLLVN-----------A---GKKWTHTLLDDITEPILNG 207 (326) T ss_pred cc----ccccceeeccccc------ccccchhHHHHHHHHHhhhhh-----------h---ccCccEEEEeHHHHHHHHH Confidence 11 0000000000011 111222222222222221111 0 0122346789999999984 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecC----------eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGG----------LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~----------~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) - |.+ ..+|||... .+.+.|+++.-.+.+..... .+++|-=.-+ . T Consensus 208 l---------kd~-----~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-----------~~~~Gd~s~~-~ 261 (326) T protein:vir:42 208 A---------KDK-----SGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTV-----------VGYQGDFRQL-V 261 (326) T ss_pred h---------hcc-----CCceeeccccccCccccccCceeeeeeEEEcCCCCCCce-----------EEEEeecceE-E Confidence 2 221 235677532 13567777776655432111 1122211110 1 Q ss_pred eecCCCCCccceech-----hhcc---------chhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 310 YGTANGLRFDWEETV-----KDYG---------NEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~-----~D~g---------~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) +|-.++..+.-.+|. .|++ +.+.+-+.+.++.+.. +.+-|-+|.--+++++ T Consensus 262 ~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~--~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 262 WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCN--DKDAFVKLTNVDATEA 326 (326) T ss_pred EEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe--cccceEEEeeccccCC Confidence 222222222211111 0111 1122222222222221 2244655555555555 No 107 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=78.92 E-value=0.11 Score=25.85 Aligned_cols=280 Identities=11% Similarity=0.060 Sum_probs=131.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) +..|....+.......|+.+++......++... +... +. . .|..+++.... .-....|.+++... T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~-l~~~--------~~-~---~~~~~~ip~~~~~~~a~~v~Eg~~~~- 92 (324) T protein:vir:93 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGKY--------EP-M---EGTEKKFTFWADKPGAYWVGEGQKIE- 92 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhh-hcce--------ee-c---cCCceEEEEEecCcceeeecCCcccc- Confidence 444444444446678899999888877777654 3211 00 0 11223332211 11122332222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +..++|.+-++.+-....-+.+..++-++ +.+||...-++.|++=+++..|+.+|.- .|+.+ ...+.. T Consensus 93 -~~~~~f~~i~~~~~k~~~~~~iS~ell~d-s~~~l~~~i~~~l~~aia~~~d~a~l~G-~g~~~---------~~~~~~ 160 (324) T protein:vir:93 93 -TSKATWVNATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKSIA 160 (324) T ss_pred -ccccceeEEEEEeEEEEEeehhhHHHHhc-chHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCC---------cCcccc Confidence 45577777777777777777665554443 5689999999999999999999988632 12110 001111 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) . .. ...+..+.+..+.+.|.++........ .+.=.++|+|..+..|++ T Consensus 161 ~--------------~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~----------------~~~~~~v~n~~~~~~L~~ 208 (324) T protein:vir:93 161 Q--------------SI--EKTNKVIKGDFTQDNIIDLEALLEDDE----------------LEANAFISKTQNRSLLRK 208 (324) T ss_pred c--------------cc--cccceeccccccHHHHHHHHHhhhhcc----------------CCCCEEEEcHHHHHHHHH Confidence 0 00 001112234567777777765443221 111147899999998875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| ...+|+|. +.-+.+.|.+++-.+.. .... -.+++|-=.- +.+|-.++..+ T Consensus 209 l~d--------------~~G~~~~~~~~~~~l~G~PVv~~~~~-------~~~~----~~i~~gdfs~-~~~~~~~~~~i 262 (324) T protein:vir:93 209 IVD--------------PETKERIYDRNSDSLDGLPVVNLKSS-------NLKR----GELITGDFDK-LIYGIPQLIEY 262 (324) T ss_pred hhC--------------CCCCeeecCCCCCcccceeeEeecCC-------CCCc----ceEEEEecce-EEEEEecCcEE Confidence 322 23478887 44577888887643221 1111 1234444222 12333334433 Q ss_pred cceechh--hccchhHHHHH-HHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVK--DYGNEPAICAG-FIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~--D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) ...++-. +..+..+.... +-.++-.+|... =|++++--...|+++. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~ 312 (324) T protein:vir:93 263 KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) T ss_pred EEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEec Confidence 3333210 00011111111 122333333321 1333332222223322 No 108 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=78.46 E-value=0.11 Score=25.75 Aligned_cols=284 Identities=12% Similarity=0.117 Sum_probs=116.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) |+.|....+.......++..+.......++... +... |. . .+.++++.... .....+|-+.+... T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~-~~~~-----~~----~---~~~~~~ip~~~~~~~a~~v~Eg~~~~- 79 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQ-FAQK-----VP----M---GTTGQKIPHWVGDVSAQWIGEGDMKP- 79 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhh-hcce-----ee----c---cCCceEEEEEeCCcceEEecCCcccc- Confidence 776666555556778889999877766665543 4211 10 1 11223333211 11222332222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +...+|..-++.+-....-+.+..++- +.|.+||-..-++.|.+-+.+..|+.+|.- .|+ ..| .++. T Consensus 80 -~~~~~f~~i~~~~~k~~~~~~iS~e~l-~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G-~g~--------~~~--~~~~ 146 (318) T protein:vir:24 80 -ITKGNMTSQTIAPHKIATIFVASAETV-RANPANYLGTMRTKVATAFAMAFDGAAMHG-TDS--------PFP--TYIG 146 (318) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHh-hcChHHHHHHHHHHHHHHHHHHHHHhhhcc-cCC--------CCC--cccc Confidence 345566554555544444444433322 347789999999999999999999988622 111 001 1111 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) . ... .......++++....+.+.++.... .. ...+.-+++|||..+..|++ T Consensus 147 ~-~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~--~~--------------~~~~~~~~v~n~~~~~~L~~ 197 (318) T protein:vir:24 147 Q-TTK------------AISIADTTGATTVYDQVAVNGLSLL--VN--------------DGKKWTHTLLDDITEPILNG 197 (318) T ss_pred c-ccc------------cccccccccccchHHHHHHHHHHhh--cc--------------ccCCCCEEEEcHHHHHHHHH Confidence 0 000 0000011111112222222222211 11 01223367999999999984 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeec-----CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKG-----GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G-----~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) -.| + ..+|||.. ....+.|-.+.-.|-+ ..+....+.. .+++|--+. +.+|..+ T Consensus 198 lkd---------~-----~G~~l~~~~~~~~~~~~~~~~~i~g~pv~--~~~~~~~~~~----~~~~gdfs~-~~~~~~~ 256 (318) T protein:vir:24 198 AKD---------Q-----NGRPLFIESTYGEAASPFRSGRIVARPTI--LSDHVVEGTT----VGFMGDFSQ-LIWGQIG 256 (318) T ss_pred hhc---------c-----CCceeecCccccCccccccCceEEEEeeE--EeCCCCCCcc----EEEEeecce-EEEEEec Confidence 222 1 22455542 2222223233322222 1111111211 234443332 1234333 Q ss_pred CCCccceechhhccchhHHHHH------HHHhhhhcc----c-----CCcccEEEEEeeeeeccC Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAG------FIAGMKKAR----F-----NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~------~i~G~~K~r----f-----~~~DfGvi~idta~~~~~ 364 (364) +......+| +.-+.+..-+ +-.++-.+| | +.+-|-+|..=+++.+.. T Consensus 257 ~l~i~~~~~---~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 257 GLSFDVTDQ---ATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred CeEEEEeec---cceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 443332222 1111111000 111222221 1 123344443333333222 No 109 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=77.78 E-value=0.12 Score=25.61 Aligned_cols=286 Identities=13% Similarity=0.056 Sum_probs=130.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) ||.+. +. .....++..+.......++.. ++... | . + .+.++++.... .....+|-+.+.. T Consensus 1 ma~~g-G~---lip~~~~~~ii~~~~~~s~i~-~~~~~-----~-~---~---~~~~~~~p~~~~~~~a~~v~Eg~~~-- 61 (298) T protein:vir:94 1 MVLNK-GT---LFDPELVTDLISKVAGKSSIA-RLSAQ-----K-P---I---PFNGEKVFTFTMDSEIDVVAESGKK-- 61 (298) T ss_pred Ceecc-cc---ccChhHHHHHHHHHHhhchhh-hhcce-----e-e---c---cCCceEEEEEecCcceEEeeCCccc-- Confidence 98744 33 345667777776665555543 33210 0 0 0 11123333221 1122333333222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRK--RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) .+...+|.+-++.+-....-+.+..++-++ -+..+|...-+..|.+-+.+..|+.+|.......|...++..... T Consensus 62 ~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~--- 138 (298) T protein:vir:94 62 THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH--- 138 (298) T ss_pred cccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccc--- Confidence 255667777777666666666554443221 345788888999999999999999888431111111111100000 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) ..+..+......+.+....+.|.++........ .+.-+.+|||..+..| T Consensus 139 ---------------~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~l 187 (298) T protein:vir:94 139 ---------------FDSKVTQKVEAPRGIADPNGAIENAVELLTGVD----------------ADVTGIAINPSFRSAL 187 (298) T ss_pred ---------------cccccccccccccccccHHHHHHHHHHhhhhcC----------------CCccEEEEcHHHHHHH Confidence 001111111222223334555666655432211 1222589999999998 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++- |.+ ..+|||. |..+.+.|.+++-.+.++. +...+. . .+|+|--+-++.|+- T Consensus 188 ~~l---------kd~-----~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~----~~~~~~--~-~~~~Gdfs~~~~~~~ 246 (298) T protein:vir:94 188 AKQ---------KDL-----QGNALFPELKWGATPDTINGLPVDVNKTVSD----MSLTQR--D-RAIIGDFANGFKWGY 246 (298) T ss_pred HHh---------hcc-----CCCeeecCcccCCCCceecceeeEEeccccc----ccCCCc--c-EEEEeeccceEEEEE Confidence 742 221 2366764 4457888998886665531 111111 1 367786665566766 Q ss_pred CCCCCccceechhhccchhHHHHH-HHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAG-FIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) .++..+.+.++ +..-+..+. +-.++--.|..- =|+.+.-=...+++-. T Consensus 247 ~~~~~~~~~~~----~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~ 296 (298) T protein:vir:94 247 AKEVPLEVIQY----GDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE 296 (298) T ss_pred ecCceEEEeec----CCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEe Confidence 56666665542 211111110 001111111100 0111111111111111 No 110 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=76.44 E-value=0.14 Score=25.34 Aligned_cols=265 Identities=13% Similarity=0.067 Sum_probs=123.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec--cccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH--LRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~--L~G~gv~Gd~~le 78 (364) +..+....+.......+...+.......++..+ ++.. + . . .+..+++..... ....+|...+.. T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~-~~~~-----~-~---~---~~~~~~~~~~~~~~~~a~~v~Eg~~~- 178 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRD-LIGS-----G-R---T---DSALIEYVQETGFVNNAAIVAEGALK- 178 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhh-hcce-----e-e---c---cCCceEEEEEecCCcceeeecCCccc- Confidence 444444444445666788888777766666543 3221 0 0 0 011222222111 122233322222 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+..++|.+-++.+.....-+.+..++-+. + .++-..-++.|..-+....|+.++.- .|+ + ....++ T Consensus 179 -~~~~~~~~~i~~~~~k~~~~~~is~ell~d-~-~~~~~~i~~~l~~~~~~~~d~a~l~G-~g~-~--------~~~~Gi 245 (390) T protein:vir:81 179 -PESSLKFAKKTDTTHVIAHTMKATRQILSD-A-PQLASYMNNRLIRGLKVKEDAEILRG-TGA-N--------DGLLGL 245 (390) T ss_pred -ccccceeeEEEEeeeEEEEeehhhHHHHHh-H-HHHHHHHHHHHHHHHHHHHHHHHHhc-CCC-C--------Ccccce Confidence 245567777777776666666665555443 3 47888888889998999999877621 111 0 111222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .. .........+.++..+++.|..+...+.... ...-..+|||..+..|+ T Consensus 246 ~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~v~~~~~~~~l~ 295 (390) T protein:vir:81 246 IP--------------QATTYAAPTTIAGATRVDQLRLAMLQASLAE----------------YNPSGIVINPIDWAAIE 295 (390) T ss_pred ee--------------cccccccccccccchhHHHHHHHHHhhcccc----------------CCCCEEEEcHHHHHHHH Confidence 11 1111111222334455666666654432111 11124678999998887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) +-.| + ..+|||. |..+.+.|++++..+.++.. .+++|--.-++-++-.+ T Consensus 296 ~lkd---------~-----~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~-------------~~~~gd~~~~~~~~~~~ 348 (390) T protein:vir:81 296 LAKD---------A-----NNQYLIGNARGTLTPTLWGLPVVATQAMAPG-------------EFLVGAFDLAAQIFDQW 348 (390) T ss_pred Hhhc---------C-----CCceeecCcccccCceecceeeEEcCCCCCC-------------cEEEEehhceEEEEEec Confidence 4222 1 1245553 44567889999888766421 13445332211112223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhc----ccCC---cccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKA----RFNS---KDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~----rf~~---~DfGvi~idta 359 (364) +....|..+..+ .-.+.-.+ ||.. ..-+++.|..| T Consensus 349 ~~~v~~~~~~~~----------~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 349 DARVEIGYVGED----------FQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ceEEEEecccch----------hhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 444444432111 11121112 2211 11122222222 No 111 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=75.03 E-value=0.15 Score=25.08 Aligned_cols=290 Identities=10% Similarity=0.027 Sum_probs=119.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) |..+....|-......|+..+.......+++.. +... . -... ..+++... ......+|.+.+... T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~-l~~~--------~-~~~~---~~~~~~~~~~~~~a~wv~E~~~~~- 195 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQ-LCRV--------Q-PVSK---AGFSKLFNMGGTTSGWVGEASQRP- 195 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhh-hcee--------e-eccC---CceEEEEEcCCcceeeeccccccc- Confidence 665555555556778888888777666665543 3210 0 0000 01111110 011111221111110 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) ......|..-++.+-....-|.+..++- +-+.+||-..-+..|..=+....|+.++.- .|. + . -.|+. T Consensus 196 ~~~~~~f~~v~~~~~k~~~~i~iS~ell-~ds~~~l~~~i~~~la~ai~~~~d~~~l~G-~G~---~-----~--p~Gil 263 (425) T protein:vir:10 196 QTNAATFQPLSFASGEIYANPAATQQIL-DDAEIDLESWLATEVQTEFAKQEGKAFLAG-DGT---N-----K--PNGLL 263 (425) T ss_pred cccccccceeeeeheeeEeehHhHHHHH-hcchhHHHHHHHHHHHHHHHHHHHhhhhcc-cCC---C-----C--cceee Confidence 0011234444444444444444433322 235688888888888888888888876621 221 1 1 11222 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ..+ ...+. -.-.+.+........+++.++.+.|-++....... + ...-+.+|||..+..|++ T Consensus 264 ~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~------------~----~~~a~~vmn~~~~~~L~~ 325 (425) T protein:vir:10 264 TYI-AGGAN-AAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSA------------F----TGNARFAMNRNTQRQVRK 325 (425) T ss_pred ecc-ccccc-cccccccccccccccccccccHHHHHHHHhhhhhh------------h----ccCCEEEEchHHHHHHHH Confidence 111 10000 00000000011111234456666555554322110 0 122356899988888864 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |. +..+|||. |.-+++.|.+++..+.++-. +++. ..+++|--.-++..+-.. T Consensus 326 l---------kD-----~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~---~~~~-----~~i~~Gd~~~~~~i~~~~ 383 (425) T protein:vir:10 326 L---------KD-----GQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDV---AANS-----TPILFGDFQQTYLIIDRI 383 (425) T ss_pred h---------hc-----CCCceeeccCccCCCCceecceeeEEecCcCCc---cCCc-----cEEEEEehhccEEEEEec Confidence 2 21 23367774 45578889998887766421 1111 235667433222223223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) +....+ +..+.+ +.+.+-+...++.+.. +.+-|-++. -+|+- T Consensus 384 ~~~v~~-d~~~~~-~~~~~~~~~r~d~~v~--~~~A~~~l~--~~as~ 425 (425) T protein:vir:10 384 GVRVLR-DPYTAK-PYVLFYTTKRVGGGLL--NPEPMRAMK--VAASE 425 (425) T ss_pred ceEEEe-cccccC-CcEEEEEEEEeccEee--cccceEEEE--eeccC Confidence 333221 111111 1111111111111110 123332222 22222 No 112 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=74.04 E-value=0.16 Score=24.90 Aligned_cols=267 Identities=12% Similarity=0.024 Sum_probs=120.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEE-eeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDL-SVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L-~~~L~G~gv~Gd~~leG 79 (364) |+.+....|.......++..+.......++... +++.- . +....| ++.+.- .......+|...... . T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~-~~~~~------~---~~~~~~-~~~~~~~~~~~~a~~v~Eg~~~-~ 190 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQ-YVTVE------P---VTTRSG-TRLLEKNADMVPFSPVEELGNL-P 190 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHh-hccee------e---ccCCce-eEEEEEecCCcceeeecccccc-c Confidence 777777777777778888888877766666543 43210 0 000011 011110 011111222221111 0 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .....+|..-++.+.....-|.+..++- +-+.+||...-+..|++=+.+..|+.++.- .|+ T Consensus 191 ~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~l~~~i~~~l~~~~~~~~d~~il~G-~g~----------------- 251 (397) T protein:vir:12 191 EIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQAIMTYVAKWFAKKSVVTRNNLILAA-IAS----------------- 251 (397) T ss_pred ccccccceeEEeeheeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc----------------- Confidence 0112344444444444545554433322 346789988889999998999999887732 110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .+ |+ .+.+++-|..+.. +.+. |-. ...-+.+|||..+..|++ T Consensus 252 ~~----~~-------------------g~~~~~~i~~~~~-~~l~-----------~~~---~~~a~~~~n~~~~~~L~~ 293 (397) T protein:vir:12 252 LK----KV-------------------DIDGLDGIKKALN-VTLD-----------PMV---APGSIVLTNQDGYDWLDT 293 (397) T ss_pred cc----cc-------------------ccccHHHHHHHHh-hccc-----------hhh---hCCCEEEEcHHHHHHHHH Confidence 00 11 1112222322211 0010 000 111346899999888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |.+ ..+|||. |.-+.+.|.+++..++. ....+++ ...+++|-=.-++.++-.. T Consensus 294 l---------kd~-----~G~~l~~~~~~~g~~~~l~G~pv~~~~~~---~~~~~~~----~~~~~~gd~~~~~~~~~~~ 352 (397) T protein:vir:12 294 L---------KDG-----TGRYLLQPDPTNPTKKLLDGRPVVPFTNR---VLKTQKG----KAPLIIGNLKEAIVLFDRE 352 (397) T ss_pred h---------hcc-----CCceeecccccCCCCccccceeeEEeccc---ccccCCC----ccEEEEEehhceEEEEeec Confidence 2 222 2356664 44567889988765431 1111112 2246677422222222223 Q ss_pred CCCccceechhhc--cchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 315 GLRFDWEETVKDY--GNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 315 g~~~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) +..+.|.++..++ .+.+.+-+..-++.+-. ..-+++.+.-+|+ T Consensus 353 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~----~~~a~~~~~~t~~ 397 (397) T protein:vir:12 353 QQSIASTDTGAGAFETNSTKVRGIEREDVRKW----DEDAVVFGQITVE 397 (397) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEe----cccceEEEEEeeC Confidence 4556665543221 11111111111111110 1224444555555 No 113 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=73.67 E-value=0.17 Score=24.84 Aligned_cols=268 Identities=12% Similarity=0.103 Sum_probs=120.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) +..|..+.+. .........+..+..++++.+..+. +. ..-..|..+.+..... ....+|.+++.. T Consensus 111 ~~~t~~~~g~-~~~~~~~~~~i~~~~~~~~~l~~~~---------~~--~~~~~~~~~~~p~~~~~~~a~wv~E~~~~-- 176 (390) T protein:vir:62 111 RDGTKAGNPN-VLSRTLYGQLIAQAVERSAIMRGGA---------TT--FTTSDANPLDFTVITGRSSASIVGETAEI-- 176 (390) T ss_pred hcccccCCCc-cccccchHHHHHHHHhhhhhhhhcc---------ee--eecCCCceeEEEEEcCCcceeeecccccc-- Confidence 3333333222 2233444445555555555443211 00 0011223344432222 122344333333 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|..-++.+....+-+.+..++-+ -+.+||-..-+..|+.-+....|+.++ .|+ | .| .++. T Consensus 177 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~l---~G~-G-------~p--~Gi~ 242 (390) T protein:vir:62 177 PESYPATAQRSMGGFKYGFASVVSYEFAT-DQVLDLVGFLVSDAGPAIGDAMGRHFI---TGT-G-------QP--RGIL 242 (390) T ss_pred cccccceeeeEeeeeeEEeehHHHHHHHh-hhhHHHHHHHHHHHHHHHHHHHHhhhh---ccC-C-------cc--cccc Confidence 35667777777777777777766555443 377899999999999999999999877 332 1 11 2222 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .++ ++ .+....-..++.++++.|-++...+... ++ ..-+.+|||..+..|++ T Consensus 243 ~~~--~~----------~~~~~~~~~~~~~~~~~l~~~~~~l~~~------------~~----~~a~~vmn~~~~~~L~~ 294 (390) T protein:vir:62 243 TDA--SP----------ATATFLATDTDSKVSDALIDLFHEVPSA------------YR----ANAKYVVNDLRAAQMRK 294 (390) T ss_pred ccc--cc----------cccceecccccccchHHHHHHHHhhhhh------------hh----cCCEEEEchHHHHHHHH Confidence 111 00 0111111234556666665554322111 00 11256889998888863 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecC-----eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGG-----LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~-----~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) .|. +..+|||..+ -+++.|.+++..+.++- + .+++|-=... ..+-.+ T Consensus 295 ---------lkd-----~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~--------~-----~i~~gd~s~~-~i~~~~ 346 (390) T protein:vir:62 295 ---------LKD-----ANGQYLWQSGLTVGAPSLFNGKVVETDDGMPA--------D-----KILFADLSKY-RVRFAG 346 (390) T ss_pred ---------hhc-----cCCCeeecCCcCCCccceecccceEEecCCCC--------c-----cEEEeeccce-eEEeec Confidence 122 2347887533 35788888876554431 1 1344431111 112223 Q ss_pred CCCccceech-hhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeee Q lcl|NC_019917. 315 GLRFDWEETV-KDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAA 360 (364) Q Consensus 315 g~~~~w~Ee~-~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~ 360 (364) ++...+..+. ++++. +.+ .. -.||. .+-|-++.|=.+| T Consensus 347 ~~~v~~~~~~~~~~~~-~~~-----~~--~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 347 SLRVDRSVDAKFSTDQ-IVY-----RF--LQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ceEEEeeccccccCCc-EEE-----EE--EEEeCcEeechhheEEEEeecCC Confidence 3333333221 11111 000 00 01221 2233333332222 No 114 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=73.33 E-value=0.17 Score=24.78 Aligned_cols=271 Identities=14% Similarity=0.055 Sum_probs=132.3 Q ss_pred Cceeeccc----CCchHH---HHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceec Q lcl|NC_019917. 1 MTTTVIPF----GDPKAV---KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYG 73 (364) Q Consensus 1 Ma~T~~~~----~dp~a~---~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~G 73 (364) ||.|+.-. +.|+.. .++++.+ + ... ++.| |.|+.-+ ..|+++++. .-...|+. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i-----~--~L~-~~Lg------i~r~~p~--a~G~tIt~p-K~~~tgda--- 60 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNI-----N--DLL-KLLG------VTRRETL--TNDLKIQTY-KWEVTLDQ--- 60 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhH-----H--HHH-HHhc------ccccccc--ccCCeEEee-eeeeeccc--- Confidence 99886543 223211 1222111 0 011 1233 3333322 459999987 34444443 Q ss_pred Cceeecchhhhhhcc------cEEEEecccceeeccchhhh-hhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_019917. 74 DARTEGTEENLRFYT------DQVKIDQVRHPVSAGGRMSR-KRSVH-NIRRIARDRLGDYFYKFTDELLFIYLSGARGI 145 (364) Q Consensus 74 d~~leGnee~L~~~~------~~v~Idq~R~~V~~~~~m~~-qrs~~-dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~ 145 (364) .+..||-+-+|+-.+ .++.++..|.++ +-+. |++-+ |=.-++-..|..-+++..|..+|-.|..+... T Consensus 61 ~dVaEGe~Iplskvt~~~~~t~t~kikK~rK~t----TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t 136 (295) T protein:vir:99 61 TDPGEGETIPLSKVTRTKDKDYTVKWFKKRRAT----TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK 136 (295) T ss_pred ccccCCcccchhhheeeeeeeeEEEeeeecccc----cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee Confidence 347788776666655 778899999988 3344 46666 44778889999999999999999888643210 Q ss_pred cccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecC-cee Q lcl|NC_019917. 146 NLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDG-DDH 224 (364) Q Consensus 146 ~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g-~~~ 224 (364) .+ ..++ ..+++.+..+.. +..+. +.. T Consensus 137 -------------------------------~t-g~~l----q~a~a~~~~al~-----------------~f~Ee~~~~ 163 (295) T protein:vir:99 137 -------------------------------VK-GVGL----QKALSASWAKLA-----------------TFNEFEGSP 163 (295) T ss_pred -------------------------------ee-hhhH----HHHHHHhhhhhh-----------------hcccccCCc Confidence 00 0011 112322222211 11111 246 Q ss_pred EEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCE-EEEecCcccccccccCCc-ccccchheeec Q lcl|NC_019917. 225 YVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNV-VLHKHRNVIRFNDYGAGA-NVEAARALFMG 302 (364) Q Consensus 225 yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngv-ii~e~~~~~~~~~~~~~~-~v~v~ralllG 302 (364) +|+|++|.++.+||.++.-.|+..+-. +...-.| |.| + .|...+++.....++.+. |...+-+=.=| T Consensus 164 ~V~FVnP~D~a~yl~~A~~~~~~a~~f--G~~~L~n--fLG-------~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~ 232 (295) T protein:vir:99 164 LVSFVSPLDVANYLGDTKVGADASNVF--GMTLLKN--FLG-------MQNVIVMPSVPEGKIYSTAVENLVFASLNVKG 232 (295) T ss_pred eEEEEehHHHHHHHhccccccchhhhh--hhhhhhh--hhc-------cceEEEcccCCCceEEEeeccceEEEEecCCc Confidence 899999999999999887778644332 1111122 444 3 233444566555444322 22110000000 Q ss_pred cchheEeeecCC-CCCccceechhhccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 303 RQAGVIAYGTAN-GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 303 aqA~~~A~g~~~-g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) ++ ++.+|.... .+-+=-.....++.+ ++.+.+.=..-+-|+. .| |||..---+++.+ T Consensus 233 g~-l~~~f~~~~D~tglIg~~h~~~~~~---~t~et~~~~~~~lfpE~~d-giv~~tI~~~~~~ 291 (295) T protein:vir:99 233 GD-LGGLFADFTDETGLIAAARNRQLSN---LTYESVFFGANVLFAEIPE-GVVEATIEAAAVP 291 (295) T ss_pred hh-hhhhhhhccCcccceEEEeccccce---eeehhhhHhHHHhcccccc-eEEEEEEecCcCC Confidence 00 112222110 000111111222322 1222222222244554 45 7777666666666 No 115 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=71.18 E-value=0.2 Score=24.43 Aligned_cols=272 Identities=10% Similarity=0.015 Sum_probs=120.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeecc Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEGT 80 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn 80 (364) |+.+..+.|.......++..+.......++... +++. .-..... ..+.+...-.+.+. ..-.-||- T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~-~~~~---------~~~~~~~---~~~~~~~~~~~~~~-a~~v~E~~ 181 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-YVRV---------ESVSTSN---GSRVYEKWTDVTPL-TVMDAEDG 181 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhh-hcce---------eeccCCc---ceEEEeeccccccc-eeeecCcc Confidence 666666666666778888888888777776653 4321 0011111 11111111111110 11111211 Q ss_pred ---h-hhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 81 ---E-ENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 81 ---e-e~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) + ....|..-++.+....+-+.+..+|-+ -+.+||...-++.|.+=+....|+.++.-.... . T Consensus 182 ~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~---------~---- 247 (408) T protein:vir:10 182 KIPDLDNPQLTIIKYLIKRYAGIITATNTSLK-DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA---------P---- 247 (408) T ss_pred ccccccCcceeeEEeeeeeEEeeehhHHHHHh-hchHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---- Confidence 1 224555555556666666655544433 378899888888888888888888776322110 0 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) |. ..+ .+.+.|-.+.... +... ++ ..=+.+|||..+.. T Consensus 248 ---------~~-------------~~~-----~~~~~l~~~~~~~-~~~~----------~~----~~a~~v~n~~~~~~ 285 (408) T protein:vir:10 248 ---------KK-------------PTI-----AKFDDVITMINTA-VDPA----------II----ATSSLLTNQSGLNK 285 (408) T ss_pred ---------cc-------------ccc-----ccHHHHHHHHHHh-hhhh----------hc----cCCEEEEcHHHHHH Confidence 00 000 1222222222111 1100 00 11256899999888 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeee Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYG 311 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g 311 (364) |++-.| + ..+|||. |.-+.+.|.+++-.++..... ..++. ..+++|-=.-++.++ T Consensus 286 l~~lkd---------~-----~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~--~~~~~----~~i~~gd~~~~~~~~ 345 (408) T protein:vir:10 286 LALVKT---------A-----EGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN--TGSTV----YPLYYGDMSQAITLF 345 (408) T ss_pred HHHhhc---------c-----CCceEeccCcCCCCCceecceeeEEecccccCc--cCCCc----eEEEEEehhccEEEE Confidence 875322 1 2356664 445688898877654321111 11111 246677433222223 Q ss_pred cCCCCCccceechhhc-c-chhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 312 TANGLRFDWEETVKDY-G-NEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 312 ~~~g~~~~w~Ee~~D~-g-~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) -..++...|..+..+. . +...+-+..-++++-. +.+-|-++.+-+++++-. T Consensus 346 ~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~--~~~a~~~~~~~~~~~~~~ 398 (408) T protein:vir:10 346 DRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT--DSEALVAGSFSAIADQVG 398 (408) T ss_pred EecceEEEEcccccchhhcCceEEEEEEeeccEEe--ccccEEEEEeeccccCCC Confidence 3345555555442211 0 1111111111111111 122233333333322222 No 116 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=66.96 E-value=0.26 Score=23.80 Aligned_cols=261 Identities=9% Similarity=0.080 Sum_probs=121.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecccc--CceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRG--KPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G--~gv~Gd~~le 78 (364) +..+....+.......|+..+.......++... +...- .-.+.+..+.....-++ .++.+ ..-. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~E-~~~~ 176 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLST-LVTKT------------PVTTPKGTYPILKRATDRFSSVAE-LAEN 176 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhh-hceee------------eccCCceEEEEEecCCCccccccc-cccc Confidence 444444445556678888888777766665543 32210 01122233332222211 12211 1111 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) ......+|..-++.+....+-+.+..++-+ .+.+||...-++.|.+=+....|+.++..+... T Consensus 177 ~~~~~~~~~~v~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~---------------- 239 (394) T protein:vir:10 177 PALAEPEFEQVDWSVSTYRGAIPLSEEAIA-DSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSF---------------- 239 (394) T ss_pred cccccccceeEEeeeeeeEeeehhHHHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------------- Confidence 113446666666767666666655544333 366788888888888888877777665322100 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) . |. +.+ ...+++.|..+... .+.. . .+ =+.+|||.-+..|+ T Consensus 240 --~----~~--------~~~--------~~~~~d~l~~~~~~-~~~~-----------~-~~----a~~vmn~~~~~~l~ 280 (394) T protein:vir:10 240 --T----AK--------ATT--------TDTLVDSLKHILNV-DLDP-----------A-YS----RALVVTQSLFNTLD 280 (394) T ss_pred --c----cc--------ccc--------ccccHHHHHHHHHh-hhhh-----------h-cc----CEEEecHHHHHHHH Confidence 0 00 000 11223333333211 1110 0 01 14779998888887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCe---------EEEcCEEEEecCcccccccccCCcccccchheeecc--chhe Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGL---------GMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR--QAGV 307 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~---------g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGa--qA~~ 307 (364) +- | .+..+|||.... +.+.|++++-.+...- ....+ .-.+++|- +++. T Consensus 281 ~l---------k-----d~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~---~~~~~----~~~i~~gd~s~~~~ 339 (394) T protein:vir:10 281 TL---------K-----DKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALL---GSAAG----DQKAFVGDLKRGVL 339 (394) T ss_pred Hh---------h-----ccCCCeeeeccccccccCCcccccccceeEEeccccc---CCCCC----ceEEEEeeccccEE Confidence 42 1 223467775332 4677887765433211 11122 22467773 3333 Q ss_pred EeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeeccC Q lcl|NC_019917. 308 IAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKKHS 364 (364) Q Consensus 308 ~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~~~ 364 (364) +. ...+....|..+ ++..+ . +.++ .||. .+-|-.+.+.++++... T Consensus 340 ~~--~~~~~~v~~~~~--~~~~~-~-----~~~~--~r~d~~~~~~~ai~~~~~~~~~~~~~ 389 (394) T protein:vir:10 340 FA--DRQQVTLAWEDS--KIYGR-Y-----LGAA--FRFGVKQADSNAGYFVTNTDAASGST 389 (394) T ss_pred EE--eecceEEEEecc--cccce-e-----EEEE--EEeccEEeccccEEEEEeecccCCCC Confidence 22 224455555443 22221 1 2222 2442 34555555555555444 No 117 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=65.08 E-value=0.29 Score=23.54 Aligned_cols=277 Identities=12% Similarity=0.083 Sum_probs=119.3 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) |+.|....+.-..+...+.+++....+.++... +.. + ... .+.+++|.... .-...+|-+.+ +- T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~-l~~--------~-~~~---~~~~~~ip~~~~~~~a~wv~Eg~--~~ 74 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQR-VAQ--------K-IPM---GATGIVIPHWTGDVSAQWIGEGD--MK 74 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhh-hcc--------e-eec---cCCceEEEEEcCCcceEEecCCc--cc Confidence 665554433223455667888888877776543 321 1 111 12234443222 11223332222 23 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|.+-++.+-....-|.+..++-+ .+.+||...-++.|.+-+++..|+.+|. |.- . T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~---G~g--------t------- 135 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATAIAMAFDNAALH---GTN--------A------- 135 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhh---ccc--------C------- Confidence 35567777777777776666766555433 4679999999999999999999998872 210 0 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |.....+.. ... .....+...+.+.+.++...+... ....-..+|||..+..|++ T Consensus 136 ------~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~l~~~----------------~~~~a~~vmn~~~~~~L~~ 190 (397) T protein:vir:23 136 ------PSAFQGYLD-QSN--KTQSISPNAYQGLGVSGLTKLVTD----------------GKKWTHTLLDDTVEPVLNG 190 (397) T ss_pred ------Ccccccccc-ccc--ceeeecccchhHHHHHHHHhhhhc----------------ccCCCEEEEcHHHHHHHHH Confidence 010001100 000 011111222333333333322211 1122357899998888874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecC----------eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGG----------LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~----------~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) -.| ...+|||... .+.+.|+++...+.+... .+ .+++|--+-++ T Consensus 191 lkd--------------~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g-------~~----~~~~gDfs~~~- 244 (397) T protein:vir:23 191 SVD--------------ANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG-------DV----VGYAGDFSQII- 244 (397) T ss_pred hhc--------------cCCceeecccccccccccccCceeeeeeEEEeCCCCCC-------ce----EEEEeecceEE- Confidence 221 1235555422 256788888877666421 11 12233222111 Q ss_pred eecCCCCCccceech-hhcc-------------chhHHHHHHHHhhhhcccCCcccEEEEEeee---------------- Q lcl|NC_019917. 310 YGTANGLRFDWEETV-KDYG-------------NEPAICAGFIAGMKKARFNSKDFGVISIDTA---------------- 359 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~-~D~g-------------~~~~i~i~~i~G~~K~rf~~~DfGvi~idta---------------- 359 (364) ++..++..+.-.+|. +-.+ +...+-+.+.++++-.+ .+-|-.+...+. T Consensus 245 i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~--~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (397) T protein:vir:23 245 WGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLIND--VNAFVKLTFDPVLTTYALDLDGASAGNF 322 (397) T ss_pred EEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceec--ccceEEEeeccccceeeecccccCcceE Confidence 222222222222110 0000 01111111111111000 111111111111 Q ss_pred ----------eeccC Q lcl|NC_019917. 360 ----------AKKHS 364 (364) Q Consensus 360 ----------~~~~~ 364 (364) .-+|. T Consensus 323 ~~~~~~~~~~~~~~~ 337 (397) T protein:vir:23 323 TLSLDGKTSANIAYN 337 (397) T ss_pred EEEecCccccCcccc Confidence 11111 No 118 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=64.43 E-value=0.3 Score=23.45 Aligned_cols=289 Identities=11% Similarity=0.074 Sum_probs=118.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEE-eeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDL-SVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L-~~~L~G~gv~Gd~~leG 79 (364) ...|....+.-.....++..+.......++.. .+... + . + .|....+.. ...-...+|......+. T Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~-~~~~~-----~-~---~---~~~~~~~~~~~~~~~a~~v~e~~~~~~ 228 (458) T protein:vir:10 162 NQSSSVEVSSESYETIFSQRIIRDLQKELVVG-ALFEE-----L-P---M---SSKILTMLVEPDAGKATWVAASTYGTD 228 (458) T ss_pred hhcccCccccceehhhHhHHHHHHHHhhhhHH-hhcce-----e-e---c---CCcceEEEEecCCcceeeccccccccc Confidence 11223333444566778888877665555543 23210 0 0 0 111122221 11111222222222211 Q ss_pred c----hhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc Q lcl|NC_019917. 80 T----EENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDF 155 (364) Q Consensus 80 n----ee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~ 155 (364) . ....+|..=++.+....+-|.+..++- +-+.+||-..-++.|..-+.+..|+.+|. |+ |.+ .| T Consensus 229 ~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell-~ds~~~~~~~i~~~l~~~i~~~~d~~~l~---G~-G~~-----~p-- 296 (458) T protein:vir:10 229 TTTGEEVKGALKEIHFSTYKLAAKSFITDETE-EDAIFSLLPLLRKRLIEAHAVSIEEAFMT---GD-GSG-----KP-- 296 (458) T ss_pred ccccccccccceeeEeeeeeEEeeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhc---CC-CCC-----cc-- Confidence 1 112334333444444444444433322 22568898888899999999999987763 31 111 11 Q ss_pred cccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHH Q lcl|NC_019917. 156 TGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQAT 235 (364) Q Consensus 156 ~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~ 235 (364) .++... ++... ...+...+-...+.++++.|-++........ ...-+.+|||..+. T Consensus 297 ~Gi~~~----~~~~~----~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~----------------~~~~~~v~~~~~~~ 352 (458) T protein:vir:10 297 KGLLTL----ASEDS----AKVVTEAKADGSVLVTAKTISKLRRKLGRHG----------------LKLSKLVLIVSMDA 352 (458) T ss_pred ceeeec----ccccc----cceeecccccccccccHHHHHHHHHhhhhhh----------------cCCCEEEEcHHHHH Confidence 222211 11000 0011111222345567777766654332110 12234688998888 Q ss_pred HHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 236 ~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) .|++-.|...+-+- .....+....|.-+.+.|++++....++.. +..+ .+++|.=.-.+.++-..+ T Consensus 353 ~l~~lkd~~G~~i~-----~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~---~~~~------~~~~~~f~~~~~~~~~~~ 418 (458) T protein:vir:10 353 YYDLLEDEEWQDVA-----QVGNDSVKLQGQVGRIYGLPVVVSEYFPAK---ANSA------EFAVIVYKDNFVMPRQRA 418 (458) T ss_pred HHHhhcccCCceee-----ccccccccccCcCceecceeeEEccccccc---cCCc------ceEEEEecccEEEEEeec Confidence 87643221100000 000112233455567889999988776421 1111 123332111112222233 Q ss_pred CCccceechhhccchhHHHHHHHHhhhhcccCC---cccEEEEEeeeee Q lcl|NC_019917. 316 LRFDWEETVKDYGNEPAICAGFIAGMKKARFNS---KDFGVISIDTAAK 361 (364) Q Consensus 316 ~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~---~DfGvi~idta~~ 361 (364) +.. +..+|..+-.+++- ...||.. ..-|++.+..+++ T Consensus 419 ~~v----~~d~~~~~~~~~~~-----~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 419 VTV----ERERQAGKQRDAYY-----VTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred eEE----EeecccCCCceEEE-----EEEEecceEecccceEEEeeccC Confidence 322 12234322111110 0112221 2336656555555 No 119 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=62.99 E-value=0.33 Score=23.26 Aligned_cols=265 Identities=11% Similarity=0.062 Sum_probs=123.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec--cccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH--LRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~--L~G~gv~Gd~~le 78 (364) +..+....+.......++..+.......+++.+ ++..- . . .+..+++..... ....+|...+.. T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~-~~~~~------~---~---~~~~~~~~~~~~~~~~a~~v~Eg~~~- 178 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRD-LIGSG------R---T---DSALIEYVQETGFVNNAAIVAEGALK- 178 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHh-hccee------e---c---cCCceEEEEEecCCcceeeecCCccc- Confidence 444444444456667788888888777777654 32210 0 0 111222222111 112233322222 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+...+|..-++.+.....-+.+..++-+ -+ .+|-..-+..|..=+....|+.+|.- +|+- ....++ T Consensus 179 -~~~~~~~~~i~~~~~k~~~~~~is~ell~-ds-~~l~~~i~~~la~a~~~~~d~a~l~G-~g~~---------~~p~Gi 245 (390) T protein:vir:97 179 -PESSLKFAKKTDTTHVIAHTMKATRQILS-DA-PQLASYMNNRLIRGLKVKEDAEILRG-TGAN---------DGLLGL 245 (390) T ss_pred -cccccceeEEEEeeeeEEEeehhhHHHHH-hH-HHHHHHHHHHHHHHHHHHHHHHHhhc-CCCC---------ccccce Confidence 24556676666777666666666555433 23 47877778888888888888876631 2210 011222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ..++ +++.. ....+...+++.|..+...+... .. +.-+++|||..+..|+ T Consensus 246 ~~~~-------------~~~~~-~~~~~~~~~~d~~~~~~~~~~~~-------------~~---~~~~~v~n~~~~~~L~ 295 (390) T protein:vir:97 246 IPQA-------------TTYAA-PTTIAGATRVDQLRLAMLQASLA-------------EY---PASGIVINPIDWAAIE 295 (390) T ss_pred eecc-------------ccccc-cccccccchHHHHHHHHHhhccc-------------cC---CCCEEEEcHHHHHHHH Confidence 2110 11111 11122234455565654433211 11 1125678999998887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) +-. . ...+|||. +.-+.+.|+++...+.++. + .+++|--..++-++-.. T Consensus 296 ~lk---------d-----~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~--------~-----~~~~gd~~~~~~~~~~~ 348 (390) T protein:vir:97 296 LAK---------D-----ANNQYLIGNARGTLTPTLWGLPVVATQAMAP--------G-----EFLVGAFDLAAQIFDQW 348 (390) T ss_pred Hhh---------c-----CCCceeecCccCCCCceecceeeEEcCCCCC--------C-----cEEEEeccceEEEEEec Confidence 422 1 12356663 4456788999888765531 1 13445322111112223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcc----cCC---cccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKAR----FNS---KDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~r----f~~---~DfGvi~idta 359 (364) +....|..+..++. .|+-..| |.. ..-+++.|+.| T Consensus 349 ~~~i~~~~~~~~f~----------~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 349 DARVEIGYVNDDFQ----------RNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred ceEEEEeecccccc----------cCcEEEEEEEeeccEEeccccEEEEEeC Confidence 33334432211111 1221122 211 23345555555 No 120 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=58.75 E-value=0.41 Score=22.73 Aligned_cols=263 Identities=8% Similarity=0.079 Sum_probs=125.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecccc--CceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRG--KPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G--~gv~Gd~~le 78 (364) |+.+....+.-..+..|+..+.......++... ++.. + . . .+.+.++.....-.+ ..+. ...-. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~-~~~~-----~-~---~---~~~~~~~~~~~~~~~~~~~~~-E~~~~ 174 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLST-LVTK-----T-P---V---TTPKGTYPILKRATDRFSSVA-ELAEN 174 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHh-hcce-----e-e---c---cCCeeEEEEEecCCCcccccc-ccccc Confidence 666665555555677888888777766665543 3221 0 0 0 111233332222111 1121 11111 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) ......+|..-++.+....+-+.+..++- +-+.+||...-++.|.+-+....|..++-.+.+.. T Consensus 175 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--------------- 238 (389) T protein:vir:10 175 PKLAEPEFNKVDWSVATYRGAIPLSEEAI-ADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT--------------- 238 (389) T ss_pred cccccccceeeeeeheeeEeeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc--------------- Confidence 11234566666666666666666654443 24678888888888888888888877764332210 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) |. + .+...+.+.+..+.... .. |.. . =+.+|||..+..|+ T Consensus 239 -------~~--------~--------~~~~~~~d~l~~~~~~~-~~-----------~~~---~--a~~~~n~~~~~~L~ 278 (389) T protein:vir:10 239 -------AK--------K--------TTTDTLVDSLKHILNVD-LD-----------PAY---S--RALVVTQSLFNTLD 278 (389) T ss_pred -------cc--------c--------ccccccHHHHHHHHHhh-hh-----------hhh---C--cEEEecHHHHHHHH Confidence 00 0 01112333333322110 00 110 0 14689999888887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecC---------eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGG---------LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~---------~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) +- |. +..+|||... -+++.|.+|+-.+...- ...++.. .+++|--.-++. T Consensus 279 ~l---------kd-----~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~---~~~~~~~----~~~~gd~~~~~~ 337 (389) T protein:vir:10 279 TL---------KD-----KNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLL---GSLAGDQ----KAFVGDLKRGVL 337 (389) T ss_pred Hh---------hc-----cCCCeeeecCcccccccccccccccceeEEeccccc---CCCCCce----EEEEeeccccEE Confidence 42 21 2337888533 24688988875443211 1122222 567774322222 Q ss_pred eecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeeeeccC Q lcl|NC_019917. 310 YGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAAKKHS 364 (364) Q Consensus 310 ~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~~~~~ 364 (364) .+-.+++...|..+. +-.. .+.++ .||. .+-|-.+.+.+++.+.. T Consensus 338 ~~~~~~~~i~~~~~~--~~~~------~~~~~--~r~d~~~~~~~a~~~~~~~~~~~~~~ 387 (389) T protein:vir:10 338 FTDRQQVTLAWEDSK--IYGK------YLGAA--FRFGVQKADSKAGYFVTNTDVPGSAL 387 (389) T ss_pred EEeecceEEEeeccc--cccc------eEEEE--EEeccEEecccceEEEEeeccCCCCC Confidence 233345566665432 2111 11222 3443 34444555555555554 No 121 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=56.50 E-value=0.46 Score=22.46 Aligned_cols=296 Identities=10% Similarity=-0.008 Sum_probs=129.9 Q ss_pred Cceeeccc------CCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceec Q lcl|NC_019917. 1 MTTTVIPF------GDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYG 73 (364) Q Consensus 1 Ma~T~~~~------~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~G 73 (364) |+.+.-+- ++-.....++.+++......++... +.- ++. + .+.++++..... -....|.+ T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~-~~~--------~~~-~---~~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLR-MGE--------QIP-I---SYGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhh-hcc--------eee-c---cCCceEEEEEeCCceeEeecC Confidence 33222111 1114567788888888877776643 321 110 1 112223322211 12222322 Q ss_pred Cceee------cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019917. 74 DARTE------GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINL 147 (364) Q Consensus 74 d~~le------Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~ 147 (364) .+... -.+...+|..-++..-....-+.+..++-+ -+..||-..-++.|.+=+.+..|+.+|. -.|+.+- T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~-~s~~~~~~~i~~~la~ai~~~~d~~~l~-G~g~~~~-- 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFAR-MNPSGLYTKLQGDLAYAIGRGIDLAVFH-GKSPLTG-- 152 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHHHHHHHHHHhc-ccCCCCC-- Confidence 22211 123445666556655555555555444322 4778999999999999999999988873 1222110 Q ss_pred cccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEE Q lcl|NC_019917. 148 DFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVV 227 (364) Q Consensus 148 ~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~ 227 (364) ..+.+...+...+ ..+.....++....+++.|.++........ ..+.=.. T Consensus 153 -----~~~~g~~~~~~~~----------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~---------------~~~~~~~ 202 (333) T protein:vir:78 153 -----SALQGIDTDNVIA----------NTTNVDYLQETGDPLLDRLLDGYDLVSANT---------------DVEFNGW 202 (333) T ss_pred -----ccccccccccccc----------ccccccccccccchhHHHHHHHHHhhcccc---------------ccCceEE Confidence 1111111110000 011222233444466766666654321110 0111256 Q ss_pred EEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec Q lcl|NC_019917. 228 VMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG 302 (364) Q Consensus 228 ~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG 302 (364) +|+|..+..|+... ..+.+ ..+|||. |..+.+.|++++..+.+.-....+.+.. .-+++| T Consensus 203 vmn~~~~~~L~~~~------~~~d~-----~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~----~~~~~g 267 (333) T protein:vir:78 203 AVDPRFRAHLLRAQ------AYRDA-----NGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSK----TRIIGG 267 (333) T ss_pred EEcchHHHHHHHHh------hhcCC-----CCceeecCccccCCCceeeceeeEEccccCCCccccCCCc----cEEEEE Confidence 77998888886311 11211 2245553 5668999999998877753322222221 134555 Q ss_pred cchheEeeecCCCCCccceechhhccchhHHHHH-HHHhhhhc----ccC-----CcccEEEEEeeee Q lcl|NC_019917. 303 RQAGVIAYGTANGLRFDWEETVKDYGNEPAICAG-FIAGMKKA----RFN-----SKDFGVISIDTAA 360 (364) Q Consensus 303 aqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~-~i~G~~K~----rf~-----~~DfGvi~idta~ 360 (364) =-.- +.+|-.+++.....++. ++.+.-+..+. +-.++-.. ||. .+-|-++.--++- T Consensus 268 D~~~-~~~g~~~~~~i~~~~~~-~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 268 DFSQ-LKFGFADEIRIKMSDTA-TLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred eccc-EEEEEeeccEEEEeccc-cccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 3332 22443344544444331 11111111110 11111111 111 1223333222222 No 122 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=54.64 E-value=0.5 Score=22.24 Aligned_cols=276 Identities=11% Similarity=0.062 Sum_probs=116.5 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc-cCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR-GKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~-G~gv~Gd~~leG 79 (364) .+.|..+.+. .........+..+..++++.+..+ .+ .+. -..|..+.+.....-. ..+|...+.. T Consensus 111 ~~~t~~~~g~-~~~~~~~~~~i~~~~~~~~~l~~~-~~----~~~------~~~~~~~~~~~~~~~~~a~~v~E~~~~-- 176 (392) T protein:vir:13 111 RDGTKAGNPN-VLSRTLYGQLIAQAVERSAIMRGG-AS----TFT------TSDANPMDFTVITGRATAGIVGETAEI-- 176 (392) T ss_pred hcccccCCCc-cccccchHHHHHHHHhhhhhhhhc-ce----eee------cCCCceeEEEEEcCCcceeeecccccc-- Confidence 3333333221 223334455555555555554322 10 000 0112223333222211 1233222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+....|..-++.+-....-|.+..++-+ -+.+||-..-+..|.+-+....|..+|. |. | +..| .++. T Consensus 177 ~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~~i~~~~d~~~l~---G~-G-----t~~p--~Gil 244 (392) T protein:vir:13 177 PESYPATTQRSMGGFKYGFASVVSYEFAT-DQVLDLVGFLVSDAGPAIGDAMGRHFLT---GT-G-----TGQP--RGIL 244 (392) T ss_pred cccccceeeEEeeeeeEEeeehhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhc---cc-C-----Cccc--cccc Confidence 35566777766666666665655444333 3677888888888888888888887772 31 1 1111 1221 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) ..+ + +++....-.+++.++++.|-++...+..+ . ...-+.+|||..+..|+. T Consensus 245 ~~~----~--------~~~~~~~~~~~~~~~~d~l~~~~~~l~~~---------~-------~~~a~~v~n~~~~~~l~~ 296 (392) T protein:vir:13 245 TDA----T--------GANAAFGEADADSKVSDALIDLFHEVPSA---------Y-------RKNAKFVVNDLRAAQMRK 296 (392) T ss_pred ccc----c--------cccccccccccccccHHHHHHHHHhhhhh---------h-------hcCCEEEEcHHHHHHHHH Confidence 111 0 01111111234556666665554322111 0 011245779988877763 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) .|. +..+|||. |.-+.+.|.+++..+.++. + .+++|--.- +.++-.+ T Consensus 297 ---------lkd-----~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~-------~------~i~~Gdf~~-~~i~~~~ 348 (392) T protein:vir:13 297 ---------LKD-----ANGQYLWQSALTVGAPDTFNGKVVETDDGMPA-------D------KVLFADLSK-YRVRFAG 348 (392) T ss_pred ---------hhc-----cCCceeecCCcCCCCCceecceeeEEcCCCCC-------C------cEEEeeccc-eeEEeec Confidence 122 23367775 3345788999887665531 1 234454221 1122233 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeec Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKK 362 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~ 362 (364) ++...+.++..=--+...+-+...+|.+-. +.+-|-++.+ .++| T Consensus 349 ~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~--~~~A~~~~~~--~~aa 392 (392) T protein:vir:13 349 SLRVDRSVDAKFSTDQIVYRFLQRADGLLV--DARGAKVLTV--TPAA 392 (392) T ss_pred ceEEEeeccccccCCcEEEEEEEEeccEEe--cccceEEEEe--eccC Confidence 444443332110001111111111111100 1222333333 2233 No 123 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=53.30 E-value=0.53 Score=22.09 Aligned_cols=293 Identities=12% Similarity=0.117 Sum_probs=126.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) |..+....|-......++..+.......+++. ++ | .+..+ ...| .++++.... -...+|-..... T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~-~~-~------~~~~~---~~~~-~~~~p~~~~~~~a~~v~E~~~~-- 197 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVR-KL-G------ARTLP---LSNG-NITIPRLKGGAIVGYIGADTDI-- 197 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhh-hh-c------ceeee---cCCC-ceEEEEEeCCcceeeeccCccc-- Confidence 33333333434556677777766554445443 23 1 00001 1112 133332211 112223222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVH--NIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~--dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) .+...+|..-++.+.....-|.+..++-+. +.+ +|-..-++.|...+.+..|+.++ .|+ | .. ....+ T Consensus 198 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~d-s~~~~~l~~~i~~~l~~ai~~~~d~a~l---~G~-G-----~~-~~p~G 266 (435) T protein:vir:14 198 PTTQQQFDDLKLTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFI---RDD-G-----TA-NTPKG 266 (435) T ss_pred cccccceeEEEeeeEEEEEeehhhHHHHHh-hccCHHHHHHHHHHHHHHHHHHHHHHhh---ccC-C-----CC-ccccc Confidence 245566777777777777777665554333 333 47788888899999999998886 221 1 00 01122 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) +.. . ..++ .+ .+.. ...+.+ .....+.+++...+... .. ...-+++|+|..+..| T Consensus 267 i~~-~-~~~~--~~-----~~~~-~~~~~~-~~~~~~~~l~~~~~~~~-----------~~---~~~~~~v~n~~~~~~L 321 (435) T protein:vir:14 267 LRF-W-ALPS--NV-----ITAS-DASTLQ-KIETDLGKVILALENAD-----------AN---LTQPGWIMAPRTFRFL 321 (435) T ss_pred eee-c-cccc--ce-----eccc-cccchh-hHHHHHHHHHHHhhhcc-----------cc---ccCCEEEEcHHHHHHH Confidence 210 0 0000 00 0000 001111 12233444443332110 00 1123568899999888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeeec-CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFKG-GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~G-~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) +.- |. +..+|||.. .-|.+.|.+++..+.++.... .+++. -.+++|-=.- +.++--++. T Consensus 322 ~~l---------kd-----~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~--~~~~~---~~i~~gd~s~-~~i~~~~~~ 381 (435) T protein:vir:14 322 EGL---------RD-----GNGNKVYPELANGMLKGYPVGKTTQVPINLG--ETGKE---SEIYFTDFGD-VFIGEEETL 381 (435) T ss_pred HHh---------hc-----cCCceeccCCCCCeeecceeEeecccccccc--CCCcc---ceEEEeeccc-EEEEEeccc Confidence 742 21 234688852 347888999988776643321 12211 1244452221 112333444 Q ss_pred Cccceechh-hccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWEETVK-DYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~Ee~~-D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) ...+..+.. ..+.-.. -..+..++.-+|... =||+++-=...+.+.. T Consensus 382 ~~~~~~~~~~~~~~~~~-~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~ 430 (435) T protein:vir:14 382 EIDYSKEATYKDADGHM-VSAFQRDQTLIRVIAKNDFGPRHVESIAVLAG 430 (435) T ss_pred EEEEeccccccccccch-hhhhhcChhheeeeeeeCceeecccceEEEec Confidence 555444321 1111111 122344444444432 2444433222233333 No 124 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=46.55 E-value=0.73 Score=21.33 Aligned_cols=291 Identities=16% Similarity=0.147 Sum_probs=122.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) |+.+.-..+. .....++..+.......++.. ++.- + ..+ .+.++++... .......|...+.. T Consensus 10 ~~~~t~~~g~-~i~~~~~~~ii~~~~~~s~l~-~~~~------~---~~~---~~~~~~~p~~~~~~~a~~v~Eg~~~-- 73 (330) T protein:vir:77 10 QVALTGDFSA-FLTPEQSQDYFAEIEKTSIVQ-RIAR------K---VPM---GPTGISIPHWTGAVSASWTGEAERK-- 73 (330) T ss_pred hccccCCCcc-eechhHHHHHHHHHHhccchh-hhcc------e---eec---cCCceEEEEEcCCcceeEecCCCcc-- Confidence 4333323232 344566777776665555443 3321 0 001 1222333321 12222333322222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+...+|.+-++.+-....-+.+..++- +.+.+||-...++.|++=+++..|+.+|. =.|+.. ...++. T Consensus 74 ~~~~~~f~~i~~~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~ai~~~~~~~~l~-G~g~~~---------~~~g~~ 142 (330) T protein:vir:77 74 PITKGSFGKQELEPVKITTIFAESAEVV-RLNPLNYLNTMRTKIAEAIALKFDAAAIH-GIDKPS---------AFKGYL 142 (330) T ss_pred ccccceeeEEEEeEEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhc-ccCCCC---------cccccc Confidence 2455666555555555555555444333 24678999999999999999999987771 122110 001110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |.+ .... .. . .+.....++.....++.+.++....... +.+.-..+|||..+..|++ T Consensus 143 -~~~--~~~~-~~-~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~----------------~~~~~~~vmn~~~~~~l~~ 199 (330) T protein:vir:77 143 -AET--TKVV-SL-A--DTNLTTASGPQGNAYLAVNNALSLLVNS----------------GKKWTGTLLDNVTEPILNT 199 (330) T ss_pred -ccc--cccc-ee-e--cccccccccccchhHHHHHHHHHhhhhc----------------CCCccEEEEcHHHHHHHHH Confidence 100 0000 00 0 0011111122223344444443322111 0122257899999998874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeec----------CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEe Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKG----------GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G----------~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A 309 (364) - |. ...+|||.. .-+.+.|.+++..+.++. ..+++. ..+++|--.-. . T Consensus 200 l---------kd-----~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~---~~~~~~----~~~~~gd~s~~-~ 257 (330) T protein:vir:77 200 A---------VD-----GNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVN---GTVGNR----VVGVMGDFSQV-I 257 (330) T ss_pred H---------hc-----cCCceeecCccccccccccCCceecceeeEEeccccC---CCCCCc----cEEEEEecceE-E Confidence 2 22 223566652 124677888888777753 112221 23666643322 2 Q ss_pred eecCCCCCccceec-hhhccchhHHHH-----H-HHHhhhhcccC-CcccEE------EEEeeeeeccC Q lcl|NC_019917. 310 YGTANGLRFDWEET-VKDYGNEPAICA-----G-FIAGMKKARFN-SKDFGV------ISIDTAAKKHS 364 (364) Q Consensus 310 ~g~~~g~~~~w~Ee-~~D~g~~~~i~i-----~-~i~G~~K~rf~-~~DfGv------i~idta~~~~~ 364 (364) ++-.+++.....+| .+++|....... . +-.++-..|.- .=|+.| +.| +.+.+-+ T Consensus 258 i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i-~~~~~~~ 325 (330) T protein:vir:77 258 WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKL-TDQVAGT 325 (330) T ss_pred EEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEE-EeccCCc Confidence 34434555544333 223332211111 0 11111111110 002222 111 1122222 No 125 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=43.95 E-value=0.82 Score=21.04 Aligned_cols=292 Identities=11% Similarity=0.018 Sum_probs=125.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) ||++..+- -.....++..+.......|.... +.. .| . + .+.++++... ......+|.+++... T Consensus 1 mat~~~gg--~lvP~~~~~~ii~~~~~~s~i~~-~~~-----~i-~---~---~~~~~~~p~~~~~~~a~wv~Eg~~~~- 64 (311) T protein:vir:81 1 MVALATGT--FQLPKHLVPGVWQKAQGQSVLAR-LSM-----AE-P---Q---EFGEQQYMTLTAPPRGEVVGEGAQKS- 64 (311) T ss_pred CceecCCc--eEcchhHHHHHHHHHHhcchhhh-hcc-----ee-e---c---CCCceEEEEEeCCceeEEeecCcccc- Confidence 99876643 25667788888887777666543 321 11 0 1 1112444332 122233343333332 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRK--RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +...+|.+-++.+.....-+++..++-++ -+..+|-..-++.|.+-+.+..|+.+|.--....|. .+.+ T Consensus 65 -~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~--------~~~g 135 (311) T protein:vir:81 65 -ESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA--------ALSG 135 (311) T ss_pred -cccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCc--------cccc Confidence 45667766666665555545443332211 245778888999999999999999887332100110 0111 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhccccc-HHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMA-PIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATD 236 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s-~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) ...+. .++ +.....++++... ...|+++........ .+.-..+|||..+.. T Consensus 136 i~~~~--~~~----------~~~~~~~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~vmn~~~~~~ 187 (311) T protein:vir:81 136 SPAKI--LDT----------TNIVELTTGTSATPDLAVEAAVGLVLGDN----------------LSPDGVALDNTFSFM 187 (311) T ss_pred ccccc--ccc----------ceeeeecccccchHHHHHHHHHHHhhhcC----------------CCceEEEEcHHHHHH Confidence 10000 000 0111112222222 234555543332111 111136889999988 Q ss_pred HhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcc----cccchh-eeeccchh Q lcl|NC_019917. 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGAN----VEAARA-LFMGRQAG 306 (364) Q Consensus 237 Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~----v~v~ra-lllGaqA~ 306 (364) |++- |+ +..+|||. |.-+.+.|.+++....+.--...+.... ...... +++|=-.- T Consensus 188 l~~l---------kd-----~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~ 253 (311) T protein:vir:81 188 LATQ---------RD-----SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA 253 (311) T ss_pred HHhh---------hc-----cCCCeeecCccccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEeccc Confidence 8742 22 22466664 4568899999887655532211111100 000111 22221111 Q ss_pred eEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccC-CcccEEEEEeeeeeccC Q lcl|NC_019917. 307 VIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-SKDFGVISIDTAAKKHS 364 (364) Q Consensus 307 ~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-~~DfGvi~idta~~~~~ 364 (364) +.+|-.++..+...++....+ .. --+-.++-..|.. -=|+.|+--..-++++. T Consensus 254 -~~i~~~~~~~~~~~~~~~~~~-~~---~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~ 307 (311) T protein:vir:81 254 -FRWGVQVSIPLELIEFGDPDG-LG---DLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) T ss_pred -EEEEEeccceEEEeccCCCCc-ch---hhhhcCcEEEEEEEEeccEeecccceEEEEe Confidence 222333344444333321111 00 0011222222211 11333322222223333 No 126 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=41.08 E-value=0.94 Score=20.72 Aligned_cols=277 Identities=10% Similarity=0.068 Sum_probs=129.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) +..|....+.......++..++....+.++... +.- ++ .+ .|.++++...... ...+|.+.+. - T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-~~~--------~~-~~---~~~~~~ip~~~~~~~a~~v~Eg~~--~ 91 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-LGK--------YE-PM---EGTEKKFTFWADKPGAYWVGEGQK--I 91 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhh-hcc--------ee-ec---cCCceEEEEEecCcceeEeccCcc--c Confidence 333333335556678888888888777776543 321 11 11 1222443322211 1223333322 2 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..+.|.+-++.+.....-+.+..++-+ .+.+||-..-++.|.+-+.+..|+.+|.- .|+.+ .-.+.. T Consensus 92 ~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~G-~g~~~---------~~~gi~ 160 (324) T protein:vir:97 92 ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKSIA 160 (324) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCc---------cCcccc Confidence 35667788777777777777766554332 46789999999999999999999988732 22110 000100 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +. ....+..+....+.+.|.++........ ...=+++|+|.-+..|++ T Consensus 161 -~~---------------~~~~~~~~~~~~~~~~i~~~~~~l~~~~----------------~~~~~~v~n~~~~~~L~~ 208 (324) T protein:vir:97 161 -QS---------------IEKTNKVIKGDFTQDNIIDLEALLEDDE----------------LEANAFISKTQNRSLLRK 208 (324) T ss_pred -cc---------------ccccceeccccCCHHHHHHHHHhhhhcc----------------CCCCEEEEcHHHHHHHHH Confidence 00 0011112334567777777755432211 011136789999988875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeec-CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKG-GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G-~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| +..+|+|.+ ..+.+.|.+++-.+... ... -.+++|-=+-+ .+|-.++..+ T Consensus 209 lkd--------------~~g~~~~~~~~~~tl~G~PV~~~~~~~------~~~-----~~~~~gd~~~~-~i~~~~~~~i 262 (324) T protein:vir:97 209 IVD--------------PETKERIYDRNSDTLDGLPVVNLKSSN------LKR-----GELITGDFDKL-IYGIPQLIEY 262 (324) T ss_pred hhc--------------CCCceeecCCCCccccceeeEeecCCC------CCc-----ceEEEEecccE-EEEEecCcEE Confidence 332 123678874 35778888876442210 000 01344432211 1232233444 Q ss_pred cceechh-----hccchhHHHHHHHHhhhhccc---------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVK-----DYGNEPAICAGFIAGMKKARF---------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~-----D~g~~~~i~i~~i~G~~K~rf---------~~~DfGvi~idta~~~~~ 364 (364) ...+|.. |.+...- --+..++-.+|. +.+-|-++..=++.. .. T Consensus 263 ~~~~~~~~~~~~~~~~~~~--~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~-~~ 319 (324) T protein:vir:97 263 KIDETAQLSTVKNEDGTPV--NLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT-DS 319 (324) T ss_pred EEeecccccccccccccch--hhhhcCcEEEEEEEEeccEEecccceEEEEeccCCC-CC Confidence 4333311 1110000 001112222221 122333333222211 11 No 127 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=280 Identities=10% Similarity=0.062 Sum_probs=127.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~Gd~~leG 79 (364) +..|....++......++.+++......++... +... + .+ .|.++++...... ....|-..+.. T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~-l~~~--------~-~~---~~~~~~~p~~~~~~~a~~v~Eg~~~-- 91 (324) T protein:vir:96 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGKY--------E-PM---EGTEKKFTFWADKPGAYWVGEGQKI-- 91 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhh-hcce--------e-ec---cCCceEEEEEecCcceeeecCCccc-- Confidence 222222223345667888999888877777654 3211 0 01 1122333322111 12233222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..++|.+-++.+-....-+.+..++-+ -+..||...-++.|.+=+.+..|+.+|.. .|+.. . T Consensus 92 ~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~aia~~~d~~~l~G-~g~~~----------~---- 155 (324) T protein:vir:96 92 ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP----------F---- 155 (324) T ss_pred cccccceeEEEEEeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhhc-CCCCC----------c---- Confidence 24567888888877777777766554433 36789999999999999999999988843 11100 0 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) |. .+... +....-.+....+.+.|.++........ .+.=.++|||..+..|++ T Consensus 156 ------~~---~~~~~--~~~~~~~~~~~~~~~~i~~~~~~i~~~~----------------~~~~~~i~n~~~~~~L~~ 208 (324) T protein:vir:96 156 ------GK---SIAQS--IKKTNKVIKGDFTQDNIIDLEALLEDDE----------------LEANAFISKTQNRSLLRK 208 (324) T ss_pred ------Cc---ccccc--ccccceecccccchHHHHHHHHhhhhcc----------------CCCCEEEEcHHHHHHHHH Confidence 00 00000 0011112233456777777655432110 011136899999888875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| ...+|+|. |.-+.+.|.+++-.+.. ... .--+++|--.- +.+|-.++..+ T Consensus 209 lkd--------------~~G~~~~~~~~~~~l~G~PV~~~~~~-------~~~----~~~~~~gd~s~-~~~~~~~~~~i 262 (324) T protein:vir:96 209 IVD--------------PETKERIYDRNSDSLDGLPVVNLKSS-------NLK----RGELITGDFDK-LIYGIPQLIEY 262 (324) T ss_pred hhC--------------CCCCeeecCCCCCcccceeeEeecCC-------CCC----cceEEEEecce-EEEEEecCcEE Confidence 322 23478886 44677888887643211 000 00234443222 12333333333 Q ss_pred cceechh--hccchhHHHHH-HHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVK--DYGNEPAICAG-FIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~--D~g~~~~i~i~-~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) .-.++-. ++.+.-+.... +-.++-.+|... =|++++--...+++.. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~ 312 (324) T protein:vir:96 263 KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) T ss_pred EEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEec Confidence 3322210 00000000000 111222222211 1444333222223332 No 128 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=40.32 E-value=0.98 Score=20.64 Aligned_cols=270 Identities=11% Similarity=0.110 Sum_probs=133.6 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeec- Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEG- 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG- 79 (364) || .| -.++|+..|+.....++.+..-+++. |...+.. .-|.+|.++-+.-. .|+.--++-.| T Consensus 1 Ma-----in---~~~k~~~~ld~~~~~~~~~~~l~~~~--n~~~~~~-----~gak~VkIp~ist~--~gl~dY~R~~g~ 63 (285) T protein:vir:79 1 MT-----VV---LDSKDLARIDEEYKADSQVWSYLTGG--NGVTQRF-----RGHNEVRINKLSGF--VDATAYKRGQDN 63 (285) T ss_pred Cc-----ch---hhHHHHHHHHHHHHHhhhhhhhcccC--CcceeEe-----cCCCEEEEeeeccc--ccccccccccCc Confidence 55 22 24789999999998888776533332 3333333 24788988744222 22322222333 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRS-VHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs-~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) +..+++....+..++|-|----.=+.|+..-+ ...+-...+.-..+...-..|.-.|-.|+..-+ T Consensus 64 ~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~-------------- 129 (285) T protein:vir:79 64 ARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA-------------- 129 (285) T ss_pred cccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc-------------- Confidence 56677788888888888832111123332111 111111111111111222445444555543110 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .....+|+++. -++.|+.+.++.+..+ +. +..|||++|.-..-|+ T Consensus 130 ------------------~~~~~~~T~~n--v~~~i~~~~~~lde~~-------------vp--~~rvl~vTp~~~~~Lk 174 (285) T protein:vir:79 130 ------------------KKATDSITKDN--ALDAYDTAEAYMFDNE-------------VP--GGFVMFVSSAYYTALK 174 (285) T ss_pred ------------------cccccccCHHH--HHHHHHHHHHHHHHcC-------------CC--CceEEEEChHHHHHHH Confidence 11222455544 3777888777655442 11 2368999999999998 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCe--eecCeEEEcC-EEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPI--FKGGLGMINN-VVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPl--F~G~~g~~ng-vii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) ++. + +++.-.-.+ ..+. +.+.++.+|| |.|++.|.- |+..++.+.+ ..+|+-...+.++.-++.- T Consensus 175 ~s~--~---~~r~~~~~~--~~~~~~i~~~V~~lDg~v~ii~Vps~-r~kt~~~~k~----Infiiv~~~a~i~~~K~~~ 242 (285) T protein:vir:79 175 QSA--A---VTRTFSTDG--TMVINGIDRRVAQLDGGVPIVRVSSD-RLKGLGITNH----VNFILTPLSAIAPIVKYDS 242 (285) T ss_pred hhh--h---hheeccccc--ceeccceeeeeccccceeEEEEcchh-hccCcCcchh----ccEEEecCceeccceeeee Confidence 654 2 333210000 0111 4455799998 999998776 6665554443 3677777777777665432 Q ss_pred CCcc----------ceechhhccchhHHHHHHHHhhhhcccCCcccEEEEEeeeee Q lcl|NC_019917. 316 LRFD----------WEETVKDYGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAK 361 (364) Q Consensus 316 ~~~~----------w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~ 361 (364) .+.+ |.=+..-|.+-.- ++++-=|+.+--.|+. T Consensus 243 ~~~f~P~~~~~~d~~~~~~R~Y~d~fv-------------~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 243 VSVIDPSTDRSGNRWTIKGLSYYDAIV-------------LDNAKKGIYVAATAGV 285 (285) T ss_pred eEeECCCCCCCcceeeeeeeeeeeeee-------------hhhccceeeeeecccC Confidence 2222 1111111111100 1222225544333333 No 129 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=39.91 E-value=1 Score=20.59 Aligned_cols=278 Identities=10% Similarity=0.071 Sum_probs=131.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) +..|....+.......++.+++......++... ++- ++. . .|.++++..... -....|-..+... T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-l~~--------~~~-~---~~~~~~~p~~~~~~~a~~v~Eg~~~~- 92 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-LGK--------YEP-M---EGTEKKFTFWADKPGAYWVGEGQKIE- 92 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhh-hcc--------eee-c---cCCceEEEEEecCcceeEecCCcccc- Confidence 444444444556778889999888888877654 321 110 1 122344332211 1222332222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +..++|..-++.+.....-+.+..++-+ -+.+||...-++.|++=+.+..|+.+|.- .|+-+ ...++. T Consensus 93 -~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~la~ai~~~~d~a~l~G-~g~~~---------~~~gi~ 160 (324) T protein:vir:78 93 -TSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKSIA 160 (324) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCC---------cCcccc Confidence 4557777777777777777766544333 45689999999999999999999888732 11100 001110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + . +...+-.+.+..+.+.|.++...+.... . +.=+++|+|..+..|++ T Consensus 161 -~-------------~--~~~~~~~~~~~~t~~~i~~~~~~l~~~~-------------~---~~~~~vmn~~~~~~L~~ 208 (324) T protein:vir:78 161 -Q-------------S--IEKTNKVIKGDFTQDNIIDLEALLEDDE-------------L---EANAFISKTQNRSLLRK 208 (324) T ss_pred -c-------------c--ccccceeccccccHHHHHHHHHhhhhcc-------------C---CCCEEEEcHHHHHHHHH Confidence 0 0 0011112234467888877765443211 0 11146899998888875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| +..+|+|. |.-+.+.|.+++..+.+. .. .--+++|--.-+ .+|-.++..+ T Consensus 209 l~d--------------~~G~~~~~~~~~~~l~G~PV~~~~~~~-------~~----~~~~~~gd~~~~-~~g~~~~~~i 262 (324) T protein:vir:78 209 IVD--------------PETKERIYDRNSDSLDGLPVVNLKSSN-------LK----RGELITGDFDKL-IYGIPQLIEY 262 (324) T ss_pred hhc--------------cCCCeeecCCCCCcccceeeEeeCCCC-------CC----cceEEEEecceE-EEEEecCcEE Confidence 332 22367776 456778888876543211 11 112344432221 1333334444 Q ss_pred cceechh-----hccchhHHHHHHHHhhhhcccCC-cccEEE-----EEeeeeeccC Q lcl|NC_019917. 319 DWEETVK-----DYGNEPAICAGFIAGMKKARFNS-KDFGVI-----SIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~-----D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi-----~idta~~~~~ 364 (364) ...++-. |++...--. +..++-..|... =|++|. ++-+.+.+-+ T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~--f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~ 317 (324) T protein:vir:78 263 KIDETAQLSTVKNEDGTPVNL--FEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred EEeecccccccccccccchhh--hhcCcEEEEEEEEEccEEecccceEEEecccccC Confidence 3333211 122111000 111111122110 122222 1222222222 No 130 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=39.91 E-value=1 Score=20.59 Aligned_cols=278 Identities=10% Similarity=0.071 Sum_probs=131.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) +..|....+.......++.+++......++... ++- ++. . .|.++++..... -....|-..+... T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-l~~--------~~~-~---~~~~~~~p~~~~~~~a~~v~Eg~~~~- 92 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-LGK--------YEP-M---EGTEKKFTFWADKPGAYWVGEGQKIE- 92 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhh-hcc--------eee-c---cCCceEEEEEecCcceeEecCCcccc- Confidence 444444444556778889999888888877654 321 110 1 122344332211 1222332222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +..++|..-++.+.....-+.+..++-+ -+.+||...-++.|++=+.+..|+.+|.- .|+-+ ...++. T Consensus 93 -~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~la~ai~~~~d~a~l~G-~g~~~---------~~~gi~ 160 (324) T protein:vir:96 93 -TSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKSIA 160 (324) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCC---------cCcccc Confidence 4557777777777777777766544333 45689999999999999999999888732 11100 001110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) + . +...+-.+.+..+.+.|.++...+.... . +.=+++|+|..+..|++ T Consensus 161 -~-------------~--~~~~~~~~~~~~t~~~i~~~~~~l~~~~-------------~---~~~~~vmn~~~~~~L~~ 208 (324) T protein:vir:96 161 -Q-------------S--IEKTNKVIKGDFTQDNIIDLEALLEDDE-------------L---EANAFISKTQNRSLLRK 208 (324) T ss_pred -c-------------c--ccccceeccccccHHHHHHHHHhhhhcc-------------C---CCCEEEEcHHHHHHHHH Confidence 0 0 0011112234467888877765443211 0 11146899998888875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| +..+|+|. |.-+.+.|.+++..+.+. .. .--+++|--.-+ .+|-.++..+ T Consensus 209 l~d--------------~~G~~~~~~~~~~~l~G~PV~~~~~~~-------~~----~~~~~~gd~~~~-~~g~~~~~~i 262 (324) T protein:vir:96 209 IVD--------------PETKERIYDRNSDSLDGLPVVNLKSSN-------LK----RGELITGDFDKL-IYGIPQLIEY 262 (324) T ss_pred hhc--------------cCCCeeecCCCCCcccceeeEeeCCCC-------CC----cceEEEEecceE-EEEEecCcEE Confidence 332 22367776 456778888876543211 11 112344432221 1333334444 Q ss_pred cceechh-----hccchhHHHHHHHHhhhhcccCC-cccEEE-----EEeeeeeccC Q lcl|NC_019917. 319 DWEETVK-----DYGNEPAICAGFIAGMKKARFNS-KDFGVI-----SIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~-----D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi-----~idta~~~~~ 364 (364) ...++-. |++...--. +..++-..|... =|++|. ++-+.+.+-+ T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~--f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~ 317 (324) T protein:vir:96 263 KIDETAQLSTVKNEDGTPVNL--FEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred EEeecccccccccccccchhh--hhcCcEEEEEEEEEccEEecccceEEEecccccC Confidence 3333211 122111000 111111122110 122222 1222222222 No 131 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=39.65 E-value=1 Score=20.57 Aligned_cols=306 Identities=11% Similarity=0.074 Sum_probs=122.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee--ccccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV--HLRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~--~L~G~gv~Gd~~le 78 (364) |+.+....+.......|+..+.......+++.. ++..-. - .+.++++.... .-...+|-.+.... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~-l~~~~~-----------~-~~~~~~~~~~~~~~~~a~wv~E~~~~~ 217 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLAD-LISSRP-----------V-TSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHh-hccccc-----------c-CCCceEEEEEcCCCCcceeeccCcccc Confidence 666666666667788899998877766665543 332100 0 11123333211 11223443333332 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) +...+|..-++.+....+-+.+..++-+. ++ +|-..-++.|..=+....|+.+|.- +|+ + ...++ T Consensus 218 --~s~~~f~~i~~~~~k~a~~~~iS~ell~d-~~-~l~~~i~~~l~~~i~~~~d~~~l~G-~G~-~---------~p~Gi 282 (497) T protein:vir:78 218 --FSSEEFARVYEQVGKVANALTITDEGLRD-AP-ELFNFVQGRLLEGIQRKEEVQLLAG-GGY-P---------GVNGL 282 (497) T ss_pred --cccccceeeEeeeeeeEeecHhHHHHHHh-HH-HHHHHHHHHHHHHHHHHHHHHhhcC-CCc-c---------ccccc Confidence 44566666666666665555554444332 33 4666666666666777777665521 221 1 11122 Q ss_pred ccCccc-CCCCCcEEeeccccch------hhhhhcccccHHHHHHHHHHHHhcccCCCC--CcceeeeEec--------- Q lcl|NC_019917. 159 AGNPLE-APDVDHLLYGGVATSK------ASLAATDIMAPIVIERAVEKAAMMQAENPE--TANMVPVSID--------- 220 (364) Q Consensus 159 ~~N~~~-apt~~r~~~~~~at~~------~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~--~~~i~Pv~~~--------- 220 (364) ..++-. +.+............. .........+.+.+..+...+......... .....|.... T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:78 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 111100 0000000000000000 000011122223333332222111111100 0000000000 Q ss_pred -------CceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecC-----------eEEEcCEEEEecCccc Q lcl|NC_019917. 221 -------GDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGG-----------LGMINNVVLHKHRNVI 282 (364) Q Consensus 221 -------g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~-----------~g~~ngvii~e~~~~~ 282 (364) ....=..+|||..+..||.-.| +..+|||... -..+.|.+++..+.|. T Consensus 363 ~~~~~~~~~~~~~~vmn~~~~~~l~~lkd--------------~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~ 428 (497) T protein:vir:78 363 VDIQLTLFQTPNAVVMNPRDWELLRLTKD--------------ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) T ss_pred hhhhhhcccCCCeEEEchHHHHHHHHhhc--------------CCCceeccCcccccccccccCCceeeceeeEecCCCC Confidence 0000157789988888774222 1223444321 1245588888877764 Q ss_pred ccccccCCcccccchheeeccchh-eEeeecCCCCCccceechhhc-c-chhHHHHHHHHhhhhcccCCcccEEEEEeee Q lcl|NC_019917. 283 RFNDYGAGANVEAARALFMGRQAG-VIAYGTANGLRFDWEETVKDY-G-NEPAICAGFIAGMKKARFNSKDFGVISIDTA 359 (364) Q Consensus 283 ~~~~~~~~~~v~v~ralllGaqA~-~~A~g~~~g~~~~w~Ee~~D~-g-~~~~i~i~~i~G~~K~rf~~~DfGvi~idta 359 (364) . + .+++|--.. .+..+-..+....|.++..|+ . |.+.+-+..=+++. -.+.+-|-++-+-++ T Consensus 429 ~-------~------~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~--v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 429 L-------G------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL--VYRPSAFQLIQLKKG 493 (497) T ss_pred C-------C------ceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecce--eeccccEEEEEecCC Confidence 2 1 123443221 111122235566666554332 1 11111111111110 002345555555444 Q ss_pred eecc Q lcl|NC_019917. 360 AKKH 363 (364) Q Consensus 360 ~~~~ 363 (364) +.+- T Consensus 494 ~~~~ 497 (497) T protein:vir:78 494 ATGS 497 (497) T ss_pred ccCC Confidence 4444 No 132 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=39.65 E-value=1 Score=20.57 Aligned_cols=306 Identities=11% Similarity=0.074 Sum_probs=122.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee--ccccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV--HLRGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~--~L~G~gv~Gd~~le 78 (364) |+.+....+.......|+..+.......+++.. ++..-. - .+.++++.... .-...+|-.+.... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~-l~~~~~-----------~-~~~~~~~~~~~~~~~~a~wv~E~~~~~ 217 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLAD-LISSRP-----------V-TSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHh-hccccc-----------c-CCCceEEEEEcCCCCcceeeccCcccc Confidence 666666666667788899998877766665543 332100 0 11123333211 11223443333332 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) +...+|..-++.+....+-+.+..++-+. ++ +|-..-++.|..=+....|+.+|.- +|+ + ...++ T Consensus 218 --~s~~~f~~i~~~~~k~a~~~~iS~ell~d-~~-~l~~~i~~~l~~~i~~~~d~~~l~G-~G~-~---------~p~Gi 282 (497) T protein:vir:10 218 --FSSEEFARVYEQVGKVANALTITDEGLRD-AP-ELFNFVQGRLLEGIQRKEEVQLLAG-GGY-P---------GVNGL 282 (497) T ss_pred --cccccceeeEeeeeeeEeecHhHHHHHHh-HH-HHHHHHHHHHHHHHHHHHHHHhhcC-CCc-c---------ccccc Confidence 44566666666666665555554444332 33 4666666666666777777665521 221 1 11122 Q ss_pred ccCccc-CCCCCcEEeeccccch------hhhhhcccccHHHHHHHHHHHHhcccCCCC--CcceeeeEec--------- Q lcl|NC_019917. 159 AGNPLE-APDVDHLLYGGVATSK------ASLAATDIMAPIVIERAVEKAAMMQAENPE--TANMVPVSID--------- 220 (364) Q Consensus 159 ~~N~~~-apt~~r~~~~~~at~~------~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~--~~~i~Pv~~~--------- 220 (364) ..++-. +.+............. .........+.+.+..+...+......... .....|.... T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:10 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 111100 0000000000000000 000011122223333332222111111100 0000000000 Q ss_pred -------CceeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeeecC-----------eEEEcCEEEEecCccc Q lcl|NC_019917. 221 -------GDDHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGG-----------LGMINNVVLHKHRNVI 282 (364) Q Consensus 221 -------g~~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~-----------~g~~ngvii~e~~~~~ 282 (364) ....=..+|||..+..||.-.| +..+|||... -..+.|.+++..+.|. T Consensus 363 ~~~~~~~~~~~~~~vmn~~~~~~l~~lkd--------------~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~ 428 (497) T protein:vir:10 363 VDIQLTLFQTPNAVVMNPRDWELLRLTKD--------------ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) T ss_pred hhhhhhcccCCCeEEEchHHHHHHHHhhc--------------CCCceeccCcccccccccccCCceeeceeeEecCCCC Confidence 0000157789988888774222 1223444321 1245588888877764 Q ss_pred ccccccCCcccccchheeeccchh-eEeeecCCCCCccceechhhc-c-chhHHHHHHHHhhhhcccCCcccEEEEEeee Q lcl|NC_019917. 283 RFNDYGAGANVEAARALFMGRQAG-VIAYGTANGLRFDWEETVKDY-G-NEPAICAGFIAGMKKARFNSKDFGVISIDTA 359 (364) Q Consensus 283 ~~~~~~~~~~v~v~ralllGaqA~-~~A~g~~~g~~~~w~Ee~~D~-g-~~~~i~i~~i~G~~K~rf~~~DfGvi~idta 359 (364) . + .+++|--.. .+..+-..+....|.++..|+ . |.+.+-+..=+++. -.+.+-|-++-+-++ T Consensus 429 ~-------~------~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~--v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 429 L-------G------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL--VYRPSAFQLIQLKKG 493 (497) T ss_pred C-------C------ceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecce--eeccccEEEEEecCC Confidence 2 1 123443221 111122235566666554332 1 11111111111110 002345555555444 Q ss_pred eecc Q lcl|NC_019917. 360 AKKH 363 (364) Q Consensus 360 ~~~~ 363 (364) +.+- T Consensus 494 ~~~~ 497 (497) T protein:vir:10 494 ATGS 497 (497) T ss_pred ccCC Confidence 4444 No 133 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=39.26 E-value=1 Score=20.52 Aligned_cols=280 Identities=10% Similarity=0.069 Sum_probs=125.9 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) +..|............|+..++......++... +.- ++. . .+.++++..... ....+|.+.+.. T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~-~~~--------~~~-~---~~~~~~~p~~~~~~~a~~v~Eg~~~-- 91 (324) T protein:vir:10 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGK--------YEP-M---EGTEKKFTFWADKPGAYWVGEGQKI-- 91 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhh-hcc--------eee-c---cCCceEEEEEeCCcceeEeccCccc-- Confidence 222222222335678889999888877776653 321 000 1 112233332221 112233222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..++|..-++.+.....-+.+..++- +-+.+||-..-++.|.+=+.+..|+.+|.. .|..+ ...+ T Consensus 92 ~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G-~g~~~---------~~~~-- 158 (324) T protein:vir:10 92 ETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKS-- 158 (324) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhhc-CCCCc---------cCcc-- Confidence 2455677777776666666666544433 246789999999999999999999877732 12100 0000 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +..... ..+-.+...++.+.|.++........ .+.=+++|||..+..|++ T Consensus 159 ------------i~~~~~--~~~~~~~~~~t~~~i~~~~~~l~~~~----------------~~~~~~v~n~~~~~~L~~ 208 (324) T protein:vir:10 159 ------------IAQSIE--KTNKVIKGDFTQDNIIDLEALLEDDE----------------LEANAFISKTQNRSLLRK 208 (324) T ss_pred ------------cccccc--ccceeccccCCHHHHHHHHHhhhhcc----------------CCCCEEEEcHHHHHHHHH Confidence 010000 01112234567777777755432110 011146789999999875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecC-eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGG-LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~-~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| +..+|+|.+. -+.+.|++++-.+.+ .... . -+++|--.- +.+|-.++..+ T Consensus 209 l~d--------------~~g~~~~~~~~~~~l~G~PV~~~~~~-------~~~~---~-~~~~gd~~~-~~~~~~~~~~i 262 (324) T protein:vir:10 209 IVD--------------PETKERIYDRNSDTLDGLPVVNLKSS-------NLKR---G-ELITGDFDK-LIYGIPQLIEY 262 (324) T ss_pred hhc--------------cCCceeecCCCCccccceeEEeecCC-------CCCc---c-eEEEEeccc-EEEEEecCcEE Confidence 222 2236888754 467788887643211 1111 1 133332222 23444444555 Q ss_pred cceechh-h-ccchhHHHHH-HHHhhhhccc---------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVK-D-YGNEPAICAG-FIAGMKKARF---------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~-D-~g~~~~i~i~-~i~G~~K~rf---------~~~DfGvi~idta~~~~~ 364 (364) ...+|-. . +.+.-+-... +-.++-..|. +.+-|-++..=++...-. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:10 263 KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred EEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 4443311 0 0001111111 1122222221 112222222211111101 No 134 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=36.65 E-value=1.2 Score=20.23 Aligned_cols=269 Identities=11% Similarity=0.083 Sum_probs=112.1 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~leG 79 (364) |.+++...+-.+....+...-|.+..+..+...+| | .+++. ...| .++|.-. ..-...+|.++... T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~------~~~~~---~~~g-~~~ip~~~~~~~a~wv~E~~~~-- 423 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-G------ARMLP---GLVG-DVDIPKKTSGANFYWIGEDEDV-- 423 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhh-c------ceEee---cCCc-ceEEEEEeCCceeEeecCCccc-- Confidence 44545444445555555444555555555444332 2 11111 1122 2333321 11122223222222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .+..++|..-++.+-....-|.+..+|-+ .+.+|+-..-++.|...+....|+.+|.- +|+-+ .| .++. T Consensus 424 ~~s~~~f~~i~l~~~k~~~~v~iS~ell~-ds~~~~~~~i~~~l~~a~~~~~d~a~l~G-~G~~~--~p-------~Gi~ 492 (632) T protein:vir:96 424 QDSDFDFTTLSFSPKTIAGAVPVTRKLRK-QSSIHVENLIREDLIEGIGVALDLAMLTG-TGLAN--DP-------VGLL 492 (632) T ss_pred cccccceeeEEeeeeEEEEehhhHHHHHh-ccchHHHHHHHHHHHHHHHHHHHHHhhcc-cCCCC--cc-------ceee Confidence 23445555444444433333333222211 35789999999999999999999988721 22111 00 1211 Q ss_pred cCcccCCCCCcEEeeccccchhhh-hhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASL-AATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i-~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) |....+ ++ .++...+.+.|.++......+. .+ ....+.+|+|.....|+ T Consensus 493 -~~~~~~---------------~~~~~~~~~~~~~i~~~~~~i~~~~-------------~~-~~~~~~~~~~~~~~~l~ 542 (632) T protein:vir:96 493 -NMTGVP---------------ALTYPAGGVDWASVVDMETKISTFN-------------AD-AGRLAYLTSVTQRGAAK 542 (632) T ss_pred -eccccc---------------ceecccccCCHHHHHHHHHHHhhcc-------------cc-cCccEEEEchhHHHHHH Confidence 100000 11 1223355656655544332221 11 12345678888777776 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) ... .+ -+..+|||.+ +..+|.+++....++.. .+++|--+-+ -+|.-+++.+ T Consensus 543 ~~~-------l~-----d~~G~~i~~~--~~l~G~pv~~s~~ip~~-------------~~~~gd~s~~-~i~~~~~~~i 594 (632) T protein:vir:96 543 KAQ-------VF-----DNTGERIWQN--NEVNGYRAEASNQIPAD-------------TWIFGDWSQI-VIAMWGVLDL 594 (632) T ss_pred HHh-------cc-----CCCCceeecC--CeecccceEeccccccC-------------cEEEeecceE-EEEEecceEE Confidence 311 11 1234888876 57778887766554311 1233322211 1222223333 Q ss_pred cceechhhccchhHHHHHHHHhhhhccc---CCcccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVKDYGNEPAICAGFIAGMKKARF---NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~D~g~~~~i~i~~i~G~~K~rf---~~~DfGvi~idta~~~~~ 364 (364) ....+.. . --| .+.| ..-|++|..-..-+.+-- T Consensus 595 ~~~~~~~-~----------~~~--~v~~~~~~~~d~~v~~~~af~~~k~ 630 (632) T protein:vir:96 595 KVDPYTK-A----------ASD--GLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) T ss_pred EEccccc-c----------ccC--ceEEEEEeecCceeechhhhhheee Confidence 2222110 0 000 0112 112343332211111111 No 135 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=34.17 E-value=1.3 Score=19.94 Aligned_cols=277 Identities=10% Similarity=0.065 Sum_probs=114.6 Q ss_pred Ccee-ecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceec-Ccee Q lcl|NC_019917. 1 MTTT-VIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYG-DART 77 (364) Q Consensus 1 Ma~T-~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~G-d~~l 77 (364) ++.| .++.|-......++..+.......++. .++ ++ .+.. .| .+++.....- ...++.+ .+.- T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i-~~~-~~----~~~~-------~~-~~~~p~~~~~~~a~~~~~~~e~~ 206 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFL-RRL-GT----GVKT-------KE-NIKYPVLVKKAEAQGHKNERTNN 206 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhh-hhh-cc----eecc-------CC-ceEEEEEecCCcccceecccccc Confidence 2222 122333455677777777655444443 222 11 1110 11 1344322111 1112111 1111 Q ss_pred ecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 78 EGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 78 eGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +..+...+|..-++.+....+-+.+..++-+ -+.+||-..-+..|++=+....|+.++.- .|+.+ | ..+ T Consensus 207 ~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~~~~d~~~l~G-~G~~~--------~-~~g 275 (434) T protein:vir:62 207 EMPETDIEFDEIELSPTEFDALATVTKKLLA-RTGLPIEQIVMDELKKAYVRKETQYMVNG-DEANN--------I-NDG 275 (434) T ss_pred cccccccceeeEEeeheeeEeehhhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCc--------c-ccc Confidence 1123345555555555555555555444333 26788888888888888888888877721 11110 0 001 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) +.. .......+++..+.+.|-++....... + ...=+.+|+|..+..| T Consensus 276 --------------~~~---~~~~~~~~~~~~~~d~l~~l~~~l~~~------------~----~~~a~~v~n~~~~~~L 322 (434) T protein:vir:62 276 --------------ALA---KKAVEFKTDEKNLYDALVKMKNTPVKE------------V----RKKARWVLNTAALTKI 322 (434) T ss_pred --------------eee---cccccccccccchhhHHHHHHhhcchh------------h----hcCCEEEEcHHHHHHH Confidence 110 011122334455566665553322110 1 1112457899999888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeeec-------CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEee Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFKG-------GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAY 310 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~G-------~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~ 310 (364) ++-.| +..+|||.. .-.++.|.+++....+.- +.+++. ..+++|-=.-.. . T Consensus 323 ~~lkd--------------~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~----~~~~~~---~~i~~Gdfs~~~-i 380 (434) T protein:vir:62 323 ETMKT--------------DDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDI----PDSPDT---PVFYFGDFSKFY-I 380 (434) T ss_pred HHhhc--------------cCCCEeeccCCCccCCCCceecceeeEEecCccC----ccCCCc---eEEEEeeccceE-E Confidence 75222 234788742 223688999988766632 222222 124444221110 1 Q ss_pred ecC-CCCCccceechhhccchhHHHHHHHHhhhhccc------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 311 GTA-NGLRFDWEETVKDYGNEPAICAGFIAGMKKARF------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 311 g~~-~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf------~~~DfGvi~idta~~~~~ 364 (364) +.. +++.+...-+. ...-+.+......|| ...+.-|+.+---++.-+ T Consensus 381 ~~~~g~~~i~~~~~~-------~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 381 QDVIGSLEVQKLVEL-------FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred EEeeceeEEEeehhh-------hcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 111 11222211111 111111222222222 233333332221111111 No 136 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=31.86 E-value=1.5 Score=19.67 Aligned_cols=284 Identities=13% Similarity=0.156 Sum_probs=126.2 Q ss_pred CceeecccCCc-------------hHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc Q lcl|NC_019917. 1 MTTTVIPFGDP-------------KAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR 67 (364) Q Consensus 1 Ma~T~~~~~dp-------------~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) |+ -.++.+.- +..+..+.- ..|+...-+...+| .++..-+-|.+..|.=+.-++ T Consensus 59 m~-G~~p~~eV~~~e~mtt~~a~IliP~vis~v-~~Eaaepl~~~~kl-----------~qk~~L~~Grsm~F~~~g~~R 125 (393) T protein:vir:79 59 ME-GETPTNEVNLREFMATPSAQILIPRVIVGT-MREAAEPLYIGTKM-----------LQKIRLKSGQSMIFPSIGIMR 125 (393) T ss_pred hc-CCCchhheehhhhhcCCCcceechhhhhhh-hhhcccchhHHHHH-----------HHHHhhhcCcceeccchheee Confidence 22 22222100 111111111 11111111110000 011112345556665444455 Q ss_pred cCceecCceeecchhhhh-hcccEEEEecccceeeccchhhhh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019917. 68 GKPTYGDARTEGTEENLR-FYTDQVKIDQVRHPVSAGGRMSRK---RSVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) Q Consensus 68 G~gv~Gd~~leGnee~L~-~~~~~v~Idq~R~~V~~~~~m~~q---rs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~ 143 (364) ..-|-....++ +.+|+ +..+.|.+.+.|-++.+ .++|| +|-.|+...+-....+-|+++.|+..+..+-- . T Consensus 126 a~~IgEGgE~~--~~sld~~T~dsv~~~~gK~G~~I--a~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~-~ 200 (393) T protein:vir:79 126 AYDVAEGQEIP--EDSIDWQTHESPEIRVGKSGIRL--RFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRS-H 200 (393) T ss_pred ecccccccccc--ccchhhhcCCceeEEechhhhhh--hhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhc-c Confidence 55443333333 67888 77889999999999877 55655 68899999999999999999999988855421 1 Q ss_pred cccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCce Q lcl|NC_019917. 144 GINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDD 223 (364) Q Consensus 144 g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~ 223 (364) .+..|-++.-|++.-|+. -+..+ .-.+++|+.-+.++...+ +....+| T Consensus 201 -------ghtvfDa~st~t~ahptG------r~~~~----~qNGTlSleDllDm~~av-~~~hyt~-------------- 248 (393) T protein:vir:79 201 -------GHTVFDNYSTNKLAHTTG------LDKNG----VQNDTFSAEDFLDLIIAV-MANEYTP-------------- 248 (393) T ss_pred -------cceeeeccccCccceeec------CCccc----cccccccHHHHHHHHHHH-hcccCCc-------------- Confidence 123344444444433432 11111 356889998888875433 3322222 Q ss_pred eEEEEEechhHHHHhhcCCHHHHHHHHHhhh---hhccCCC------eeecCeEEEcCEEEEecC----ccccccccc-C Q lcl|NC_019917. 224 HYVVVMSEYQATDMRTAAGGTWIDFQKAAAA---AEGRNNP------IFKGGLGMINNVVLHKHR----NVIRFNDYG-A 289 (364) Q Consensus 224 ~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~---~~g~~nP------lF~G~~g~~ngvii~e~~----~~~~~~~~~-~ 289 (364) =+++|||..|+...+++ ..-..|.+|.. .++.+.- +-+|-+-.==||++-+.. .-.||.-++ . T Consensus 249 -svi~MHPLAWnv~AKna--~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd 325 (393) T protein:vir:79 249 -SDLMMHPLAWTVFAKNE--LMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVD 325 (393) T ss_pred -ceEEEcCchhhhhhhhh--hhcceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEee Confidence 37899999998887654 22223333321 0110000 001111000122222210 011221111 1 Q ss_pred CcccccchheeeccchheEeeecCCCCCccceechhhccchhHHHHHHHHhhhhcccCC--------cccEEEEEeeeee Q lcl|NC_019917. 290 GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFNS--------KDFGVISIDTAAK 361 (364) Q Consensus 290 ~~~v~v~ralllGaqA~~~A~g~~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~--------~DfGvi~idta~~ 361 (364) -+++ .+|| -..+..--.|.|+ +.|+.|++... +-+||.+.---.. T Consensus 326 ~Nnv----gvlL---------V~D~i~tdq~ddk--------------~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~ 378 (393) T protein:vir:79 326 RNNV----GVLL---------VRDDLKTDQWDEK--------------ARGLQNIKMIERYGIGILNEGKAIAVAKNISM 378 (393) T ss_pred cCCc----eEEE---------EecCcceeccccc--------------cccceeeeeeeeeceeeeeCCceEEEEeccee Confidence 1222 2233 1111223344433 45566665432 2333322110001 Q ss_pred ccC Q lcl|NC_019917. 362 KHS 364 (364) Q Consensus 362 ~~~ 364 (364) ..+ T Consensus 379 ~k~ 381 (393) T protein:vir:79 379 DKS 381 (393) T ss_pred ecc Confidence 111 No 137 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=30.65 E-value=1.6 Score=19.53 Aligned_cols=299 Identities=11% Similarity=-0.005 Sum_probs=133.8 Q ss_pred Cceeeccc------CCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc-ccCceec Q lcl|NC_019917. 1 MTTTVIPF------GDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL-RGKPTYG 73 (364) Q Consensus 1 Ma~T~~~~------~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~G~gv~G 73 (364) |+.+.-.- +.-...+.|+.++.......++... +... ..+ .+.++++.....- ....|.+ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~-l~~~---------~~~---~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLR-LGEN---------IPI---SYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhh-hcce---------eec---cCCceEEEEEecCccceeecc Confidence 32221111 1125678899999888877777654 3211 011 1122333321111 0111111 Q ss_pred Cce---eec---chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019917. 74 DAR---TEG---TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINL 147 (364) Q Consensus 74 d~~---leG---nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~ 147 (364) ++. -|| .+..++|.+-++.+-....-+.+..++-+ -+.+||-..-++.|++=+.+..|+.+|.- .|+.+. T Consensus 77 ~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~~~~~i~~~la~a~~~~~d~~~l~G-~g~~~~-- 152 (338) T protein:vir:78 77 GTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFAR-MNPSGLYTKLQADLAYAIGRGIDLAVFHG-KSPLTG-- 152 (338) T ss_pred cccccccccccccccccceeEEEEEEEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCCcc-- Confidence 111 111 24456666666666555555555444333 26789999999999999999999887721 221110 Q ss_pred cccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEE Q lcl|NC_019917. 148 DFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVV 227 (364) Q Consensus 148 ~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~ 227 (364) ....++..+...+.+ +......+.....++.+.++....... .. .+.-+. T Consensus 153 -----~~~~gi~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~-~~~~~~ 202 (338) T protein:vir:78 153 -----SALQGIDTNNVIVNT----------TNVDYLQTGTTPLLDRFLDGYDLVSAN--------------TD-VDFNGW 202 (338) T ss_pred -----ccccccccccccccc----------cccccccccchhhHHHHHHHHHHhhhh--------------cc-ccceEE Confidence 111222211111111 111111111223345555544332110 00 123357 Q ss_pred EEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeec Q lcl|NC_019917. 228 VMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG 302 (364) Q Consensus 228 ~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllG 302 (364) +|+|..+..|+. +.+. ..+..+|||. |.-+.+.|.+++-.+.++.....+.+.. ..+++| T Consensus 203 ~m~~~~~~~L~~--------~~~l---~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~----~~~~~g 267 (338) T protein:vir:78 203 AADPRYRARLLR--------SQAY---RDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSK----VRVVGG 267 (338) T ss_pred EEchHHHHHHHH--------Hhhh---ccCCCceeecccccCCCCceeeeeeEEEccccCccccccCCcc----cEEEEE Confidence 889988877753 1111 1123356653 5568899999988777753322222221 134555 Q ss_pred cchheEeeecCCCCCccceechh--hccchhHHHHH-HHHhhhhccc---------CCcccEEEEEeeeeec Q lcl|NC_019917. 303 RQAGVIAYGTANGLRFDWEETVK--DYGNEPAICAG-FIAGMKKARF---------NSKDFGVISIDTAAKK 362 (364) Q Consensus 303 aqA~~~A~g~~~g~~~~w~Ee~~--D~g~~~~i~i~-~i~G~~K~rf---------~~~DfGvi~idta~~~ 362 (364) --+. ..||-.+++.....++.. |.....+..+. +-.++-..|+ +.+-|-++.--++..| T Consensus 268 dfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 268 DFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred ecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 4433 224444455554444311 11111111111 1112222222 1244666666666666 No 138 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=29.40 E-value=1.7 Score=19.37 Aligned_cols=293 Identities=14% Similarity=0.120 Sum_probs=123.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEee-ccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSV-HLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~G~gv~Gd~~leG 79 (364) +..+....|-......|+..+.......++.. ++ + .+.++ ...| .+++.... .-...+|..++... T Consensus 132 ~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~-~~-~------~~~v~---~~~~-~~~~p~~~~~~~a~~v~E~~~~~- 198 (435) T protein:vir:80 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVR-KL-G------ARTLP---LSNG-NITIPRLKGGAIVGYIGADTDIP- 198 (435) T ss_pred hcccCCCCCccccchhHHHHHHHHHhhhchhh-hc-c------ceeee---cCCC-ceEEEEEeCCcceeeeccCcccc- Confidence 22333333434556777777765554445443 33 1 00000 0111 12222111 11122232333222 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSV--HNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~--~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) +...+|..-++.+.....-|.+..++-++ +. .+|...-++.|..-+....|+.+|.- +|+- + .-.+ T Consensus 199 -~~~~~f~~i~~~~~k~~~~~~is~ell~d-s~~~~~l~~~i~~~l~~a~~~~~d~a~l~G-~G~~--~-------~p~G 266 (435) T protein:vir:80 199 -TTQQQFDDLKLTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFIRD-DGTA--N-------TPKG 266 (435) T ss_pred -ccccceeeEEEeeEEEEEeehhhHHHHHh-hcccHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCC--C-------cccc Confidence 44556666666666666666554444322 33 36888888999999999999977721 2210 0 0112 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) +..+. . .....+++...+.+.+......+-......+ +. ...-+++|||..+..| T Consensus 267 i~~~~----~-----------~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~------~~----~~~~~~vmn~~~~~~L 321 (435) T protein:vir:80 267 LRFWA----L-----------PGNVITASDGSTLQKIETDLGKAILALENAD------AN----LTQPGWIMAPRTFRFL 321 (435) T ss_pred eeecc----c-----------ccceeecccccchhhHHHHHHHHHHHhhccc------cc----cccCEEEEcHHHHHHH Confidence 11110 0 0001111222222222222222211100000 00 1223568899998888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeeec-CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCC Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFKG-GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~G-~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~ 316 (364) ++-.| +..+|||.. .-|.+.|.+++....++.... .+++. . .+++|==+- +.+|.-++. T Consensus 322 ~~lkd--------------~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~--~~~~~--~-~i~~gd~s~-~~i~~~~~~ 381 (435) T protein:vir:80 322 EGLRD--------------GNGNKVYPELANGMLKGYPVGKTTQVPINLG--EAGKE--S-EIYFTDFGD-VFIGEEETL 381 (435) T ss_pred Hhhhc--------------cCCceeccCCCCCeEeeeeeEEecccccccc--CCCCc--c-eEEEEEccc-EEEEeecce Confidence 64221 233678852 346888999988877654221 11111 1 234442221 123333455 Q ss_pred Cccceechh-hccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeeeccC Q lcl|NC_019917. 317 RFDWEETVK-DYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAKKHS 364 (364) Q Consensus 317 ~~~w~Ee~~-D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~~~~ 364 (364) .+.+..+.. .-++...+ --+..++-.+|... =||.|+-=...+.+.. T Consensus 382 ~i~~~~~~~~~~~~~~~~-~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~ 430 (435) T protein:vir:80 382 EIDYSKEATYKDADGHMV-SAFQRDQTLIRVIAKNDFGPRHVESIAVLSG 430 (435) T ss_pred EEEEeccccccccccchh-hhhhcCcceeeeeeeeCcEeecccceEEEec Confidence 555554421 11111111 11333443333322 2444443333333333 No 139 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=28.95 E-value=1.7 Score=19.32 Aligned_cols=280 Identities=13% Similarity=0.046 Sum_probs=138.7 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccccCceecCceeec- Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLRGKPTYGDARTEG- 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG- 79 (364) ||-| +. -.++|+..|-..+.+.+.+.. |. +.+.-| +. .-|.+|.++ +++-.|..-=+|-.| T Consensus 1 Mant-l~-----ya~~~~~~LD~~~~~~~~s~~-l~--~~~~~v-~~-----~ggktVkIp---~i~~~gl~DY~R~~g~ 62 (312) T protein:vir:10 1 MANT-LA-----YGQVLQQGLDKQATQELLTGW-MD--SNAKQI-KY-----EGGKEVKIG---KLSTDGLGDYSRGSAN 62 (312) T ss_pred CCcc-hh-----HHHHHHHHHHHHHHhhhcccc-cc--CCCceE-EE-----ecCcEEEEE---eeecccccccccccCC Confidence 9944 23 347899999888877776653 42 222223 33 258899988 333344422233344 Q ss_pred --chhhhhhcccEEEEecccceeeccchhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccccccccccccc Q lcl|NC_019917. 80 --TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSV--HNIRRIARDRLGDYFYKFTDELLFIYLSG-ARGINLDFVETPD 154 (364) Q Consensus 80 --nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~--~dlr~~ar~~L~~w~~~~~D~~~~~~l~g-a~g~~~~~~~~~~ 154 (364) +..+++....+..++|-|----.=.+|+...|- +.+-...+.-..+...-..|.-.|..|+. +-+.+ T Consensus 63 ~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~-------- 134 (312) T protein:vir:10 63 AYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIK-------- 134 (312) T ss_pred ccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccc-------- Confidence 334688888899999988322221445544432 22333333333333334556666666653 11100 Q ss_pred ccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhH Q lcl|NC_019917. 155 FTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQA 234 (364) Q Consensus 155 ~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~ 234 (364) ... ..+...+++++. -++.|+.+.++.+..+ +. +-.||+++|.-. T Consensus 135 ------------~~~------~~~~~~~~T~~n--i~~~i~~~~~~lde~~-------------vp--~~rvl~vTp~~~ 179 (312) T protein:vir:10 135 ------------GDT------NVEYSYSVNSST--IINKIKTGIKIIRENG-------------YN--GPLVCHLTYDSM 179 (312) T ss_pred ------------ccc------ccccccccCHHH--HHHHHHHHHHHHHHcc-------------CC--CceEEEeChHHH Confidence 000 001111223333 2455666666554332 11 246899999988 Q ss_pred HHHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCccccc------ccccC---------Ccccccchhe Q lcl|NC_019917. 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRF------NDYGA---------GANVEAARAL 299 (364) Q Consensus 235 ~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~------~~~~~---------~~~v~v~ral 299 (364) .-|+++ .. ++.. ..-..+-...+.++.+|||.|+|.|.- || .++.. .+..+-...+ T Consensus 180 ~lLk~~--~~----~~~~--~~~~~~~~i~~~V~~iDgv~Ii~VPs~-r~~t~~~f~dG~t~~~~~gg~~~~~~ak~INf 250 (312) T protein:vir:10 180 FAIEEK--VL----EKLT--AVTFAQGGIQTQVPSIDGCALIKTPQN-RMYSSILLNDGTTSNQTAGGYLKGTKALDTNF 250 (312) T ss_pred HHHhhh--hh----ceec--ccccccceeeeeeeeecccEEEEchhh-hccceeeeccCcccccccCceeecCcccccce Confidence 888852 11 1111 111223356899999999999998875 44 33311 1222345688 Q ss_pred eeccchheEeeecCCCCCccceech----------hhccchhHHHHHHHHhhhhcccCCcccEE-EEEeeeeeccC Q lcl|NC_019917. 300 FMGRQAGVIAYGTANGLRFDWEETV----------KDYGNEPAICAGFIAGMKKARFNSKDFGV-ISIDTAAKKHS 364 (364) Q Consensus 300 llGaqA~~~A~g~~~g~~~~w~Ee~----------~D~g~~~~i~i~~i~G~~K~rf~~~DfGv-i~idta~~~~~ 364 (364) ||=...+.+|.-++.-.+.+=.+.. .-|.+-.- ++++-=|+ +.+.+|-+- . T Consensus 251 iiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv-------------~~nk~~~Iyv~~k~a~~~-~ 312 (312) T protein:vir:10 251 IIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWV-------------TDNKANSVYANFKDAKPV-G 312 (312) T ss_pred EEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeee-------------eccccCeEEEEeecccCC-C Confidence 8888888888876544433311111 11111000 11222233 122111111 1 No 140 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=28.67 E-value=1.7 Score=19.28 Aligned_cols=288 Identities=10% Similarity=0.053 Sum_probs=128.0 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEE-eeccccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDL-SVHLRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L-~~~L~G~gv~Gd~~leG 79 (364) ||.|...-+-......++..++......++... +.- ++. . .+.++++.. .......+|-+++..+. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~-l~~--------~~~-~---~~~~~~~p~~~~~~~a~wv~E~~~~~~ 67 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLS-AFQ--------NVN-M---GTKTTHLPVLATLPEADWVGESATDPK 67 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhh-hcc--------eee-c---cCCcEEEEEEeCCcceEEeeccccccc Confidence 999988887788889999999988888877653 321 111 1 122344432 22223445544455444 Q ss_pred ch---hhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Q lcl|NC_019917. 80 TE---ENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFT 156 (364) Q Consensus 80 ne---e~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~ 156 (364) .+ ...+|..-++.+-....-+.+..+|-+ -+.+||-..-++.|++=+++..|+.+|. |. |- + .+.+. T Consensus 68 ~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~~~~a~~~d~a~~~---G~-g~--~---~~~~~ 137 (305) T protein:vir:25 68 GVKPTSKVTWANRTLVAEEIAVIIPVHENVID-DATVAVLTEVAELGGQAIGKKLDQAVIF---GT-DK--P---ASWVS 137 (305) T ss_pred ccccccccceeeEEeeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHhhhhee---cc-CC--C---CCccc Confidence 32 355666667777777777766555443 3678999999999999999999998882 31 00 0 00000 Q ss_pred ccccCcccCCCCCcEEeeccccchhhhh-hcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHH Q lcl|NC_019917. 157 GFAGNPLEAPDVDHLLYGGVATSKASLA-ATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQAT 235 (364) Q Consensus 157 ~~~~N~~~apt~~r~~~~~~at~~~~i~-~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~ 235 (364) . + +.+...+.....+ ++-......+..+...+.... . -.+...=-++|||..+. T Consensus 138 ~---~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-----------~~~~~~~~~v~~~~~~~ 192 (305) T protein:vir:25 138 P---A----------LIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAV-A-----------SAGWAPDTLLSSLALRY 192 (305) T ss_pred c---c----------cccccccccccccccccchhhhHHHHHHHHHHHhh-h-----------hcccccceeEecHHHHH Confidence 0 0 0000000001111 111111121222222221110 0 00001012677999988 Q ss_pred HHhhcCCHHHHHHHHHhhhhhccCCCeeecCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCC Q lcl|NC_019917. 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) Q Consensus 236 ~Lr~~~d~~w~~~qk~A~~~~g~~nPlF~G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g 315 (364) .|++-.| +..+|||.. +.+.|.+++-.+.+.. .... . .+++|--.- +.+|-.++ T Consensus 193 ~l~~lkd--------------~~G~~i~~~--~~l~G~Pv~~~~~~~~----~~~~-~----~~~~gd~s~-~~i~~~~~ 246 (305) T protein:vir:25 193 EVANIRD--------------ANGNPVFRD--DSFAGFRTFFNRNGAW----DADA-A----IEVIADSSR-VKIGVRQD 246 (305) T ss_pred HHHHhhc--------------cCCceeecC--CcccccceEEcCccCC----CCCc-c----EEEEEecce-EEEEEecC Confidence 8874222 134688865 4667777765444321 1111 1 233443221 12333344 Q ss_pred CCccceech-hhccchhHHHHHHHHhhhhcccC-CcccEE-----EEEeeeeeccC Q lcl|NC_019917. 316 LRFDWEETV-KDYGNEPAICAGFIAGMKKARFN-SKDFGV-----ISIDTAAKKHS 364 (364) Q Consensus 316 ~~~~w~Ee~-~D~g~~~~i~i~~i~G~~K~rf~-~~DfGv-----i~idta~~~~~ 364 (364) ..+...+|- ++-+... +. -+-..+-.+|.- --||+| ++.-+..+.-. T Consensus 247 ~~i~~~~~~~~~~~~~~-~~-~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~ 300 (305) T protein:vir:25 247 ITVKFLDQATLGTGENQ-IN-LAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) T ss_pred eEEEEeeeeeeecCCce-ee-eeecCcEEEEEEEeecceeeCcccEEEEccccccc Confidence 444433321 1111100 00 000111111110 012222 12222211111 No 141 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=27.39 E-value=1.8 Score=19.12 Aligned_cols=296 Identities=11% Similarity=0.124 Sum_probs=120.0 Q ss_pred Cc-eeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEe-eccccCceecCceee Q lcl|NC_019917. 1 MT-TTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLS-VHLRGKPTYGDARTE 78 (364) Q Consensus 1 Ma-~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~le 78 (364) ++ .|....|-......++..+.......++. .++ |. ++++ ...|+ ++++.. ......+|...+.. T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l-~~~-~~------~~~~---~~~g~-~~~p~~~~~~~a~~v~Eg~~~- 191 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIV-RKL-GA------RSIP---LPNGN-MSLPRLAGGATASYTGENQDA- 191 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchh-hhh-cc------eeee---cCCcc-eEEEEEeCCcceeeeccCccc- Confidence 22 23333333445567777766544334433 223 10 0111 11122 233221 11122233222222 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) .+...+|..-++.+....+-|.+..+|-+ -+.+||-..-++.|.+=+....|+.+| .|+ ++ .....|+ T Consensus 192 -~~~~~~f~~i~~~~~k~~~~v~is~ell~-ds~~~l~~~i~~~l~~ai~~~~d~~~l---~G~------G~-~~~p~Gi 259 (428) T protein:vir:10 192 -KVSEARFDDVKLTAKTMIAMVPISNALIG-RAGFNVEQLVLQDILTAISVREDKAFM---RDD------GT-GDTPIGM 259 (428) T ss_pred -cccccceeeEEeeeEEEEEeehhhHHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHHh---ccC------CC-Ccccccc Confidence 24456666666666666666655444422 356788888888888888888888776 231 10 0111233 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) .... +....+....+ .-..+++.++.......+....... . ...-+.+|+|..+..|+ T Consensus 260 ~~~~----~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~------~----~~~~~~v~n~~~~~~L~ 317 (428) T protein:vir:10 260 KARA----TQWNRLLPWAA--------DAAVNLDTIDTYLDSIILMSMDGNS------N----MISSGWGMSNRTYMKLF 317 (428) T ss_pred cccc----ccccccccccc--------cccccHHHHHHHHHHHHHhhhcccc------c----cccCEEEEcHHHHHHHH Confidence 2111 11111111111 1122333333322222111100000 0 11235678999988887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeeec-CeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFKG-GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLR 317 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~G-~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~ 317 (364) +-. . +..+|||.. .-|.+.|.+++....++... +.+++. ..+++|-=+- +-+|..++.. T Consensus 318 ~lk---------d-----~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~--~~~~~~---~~i~~gd~s~-~~i~~~~~i~ 377 (428) T protein:vir:10 318 GLR---------D-----GNGNKVYPEMAQGMLKGYPIQRTSAIPANL--GEGGKE---SEIYFADFND-VVIGEDGNMK 377 (428) T ss_pred Hhh---------c-----cCCceeccCCCCCeeeceeeEEeccccccc--cCCCcc---ceEEEEecce-EEEEEecceE Confidence 422 1 233578742 23578899988776654322 111111 1234443221 1122223344 Q ss_pred ccceechhhccchhHHHHHHHHhhhhcccCC-cccEEEEEeeeee-ccC Q lcl|NC_019917. 318 FDWEETVKDYGNEPAICAGFIAGMKKARFNS-KDFGVISIDTAAK-KHS 364 (364) Q Consensus 318 ~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~~-~DfGvi~idta~~-~~~ 364 (364) .....+..-......+...+..++-.+|... =||+|.. +.|.+ +.. T Consensus 378 i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~-p~a~~~~t~ 425 (428) T protein:vir:10 378 VDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRH-PEGLVLGTG 425 (428) T ss_pred EEeecccccccccccccchhhcchhheeeeeeeCceeec-cceEEEEec Confidence 4333321100011111123334444444321 1333321 11111 111 No 142 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=24.71 E-value=2.1 Score=18.77 Aligned_cols=264 Identities=11% Similarity=0.061 Sum_probs=120.4 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeecc--ccCceecCceee Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHL--RGKPTYGDARTE 78 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L--~G~gv~Gd~~le 78 (364) ++.+....| ...+..+...+.......++..+ ++.. .. . .+.++++.....- ...+|... -+ T Consensus 114 ~~~~~~~~g-~~~~~~~~~~ii~~~~~~~~l~~-~~~~--------~~-~---~~~~~~~~~~~~~~~~a~~v~Eg--~~ 177 (390) T protein:vir:10 114 STDAAGSAG-ALTTPNRLPGFITQPDARLTVRD-LIGS--------GR-T---DSALIEYVQETGFVNNAAIVAEG--AL 177 (390) T ss_pred hcccccccc-cccchhHHHHHHHHHHhhchhhh-hcce--------ee-c---cCCceEEEEEecCCcceeeecCC--cc Confidence 333333322 34455667777777666665543 3210 00 0 1222333322211 12222222 23 Q ss_pred cchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_019917. 79 GTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGF 158 (364) Q Consensus 79 Gnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~ 158 (364) -.+..++|..-++.+.....-+.+..++-+ -+ .+|-..-++.|...++...|+.++.- +|+. ....++ T Consensus 178 ~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~l~~~i~~~l~~~~~~~~~~~il~G-~G~~---------~~p~Gi 245 (390) T protein:vir:10 178 KPESSLKFAKKTDTTHVIAHTMKATRQILS-DA-PQLASYMNNRLIRGLKVKEDAEILRG-TGAN---------DGLLGL 245 (390) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHH-hH-HHHHHHHHHHHHHHHHHHHHHHHhhc-CCCC---------cccccc Confidence 335566777777777776666666555433 24 48888999999999999999877622 2211 112222 Q ss_pred ccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHh Q lcl|NC_019917. 159 AGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMR 238 (364) Q Consensus 159 ~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) ... .+.+.. ..+.+...+.+.+..++..+.... .+.-+++|||..+..|+ T Consensus 246 ~~~-------------~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~----------------~~~~~~v~n~~~~~~L~ 295 (390) T protein:vir:10 246 IPQ-------------ATTYAA-PTTIAGATRVDQLRLAMLQASLAE----------------YPASGIVINPIDWAAIE 295 (390) T ss_pred ccc-------------cccccc-cccccccchHHHHHHHHHhhcccc----------------CCCCEEEEcHHHHHHHH Confidence 211 011110 011112234455555544332110 11224679999888887 Q ss_pred hcCCHHHHHHHHHhhhhhccCCCeee----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 239 TAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 239 ~~~d~~w~~~qk~A~~~~g~~nPlF~----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) +-.| +..+|||. +.-+.+.|++++..+.++.. .+++|--.-++-.+-.. T Consensus 296 ~lkd--------------~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~-------------~~~~gdf~~~~~~~~~~ 348 (390) T protein:vir:10 296 LAKD--------------ANNQYLIGNARGTLTPTLWGLPVVATQAMAPG-------------EFLVGAFDLAAQIFDQW 348 (390) T ss_pred Hhhc--------------CCCceeecCCcCcCCceecceeeEEcCCCCCC-------------cEEEEeccceEEEEEec Confidence 4222 12245554 44567789998887766421 13444322111112223 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhc----ccCC---cccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKA----RFNS---KDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~----rf~~---~DfGvi~idta 359 (364) +....|..+. ++ +-.++-.. ||.. ..-+++.|.-| T Consensus 349 ~~~i~~~~~~-~~---------~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 349 DARVEIGYVN-DD---------FQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ceEEEEeecc-cc---------cccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 4444444321 11 11122222 2211 22334444444 No 143 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=24.44 E-value=2.2 Score=18.73 Aligned_cols=280 Identities=10% Similarity=0.083 Sum_probs=126.8 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeec-cccCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVH-LRGKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~G~gv~Gd~~leG 79 (364) +..|.....+......|+.++.....+.++... +... +. . .+.++++..... -...+|.+++... T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~-~~~~--------~~-~---~~~~~~~p~~~~~~~a~~v~Eg~~~~- 92 (324) T protein:vir:99 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMR-LGKY--------EP-M---EGTEKKFTFWADKPGAYWVGEGQKIE- 92 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhh-hcce--------ee-c---cCCceEEEEEecCcceeEeccCcccc- Confidence 222222223335678888888888877776643 3211 00 0 112233332211 1222333333322 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) +..++|..-++.+.....-+.+..++-+ -+.+||-..-+..|.+=+.+..|+.+|.. .|..+ .-.+ T Consensus 93 -~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~ai~~~~d~~~l~G-~g~~~---------~~~~-- 158 (324) T protein:vir:99 93 -TSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNP---------FGKS-- 158 (324) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhhc-CCCCc---------cCcc-- Confidence 5566777777777777766766544332 35689989999999998988899887732 12110 0000 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) +....+ ...-.+.+.++.+.|.++....... .... . +++|+|.-+..|++ T Consensus 159 ------------~~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~-------------~~~~-~--~~v~n~~~~~~L~~ 208 (324) T protein:vir:99 159 ------------IAQSIE--KTNKVIKGDFTQDNIIDLEALLEDD-------------ELEA-N--AFISKTQNRSLLRK 208 (324) T ss_pred ------------cccccc--ccceeccccCCHHHHHHHHHhhhhc-------------cCCC-C--EEEEcHHHHHHHHH Confidence 010000 0111233456777787775543221 1111 1 46789999999875 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeeecC-eEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCCCCCc Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFKGG-LGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~G~-~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~g~~~ 318 (364) -.| +..+|+|.+. -+.+.|.+++-.+.+ .... . -+++|=-.- +.++-.++..+ T Consensus 209 l~d--------------~~g~~~~~~~~~~~l~G~PVv~~~~~-------~~~~---~-~~i~gd~~~-~~~~~~~~~~i 262 (324) T protein:vir:99 209 IVD--------------PETKERIYDRNSDTLDGLPVVNLKSS-------NLKR---G-ELITGDFDK-LIYGIPQLIEY 262 (324) T ss_pred hhc--------------CCCceeecCCCCccccceeEEeecCC-------CCCc---c-eEEEEeccc-EEEEEecCcEE Confidence 222 2236888754 467888887543221 1111 0 133443222 22343334444 Q ss_pred cceechh--hccchhHHHHH-HHHhhhhccc---------CCcccEEEEEeeeeeccC Q lcl|NC_019917. 319 DWEETVK--DYGNEPAICAG-FIAGMKKARF---------NSKDFGVISIDTAAKKHS 364 (364) Q Consensus 319 ~w~Ee~~--D~g~~~~i~i~-~i~G~~K~rf---------~~~DfGvi~idta~~~~~ 364 (364) ...+|.. .+.+.-+.... +..++-.+|. +.+-|-++..=+++..-. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~ 320 (324) T protein:vir:99 263 KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred EEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 3333311 00000011110 1222222222 112222222211111111 No 144 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=24.24 E-value=2.2 Score=18.70 Aligned_cols=255 Identities=10% Similarity=0.075 Sum_probs=113.1 Q ss_pred Cc-eeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc--cCceecCcee Q lcl|NC_019917. 1 MT-TTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR--GKPTYGDART 77 (364) Q Consensus 1 Ma-~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~--G~gv~Gd~~l 77 (364) |. .+....+....+..|+..+.......+.+.+ ++..- . . .+..++++....-. ..++.+.... T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~~------~---~---~~~~~~~~~~~~~~~~~~~~~E~~~~ 199 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKP-FTNVF------Q---A---STQKGTYPTVANATTKMVTVAELEKN 199 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhh-cceeE------e---c---cCcceEEEEEecCCCccccccccccc Confidence 22 2233335566778888888777666555543 33210 0 0 11123333222211 1222221111 Q ss_pred ecchhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_019917. 78 EGTEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTG 157 (364) Q Consensus 78 eGnee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~ 157 (364) ......+|..-++.+....+-|.+..++- +-+.+||...-++.|.+-+....|+.++.-+.+. T Consensus 200 -~~~~~~~f~~i~~~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~--------------- 262 (400) T protein:vir:38 200 -PAMAKPEFKPVNWSVETYRQALPVSQESI-DDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF--------------- 262 (400) T ss_pred -cccccccceeeEeehhheeeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc--------------- Confidence 12234555555555555555554433221 2355666666666666666666665444211100 Q ss_pred cccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHH Q lcl|NC_019917. 158 FAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDM 237 (364) Q Consensus 158 ~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) +++ ...+.+-|-.+.... .. |. ..-+.+|||.-+..| T Consensus 263 ------~~~--------------------~~~~~~~~~~~~~~~-~~-----------~~-----~~a~~v~~~~~~~~l 299 (400) T protein:vir:38 263 ------TAK--------------------TISSVDDLKHINNVD-LD-----------PA-----YSRVIIASQSFYNFL 299 (400) T ss_pred ------ccc--------------------ccccHHHHHHHHHhh-hh-----------hh-----hCcEEEEcHHHHHHH Confidence 000 011222222221100 00 00 123678899988888 Q ss_pred hhcCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeec Q lcl|NC_019917. 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) Q Consensus 238 r~~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~ 312 (364) ++- |. ...+|||. |.-+++.|.+++-.+.++- +.++ +..+|+|-=.-++-.+. T Consensus 300 ~~l---------kd-----~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~----~~~g----~~~~~~gd~s~~~~~~~ 357 (400) T protein:vir:38 300 DTV---------KD-----GNGRYLLQDSILTPSGKSVLGMPIAVVSDDTL----GAAG----EAHAFLGDIKRAILFAN 357 (400) T ss_pred HHh---------hc-----cCCCeeeecCcCCCCccccccceeEEeccccc----CCCC----ceEEEEEeccccEEEEe Confidence 742 21 23467885 3446899999987765532 1222 22456665332222222 Q ss_pred CCCCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeeee Q lcl|NC_019917. 313 ANGLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTAA 360 (364) Q Consensus 313 ~~g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta~ 360 (364) ..+..+.|..+ |+-.+ . +.+. .||. .+-|-.|.+=++| T Consensus 358 ~~~~~~~~~~~--~~~~~-~-----~~~~--~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 358 RADFMVRWVDD--QIYGQ-F-----LQAG--MRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ecceEEEEecc--cccce-e-----EEEE--EEeccEEecccceEEEEeecCC Confidence 24566667654 33221 1 2222 2442 3445555543333 No 145 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=23.86 E-value=2.2 Score=18.65 Aligned_cols=255 Identities=8% Similarity=0.063 Sum_probs=107.2 Q ss_pred CceeecccCCchHHHHHHHHHHHHHHhhcccccceeecCCCccEEEEeecCCCCCceEEEEEeeccc-cCceecCceeec Q lcl|NC_019917. 1 MTTTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDTISFDLSVHLR-GKPTYGDARTEG 79 (364) Q Consensus 1 Ma~T~~~~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~g~~~~I~~~~dL~k~~Gd~v~f~L~~~L~-G~gv~Gd~~leG 79 (364) ++.+....+.-.....++..+.. -...++..+ ++..- + -.+.+..+++...-+ +.+..+...... T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~-~~~~~---~---------~~~~~~~~~~~~~~~~~~~~~~E~~~~~ 197 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLE-PKDIVDLSK-YVRSV---P---------VNSASGKFPVISKSGSKMATVQQLEKNP 197 (397) T ss_pred hhcccccccccchhHHHHHHHHH-hhhhhhHHH-hhhhc---c---------ccccceeEEEEeccCCcccccccccccc Confidence 44333333333444555555542 222222211 11100 0 011122333222111 111112211122 Q ss_pred chhhhhhcccEEEEecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_019917. 80 TEENLRFYTDQVKIDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFVETPDFTGFA 159 (364) Q Consensus 80 nee~L~~~~~~v~Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ga~g~~~~~~~~~~~~~~~ 159 (364) .....+|..-++.+....+-+.+..++- +-+.+||-..-++.|.+=+....|..++.- .|. T Consensus 198 ~~~~~~~~~i~~~~~~~~~~~~~s~ell-~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g-~g~----------------- 258 (397) T protein:vir:96 198 QLANPKMVEIDYSVATRRGYIPISQEMI-DDASYDVTGLIADEIQDQSLNTKNADIAAV-LKT----------------- 258 (397) T ss_pred ccccccccceeecHhHhhcchhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc----------------- Confidence 2334555555555555555554433322 235667766666666666666666655521 110 Q ss_pred cCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEecCceeEEEEEechhHHHHhh Q lcl|NC_019917. 160 GNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSIDGDDHYVVVMSEYQATDMRT 239 (364) Q Consensus 160 ~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) .+ |+ ...+.+.|.++...+ .. |.. + -+.+|||..+..|++ T Consensus 259 ~~----~~-------------------~~~~~d~~~~~~~~~-~~-----------~~~-~----a~~v~n~~~~~~l~~ 298 (397) T protein:vir:96 259 AT----AK-------------------SVVGVDGLKDLINKE-IK-----------KVY-D----VKLFISASMYSELDK 298 (397) T ss_pred cc----cc-------------------cccchHHHHHHHHHh-hh-----------hhc-C----cEEEEcHHHHHHHHH Confidence 00 11 112233333333211 11 110 1 257999999988874 Q ss_pred cCCHHHHHHHHHhhhhhccCCCeee-----cCeEEEcCEEEEecCcccccccccCCcccccchheeeccchheEeeecCC Q lcl|NC_019917. 240 AAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) Q Consensus 240 ~~d~~w~~~qk~A~~~~g~~nPlF~-----G~~g~~ngvii~e~~~~~~~~~~~~~~~v~v~ralllGaqA~~~A~g~~~ 314 (364) - |. +..+|||. |.-+.+.|.+++..+...- .+..+ ...+|+|--.-++-++... T Consensus 299 l---------kd-----~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~---~~~~~----~~~~~~gd~~~~~~~~~~~ 357 (397) T protein:vir:96 299 L---------KD-----KNGRYLLQDSITAASGKQLLGKEVVVLDDDVI---GKSVG----NVVGFIGDAKAFASFFDRK 357 (397) T ss_pred h---------hc-----cCCCeEeccCccCCCcccccccceEEeccccc---CCCCC----ceEEEEeehhcceEeEeec Confidence 2 22 22367774 3346888998886543211 11122 3356777433222233334 Q ss_pred CCCccceechhhccchhHHHHHHHHhhhhcccC-----CcccEEEEEeee Q lcl|NC_019917. 315 GLRFDWEETVKDYGNEPAICAGFIAGMKKARFN-----SKDFGVISIDTA 359 (364) Q Consensus 315 g~~~~w~Ee~~D~g~~~~i~i~~i~G~~K~rf~-----~~DfGvi~idta 359 (364) +....|..+ ++... .+.++ .||. .+-|-++.+-+| T Consensus 358 ~~~~~~~~~--~~~~~------~~~~~--~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 358 QVSVSWVDN--NIYGQ------LLAGI--IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ceEEEEecc--cccce------eEEEE--EEEccEEecccceEEEEeecC Confidence 555666543 33221 12333 2442 344555555444 No 146 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=20.89 E-value=2.7 Score=18.22 Aligned_cols=280 Identities=12% Similarity=0.037 Sum_probs=110.3 Q ss_pred Cc-eeecc-------------cCCchHHHHHHHHHHHHHHhhcccccceeec-CC-CccEEEEeecCCCCCceEEEEEee Q lcl|NC_019917. 1 MT-TTVIP-------------FGDPKAVKRWSADLAVDVRKKSYFEQRFIGT-SE-NAVIQRKTELESDAGDTISFDLSV 64 (364) Q Consensus 1 Ma-~T~~~-------------~~dp~a~~~w~~~l~~~~~~~s~f~~~~~G~-g~-~~~I~~~~dL~k~~Gd~v~f~L~~ 64 (364) |+ -|.+- .++|... ... ..+..++..+..+|.-+ +. ++.+ |.|.-.. T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I---~~~-i~e~~~~~~iad~lf~~~~a~~~~~-------------v~f~~~~ 63 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWI---PTA-LKKMMVNQFISESLFRNGGANPNGV-------------VAYNEGN 63 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhH---HHH-HHHHHhccchhhhhhhcccccccce-------------eEEEecc Confidence 44 12111 1233311 111 13444555555543222 22 2222 2232222 Q ss_pred ccccCceecCceeecchh---hhhhcccEEE-EecccceeeccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019917. 65 HLRGKPTYGDARTEGTEE---NLRFYTDQVK-IDQVRHPVSAGGRMSRKRSVHNIRRIARDRLGDYFYKFTDELLFIYLS 140 (364) Q Consensus 65 ~L~G~gv~Gd~~leGnee---~L~~~~~~v~-Idq~R~~V~~~~~m~~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l~ 140 (364) +.-..+- =.+..||-|= ...+...+|. +.-..-.+++..+ .+.|..+|.-..+-.+|..=+.++.|++.+-.|. T Consensus 64 p~~~~~d-~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~E-m~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~ 141 (318) T protein:vir:10 64 PSFLEDD-VADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKE-MIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQ 141 (318) T ss_pred cccccCc-HhhccCcccccccCCCCCchhhhhhehhccceeccHH-HHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2211110 0112233221 1222222221 1222233333323 3567788888888899999999999999998886 Q ss_pred ccccccccccccccccccccCcccCCCCCcEEeeccccchhhhhhcccccHHHHHHHHHHHHhcccCCCCCcceeeeEec Q lcl|NC_019917. 141 GARGINLDFVETPDFTGFAGNPLEAPDVDHLLYGGVATSKASLAATDIMAPIVIERAVEKAAMMQAENPETANMVPVSID 220 (364) Q Consensus 141 ga~g~~~~~~~~~~~~~~~~N~~~apt~~r~~~~~~at~~~~i~~~D~~s~~~i~~a~~~a~~~~~~~~~~~~i~Pv~~~ 220 (364) .+-. +-+.++.. .+..+....|++ .|+..+..+. +. +.|-... T Consensus 142 sa~t----------------~~~~~s~~---------w~~~~~~~~d~~------~A~e~v~~a~---~~---~~~a~~~ 184 (318) T protein:vir:10 142 SPIV----------------PTLAVPTA---------WDNGGKVRTDIA------IAIEQISTAA---PT---AYPAGVG 184 (318) T ss_pred cccc----------------ccccCCcC---------CCCcccccccch------hhhhhhhhhh---hh---hhhhhhh Confidence 4321 11111111 000011111221 1222221111 00 0111111 Q ss_pred Cc------eeEEEEEechhHHHHhhcCCHHHHHHHHHhhhhhccCCCee-----ecCe-EEEcCEEEEecCccccccccc Q lcl|NC_019917. 221 GD------DHYVVVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIF-----KGGL-GMINNVVLHKHRNVIRFNDYG 288 (364) Q Consensus 221 g~------~~yV~~l~p~q~~~Lr~~~d~~w~~~qk~A~~~~g~~nPlF-----~G~~-g~~ngvii~e~~~~~~~~~~~ 288 (364) .+ ..=+++|||.++..|+++ +.|++... ++.||+| +|.+ |+.-|+-+...|.++... T Consensus 185 ~~~~~~GY~pdtIVlhP~~~~~l~~n--~~~~~~y~------~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~--- 253 (318) T protein:vir:10 185 SSDEYFGFIPDTIVMHYALLPILMDN--ENFMKVYE------RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDR--- 253 (318) T ss_pred hhhhccCccceeeEECHHHHHHHhcc--hhhhhhhh------ccchhhhhcccccccccceeeceEEeecCccCCCe--- Confidence 11 124899999999999974 46655432 2347776 4443 556778777777665422 Q ss_pred CCcccccchheeeccchheEeeecCCC-CCccceechhh-ccchhHHHHHHHHhhhhcccCCcccEEEEEeeeeeccC Q lcl|NC_019917. 289 AGANVEAARALFMGRQAGVIAYGTANG-LRFDWEETVKD-YGNEPAICAGFIAGMKKARFNSKDFGVISIDTAAKKHS 364 (364) Q Consensus 289 ~~~~v~v~ralllGaqA~~~A~g~~~g-~~~~w~Ee~~D-~g~~~~i~i~~i~G~~K~rf~~~DfGvi~idta~~~~~ 364 (364) +++|=.++.+.- +-..+ +...|..|.-| +|-+..- .++.- .|| .-+||.-=-.+++++. T Consensus 254 ---------alvlq~g~vG~~-~d~~pl~~t~~~~egg~~~g~~~~s--~~~~~---~~~--~~~~V~~PkA~~~itg 314 (318) T protein:vir:10 254 ---------VLIMERGTVGFY-SDTRPLQFTALYPEGNGPNGGPTES--YRADA---SHK--RALAVDQPKAALWLTG 314 (318) T ss_pred ---------eEEEecCCccee-eccccceeeecccCCCCCCCCcchh--hheeh---hee--eeeeeeCcceeEEEee Confidence 466655544321 11111 11223322100 0111110 11111 111 1111111112222222 Done!