Query lcl|NC_011802.1_cdsid_YP_002455891.1 [gene=orf55] [protein=coat protein] [protein_id=YP_002455891.1] [location=29627..30919] Match_columns 430 No_of_seqs 53 out of 63 Neff 5.5 Searched_HMMs 1612 Date Thu Nov 7 13:22:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_55 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_55_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2106 Length: 430 # 100.0 3E-176 2E-179 983.1 36.5 430 1-430 1-430 (430) 2 protein:vir:100939 Length: 430 100.0 3E-175 2E-178 977.8 36.3 430 1-430 1-430 (430) 3 protein:vir:9265 Length: 430 # 100.0 3E-175 2E-178 977.8 36.3 430 1-430 1-430 (430) 4 protein:vir:108303 Length: 418 100.0 2E-120 1E-123 677.4 32.7 400 1-430 1-417 (418) 5 protein:vir:105522 Length: 423 100.0 4E-118 2E-121 664.3 32.8 399 1-429 1-423 (423) 6 protein:vir:105374 Length: 423 100.0 6E-116 3E-119 652.4 31.2 400 1-429 1-423 (423) 7 protein:vir:174 Length: 423 # 100.0 6E-115 4E-118 646.9 31.4 400 1-429 1-423 (423) 8 protein:vir:3525 Length: 423 # 100.0 2E-114 1E-117 644.5 31.1 401 1-429 1-423 (423) 9 protein:vir:99075 Length: 392 100.0 6E-40 3.7E-43 235.6 23.0 372 1-410 1-392 (392) 10 protein:vir:102605 Length: 273 100.0 2.1E-32 1.3E-35 194.3 19.7 266 1-429 1-273 (273) 11 protein:vir:105822 Length: 273 100.0 2.1E-32 1.3E-35 194.3 19.7 266 1-429 1-273 (273) 12 protein:vir:94622 Length: 341 100.0 5.4E-32 3.3E-35 192.0 16.5 316 1-430 1-339 (341) 13 protein:vir:7990 Length: 273 # 99.9 2.3E-30 1.4E-33 183.1 16.5 267 1-304 1-273 (273) 14 protein:vir:80180 Length: 381 99.9 3E-27 1.9E-30 166.0 17.1 345 1-430 1-381 (381) 15 protein:vir:1541 Length: 347 # 99.8 1.6E-21 1E-24 134.6 16.7 299 1-430 1-345 (347) 16 protein:vir:3364 Length: 347 # 99.8 3.3E-21 2.1E-24 132.8 15.5 298 1-430 1-345 (347) 17 protein:vir:94711 Length: 347 99.8 8.5E-21 5.3E-24 130.6 13.5 303 1-430 1-346 (347) 18 protein:vir:78739 Length: 332 99.7 5.2E-20 3.2E-23 126.3 12.6 291 1-427 19-332 (332) 19 protein:vir:10450 Length: 344 99.7 8.1E-20 5E-23 125.2 12.7 297 1-429 1-344 (344) 20 protein:vir:94576 Length: 347 99.7 1.6E-18 9.7E-22 118.2 13.9 300 1-430 1-347 (347) 21 protein:vir:8885 Length: 347 # 99.7 2.7E-18 1.6E-21 116.9 15.0 300 1-430 1-347 (347) 22 protein:vir:2201 Length: 345 # 99.6 6.6E-18 4.1E-21 114.8 14.3 298 1-429 1-345 (345) 23 protein:vir:93742 Length: 274 99.6 6.8E-17 4.2E-20 109.2 17.6 258 1-430 1-270 (274) 24 protein:vir:80930 Length: 278 99.6 4.5E-17 2.8E-20 110.2 16.0 264 1-430 1-277 (278) 25 protein:vir:96123 Length: 274 99.6 2.1E-16 1.3E-19 106.5 18.3 258 1-430 1-270 (274) 26 protein:vir:1239 Length: 274 # 99.6 2.8E-16 1.7E-19 105.8 17.5 258 1-430 1-270 (274) 27 protein:vir:3613 Length: 272 # 99.6 1.4E-16 8.5E-20 107.6 15.7 260 1-313 1-272 (272) 28 protein:vir:3136 Length: 322 # 99.5 2.3E-16 1.4E-19 106.3 15.8 286 1-306 1-322 (322) 29 protein:vir:100057 Length: 375 99.5 1.2E-15 7.3E-19 102.4 17.9 321 1-430 1-371 (375) 30 protein:vir:99675 Length: 324 99.5 2.4E-16 1.5E-19 106.2 12.5 272 26-430 1-297 (324) 31 protein:vir:103323 Length: 364 99.5 2.2E-15 1.4E-18 100.9 17.0 299 1-430 1-340 (364) 32 protein:vir:94494 Length: 274 99.4 1.2E-14 7.2E-18 97.0 17.0 260 1-324 1-274 (274) 33 protein:vir:97433 Length: 274 99.4 1.2E-14 7.2E-18 97.0 17.0 260 1-324 1-274 (274) 34 protein:vir:96833 Length: 275 99.4 1.6E-14 9.9E-18 96.2 16.8 257 1-430 3-271 (275) 35 protein:vir:95898 Length: 274 99.4 1.5E-14 9.2E-18 96.4 16.6 260 1-308 1-274 (274) 36 protein:vir:96262 Length: 274 99.4 1.5E-14 9.2E-18 96.4 16.6 260 1-308 1-274 (274) 37 protein:vir:9820 Length: 272 # 99.4 8.3E-14 5.1E-17 92.3 19.1 256 1-429 1-272 (272) 38 protein:vir:3033 Length: 272 # 99.4 8.3E-14 5.1E-17 92.3 19.1 256 1-429 1-272 (272) 39 protein:vir:80213 Length: 334 99.4 1.1E-14 7E-18 97.1 13.1 288 1-430 1-333 (334) 40 protein:vir:105334 Length: 276 99.3 1E-12 6.3E-16 86.3 17.7 258 1-430 1-271 (276) 41 protein:vir:102655 Length: 322 99.1 1.2E-11 7.5E-15 80.4 17.4 286 1-430 1-321 (322) 42 protein:vir:739 Length: 231 # 99.1 1.1E-12 6.9E-16 86.1 11.6 225 38-313 1-231 (231) 43 protein:vir:97031 Length: 402 99.1 1.5E-12 9.3E-16 85.4 10.3 290 1-430 1-334 (402) 44 protein:vir:1781 Length: 221 # 98.8 7E-11 4.4E-14 76.2 11.3 203 79-399 1-221 (221) 45 protein:vir:7019 Length: 401 # 98.8 5.5E-11 3.4E-14 76.8 10.1 291 1-430 1-334 (401) 46 protein:vir:78935 Length: 335 98.8 2.4E-10 1.5E-13 73.4 13.4 292 1-358 1-335 (335) 47 protein:vir:6324 Length: 335 # 98.7 9.2E-10 5.7E-13 70.1 13.9 292 1-358 1-335 (335) 48 protein:vir:105645 Length: 400 98.7 3.1E-10 1.9E-13 72.7 10.9 291 1-430 1-334 (400) 49 protein:vir:79008 Length: 299 98.6 1.1E-08 6.8E-12 64.2 17.5 281 1-337 1-299 (299) 50 protein:vir:107120 Length: 329 98.6 3.1E-08 1.9E-11 61.8 18.9 259 1-430 30-306 (329) 51 protein:vir:95107 Length: 270 98.6 9E-09 5.6E-12 64.7 14.9 256 1-320 1-270 (270) 52 protein:vir:97331 Length: 319 98.3 6E-07 3.7E-10 54.7 18.5 287 1-340 19-319 (319) 53 protein:vir:94800 Length: 319 98.3 6E-07 3.7E-10 54.7 18.5 287 1-340 19-319 (319) 54 protein:vir:78920 Length: 290 96.9 0.00025 1.6E-07 40.3 17.1 269 1-340 1-290 (290) 55 protein:vir:79712 Length: 285 96.8 0.00019 1.2E-07 41.0 13.9 272 1-302 1-285 (285) 56 protein:vir:102335 Length: 312 94.8 0.0033 2.1E-06 34.2 16.8 288 1-337 1-312 (312) 57 protein:vir:95875 Length: 401 94.8 0.0035 2.2E-06 34.0 14.2 312 1-430 16-400 (401) 58 protein:vir:105464 Length: 346 91.1 0.018 1.1E-05 30.2 17.5 311 1-356 1-346 (346) 59 protein:vir:4339 Length: 395 # 86.4 0.046 2.8E-05 27.9 17.2 266 1-367 117-395 (395) 60 protein:vir:78090 Length: 302 84.7 0.058 3.6E-05 27.3 12.5 263 1-302 1-302 (302) 61 protein:vir:9927 Length: 295 # 84.3 0.061 3.8E-05 27.2 10.2 278 1-338 1-295 (295) 62 protein:vir:99523 Length: 311 64.9 0.29 0.00018 23.5 13.3 275 1-333 8-311 (311) 63 protein:vir:1638 Length: 298 # 63.7 0.31 0.00019 23.4 16.6 271 1-366 1-298 (298) 64 protein:vir:108211 Length: 318 61.1 0.36 0.00022 23.0 12.0 267 1-315 1-318 (318) 65 protein:vir:102944 Length: 330 57.6 0.43 0.00027 22.6 12.4 281 1-322 1-330 (330) 66 protein:vir:7771 Length: 330 # 55.7 0.47 0.00029 22.4 21.9 288 1-430 1-323 (330) 67 protein:vir:8187 Length: 311 # 50.3 0.61 0.00038 21.7 16.6 281 1-368 1-311 (311) 68 protein:vir:94771 Length: 298 47.8 0.69 0.00043 21.5 16.4 275 1-366 1-298 (298) 69 protein:vir:95451 Length: 313 46.2 0.74 0.00046 21.3 14.6 266 1-306 1-313 (313) 70 protein:vir:5974 Length: 324 # 44.4 0.81 0.0005 21.1 16.1 294 1-358 1-324 (324) 71 protein:vir:2430 Length: 318 # 35.4 1.2 0.00076 20.1 17.6 270 1-430 14-313 (318) 72 protein:vir:101508 Length: 120 33.2 1.4 0.00085 19.8 7.2 109 82-213 1-120 (120) 73 protein:vir:80684 Length: 315 30.4 1.6 0.00098 19.5 18.1 289 1-373 1-315 (315) 74 protein:vir:3426 Length: 117 # 29.0 1.3 0.00081 19.9 4.8 111 170-340 1-117 (117) 75 protein:vir:78523 Length: 338 28.4 1.8 0.0011 19.2 18.8 291 1-430 10-335 (338) 76 protein:vir:104085 Length: 320 27.8 1.8 0.0011 19.2 18.3 281 1-430 1-317 (320) 77 protein:vir:6212 Length: 434 # 27.8 1.8 0.0011 19.2 14.3 274 1-350 141-434 (434) 78 protein:vir:9704 Length: 394 # 25.8 2 0.0012 18.9 13.5 239 1-315 131-394 (394) 79 protein:vir:9875 Length: 296 # 23.7 2.3 0.0014 18.6 10.0 266 1-334 1-296 (296) 80 protein:vir:1886 Length: 385 # 23.6 2.3 0.0014 18.6 17.1 266 1-368 105-385 (385) 81 protein:vir:191 Length: 385 # 23.6 2.3 0.0014 18.6 17.1 266 1-368 105-385 (385) 82 protein:vir:3870 Length: 400 # 22.0 2.5 0.0016 18.4 16.6 255 1-368 137-400 (400) 83 protein:vir:7449 Length: 123 # 21.0 2.7 0.0017 18.2 8.7 100 82-200 1-123 (123) No 1 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=100.00 E-value=2.7e-176 Score=983.08 Aligned_cols=430 Identities=98% Similarity=1.384 Sum_probs=424.0 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCcCccccceeEEEec Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~~di~e~sv~v~ld 80 (430) ||+|+|++|||++||+|+.|||+||||++|++||||+++|+|+|||||||+|+++++++|++++++++|++|++||++|| T Consensus 1 Ma~~~~~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t~~~~~~~e~~v~~~~~ 80 (430) T protein:vir:21 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccccchhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccccccccCCCccceeeeEeEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhcC Q lcl|NC_011802. 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~f~~t~keL~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++++++||+|||||++|||+||+||+++++++++||.+.+.+.++.+.+.|+++++++++|++++| T Consensus 81 ~~~~V~~~~~~kEl~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:21 81 EPDNDFFQLRADDLRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIMFSREL 160 (430) T ss_pred eeccceEEeehhHhcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999987666677777899999999999999999 Q ss_pred CCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccccccccccce Q lcl|NC_011802. 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~ 240 (430) |++++|++||||+++++|+++|++++++++.+++|||+|+|||+|+||||+|+|+++|+|++|+++++||+||+|+|+++ T Consensus 161 P~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:21 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHhhhhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeecccccccccc Q lcl|NC_011802. 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~ 320 (430) ++|+++|+++++|++..++++|+||+||+||+|||+|||+|||+|||+||++|||+|++++++++|+|||+|+|++++++ T Consensus 241 ~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~ 320 (430) T protein:vir:21 241 WQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeecccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeeccc Q lcl|NC_011802. 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~ 400 (430) +++++|||||||+||+|++|||+|++++++||+||||||+|+|||||+|+|+++++.++++++|++|||+|++||||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:21 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 401 LFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ++.+||||+|||+++|||||+||+|+||+| T Consensus 401 ~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEEeecCccccCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 2 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=100.00 E-value=2.6e-175 Score=977.76 Aligned_cols=430 Identities=99% Similarity=1.391 Sum_probs=423.7 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCcCccccceeEEEec Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~~di~e~sv~v~ld 80 (430) ||+|++++||+.++|+|+.|||+|||++++++||||+++|+|+|||||||+|+++++++|++++++++|++|++||++|| T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhcC Q lcl|NC_011802. 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~f~~t~keL~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++++++||+|||||++|||+||+||+++++++++||.+.+...++...+.|+|+++++++|++++| T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:10 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999987666677777899999999999999999 Q ss_pred CCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccccccccccce Q lcl|NC_011802. 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~ 240 (430) |++++|++||||+++++|+++|++|+++++.+++|||+|+|||+|+||||+|+|+++|+|++|+++++|||||+|+++++ T Consensus 161 P~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:10 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeecccccccccc Q lcl|NC_011802. 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~ 320 (430) ++|+++|+++++|++.+++++|+||+||+||+|||+|||+|||+|||+++++|||+||+++++++|+|||+|+|++++++ T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:10 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeeccc Q lcl|NC_011802. 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~ 400 (430) +++++|||||+++||+|++|||+|++++++||+||||||+|+|||||+|+|+++++.++++++|++||++|+++|||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:10 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 401 LFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ++.+||||+|||+++|||||+||+|+||+| T Consensus 401 ~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEeeeccceecCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 3 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=100.00 E-value=2.6e-175 Score=977.76 Aligned_cols=430 Identities=99% Similarity=1.391 Sum_probs=423.7 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCcCccccceeEEEec Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~~di~e~sv~v~ld 80 (430) ||+|++++||+.++|+|+.|||+|||++++++||||+++|+|+|||||||+|+++++++|++++++++|++|++||++|| T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhcC Q lcl|NC_011802. 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~f~~t~keL~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++++++||+|||||++|||+||+||+++++++++||.+.+...++...+.|+|+++++++|++++| T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:92 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999987666677777899999999999999999 Q ss_pred CCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccccccccccce Q lcl|NC_011802. 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~ 240 (430) |++++|++||||+++++|+++|++|+++++.+++|||+|+|||+|+||||+|+|+++|+|++|+++++|||||+|+++++ T Consensus 161 P~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:92 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeecccccccccc Q lcl|NC_011802. 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~ 320 (430) ++|+++|+++++|++.+++++|+||+||+||+|||+|||+|||+|||+++++|||+||+++++++|+|||+|+|++++++ T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:92 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeeccc Q lcl|NC_011802. 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~ 400 (430) +++++|||||+++||+|++|||+|++++++||+||||||+|+|||||+|+|+++++.++++++|++||++|+++|||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:92 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 401 LFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ++.+||||+|||+++|||||+||+|+||+| T Consensus 401 ~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEeeeccceecCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 4 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.5e-120 Score=677.43 Aligned_cols=400 Identities=20% Similarity=0.214 Sum_probs=349.3 Q ss_pred Ccccccchhh--hhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCcCccccceeEEE Q lcl|NC_011802. 1 MALNEGQIVT--LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 ma~~~~~~~t--~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~~di~e~sv~v~ 78 (430) ||+++|+||| +..+|+|+.||++|||+++| ||+|+.||.|+||||+||+|..+++++|.+. ++++++|++++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv--~r~y~~e~~~~GDTV~I~vp~~~~v~dg~~~--~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCV--YRNYEKTFGKVGDTIRLKLPYRVKSASGRTL--VKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhh--cCCCchHHhhCCCEEEEeeCCceeecccCCc--cccccccceEEEE Confidence 9999999999 55599999999999999999 9999999999999999999999999999876 4679999999999 Q ss_pred eccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHH Q lcl|NC_011802. 79 MGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMF 156 (430) Q Consensus 79 ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~ 156 (430) ||++|++.|.++++| +++.++.++++++|+++||++||.+|++++...++.+++ +++ ....|+++++++++|+ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt-~gt----~~~~~~~i~~a~~~Ld 151 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGT-PGV----RPGAFIDFANAGAKQT 151 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-CCc----CcchHHHHHHHHHHHH Confidence 999999999999999 679999999999999999999999999999888766654 233 2256999999999999 Q ss_pred hhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccc-cccccccccc Q lcl|NC_011802. 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA-TGITVSGAQS 235 (430) Q Consensus 157 ~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~-t~~tv~gA~~ 235 (430) +++||++++|.+|++|+.++.|++++..+++ +...+++||+|+||| ++||+ +|+++++|.|++|++ ++.+|+|+.+ T Consensus 152 ~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~-~~~~~~~lr~G~IG~-i~GF~-V~~S~nip~~tag~~~~t~~v~ga~~ 228 (418) T protein:vir:10 152 TYAVPQDGMRHAVLDPFTCASLSDEVTKLFK-ESMVEQAYKMGYRGN-VAAYE-VYESQNLPKHTVGDHGGTPLVNGTVV 228 (418) T ss_pred hcCCCCCCceEEEeCHHHHHHHhhhcccccc-ccccchhhheeeeee-eeceE-EEEecCCCcccccccccceeeecccc Confidence 9999996679999999999999999888765 446778999999998 89998 577899999999974 4578999976 Q ss_pred cccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeecc-----CceeEEee Q lcl|NC_011802. 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD-----GTHVEITP 310 (430) Q Consensus 236 ~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~-----a~tv~I~P 310 (430) .+ .++..++. +.+.+|+|++||+|||+||++||++|||+++.+|||+|+++++ +++|+||| T Consensus 229 ~~---~~~~~~~~-----------t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p 294 (418) T protein:vir:10 229 NG---DTVGFDGG-----------TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISP 294 (418) T ss_pred cc---eeEEEeec-----------ceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecc Confidence 33 22221221 4566799999999999999999999999999999999999863 46899999 Q ss_pred ccccccccccc-----cccccccccccccccCceeEEeccCCc--ccceeeccceeeEEeeccccCCCcchheeeEEEec Q lcl|NC_011802. 311 KPVALDDVSLS-----PEQRAYANVNTSLADAMAVNILNVKDA--RTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSI 383 (430) Q Consensus 311 ai~~~~~~~~~-----~~~~~~~nVta~~A~~aavTv~~~~s~--~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~ 383 (430) +|++...+... ...++|+||+++||++++|||+|++++ ++||+||||||+|+||||++|+|.... .+..+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~---~~~~~ 371 (418) T protein:vir:10 295 SLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIK---SRAAD 371 (418) T ss_pred ccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcc---eEEEe Confidence 99765544422 345799999999999999999998655 689999999999999999999886432 34556 Q ss_pred CcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 384 PDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 384 p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |.+|+|+|++++||+++++.+||||+|||+++|||||+ |+|.||-+ T Consensus 372 ~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~-~~~~g~~~ 417 (418) T protein:vir:10 372 PETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELA-LRLWGAAS 417 (418) T ss_pred ccCCeEEEEEEcccccccceEEEEEeecCceeecccce-EEEEeecC Confidence 77899999999999999999999999999999999995 89999999 No 5 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=3.8e-118 Score=664.33 Aligned_cols=399 Identities=17% Similarity=0.213 Sum_probs=339.7 Q ss_pred Cccccc-chhhhhHHHHHHHHhhhcccchhcccCCCchhhh--hccCcEEEEecCcccccccCC--cccCC-cCccccce Q lcl|NC_011802. 1 MALNEG-QIVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEGW--DLTDK-ATGLLELN 74 (430) Q Consensus 1 ma~~~~-~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~--~k~GdTV~i~~P~~~~~~~g~--~~s~~-~~di~e~s 74 (430) ||++.. -+.++..+|+|+.||++|||+++| +|+|+.|| +|+||||+||+|..++.+++. +.+.+ +++++|++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV--~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~ 78 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTV--DRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAK 78 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhh--ccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccce Confidence 998864 567777899999999999999999 89998886 689999999999999888764 34444 57999999 Q ss_pred eEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~ 152 (430) |+++||++|++.|+|+++|++ +.++ +|+|+|||++||++||++|+..+...++++.+++++.+ +.|+++++++ T Consensus 79 v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~----~a~~~~a~a~ 153 (423) T protein:vir:10 79 ATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTPI----KKWSDVAQTA 153 (423) T ss_pred EEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc----ccHHHHHHHH Confidence 999999999999999999964 5555 89999999999999999999777777777776655533 5699999999 Q ss_pred HHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccc-cccchhhhhhhhcCCcccccCcccc-cccc Q lcl|NC_011802. 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTAT-GITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~v~~~t~gt~t-~~tv 230 (430) ++|+++++|+ ++|.+|++|+.+++|++++..+++.++..+++||+|+| || ++|||+ |+|+++|.||+|+.+ ..++ T Consensus 154 ~~L~~~~vP~-~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~-~~GFdi-~~Sn~vp~~T~g~~~ga~~~ 230 (423) T protein:vir:10 154 SFLKDLGINS-GENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGN-FGGIRA-LMSNGLASRTQGAFGGKLTV 230 (423) T ss_pred HHHhhccCCc-CCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhccccee-ecceEE-EEecCCcccccccccceeee Confidence 9999999999 48999999999999999999999999999999999998 65 899986 678999999999865 5666 Q ss_pred ccccccccceeeeeeccccc--cccceeeeEEeeccceeecccEEEEcceeeecccccc-----cccCcceEEEEeecc- Q lcl|NC_011802. 231 SGAQSFKPVAWQLDNDGNKV--NVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRVVD- 302 (430) Q Consensus 231 ~gA~~~~~~~~~v~~~g~~~--~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~-----~~~~l~~fvVta~~~- 302 (430) +|+.. ++.++.+. ..++...+++.+.+|+||+||+|||+|||+|||+||| +++++|||+|++++. T Consensus 231 ~~~~~-------vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~ 303 (423) T protein:vir:10 231 KGTPE-------VNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANA 303 (423) T ss_pred eeeeE-------EEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccc Confidence 66543 22222222 3445556667777899999999999999999999999 579999999998752 Q ss_pred ----CceeEEeeccccccccccccccccccccccccccCceeEEeccCCc--ccceeeccceeeEEeeccccCCCcchhe Q lcl|NC_011802. 303 ----GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDA--RTNVFWADDAIRIVSQPIPANHELFAGM 376 (430) Q Consensus 303 ----a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~--~~Nl~Fhr~A~aLat~pl~~p~g~~~~~ 376 (430) +++|+|||+|+++ ..+++|+||+|+||++++|||+|++++ ++||+|||+||+|+|+|||+|++.+++. T Consensus 304 ~a~~~~tv~i~p~~~~~------~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~ 377 (423) T protein:vir:10 304 HSSGDVTVKISGVPIFD------AGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSAV 377 (423) T ss_pred cccCceEEEeccccccc------cCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCccceee Confidence 4589999999863 357899999999999999999999876 5899999999999999999875544332 Q ss_pred eeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 377 KTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 377 ~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) ++..|+|+|++++||+++++.+||||+|||+++|||||++ +|.|-- T Consensus 378 ------~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:10 378 ------ATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGG-QFFGNP 423 (423) T ss_pred ------cccccceEEEEEeeeccccceEEEEEeecceeeeccceEE-EEEecC Confidence 2334999999999999999999999999999999999985 555555 No 6 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=5.6e-116 Score=652.42 Aligned_cols=400 Identities=16% Similarity=0.194 Sum_probs=333.1 Q ss_pred Cccc-ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhh--hccCcEEEEecCcccccccCC--ccc-CCcCccccce Q lcl|NC_011802. 1 MALN-EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEGW--DLT-DKATGLLELN 74 (430) Q Consensus 1 ma~~-~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~--~k~GdTV~i~~P~~~~~~~g~--~~s-~~~~di~e~s 74 (430) ||+| +-.+-++..+|+|+.||++|||+++| +|+|+.|| +|+||||+||+|..++..+.. +++ .+++|++|++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lV--nr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~ 78 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhh--cccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccce Confidence 9977 34445777799999999999999999 89999887 689999999999988877653 433 4689999999 Q ss_pred eEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~ 152 (430) ++++||++|++.|+|+++|+. +.++ +|+|+|||++||++||.+|+++++..++++.+++++.+ ..|+++++++ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~----~a~~~i~~a~ 153 (423) T protein:vir:10 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPI----TKWSDVAQTA 153 (423) T ss_pred eEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCccc----chHHHHHHHH Confidence 999999999999999999964 4554 89999999999999999999999998888877655543 5699999999 Q ss_pred HHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccc-cccc Q lcl|NC_011802. 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG-ITVS 231 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~-~tv~ 231 (430) ++|++++||++ +|++|++|+.+++|+++...+++.++..+++||+|+|.++++|||+ |+|+++|.||+|++++ .++. T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv-~~Snnip~~T~gt~~~t~~~~ 231 (423) T protein:vir:10 154 SFLKDLGVNEG-ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRA-LMSNGLASRTQGAFGGTLTVK 231 (423) T ss_pred HHHHhccCCcC-CCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEE-EEeCCCccccccccccceeee Confidence 99999999994 8999999999999999999999999999999999998334899985 7789999999998643 2222 Q ss_pred cccccccceeeeeecccccccccee--eeEEeeccceeecccEEEEcceeeecccccc-----cccCcceEEEEeecc-- Q lcl|NC_011802. 232 GAQSFKPVAWQLDNDGNKVNVDNRF--ATVTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRVVD-- 302 (430) Q Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~--~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~-----~~~~l~~fvVta~~~-- 302 (430) .+. ++...+.+.+.+... ...+++.+++||+||+|||+|||+|||+||| +++++|||+|++++. T Consensus 232 ~~~-------~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~ 304 (423) T protein:vir:10 232 TQP-------TVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSD 304 (423) T ss_pred ecc-------eeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeec Confidence 221 222222111211111 2234566899999999999999999999999 779999999999873 Q ss_pred ---CceeEEeeccccccccccccccccccccccccccCceeEEeccCCc--ccceeeccceeeEEeeccccCCCcchhee Q lcl|NC_011802. 303 ---GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDA--RTNVFWADDAIRIVSQPIPANHELFAGMK 377 (430) Q Consensus 303 ---a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~--~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~ 377 (430) +++|+|+|+|+++ ..+++|+||+++||++++|||+|++++ ++||+||||||+|+|||||+|++.+++ T Consensus 305 ~~g~~tv~i~p~~i~~------~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~-- 376 (423) T protein:vir:10 305 SGGDVTVTLSGVPIYD------TTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSA-- 376 (423) T ss_pred cCCceeeeccCccccc------cCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCcccee-- Confidence 3589999999875 357899999999999999999999876 489999999999999999987544333 Q ss_pred eEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 378 TTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 378 ~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) .++..|+|+|++++||+++++.+||||+|||+++|||||++ +|.|-- T Consensus 377 ----~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:10 377 ----VATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGG-QFFGNP 423 (423) T ss_pred ----eccccCceEEEEEeeeccccceEEEEEeecceeeeccceEE-EEEecC Confidence 22334999999999999999999999999999999999985 555555 No 7 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=5.7e-115 Score=646.91 Aligned_cols=400 Identities=16% Similarity=0.194 Sum_probs=333.6 Q ss_pred Cccc-ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhh--hccCcEEEEecCcccccccC--CcccC-CcCccccce Q lcl|NC_011802. 1 MALN-EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEG--WDLTD-KATGLLELN 74 (430) Q Consensus 1 ma~~-~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~--~k~GdTV~i~~P~~~~~~~g--~~~s~-~~~di~e~s 74 (430) ||+| +..+-++..+|+|+.||++|||+++| +|+|+.|+ +|+||||+||+|..++..+. ++++. ++++++|++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lV--nr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~ 78 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhh--cccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccce Confidence 9977 34445777799999999999999999 89999887 68999999999988777654 44444 579999999 Q ss_pred eEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~ 152 (430) ++++||++|++.|+|+++|+. +.++ +|+|+|||++||++||.+|+++++..++++.+++++.+ +.|+++++++ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~----~a~~~i~~a~ 153 (423) T protein:vir:17 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPI----TKWSDVAQTA 153 (423) T ss_pred eEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCccc----ccHHHHHHHH Confidence 999999999999999999964 5555 89999999999999999999999898888777666543 4699999999 Q ss_pred HHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccc-ccccc Q lcl|NC_011802. 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT-GITVS 231 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t-~~tv~ 231 (430) ++|++++||++ +|.+|++|+.+++|+++...+++.++..+++||+|+|.++++|||+ |+|+++|.||+|+.+ +.++. T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv-y~Snnip~~T~gt~~~t~~~~ 231 (423) T protein:vir:17 154 SFLKDLGVNEG-ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRA-LMSNGLASRTQGAFGGTLTVK 231 (423) T ss_pred HHHHhccCCcC-CCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEE-EEeCCCccccccceeceeeec Confidence 99999999994 8999999999999999999999999999999999999434899985 778999999999854 33333 Q ss_pred cccccccceeeeeecccccccc--ceeeeEEeeccceeecccEEEEcceeeecccccc-----cccCcceEEEEeecc-- Q lcl|NC_011802. 232 GAQSFKPVAWQLDNDGNKVNVD--NRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRVVD-- 302 (430) Q Consensus 232 gA~~~~~~~~~v~~~g~~~~~d--~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~-----~~~~l~~fvVta~~~-- 302 (430) .+.+ +...+.+.... ......+.+.+++|++||+|||+|||+|||+||+ +++++|||+|+++++ T Consensus 232 ~~~~-------v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~ 304 (423) T protein:vir:17 232 TQPT-------VTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSD 304 (423) T ss_pred cccc-------ccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEeccccc Confidence 3322 11111111111 1123345677899999999999999999999999 669999999999773 Q ss_pred ---CceeEEeeccccccccccccccccccccccccccCceeEEeccCCc--ccceeeccceeeEEeeccccCCCcchhee Q lcl|NC_011802. 303 ---GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDA--RTNVFWADDAIRIVSQPIPANHELFAGMK 377 (430) Q Consensus 303 ---a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~--~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~ 377 (430) +++|+|||+|+++ ..+.+|+||+++||++++|||+|++++ ++||+||||||+|+|||||+|++.+++ T Consensus 305 a~~~~tv~i~p~~i~~------~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~-- 376 (423) T protein:vir:17 305 SSGDVTVTLSGVPIYD------TTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSA-- 376 (423) T ss_pred ccCceEEEecCccccc------cCCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCcccee-- Confidence 4589999999975 356889999999999999999999877 489999999999999999977543332 Q ss_pred eEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 378 TTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 378 ~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) +++..|||+|++++||+++++.+||||+|||+++|||||++ +|.|-- T Consensus 377 ----~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:17 377 ----VATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGG-QFFGNP 423 (423) T ss_pred ----ecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEE-EEEecC Confidence 23345999999999999999999999999999999999985 555555 No 8 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=1.6e-114 Score=644.47 Aligned_cols=401 Identities=16% Similarity=0.197 Sum_probs=333.0 Q ss_pred Cccc-ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhh--hccCcEEEEecCcccccccCCc---ccCCcCccccce Q lcl|NC_011802. 1 MALN-EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEGWD---LTDKATGLLELN 74 (430) Q Consensus 1 ma~~-~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~--~k~GdTV~i~~P~~~~~~~g~~---~s~~~~di~e~s 74 (430) ||+| +..+-++..+|+|+.||++|||+++| +|+|+.|+ +|+||||+||+|..++++++.. ...++++++|.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV--~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~ 78 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTV--DRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhc--ccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccce Confidence 9966 33345666799999999999999999 89998887 6999999999999998887743 344678999999 Q ss_pred eEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~ 152 (430) |+++||++|++.|+|+++|+ ++.++ +|+|+|+|++||++||.+|++.++...++..+++++. ...|+++++++ T Consensus 79 v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~----~~~~~~i~~a~ 153 (423) T protein:vir:35 79 ATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGSPNTA----IKKWADVAQTA 153 (423) T ss_pred eeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC----cchHHHHHHHH Confidence 99999999999999999996 45665 8999999999999999999997777666666554443 25699999999 Q ss_pred HHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccc-cccchhhhhhhhcCCcccccCccc-ccccc Q lcl|NC_011802. 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTA-TGITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~v~~~t~gt~-t~~tv 230 (430) ++|++++||++ +|++|++|+.++.|++++..+++.++..+++||+|+| |+ ++|||+ |+|+++|.||+|+. ++.++ T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~-i~GFdv-~~Snnvp~~T~gt~~~~~~v 230 (423) T protein:vir:35 154 SFIKDIGIKTG-ENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGN-FGGIRA-LMSNGLASRKQGDFDGAITV 230 (423) T ss_pred HHHHHhcCCcC-CCEEEeCHHHHHHHhccccceeccccchhHHHhhccceee-ecceEE-EEcCCCccccccccccceee Confidence 99999999994 8999999999999999999999999999999999998 65 899985 67899999999985 44555 Q ss_pred ccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccccc-----ccCcceEEEEeec---- Q lcl|NC_011802. 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV-----LAQDATFSVVRVV---- 301 (430) Q Consensus 231 ~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~-----~~~l~~fvVta~~---- 301 (430) +++... ..+.. .+......+ ....+++.+++|++||+|||+|||+|||+||++ ++++|||+|+++. T Consensus 231 ~~a~~v-~~~a~---~~~~~~~~~-~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a 305 (423) T protein:vir:35 231 KTAPNV-DYLSV---KDSYQFTVA-LTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTA 305 (423) T ss_pred cccccc-ccccc---cccccceee-eeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccccccc Confidence 555421 11111 111111111 233467788999999999999999999999994 7999999999876 Q ss_pred -cCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcc--cceeeccceeeEEeeccccCCCcchheee Q lcl|NC_011802. 302 -DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDAR--TNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 302 -~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~--~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~ 378 (430) .+++|+|||+|+++ ....+|+||+|+||++++|||+|+++++ +||+||||||+|+|+|||+|++.+++... T Consensus 306 ~g~~~v~i~p~~~~~------~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~~~~~ 379 (423) T protein:vir:35 306 SGDVTVKLSGVPIYD------EKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHSLDSAVAT 379 (423) T ss_pred cCceeEEcccccccc------CCCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccCCccceeecc Confidence 24689999999875 2457899999999999999999998774 89999999999999999988554443222 Q ss_pred EEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 379 TSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 379 ~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) ..|+|+|++++||+++++.+||||+|||+++|||||++ +|.|-- T Consensus 380 ------~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:35 380 ------YEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGG-QFFGNP 423 (423) T ss_pred ------ccCceEEEEEeeccccCceEEEEEeecceeeecccceE-EEEecC Confidence 34999999999999999999999999999999999985 555555 No 9 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=6e-40 Score=235.62 Aligned_cols=372 Identities=13% Similarity=0.081 Sum_probs=213.4 Q ss_pred Ccccccchhh--hhHHHHHHHHhhhcccchhcccCCCchhhhh-ccCcEEEEecCccccccc------CCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVT--LAVDEIIETISAITPMAQKAKKYTPPAASMQ-RSSNTIWMPVEQESPTQE------GWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t--~~~~evi~~len~lvmA~~V~~~r~~~~~~~-k~GdTV~i~~P~~~~~~~------g~~~s~~~~di~ 71 (430) ||++ +++ +..+|+|+.|+++|||+++| ||+|+.||. |+||||+||+|..++..+ +.....+++++. T Consensus 1 Ma~~---~~~p~~~a~~~l~~l~~~lv~~~lv--~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) T protein:vir:99 1 MANA---FSKPTAVVDTAIQMLQNELILTNLV--WLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFT 75 (392) T ss_pred Cccc---cccHHHHHHHHHHHHHhhccchhhh--ccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccc Confidence 9954 333 44489999999999999999 999999985 789999999998876553 334455678999 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) +.+++++||++|.+.|.++++|+ .+.++.++++++++++||++||.+|++++......+.+.. ...+....++.+. T Consensus 76 ~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~--~~~~~~~~~~~i~ 153 (392) T protein:vir:99 76 EDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAV--HEVAPDEFFKGVN 153 (392) T ss_pred cceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc--cccChhhhHHHHH Confidence 99999999999999999999995 5789999999999999999999999999887665544321 1223345688999 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHh--hhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcc--c Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKST--A 225 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~--~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt--~ 225 (430) ++++.|+++++|. +|+++++|+.++.|+.+ +...++.+....+++|+|.||+ +.||++ |.++++|.++.-. . T Consensus 154 ~a~~~L~~~~vP~--~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~-i~G~~v-~~s~~~~~~t~~a~~~ 229 (392) T protein:vir:99 154 GARRALNELYIPQ--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYEI-VESTLIPHGDAYLYHP 229 (392) T ss_pred HHHHHHhhcCCCC--CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeee-eeeeEE-Eeecccccccceeeec Confidence 9999999999996 49999999999998754 3444455555667899999997 889985 6788888765311 1 Q ss_pred cccc-cccccccccceeeeeeccccccccc-eeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEee--c Q lcl|NC_011802. 226 TGIT-VSGAQSFKPVAWQLDNDGNKVNVDN-RFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRV--V 301 (430) Q Consensus 226 t~~t-v~gA~~~~~~~~~v~~~g~~~~~d~-~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~--~ 301 (430) +..+ ..+++.. +.+.. .+.....++ .....+....++ .+.|.+++..+....-.++..... |..... . T Consensus 230 ~a~~~at~a~v~-~~~~~---~~~s~s~~~~v~~~~~~~~~~t-~~s~~~~v~~~~g~~~v~~~~~~~---~~~~~~~~~ 301 (392) T protein:vir:99 230 TAFIMATRAPAP-PMGAV---RSTAISGDQRIAMRWLVDYDST-ITSNRSLIDTYFGLKVVEDPNGVG---FVRARKIHL 301 (392) T ss_pred cccccccccccc-ccccc---ceeEEecccceecceeecccce-eeccccccceeEEEEEEeeccccc---eeeeeeeee Confidence 1111 1111100 00000 000000000 000111111111 123444444333222222221111 111110 1 Q ss_pred cCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEE Q lcl|NC_011802. 302 DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSF 381 (430) Q Consensus 302 ~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~ 381 (430) ...++++.|....................+..|.++.. .++++-|.-+-=..|++.- .|.--+.. T Consensus 302 ~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~--------~~~~vtw~Ssn~~vAtV~~---~G~Vt~v~---- 366 (392) T protein:vir:99 302 IPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDD--------VTALCDFESSATDKATVAA---GGLVTGVA---- 366 (392) T ss_pred ecceeeeeeeecccceeEeeeccceeEEEEEEecCCcc--------ccceEEEEEcCCeeEEEcC---CceEEEEe---- Confidence 12233333321111111111111111222223332221 1234556655556666652 11111111 Q ss_pred ecCcceEE-EEEEEeeecccceeEEEEEee Q lcl|NC_011802. 382 SIPDVGLN-GIFRTQGDISTLFRLCRIALW 410 (430) Q Consensus 382 ~~p~~Gls-lrv~~~yd~~~~~~~~riDvL 410 (430) -|-. |.+.....-......|.+.|+ T Consensus 367 ----~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 367 ----AGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred ----cceEEEEEEEEcCCCcEEEEEEEEeC Confidence 1222 222211111234566777777 No 10 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.96 E-value=2.1e-32 Score=194.26 Aligned_cols=266 Identities=18% Similarity=0.166 Sum_probs=189.8 Q ss_pred Ccccccchh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccccc--CCcccCCcCccccceeEE Q lcl|NC_011802. 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 ma~~~~~~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~--g~~~s~~~~di~e~sv~v 77 (430) ||++ +++ ++.-+++++.|++.++|+++| +|+|+.++ +.||||.||+|......+ +......++++.+..+++ T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 9995 433 556699999999999999999 89998886 679999999998876654 223344567899999999 Q ss_pred EeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHH Q lcl|NC_011802. 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELM 155 (430) Q Consensus 78 ~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L 155 (430) +||+++.+.|.+++.|. ...++ +.++++++.+||++||.+++.++.....-+. ++.+.+....++.+.++++.| T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~---~~~~~~~~~~~~~i~~a~~~l 151 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPTDADDAFDLIAKALKEL 151 (273) T ss_pred EEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc---cccccchhHHHHHHHHHHHHh Confidence 99999999999997663 44444 6789999999999999999998876544332 223334445678899999999 Q ss_pred HhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccch-hhHHHHhccccccchhhhhhhhcCCcccccCcccccccccccc Q lcl|NC_011802. 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) Q Consensus 156 ~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~ 234 (430) ++..||. ++|.++++|+.+..|...-..+.+.... ....+|+|.||+ +.||++ ++++++|.+... T Consensus 152 d~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-i~G~~v-~~s~~lp~~~~~----------- 217 (273) T protein:vir:10 152 TKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDE----------- 217 (273) T ss_pred hhcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE-EeceEE-EEecccccCCcc----------- Confidence 9999999 5899999999999987643323333322 345799999998 899975 566777641000 Q ss_pred ccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeecccc Q lcl|NC_011802. 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVA 314 (430) Q Consensus 235 ~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~ 314 (430) + T Consensus 218 ---------------------------------------------------------------~---------------- 218 (273) T protein:vir:10 218 ---------------------------------------------------------------Q---------------- 218 (273) T ss_pred ---------------------------------------------------------------E---------------- Confidence 0 Q ss_pred ccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEE Q lcl|NC_011802. 315 LDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRT 394 (430) Q Consensus 315 ~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~ 394 (430) + ++||++||.++.+-... ....++. . T Consensus 219 ---------------------------~---------~~~~~~A~~~a~q~~~~----------e~~r~~~-~------- 244 (273) T protein:vir:10 219 ---------------------------F---------VAFHPSAAAYVSQIDTV----------EALRDQD-S------- 244 (273) T ss_pred ---------------------------E---------EEEeccceeeeeeeehh----------hcccCCC-c------- Confidence 0 56889999888764321 1111111 1 Q ss_pred eeecccceeEEEEEeeccceecCcceeEEecC-CCC Q lcl|NC_011802. 395 QGDISTLFRLCRIALWYGVNATRPEAIGVGLP-GQT 429 (430) Q Consensus 395 ~yd~~~~~~~~riDvLyG~~~v~PElagv~l~-~q~ 429 (430) --...+-+..||++++|||-. |+|. .=. T Consensus 245 ------~~~~v~~~~~yg~~v~~~~~~-~~l~~~g~ 273 (273) T protein:vir:10 245 ------FSDRIRALHVYGGKVVRPTGV-VVFNKTGS 273 (273) T ss_pred ------ceeeeeeeeeeeeeEeccceE-EEEeccCC Confidence 111223456799999999965 3442 111 No 11 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.96 E-value=2.1e-32 Score=194.26 Aligned_cols=266 Identities=18% Similarity=0.166 Sum_probs=189.8 Q ss_pred Ccccccchh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccccc--CCcccCCcCccccceeEE Q lcl|NC_011802. 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 ma~~~~~~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~--g~~~s~~~~di~e~sv~v 77 (430) ||++ +++ ++.-+++++.|++.++|+++| +|+|+.++ +.||||.||+|......+ +......++++.+..+++ T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 9995 433 556699999999999999999 89998886 679999999998876654 223344567899999999 Q ss_pred EeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHH Q lcl|NC_011802. 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELM 155 (430) Q Consensus 78 ~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L 155 (430) +||+++.+.|.+++.|. ...++ +.++++++.+||++||.+++.++.....-+. ++.+.+....++.+.++++.| T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~---~~~~~~~~~~~~~i~~a~~~l 151 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPTDADDAFDLIAKALKEL 151 (273) T ss_pred EEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc---cccccchhHHHHHHHHHHHHh Confidence 99999999999997663 44444 6789999999999999999998876544332 223334445678899999999 Q ss_pred HhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccch-hhHHHHhccccccchhhhhhhhcCCcccccCcccccccccccc Q lcl|NC_011802. 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) Q Consensus 156 ~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~ 234 (430) ++..||. ++|.++++|+.+..|...-..+.+.... ....+|+|.||+ +.||++ ++++++|.+... T Consensus 152 d~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-i~G~~v-~~s~~lp~~~~~----------- 217 (273) T protein:vir:10 152 TKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDE----------- 217 (273) T ss_pred hhcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE-EeceEE-EEecccccCCcc----------- Confidence 9999999 5899999999999987643323333322 345799999998 899975 566777641000 Q ss_pred ccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeecccc Q lcl|NC_011802. 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVA 314 (430) Q Consensus 235 ~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~ 314 (430) + T Consensus 218 ---------------------------------------------------------------~---------------- 218 (273) T protein:vir:10 218 ---------------------------------------------------------------Q---------------- 218 (273) T ss_pred ---------------------------------------------------------------E---------------- Confidence 0 Q ss_pred ccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEE Q lcl|NC_011802. 315 LDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRT 394 (430) Q Consensus 315 ~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~ 394 (430) + ++||++||.++.+-... ....++. . T Consensus 219 ---------------------------~---------~~~~~~A~~~a~q~~~~----------e~~r~~~-~------- 244 (273) T protein:vir:10 219 ---------------------------F---------VAFHPSAAAYVSQIDTV----------EALRDQD-S------- 244 (273) T ss_pred ---------------------------E---------EEEeccceeeeeeeehh----------hcccCCC-c------- Confidence 0 56889999888764321 1111111 1 Q ss_pred eeecccceeEEEEEeeccceecCcceeEEecC-CCC Q lcl|NC_011802. 395 QGDISTLFRLCRIALWYGVNATRPEAIGVGLP-GQT 429 (430) Q Consensus 395 ~yd~~~~~~~~riDvLyG~~~v~PElagv~l~-~q~ 429 (430) --...+-+..||++++|||-. |+|. .=. T Consensus 245 ------~~~~v~~~~~yg~~v~~~~~~-~~l~~~g~ 273 (273) T protein:vir:10 245 ------FSDRIRALHVYGGKVVRPTGV-VVFNKTGS 273 (273) T ss_pred ------ceeeeeeeeeeeeeEeccceE-EEEeccCC Confidence 111223456799999999965 3442 111 No 12 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.96 E-value=5.4e-32 Score=192.02 Aligned_cols=316 Identities=12% Similarity=0.043 Sum_probs=196.0 Q ss_pred Cccc-------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCC-cccCC Q lcl|NC_011802. 1 MALN-------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGW-DLTDK 66 (430) Q Consensus 1 ma~~-------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~-~~s~~ 66 (430) ||.- .+-+-++.-+|+++.|+.+++++.++ |+|+.++ +.||||+||++...++.+-. ..... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~---~d~~~~~-~~Gdtv~ip~~g~~~~~d~~~~~~i~ 76 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV---KTWGAQV-KKGDTFHVPRISELGVEDKATDVPVG 76 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc---ccccccc-cCCceEEEeccCcceeeeecCCCccc Confidence 5543 11222444588899999999999988 6776655 45999999999877665521 12234 Q ss_pred cCccccceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC----CCCC Q lcl|NC_011802. 67 ATGLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA----IGTN 140 (430) Q Consensus 67 ~~di~e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t----~~~~ 140 (430) ++++.+..++++||+++...|.+++.|. ...++.+++++++.++||+++|.+++.+++.........+.+ ..++ T Consensus 77 ~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~ 156 (341) T protein:vir:94 77 VQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITG 156 (341) T ss_pred cccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccC Confidence 6688899999999999999999997663 567888899999999999999999998876654332221111 1111 Q ss_pred c--chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcc Q lcl|NC_011802. 141 T--ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLP 218 (430) Q Consensus 141 ~--~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~ 218 (430) . ...+..+.++++.|++.+||. ++|.+|++|+.+..|.. ...+.+.+...+..+++|.||+ +.||++ ++++++| T Consensus 157 ~~~~~~~~~i~~a~~~Lde~~VP~-~gR~lvv~P~~~~~Ll~-~~~~~~~~~~g~~~l~~G~ig~-i~G~~V-~~Sn~lp 232 (341) T protein:vir:94 157 NGQAFSFAVFLAARRLLLEADVPE-EKIVLLISPGQESALFT-IPQFISKDFINNAPIAQGQIGS-LMGVRV-IRTSLIG 232 (341) T ss_pred chhhhhHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHhh-chhhhhhhccccchhheeeeee-EeceEE-EEecccc Confidence 1 123677889999999999999 57999999999999875 3455555555666799999997 899975 6788888 Q ss_pred cccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEE Q lcl|NC_011802. 219 VLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVV 298 (430) Q Consensus 219 ~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVt 298 (430) .++... + +.+++. ....+..-.|.|+.. T Consensus 233 ~~~~~~---~-~~~~~~------------------------------~~~~~~~~~i~~~~~------------------ 260 (341) T protein:vir:94 233 NNSATG---W-RNGAPT------------------------------IAPAEATPGFTGSRY------------------ 260 (341) T ss_pred cccccc---c-cccccc------------------------------eeccccccccccccc------------------ Confidence 744211 1 111110 000111112222100 Q ss_pred eeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEe-eccccCCCcchhee Q lcl|NC_011802. 299 RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVS-QPIPANHELFAGMK 377 (430) Q Consensus 299 a~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat-~pl~~p~g~~~~~~ 377 (430) .+.|+ +..-. ..=|+|||+|+..+- ..++. ...... T Consensus 261 -------------------------~~~~~--------~~~~~-------~~gl~~~~~av~~~k~~~~~~---~~~~~~ 297 (341) T protein:vir:94 261 -------------------------LPKQD--------SFTSL-------PATFTGNSRPVHTAVMCHMDW---AAAVVS 297 (341) T ss_pred -------------------------ccccc--------ccccc-------EEEEEEecccccceeeecchh---hhcccc Confidence 01110 11111 223999999976652 22211 000000 Q ss_pred eEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 378 TTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 378 ~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . -+++...|++.......+=..+||++.+|||.+ |-|.=--+ T Consensus 298 ~----------~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~-v~~~~~~~ 339 (341) T protein:vir:94 298 K----------APRVTQSFENREQVWLMVGRQAYGARLYRPLHA-VNIHTTGD 339 (341) T ss_pred c----------cccccccchhhhhhhhhhhhhhhcccccCccee-EEEecCcC Confidence 0 022222333333233333455899999999997 44432222 No 13 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.95 E-value=2.3e-30 Score=183.11 Aligned_cols=267 Identities=17% Similarity=0.131 Sum_probs=176.3 Q ss_pred Ccccccchh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccccc--CCcccCCcCccccceeEE Q lcl|NC_011802. 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 ma~~~~~~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~--g~~~s~~~~di~e~sv~v 77 (430) ||++ +++ ++.-+++++.|++.++|+++| +|+|+.+ .+.||||.||++......+ +.......+++.+..+++ T Consensus 1 MA~~--~~~pei~~~~v~~~~~~~lv~~~l~--~~~~~~~-~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:79 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhccchhhh--hcccccc-ccCCcEEEEeecCcccccccccCCCccCccccccceEEE Confidence 9995 343 666699999999999999999 8888775 4789999999997766553 222334567899999999 Q ss_pred EeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHH Q lcl|NC_011802. 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELM 155 (430) Q Consensus 78 ~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L 155 (430) +||+++.+.|.+++.|. ...++ ++++++++.+||+++|.+++.++.....-.. .+.+.+....++.+.++++.| T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~---~~~~~~~~~~~~~i~~a~~~l 151 (273) T protein:vir:79 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPSDADDAFDLIASALKEL 151 (273) T ss_pred EEeeecccceeeccHHHHhhcccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccccchhhHHHHHHHHHHHh Confidence 99999999999997664 44554 6788999999999999999988876543221 122333345578899999999 Q ss_pred HhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccch-hhHHHHhccccccchhhhhhhhcCCcccccCcccccccccccc Q lcl|NC_011802. 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) Q Consensus 156 ~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~ 234 (430) ++..||. ++|.++++|+.+..|...-..+.+.... ....+++|.||| +.||++ ++++++|.++..+ .+++.. T Consensus 152 d~~~vP~-~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~i-~~s~~lp~~~~~~----~~a~~~ 224 (273) T protein:vir:79 152 TKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDEQ----FVAFHP 224 (273) T ss_pred hhccCCc-cCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeE-EeceEE-EecccccccCceE----EEEEec Confidence 9999999 4899999999999987643333333333 345699999998 899985 6789998754322 122222 Q ss_pred ccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCc Q lcl|NC_011802. 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) Q Consensus 235 ~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~ 304 (430) .....+.++... + ..+-. ..=+++..|+.. -|++=++|.. +|+-..+++ T Consensus 225 ~A~~~a~~~~~~---e--~~r~~----~~~~~~v~~~~~--yg~~v~~p~~----------vv~~~~~g~ 273 (273) T protein:vir:79 225 SAAAYVSQIDTV---E--ALRDQ----DSFSDRIRALHV--YGGKVVRPTG----------VVVFNKTGS 273 (273) T ss_pred cceeeeeehhhh---h--cccCc----ccceeeeeeeee--eeeEEecCce----------EEEEeccCC Confidence 111111111000 0 00000 001233344433 3444444443 222223333 No 14 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.92 E-value=3e-27 Score=165.98 Aligned_cols=345 Identities=12% Similarity=0.026 Sum_probs=208.1 Q ss_pred Cc--------------cc-ccchh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccccc---CC Q lcl|NC_011802. 1 MA--------------LN-EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GW 61 (430) Q Consensus 1 ma--------------~~-~~~~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~---g~ 61 (430) || +. -++++ ++.-+|+++.|++++++.++++ +++|+. +.||||.||++......+ |. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~-~~~~~~---~~GdTV~ip~~g~~~a~d~~~g~ 76 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATK-KIPFEG---KKGDLIHIPNISRAAVYDKQPQT 76 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccc-ccccee---ecCceEEeeccCcceeeeecCCC Confidence 33 32 24444 5666999999999999999984 456653 569999999887654433 32 Q ss_pred cccCCcCccccceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCC-- Q lcl|NC_011802. 62 DLTDKATGLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI-- 137 (430) Q Consensus 62 ~~s~~~~di~e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~-- 137 (430) .+ ..+++.+.+++++||+.+...+.+++.|. ..-+..+++++.+..+||+++|.+++.++............+. T Consensus 77 ~i--~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~ 154 (381) T protein:vir:80 77 PV--NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDT 154 (381) T ss_pred cc--cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Confidence 33 45678889999999999999999997664 4667778899999999999999999887755432211110000 Q ss_pred ------------CCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccc Q lcl|NC_011802. 138 ------------GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQV 205 (430) Q Consensus 138 ------------~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~ 205 (430) ..+....+..+.++++.|++..||. ++|.+|++|+.+..|+.. ..+.+.+......+++|.||+ + T Consensus 155 ~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~-egR~lvv~P~~~~~Ll~~-~~~~~ad~~~~~~l~~G~Ig~-i 231 (381) T protein:vir:80 155 TLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQ-EGRIVMVSPAQYIDLLSI-NQFISVDFSQVKPVTSGVVGT-I 231 (381) T ss_pred cccccccccccccchhhHHHHHHHHHHHHHhhcCCCc-CCcEEEeCHHHHHHHhhc-hhhhhhhhccchhhhceeeeE-E Confidence 0011124567899999999999998 589999999999998764 344455555677899999997 8 Q ss_pred hhhhhhhhcCCcccccCccccccc-cccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccc Q lcl|NC_011802. 206 AGFDDVLRSPKLPVLTKSTATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQM 284 (430) Q Consensus 206 ~Gfd~~~~~~~v~~~t~gt~t~~t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~ 284 (430) .||++ ++++++|.... +... ..|+... ....+ .+.. +.+.. +-..++.+++|.-|.-+......|+-- T Consensus 232 ~G~~V-v~Sn~lp~~~~---t~~~~~agap~~--~~~~~--~~~~--~~g~~-s~~a~av~~~k~yd~~~~~~~~~~~~~ 300 (381) T protein:vir:80 232 LGMEV-IVTTQIGINSL---TGYVNGQGAPTQ--PTPGV--LGSP--YLPDQ-AGTANVVNTGSASDLAVSLSYFGLPVF 300 (381) T ss_pred cceEE-Eeecccccccc---cceeeecccccc--ccccc--cccc--ccccc-ccceeeeeeeeeeceeeeeeeccceee Confidence 99975 67888887322 1111 1112110 00000 1111 11111 111234556666666665543332211 Q ss_pred ccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEee Q lcl|NC_011802. 285 AKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQ 364 (430) Q Consensus 285 tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~ 364 (430) ++.. .+.+...+++-+++ .|.|.+-++++- T Consensus 301 ~g~~--------~~~~~~~~~~~~~~------------------------------------------~~~~~~~~~~~~ 330 (381) T protein:vir:80 301 SGAG--------ATAADGGQTLGSFG------------------------------------------GANRWATAVVCH 330 (381) T ss_pred ecce--------eeecCCCceeeeeh------------------------------------------hhhhhhhhcccc Confidence 1110 11111122222221 155555444444 Q ss_pred ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 365 PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 365 pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |==..-|++.. .++ |-+..+++|.| ....|+. ||++.+||.++ |-|----- T Consensus 331 ~~~~~~~~~~~--~~~------~~~~~~~~~~~----~~~~~~~--~~~~~~~~~~~-~~~~~~~~ 381 (381) T protein:vir:80 331 PDWLAVGVQQN--VKS------ESSRETMYLAD----AFVTSCV--YGAKVFRPDHC-VLLHTSGI 381 (381) T ss_pred cccccccceeE--eec------ccchhheeehh----hhhhhhh--hccccccchhh-hhhhhcCC Confidence 31111122111 111 45777888888 6666654 99999999984 55532211 No 15 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.80 E-value=1.6e-21 Score=134.57 Aligned_cols=299 Identities=12% Similarity=0.069 Sum_probs=186.3 Q ss_pred Cccc-------------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc--- Q lcl|NC_011802. 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 ma~~-------------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~--- 58 (430) ||+. ++-.|+.+-.|++..|+...+|..++.++ -.+.|++|.|++=.+.+.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~------~~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR------SIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc------cccccceeEeeeccceeeeeec Confidence 5543 44567777799999999999999999432 3467999999866554444 Q ss_pred cCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee------ Q lcl|NC_011802. 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLV------ 130 (430) Q Consensus 59 ~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v------ 130 (430) .|..+..+++++......++||+++-..|.+.+-| ....+.-..+.+.+..+||..+|+.++........+. T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 35555556677777888999999999988886433 3344566668889999999999999987655432111 Q ss_pred ----eecC-----CCCCCCcc-------hhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhH Q lcl|NC_011802. 131 ----ITSP-----DAIGTNTA-------DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 131 ----~~~~-----~t~~~~~~-------~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~ 194 (430) +... ...++... .-++.+-++++.|++..||. .+|.+|++|+.+..|+... .+...+..... T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~-~gR~~vv~P~~y~~LL~~~-~~~~~d~~~~~ 232 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAAL-MPNAANYQALI 232 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHhccc-ccccccccccc Confidence 0000 00000000 11455667889999999998 4799999999999988654 34455555667 Q ss_pred HHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEE Q lcl|NC_011802. 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~T 274 (430) .+++|.|++ +.||++ |+|.++|....+......+.|... .+.++...+ T Consensus 233 ~~~~G~Vg~-i~G~~V-~~Sn~lp~~~~t~~~~~~~~g~~~------------------------------~~~~~~~~~ 280 (347) T protein:vir:15 233 DHERGTIRN-VMGFEV-VEVPHLTAGGAGDTREDAPADQKH------------------------------AFPATSSTT 280 (347) T ss_pred cccceEEEE-EeceEE-Eecccccccccccccccccccccc------------------------------cccccccce Confidence 799999986 889975 678888863221111100011100 000111111 Q ss_pred EcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceee Q lcl|NC_011802. 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~F 354 (430) +.+ .|.+ ..=|+| T Consensus 281 ~~~------------------------------------------------~f~~-------------------~~~l~~ 293 (347) T protein:vir:15 281 VKV------------------------------------------------ALDN-------------------VVGLFQ 293 (347) T ss_pred eee------------------------------------------------cccc-------------------ceeeee Confidence 110 0000 112999 Q ss_pred ccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 355 ADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 355 hr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ||+|...+..-.. ++-+.||.+..-...+--..||.+.+|||.+ |.|.=+|. T Consensus 294 h~~A~g~v~~~~~-----------------------~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~a-v~~~~~~~ 345 (347) T protein:vir:15 294 HRSAVGTVKLKDL-----------------------ALERARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred ccceeeeeEeece-----------------------eeeecccchhhhhhhehhhhcCCceeccccE-EEEecCCC Confidence 9998876653221 1111122222233444456799999999997 56776776 No 16 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.78 E-value=3.3e-21 Score=132.83 Aligned_cols=298 Identities=13% Similarity=0.087 Sum_probs=183.5 Q ss_pred Cccc-------------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc--- Q lcl|NC_011802. 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 ma~~-------------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~--- 58 (430) ||++ +.-+++.+-.|++..|+...+++.+|.. |+ .+.|++|.|+.=.+.+.. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~-r~-----~~~G~sv~i~~iG~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-RS-----IASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhcc-cc-----ccccceeEeeeccceeeeeec Confidence 6633 1235666668899999999999999953 32 466999999865444433 Q ss_pred cCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee------ Q lcl|NC_011802. 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLV------ 130 (430) Q Consensus 59 ~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v------ 130 (430) .|..+..+++++......++||+++-..|.+.+-| ....+.-..+.+.+..+||.++|+.++.......... T Consensus 75 ~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 36666556677778888999999998887777433 3444555567788999999999999975543221110 Q ss_pred ----ee---cCCCCC-CCcc--------hhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhH Q lcl|NC_011802. 131 ----IT---SPDAIG-TNTA--------DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 131 ----~~---~~~t~~-~~~~--------~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~ 194 (430) +. .+...+ +... ..++.+-++++.|+++.||. .+|.+|++|+.+..|+... .+.+.+....+ T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~-~~~~~d~~~~~ 232 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAAL-MPNAANYQALL 232 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCcEEEeCHHHHHHHhccc-ccccccccccc Confidence 00 000000 0011 12455778999999999998 4799999999999988643 34444444556 Q ss_pred HHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEE Q lcl|NC_011802. 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~T 274 (430) .+++|.|++ +.||++ ++|.++|...........+.|+.. . T Consensus 233 ~~~~G~V~~-i~G~~V-~~Sn~lp~~~~~~~~~~~~ag~~~--------------------------------------~ 272 (347) T protein:vir:33 233 DPERGTIRN-VMGFEV-VEVPHLTAGGAGDTREDAPADQKH--------------------------------------A 272 (347) T ss_pred ccccceeEE-EeceeE-EEecccccCccccccccccccccc--------------------------------------c Confidence 799999986 889975 677888874322111111111110 0 Q ss_pred EcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceee Q lcl|NC_011802. 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~F 354 (430) |-++. +.++. .++ +-..=|+| T Consensus 273 --------------------~~~~~-----~~~~~---------------~a~-------------------~~~~gl~~ 293 (347) T protein:vir:33 273 --------------------FPATS-----STTVK---------------VAL-------------------DNVVGLFQ 293 (347) T ss_pred --------------------ccCCc-----cccee---------------ccc-------------------cceeeeee Confidence 00000 00000 000 00113999 Q ss_pred ccceeeEEee-ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 355 ADDAIRIVSQ-PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 355 hr~A~aLat~-pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ||+|+..+.. ++ ++-+.||.+..-..++--+.||++++|||.+ |.|.=+|. T Consensus 294 h~~A~g~v~~~~~------------------------~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~a-v~i~~~~~ 345 (347) T protein:vir:33 294 HRSAVGTVKLKDL------------------------ALERARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred cchhheeeeeece------------------------eeeeccchhhhhHhhhhhhhcCCceecccce-EEEecCCC Confidence 9998865432 22 1112223333334445567799999999997 56666666 No 17 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.75 E-value=8.5e-21 Score=130.62 Aligned_cols=303 Identities=17% Similarity=0.122 Sum_probs=179.3 Q ss_pred Cccc--------------c----cchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccc---cc Q lcl|NC_011802. 1 MALN--------------E----GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT---QE 59 (430) Q Consensus 1 ma~~--------------~----~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~---~~ 59 (430) ||.- . .-.|+.+.-|+...|++..++..++..+ -.+.|++|.||.=-+.+. .. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r------~i~~G~sv~i~~iG~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVR------TIQNGKSAQFPVMGRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc------cccccceEEEecccceeeeeecC Confidence 4432 1 1233444466666788888888888422 247799999986655444 35 Q ss_pred CCcccCCcCccccceeEEEeccccccceEecH--HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCC Q lcl|NC_011802. 60 GWDLTDKATGLLELNVAVNMGEPDNDFFQLRA--DDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI 137 (430) Q Consensus 60 g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~--keL~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~ 137 (430) |..+.++++++......|+||+++-..|.+.+ +.....+....+.+.+..+||..+|+.++.++....++........ T Consensus 75 G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~ 154 (347) T protein:vir:94 75 GERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENI 154 (347) T ss_pred CCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 77777777788888899999999977777764 3344555666788999999999999999877765443322111000 Q ss_pred C--------------C--C----cchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHH Q lcl|NC_011802. 138 G--------------T--N----TADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYR 197 (430) Q Consensus 138 ~--------------~--~----~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r 197 (430) + . . ....++.+-++++.|++..||.+ +|.+|++|+.+..|+.. ..+..........++ T Consensus 155 ~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~-~R~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 155 AGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAG-DRYFYTTPDNYSAILAA-LMPNAANYAALIDPE 232 (347) T ss_pred CCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCC-CcEEEeCHHHHHHHhcc-chhhhhhcccccccc Confidence 0 0 0 01125556788999999999995 89999999999988753 333333333445689 Q ss_pred hccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcc Q lcl|NC_011802. 198 DGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTG 277 (430) Q Consensus 198 ~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaG 277 (430) +|.|++ +.||+ +++|.++|....+. ++. .+..-+.+| T Consensus 233 ~G~Vg~-i~G~~-V~~Sn~lp~~~~t~----~~~-------------------------------------~~~~~~~aG 269 (347) T protein:vir:94 233 TGNIRN-VMGFV-VVEVPHLVQGGAGE----TRG-------------------------------------DDGITIASG 269 (347) T ss_pred ccceEE-EeceE-EEecCccccccccc----ccc-------------------------------------cCcceecCc Confidence 999987 88997 57788888622111 000 111111222 Q ss_pred eeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccc Q lcl|NC_011802. 278 VKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADD 357 (430) Q Consensus 278 V~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~ 357 (430) -. .-|.=+. +..| .+ ..+.+.=|+|||+ T Consensus 270 ~~-------------~~~~~~~------------------------~~~~-----------~~----~~~~~~~l~~h~~ 297 (347) T protein:vir:94 270 QK-------------HAFPATA------------------------SSDV-----------KV----TMDNVVGLFSHRS 297 (347) T ss_pred cc-------------ccccccc------------------------hhhh-----------cc----cccceeEEEeehh Confidence 00 0000000 0000 00 0011123999999 Q ss_pred eeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 358 AIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 358 A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |...+-.-.+.. + .+||.+..-...+==+.||++.+|||.+++.-+. .| T Consensus 298 A~~~v~~~~~~~---------------------e--~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 298 AVGTVKLRDLAL---------------------E--RDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred hhhhhhcccccc---------------------c--chhchhhHHHHhhhhhhhcCcccccceeEEEEec-CC Confidence 987554321111 1 1122111111111124599999999999877666 66 No 18 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.72 E-value=5.2e-20 Score=126.32 Aligned_cols=291 Identities=12% Similarity=0.077 Sum_probs=177.6 Q ss_pred Ccccc---cchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccccc---CCcccCCcCccccce Q lcl|NC_011802. 1 MALNE---GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLELN 74 (430) Q Consensus 1 ma~~~---~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~---g~~~s~~~~di~e~s 74 (430) =|..| .-+|+...-|++..|++..+|+.++.. |+ .+.|+||.|+.=...+..+ |..+..+ .++.... T Consensus 19 ~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~-r~-----i~~G~tv~i~~ig~~~~~~~~~g~~l~~~-~~~~~~~ 91 (332) T protein:vir:78 19 NADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YD-----LRGGKSKQFMFTGKLSAGYHTPGTPIVGD-AGIKANE 91 (332) T ss_pred ccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc-cc-----ccccceEEEEeccceeEeeecCCCCCCCC-CCCCCce Confidence 12223 346677778999999999999999953 43 3579999999776655543 4334332 2566677 Q ss_pred eEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeee-----------cCCCCCCCc Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT-----------SPDAIGTNT 141 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~-----------~~~t~~~~~ 141 (430) +.+++|+.|-..|.+.+-| ....+...++.+.+..+||..+|..++.++......... .+.+...+. T Consensus 92 ~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~~~ 171 (332) T protein:vir:78 92 KTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDA 171 (332) T ss_pred EEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccccccCCccccCH Confidence 8899999999888886422 344556666788999999999999999888775432110 011111111 Q ss_pred chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhh-hhhcccch-hhHHHHhcc-ccccchhhhhhhhcCCcc Q lcl|NC_011802. 142 ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLT-KRDIFGRI-PEEAYRDGT-IQRQVAGFDDVLRSPKLP 218 (430) Q Consensus 142 ~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~-~l~~~~~~-~~~a~r~g~-igr~~~Gfd~~~~~~~v~ 218 (430) ...++.+-++++.|++..||. ++|.+|++|+.+..|+...- .+.+.... ....+++|. |+ .+.||+ +|+|.++| T Consensus 172 ~~~~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~-~i~G~~-V~~Sn~lp 248 (332) T protein:vir:78 172 QAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY-SIAGIR-ILKSNNLA 248 (332) T ss_pred HHHHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeee-EEeeeE-EEecCccc Confidence 124566889999999999998 47999999999999876321 22222222 234578876 65 589996 68889998 Q ss_pred cccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEE Q lcl|NC_011802. 219 VLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVV 298 (430) Q Consensus 219 ~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVt 298 (430) ..+....... + ++|+ -.-|.++ T Consensus 249 ~~~g~~~~~~----~-----------------------------------------~~~~-------------~n~~~~~ 270 (332) T protein:vir:78 249 GLYGQDLSSA----A-----------------------------------------VTGE-------------NNDYQVD 270 (332) T ss_pred cCcccccccc----c-----------------------------------------cccc-------------ccccccc Confidence 5221110000 0 0110 0011111 Q ss_pred eeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheee Q lcl|NC_011802. 299 RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 299 a~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~ 378 (430) . +.+.=|+|||+|.+++..- T Consensus 271 ~-----------------------------------------------~~~~~~~~h~~a~~~v~~~------------- 290 (332) T protein:vir:78 271 A-----------------------------------------------SALAGLIFHREAAGCIQSV------------- 290 (332) T ss_pred c-----------------------------------------------ccceEEeecccceeeeeee------------- Confidence 1 1111389999987666421 Q ss_pred EEEecCcceEEEEEEE-eeecccceeEEEEEeeccceecCcceeEEecCC Q lcl|NC_011802. 379 TSFSIPDVGLNGIFRT-QGDISTLFRLCRIALWYGVNATRPEAIGVGLPG 427 (430) Q Consensus 379 ~~~~~p~~Glslrv~~-~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~ 427 (430) ++.+++.+ .|+.+......+=-+.||++.+|||.+++.-+- T Consensus 291 --------~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 291 --------APTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred --------ccchhhhhcccchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 11122111 112111112222335699999999999877766 No 19 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.71 E-value=8.1e-20 Score=125.24 Aligned_cols=297 Identities=14% Similarity=0.142 Sum_probs=182.4 Q ss_pred Cccc--------------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccc--- Q lcl|NC_011802. 1 MALN--------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT--- 57 (430) Q Consensus 1 ma~~--------------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~--- 57 (430) ||+. ..-+|+..--|++..|+..++|+.++.. | ..+.|+++.||.=-+.+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~-r-----~i~~g~s~~~~~iG~~~~~~~ 74 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R-----SISSGKSAQFPVLGRTQAAYL 74 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccccee-e-----eecccceEEEEeeceeEEEee Confidence 7744 1125666668999999999999999953 3 246699999886533333 Q ss_pred ccCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee----- Q lcl|NC_011802. 58 QEGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLV----- 130 (430) Q Consensus 58 ~~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v----- 130 (430) ..|..+.+..+|+.-..+.|++|+.+-..|.+.+-| ....+.-..+-+.+.++||..+|+.++....+...+. T Consensus 75 ~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 75 APGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred ecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 346666666667777789999999999888888544 3455555667788999999999998876655422211 Q ss_pred ----------eec--CCCCCCCc----chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhH Q lcl|NC_011802. 131 ----------ITS--PDAIGTNT----ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 131 ----------~~~--~~t~~~~~----~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~ 194 (430) ... .++..+.. ...++.+-.+++.|+++.||. .+|.+|++|+.+..|... ..+.+....... T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~-~~~~~~~~~~~~ 232 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPS-SDRVFYCDPDSYSAILAA-LMPNAANYAALI 232 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeChHHHHHHhhc-cccccccccccc Confidence 000 00001111 113556778999999999998 479999999999988653 233333344566 Q ss_pred HHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEE Q lcl|NC_011802. 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~T 274 (430) .+++|.|+. +.||. +|++.++|....+++... +.|+ ... T Consensus 233 ~~~~G~V~~-v~G~~-V~~Sn~lp~~~~~~~~~~-~tg~--------------------------------------~~~ 271 (344) T protein:vir:10 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTSREG-TTGQ--------------------------------------KHA 271 (344) T ss_pred ceeeeEEEE-EeceE-EEeccccccccCCccccc-ccCc--------------------------------------ccc Confidence 689999987 88995 899999885222111110 0000 000 Q ss_pred EcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceee Q lcl|NC_011802. 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~F 354 (430) +.+ + ++....+ ..+.+.=|+| T Consensus 272 ~~~--------------------~-------------------------------------~~~~~~~--~~s~~~~l~~ 292 (344) T protein:vir:10 272 FPA--------------------T-------------------------------------KSGNDKV--AKDNVIGLFM 292 (344) T ss_pred ccC--------------------C-------------------------------------cccceee--ecceeEEEee Confidence 000 0 0000001 0011123899 Q ss_pred ccceeeEEee-ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 355 ADDAIRIVSQ-PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 355 hr~A~aLat~-pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) ||+|...+-. +|.. ..+. ...++.| ...++ +.||.+.+|||.+|++..=+| T Consensus 293 h~~A~~~v~~~~~~~----------e~~r--------~~~~~~d----~i~g~--~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 293 HRSAVGTVKLRDLAL----------ERAR--------RANFQAD----QIIAK--YAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred chhhhhhhhhcccee----------eccc--------chhHHHH----HHHHH--hhcccceecccceEEEEeecC Confidence 9999865432 2211 1100 0111222 22222 459999999999999999999 No 20 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.66 E-value=1.6e-18 Score=118.21 Aligned_cols=300 Identities=15% Similarity=0.135 Sum_probs=175.8 Q ss_pred Cccc-------------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc--- Q lcl|NC_011802. 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 ma~~-------------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~--- 58 (430) ||+. +.=+|+..--||+..|+..++|..++..+ -.+.|+++.||.=-+.+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r------ti~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVR------SIQSGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhhe------eccccceEEeeeccceeEeeee Confidence 5532 11245555578999999999999999432 2467999999865444433 Q ss_pred cCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC Q lcl|NC_011802. 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA 136 (430) Q Consensus 59 ~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t 136 (430) .|..+-+.-+|+.-....|++|+.+-..|.+.+-| ....+.-..+.+.+..+||..+|+-++..+.....+...+... T Consensus 75 ~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:94 75 PGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNEN 154 (347) T ss_pred cCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 45444333346666778899999998888777533 3344455557788999999999998876665433321110000 Q ss_pred C-----------------CCC----cchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHH Q lcl|NC_011802. 137 I-----------------GTN----TADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEA 195 (430) Q Consensus 137 ~-----------------~~~----~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a 195 (430) . ..+ ....++.+-++...|+++.||.+ +|.+|++|+.+..|+...-.. ..+...... T Consensus 155 ~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~-~R~~vv~P~~y~~LLk~~~~~-~~~~~~~~~ 232 (347) T protein:vir:94 155 IAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSS-DRVFYTTPDNYSAILAALMPN-AANYQALID 232 (347) T ss_pred cccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCC-CCEEEeChHHHHHHHHhhccc-ccccccccc Confidence 0 000 11226678889999999999984 899999999999988532222 222224445 Q ss_pred HHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEE Q lcl|NC_011802. 196 YRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISF 275 (430) Q Consensus 196 ~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Ti 275 (430) +++|.|+. +.||. +|++.++|....+. ++.+ +|. ++ T Consensus 233 ~~~G~V~~-v~G~~-V~~Sn~~p~~~~~~------~~~~----------------------------------~~~--~~ 268 (347) T protein:vir:94 233 PSTGSIRN-VMGFE-VIEVPHLTAGGAGD------NRAE----------------------------------EGV--AP 268 (347) T ss_pred cccceeEE-eeceE-EEEcCccccccCcc------cccc----------------------------------ccc--cc Confidence 88999997 88996 68888888733211 1110 111 22 Q ss_pred cceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeec Q lcl|NC_011802. 276 TGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWA 355 (430) Q Consensus 276 aGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fh 355 (430) ++- .+-|..+. +..|. +++ +-+..|+|| T Consensus 269 ~~~-------------~~~~~~~~------------------------~~~y~-----------~d~----~~~~~l~~~ 296 (347) T protein:vir:94 269 TNQ-------------KHAFPDTA------------------------SGDTR-----------VAL----DNVVGLFNH 296 (347) T ss_pred ccc-------------cccccccc------------------------ccccc-----------ccc----cceEEEEec Confidence 220 00011000 00010 011 113379999 Q ss_pred cceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccc--eeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 356 DDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTL--FRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 356 r~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~--~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |+|...+ -+-. +.++. +||.... .-.++ ..||...+|||.+++.+. -+| T Consensus 297 ~~A~~tv--~~~~-------------------~~~e~--~~~~~~~~~~i~~~--~a~G~g~~rPe~a~~i~~-~~a 347 (347) T protein:vir:94 297 RSAVGTV--KLKD-------------------MALER--ARRANFQADQIIAK--YAMGHGGLRPEACGALVF-KKA 347 (347) T ss_pred hhhhhhh--hhcc-------------------cceee--eechhhhhhhhhhh--hhhcCcccccceeEEEEe-cCC Confidence 9976543 2211 11111 1232221 22222 349999999999865554 455 No 21 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.66 E-value=2.7e-18 Score=116.95 Aligned_cols=300 Identities=15% Similarity=0.132 Sum_probs=179.2 Q ss_pred Cccc-----------------c--cchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc--- Q lcl|NC_011802. 1 MALN-----------------E--GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 ma~~-----------------~--~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~--- 58 (430) ||+. | .-+|+....|++..|+...+|..++.+ | -.+.|+++.||.=-+.+.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-r-----~i~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R-----TIQNGKSASFPVMGRTKGYYLA 74 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccc-c-----cccCcceEEEeeecceeeeeec Confidence 6633 1 234566668899999999999999943 2 2477999999866444433 Q ss_pred cCCcccCCcCccccceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeee---- Q lcl|NC_011802. 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) Q Consensus 59 ~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~---- 132 (430) .|..+...-+|+.-..+.++||+.+-..|.+.+-|- ...+.-..+.+.+.++||..+|+-++......+..... T Consensus 75 ~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:88 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) T ss_pred cccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 344443333456667899999999998888886552 33455556778899999999999887665543321100 Q ss_pred cCC-------C--CCCC-------cchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHH Q lcl|NC_011802. 133 SPD-------A--IGTN-------TADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) Q Consensus 133 ~~~-------t--~~~~-------~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~ 196 (430) +++ . .+.. ....++.+-++++.|+++.||.+ +|.+|++|+.+..|+... .....+......+ T Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~-gR~~vv~P~~y~~Ll~~~-~~~~~~~~~~~~~ 232 (347) T protein:vir:88 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG-DRRFYCAPEDYSAILSAL-MPNAANYAALIDP 232 (347) T ss_pred cCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCC-CCEEEeCHHHHHHHhcch-hhhhhhhccccch Confidence 000 0 0000 01125667889999999999995 799999999998887533 2223333344468 Q ss_pred HhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEc Q lcl|NC_011802. 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) Q Consensus 197 r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Tia 276 (430) ++|.++. +.||. ++++.++|....+. .. .|+.+... T Consensus 233 ~~G~vg~-i~G~~-V~~s~nlp~~~~~~---~~---------------------------------------~~~~~~~t 268 (347) T protein:vir:88 233 ETGNIRN-VMGFE-VIEVPHLTVGGAGD---NN---------------------------------------PADGVAPT 268 (347) T ss_pred hcceeee-eccce-EEEeeccccccccc---cc---------------------------------------cccccccc Confidence 9999986 88996 67888888522221 00 01111111 Q ss_pred ceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeecc Q lcl|NC_011802. 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) Q Consensus 277 GV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr 356 (430) + +. ..| .....-.+ .+ ..+-+.-|+||| T Consensus 269 ~---------~~----~~~-----~~~~~~~~--------------------------------~~--d~~~~~~l~~~~ 296 (347) T protein:vir:88 269 N---------QK----HIF-----PATATGDD--------------------------------RV--AQNNVVGLFNHR 296 (347) T ss_pred c---------cc----ccc-----cccccccc--------------------------------cc--ccCcEEEEEech Confidence 1 00 000 00000000 00 011123599999 Q ss_pred ceeeEEe-eccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEE--eeccceecCcceeEEecCCCCC Q lcl|NC_011802. 357 DAIRIVS-QPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 357 ~A~aLat-~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riD--vLyG~~~v~PElagv~l~~q~~ 430 (430) +|+..+. .+|.. .. +||.+ ...+.|+ +.||++.+|||.++++-..-.| T Consensus 297 ~a~g~v~~~d~~~----------e~--------------~r~~~--~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 297 SAVGTVKLKDMAL----------ER--------------ARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhhheeccccee----------ee--------------eechh--hHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 9997764 33321 11 12211 1111222 4599999999999877777777 No 22 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.63 E-value=6.6e-18 Score=114.78 Aligned_cols=298 Identities=15% Similarity=0.132 Sum_probs=179.6 Q ss_pred Cccc--------------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc-- Q lcl|NC_011802. 1 MALN--------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-- 58 (430) Q Consensus 1 ma~~--------------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~-- 58 (430) ||.+ ..-+|+.+--|++..|+..+++..++.. | ..+.|+++.||+=-+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r-----~i~~gks~~~~~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R-----SISSGKSAQFPVLGRTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-e-----eccccceEEEeeecceEEEee Confidence 4433 1334555558899999999999999953 3 3355999988865443333 Q ss_pred -cCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeec-- Q lcl|NC_011802. 59 -EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS-- 133 (430) Q Consensus 59 -~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~-- 133 (430) .|..+...+.|+......|++|+.+-..|.+.+-| ....+.-..+-+.+.++||..+|+-++....+.+.+.... T Consensus 75 ~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~ 154 (345) T protein:vir:22 75 APGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNE 154 (345) T ss_pred ecCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 45555444545555567799999999888888544 3445555567788999999999998876655433221110 Q ss_pred ---------------CCC---CCCCc-chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhH Q lcl|NC_011802. 134 ---------------PDA---IGTNT-ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 134 ---------------~~t---~~~~~-~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~ 194 (430) .++ .+..+ ...|..+-++++.|++..||.+ +|.+|++|+.+..|.... .+......... T Consensus 155 ~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~-~R~~vv~P~~y~~Ll~~~-~~~~~~~~~~~ 232 (345) T protein:vir:22 155 NIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAA-DRVFYCDPDSYSAILAAL-MPNAANYAALI 232 (345) T ss_pred cccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCcc-CCEEEeChHHHHHHhccc-ccccccccccc Confidence 011 11111 1247778899999999999994 799999999999886533 23333344566 Q ss_pred HHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEE Q lcl|NC_011802. 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~T 274 (430) .+++|.|+. +.||. ++++.++|....++.... + .++... T Consensus 233 ~~~~G~V~~-i~G~~-V~~sn~lp~~~~~~~~~~---~------------------------------------~~~~~~ 271 (345) T protein:vir:22 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTAREG---T------------------------------------TGQKHV 271 (345) T ss_pred ccccceEEE-EeceE-EEecccccccccCccccC---c------------------------------------cccccc Confidence 689999987 89995 899998886322221111 0 111111 Q ss_pred EcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceee Q lcl|NC_011802. 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~F 354 (430) +... ....++. + ..+.+.=|+| T Consensus 272 ~~~~------------~g~~~~~--------------------------------------------~--~~~~~~~l~~ 293 (345) T protein:vir:22 272 FPAN------------KGEGNVK--------------------------------------------V--AKDNVIGLFM 293 (345) T ss_pred cccc------------ccceeee--------------------------------------------e--ccCceEEEEE Confidence 1110 0000000 0 0011123999 Q ss_pred ccceeeEEee-ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCC Q lcl|NC_011802. 355 ADDAIRIVSQ-PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 355 hr~A~aLat~-pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~ 429 (430) ||+|...+-. +|.. ..+. +..+|.| ...++ ..||++.+|||.++++..--+ T Consensus 294 h~~A~~~v~~~~~~~----------e~~r--------~~~~~~d----~I~~~--~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 294 HRSAVGTVKLRDLAL----------ERAR--------RANFQAD----QIIAK--YAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ehhheeeeeeeccee----------eeee--------chhHHHH----HHHHH--HhcCCcccccceeEEEEEeeC Confidence 9997764432 1210 1111 0112222 22111 459999999999998887777 No 23 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.60 E-value=6.8e-17 Score=109.23 Aligned_cols=258 Identities=14% Similarity=0.064 Sum_probs=167.6 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc-c---cccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-P---TQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~-~---~~~g~~~s~~~~di~ 71 (430) ||++.=++..+.+ +.+++.+++.+++++++ .++++-++ +.||||+||+-... . ..+|.++ .++.+. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~--~~~~~l~g-~~G~tv~ip~~~~~g~~~~~~eg~~i--~~~~it 75 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccc--cccccccC-CCCCEEEEEeeccCCCcccccCCCcc--cccccc Confidence 9999766666555 55677799999999999 45554443 57999999985322 1 2234333 356777 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++. .-.|.+++.+ ++..+...+..+.+.+.||+++|.+++..+......+.+ ....++.+. T Consensus 76 ~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~--------~~~~~d~i~ 146 (274) T protein:vir:93 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------DITKLNGLQ 146 (274) T ss_pred cceeEEEeeee-cccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--------cccCHHHHH Confidence 88888888664 4468888766 345677888889999999999999999887665433221 123467788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|++.+. ..|.++++|+.+..|..+....|.. .......+++|.||+ +.||+ ++.+.++|. T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~--------- 212 (274) T protein:vir:93 147 SAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA--------- 212 (274) T ss_pred HHHHHhhhccC---CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccce-ecCee-EEEcCCCCc--------- Confidence 89999999864 3688999999998886543222211 111223355566655 45554 222222210 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:93 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) . .-++||+.||.+...+.. T Consensus 213 --------------------------------------~--t~~l~~~gai~~~~~~~~--------------------- 231 (274) T protein:vir:93 213 --------------------------------------G--TAILAKKGAVKLILKRDF--------------------- 231 (274) T ss_pred --------------------------------------c--eEEEEeCCeEEEEecCCc--------------------- Confidence 0 015777788887654321 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . +-.++|.+......+.+..||++.++|+-. |.|.-..| T Consensus 232 ~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~-v~~t~~~~ 270 (274) T protein:vir:93 232 F--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred c--cccccchhhcccEEEEEEEEEEEEEcCCce-EEEeeCcc Confidence 1 112234555556666777899999999986 67765555 No 24 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.60 E-value=4.5e-17 Score=110.18 Aligned_cols=264 Identities=14% Similarity=0.110 Sum_probs=166.6 Q ss_pred Cccc---ccchh--hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccc----cccCCcccCCcCccc Q lcl|NC_011802. 1 MALN---EGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~---~~~~~--t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~----~~~g~~~s~~~~di~ 71 (430) ||+. .++++ ++.-+.+++.|++.+++++++..-+.++ .+.||||+||+-.... ..+|.++ .++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~---g~~G~tv~ip~~~~~g~a~~~~~g~~i--~~~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLE---GQPGSEITVPKYKYIGDAQDVAEGAAI--DYSALE 75 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceeccccc---CCCCCEEEEeeeccCCcceeecCCCcC--cccccc Confidence 8854 33332 2233667788999999999985444443 3579999999854321 2224333 355778 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcc-hhhhHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTA-DAWNFV 148 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~-~~~~~~ 148 (430) ..+..+++++.+. .|++++.+. +..+...+..+++.+.|+.++|.+|+..+......+. ++.+..+. ..+..+ T Consensus 76 ~~~~~~~i~~~~~-a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~---~~~t~~~~~~~~~~~ 151 (278) T protein:vir:80 76 TESVKHGIKKAGK-GVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVK---GAINIGLIDKIENTF 151 (278) T ss_pred cceeeEeeehhhc-cccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---cccccchhhhHHHHH Confidence 8888899877554 788887663 4577888888999999999999999988876543332 22222233 235667 Q ss_pred HHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhc-ccchhhHHHHhccccccchhhhhhhhcCCcccccCccccc Q lcl|NC_011802. 149 ADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDI-FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 149 a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~-~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~ 227 (430) ..+...|++..+|. .+.++++|+.+..|.......+. ........+++|.||+ +.||+. +.+.++|. T Consensus 152 ~da~~~l~~~~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~-~~G~~V-i~s~~~p~-------- 219 (278) T protein:vir:80 152 TDAPDAIEDESITT--TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGE-LLGWEI-VRTKKLAD-------- 219 (278) T ss_pred HHHHHhhcccCCCc--ccEEEECHHHHHHHHhhhhhhccccccccccceeecccee-ecceeE-EEcCCCCc-------- Confidence 88999999999997 36699999999887653322222 1122233456677765 566653 33233220 Q ss_pred cccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeE Q lcl|NC_011802. 228 ITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) Q Consensus 228 ~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~ 307 (430) | T Consensus 220 -------------------------------------------------~------------------------------ 220 (278) T protein:vir:80 220 -------------------------------------------------G------------------------------ 220 (278) T ss_pred -------------------------------------------------c------------------------------ Confidence 0 Q ss_pred EeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcce Q lcl|NC_011802. 308 ITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVG 387 (430) Q Consensus 308 I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~G 387 (430) .-++||++||.+..... T Consensus 221 ------------------------------------------t~~l~~~gAi~~~~~~~--------------------- 237 (278) T protein:vir:80 221 ------------------------------------------NALAVKAGALKTFLKRN--------------------- 237 (278) T ss_pred ------------------------------------------eEEEEeccceeeeecCC--------------------- Confidence 01567777776643221 Q ss_pred EEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 388 LNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 388 lslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) +++ -.++|.+......+.+..||++.++||-. |.|.=+-+ T Consensus 238 ~~v--E~~Rd~~~~~d~i~~~~~yg~~v~~~~~~-v~it~~a~ 277 (278) T protein:vir:80 238 LLA--ESGRDMDHKLTKFNADQHYAVALVDETKA-VKVVPVAG 277 (278) T ss_pred ccc--ccccchhhccceeeeeeEEEEEEEcCcce-EEEeeccC Confidence 111 12234444555666778899999999986 66654444 No 25 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.58 E-value=2.1e-16 Score=106.50 Aligned_cols=258 Identities=13% Similarity=0.071 Sum_probs=166.8 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcc-cccc---cCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE-SPTQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~-~~~~---~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +-+++.|++.+++++++. ++++-+ .+.||||+||.-.. .... +|.++ .++.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~--~~~~l~-g~~G~tv~ip~~~~~g~~~~~~~g~~i--~~~~it 75 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFAD--IDSTLV-GQPGDTLTFPAFTYSGDAQVIAEGEKI--PVDQIG 75 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhccccc--cccccc-CCCCCEEEEEeeccCCCccccCCCCcC--chhhcc Confidence 9987666665555 445666999999999884 444333 35799999997532 1222 24333 355778 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++ ..-.|.+++.+ +...+...+..+++.+.||+++|.+++..+...... .......++.+. T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~--------~~~~~~~~d~i~ 146 (274) T protein:vir:96 76 TSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT--------VEADITKLDGLQ 146 (274) T ss_pred cceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------cCcccccHHHHH Confidence 8888888876 45568888766 346778888889999999999999999877553321 111223467889 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhccc-chhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG-RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~-~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|++... ..|.++++|+.+..|.......|... ......+|+|.||+ +.||+ ++.+.++|. T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~-~~G~~-Vi~s~~~p~--------- 212 (274) T protein:vir:96 147 TAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAV-IVRSNKLNK--------- 212 (274) T ss_pred HHHHHhcccCC---CceEEEeCHHHHHHHHhcccccccccccccccceeecccce-ecCee-EEEcCCCCc--------- Confidence 99999999875 35899999999988765433222221 12233455666665 55654 333232220 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:96 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) . ..++||+.||.++...-+ T Consensus 213 --------------------------------------~--t~~l~~~gA~~~~~~~~~--------------------- 231 (274) T protein:vir:96 213 --------------------------------------G--EALLAKKGAVKLITKRDF--------------------- 231 (274) T ss_pred --------------------------------------c--eEEEEeCcceeeeecCCc--------------------- Confidence 0 026778888887654321 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) .+ -.++|.+......+.+..||++.++|+=. |+|.-..| T Consensus 232 ~v--E~~Rd~~~~~d~i~~~~~yg~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:96 232 FL--EKDRDASRKSTALYSDKHYVAYLYDESKV-VKITKGAG 270 (274) T ss_pred cc--ccccchhhcccEEEEeeEEEEEEEcCccE-EEEEcCcc Confidence 11 12234444555666677799999999986 78887777 No 26 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.56 E-value=2.8e-16 Score=105.85 Aligned_cols=258 Identities=14% Similarity=0.068 Sum_probs=168.6 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc----ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES----PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~----~~~~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +.++++|++.+++++++ .++++-++ +.||||+||.-... ...+|.++ .++.+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~--~~d~~l~g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccc--eecccccC-CCCCEEEEeeecCCCccccccCCCcc--chhhcc Confidence 9998766666655 55677799999999999 55655443 57999999974321 12234444 345677 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++.+ ..-.|.+++.+ +...+...+..+++...||+++|.+++..+......+. ..+..++.+. T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~--------~~a~~~d~i~ 146 (274) T protein:vir:12 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN--------ADITKLNGLQ 146 (274) T ss_pred cceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------ccccCHHHHH Confidence 7777888866 44468888766 34567788888888899999999999988766443221 1123567788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|.+.|++... ..|.++++|+.+..|.......|.. .+.....+++|.||+ +.||+ ++.+.++|. T Consensus 147 dA~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~--------- 212 (274) T protein:vir:12 147 SAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRSNKLEA--------- 212 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhhhhhccccccccccceeccccee-ecCee-EEEeCCCCc--------- Confidence 89999998753 4699999999998887644322222 222333456666665 55664 333222220 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:12 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) .+ -++|++.||.+...+. + T Consensus 213 --------------------------------------~t--~~l~~~gA~~~~~~~~---------------------~ 231 (274) T protein:vir:12 213 --------------------------------------GT--AILAKKGAVKLILKRD---------------------F 231 (274) T ss_pred --------------------------------------ce--EEEEeccceeeeecCC---------------------c Confidence 00 1556666666544321 1 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . +-.++|++......+-+..||++.++|+-. |+|.-..| T Consensus 232 ~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:12 232 F--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred e--eccccchhhcccEEEeeeEEEEEEEcCCce-EEEEcCCc Confidence 1 122334555556666778899999999996 78887777 No 27 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.56 E-value=1.4e-16 Score=107.55 Aligned_cols=260 Identities=17% Similarity=0.110 Sum_probs=151.8 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc----ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES----PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~----~~~~g~~~s~~~~di~ 71 (430) ||+.+=++..+.+ +-+++.++..++++.++...+.++ .+.||||+||+-... ...+|.++ .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~---g~~G~ti~iP~~~~~gda~~~~eg~~i--~~~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQ---GQPGNTLKFPAFTYIGDAADVAEGGEI--SLDKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccc---cCCCCEEEEeeeccCccccccCCCCcc--ChhhcC Confidence 9988666666555 556677999999999996555554 357999999974222 13345444 345566 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++.+ ..-.|.+++.+ +...+...+..+...+.||+++|.+|+..+..... ..+....++.+. T Consensus 76 ~~~~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~---------~~~~~~~~d~i~ 145 (272) T protein:vir:36 76 TTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ---------TVSTKANVDGVQ 145 (272) T ss_pred CcceeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cccccccHHHHH Confidence 6677777755 44468888755 34577777788888889999999999877654321 122234577889 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc- Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI- 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~- 228 (430) +|+..|.+...|. |.++++|..+..|.......+.........+++|.||+ +.|++ ++.+.++|..++ ....+ T Consensus 146 ~A~~~lgd~~~~~---~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~-~~G~~-Vv~s~~~p~~~~-~~~~~~ 219 (272) T protein:vir:36 146 AALDIFNDEDAQA---YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD-VLGAQ-IVRSKKLAEGSA-LMFKIV 219 (272) T ss_pred HHHHHhhhcCCCc---eEEEEcHHHHHHHhcccccccccccccccceeeeccce-ecCee-EEEeCCCCCCce-eEEEEE Confidence 9999999998773 78999999999887654333333344566789999997 88997 677888885322 11111 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) ...||-...... .+. .+ ..|. -.+..|.|.-- +.|.+--.-...-+.+ T Consensus 220 ~~~gA~~~~~~~-~~~----vE--~~R~---------~~~~~d~i~~~----------------~~y~~~v~~~~~vv~~ 267 (272) T protein:vir:36 220 SNSPALKLVLKR-GVQ----VE--TDRD---------IVTKTTVITAD----------------EHYAAYLYDLTKVVNI 267 (272) T ss_pred ecccceeeeecC-Ccc----cc--cccc---------hhhcCcEEEEE----------------EEEEEEEEcCccEEEE Confidence 112222100000 000 00 0000 01111222111 1121111111111222 Q ss_pred eeccc Q lcl|NC_011802. 309 TPKPV 313 (430) Q Consensus 309 ~Pai~ 313 (430) +=+.+ T Consensus 268 t~~g~ 272 (272) T protein:vir:36 268 TFTGV 272 (272) T ss_pred eecCC Confidence 22222 No 28 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.55 E-value=2.3e-16 Score=106.35 Aligned_cols=286 Identities=13% Similarity=0.039 Sum_probs=151.7 Q ss_pred Ccccccch-------hhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccC---CcccCCcCcc Q lcl|NC_011802. 1 MALNEGQI-------VTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG---WDLTDKATGL 70 (430) Q Consensus 1 ma~~~~~~-------~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g---~~~s~~~~di 70 (430) |++-.|.= .++--+++++.|++.|+...+++ +.+. +.||||.|+...+.++++= .++ ..+++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~--~~d~----g~GDtV~InsIg~~tV~dY~~~~~i--~~d~l 72 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIAR--VVDF----PDGDKLTIPSVGTPVVRSRPEQGDF--TFDNL 72 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhc--cccc----CCCCeEEeccccccccccccCCCCc--ccccC Confidence 98864432 34556999999999999877763 2222 5699999998877777653 333 34556 Q ss_pred ccceeEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--------ceee-ecCC---C Q lcl|NC_011802. 71 LELNVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMG--------SLVI-TSPD---A 136 (430) Q Consensus 71 ~e~sv~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~--------~~v~-~~~~---t 136 (430) ....+.++||+.|--.|.+++ |+. -.++.....+.+..+||..+|..++++.+..+ +++. +.+. + T Consensus 73 tt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~ 151 (322) T protein:vir:31 73 DTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVG 151 (322) T ss_pred CCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceec Confidence 777899999999999999999 653 23444444566778899999988877655433 1111 1111 1 Q ss_pred CCCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHH--hhhhhhcccc---hhhHHHHhcc--ccccchhhh Q lcl|NC_011802. 137 IGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGR---IPEEAYRDGT--IQRQVAGFD 209 (430) Q Consensus 137 ~~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~--~~~~l~~~~~---~~~~a~r~g~--igr~~~Gfd 209 (430) .+++....|+.+-.++..|++..||+ .+|-+|++|+-+..|.+ ..+.+.+..+ ....-..+|+ +|+ ++||| T Consensus 152 ~gt~~~~ay~~lv~l~~kLdkanVP~-~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~-~~GF~ 229 (322) T protein:vir:31 152 TGTDQTMDVTDFSRVNYVMTQSKMPM-GGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRS-VYGID 229 (322) T ss_pred cCCCchhhHHHHHHHHHHhccccCCC-CCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHH-Hhcee Confidence 12333468999999999999999999 58999999998876622 1222222211 1111112232 776 78996 Q ss_pred hhhhcCCccc--ccCccccccccccccccccceeeeeeccccccc-cceeeeEEeecccee--ecccEEEEcceeeeccc Q lcl|NC_011802. 210 DVLRSPKLPV--LTKSTATGITVSGAQSFKPVAWQLDNDGNKVNV-DNRFATVTLSATTGL--KRGDKISFTGVKFLGQM 284 (430) Q Consensus 210 ~~~~~~~v~~--~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~-d~~~~~~t~s~tgtl--kaGDv~TiaGV~~v~~~ 284 (430) +|.|.+++. ++-=.+....++++++..- ...+...+.+... ..+.+- .+...-. ..+|-+----||.---+ T Consensus 230 -V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~-f~~~~~~~~~~~~~~~~~l~--~~e~~r~~~~~~d~~~~~~~~g~g~~ 305 (322) T protein:vir:31 230 -LFVSNLLADANETINAGGDARSTTAGKCNM-FMNVSDMGLLPFVVAWKEMP--TTKSFIDDYNDDLNTATTARWGNGLV 305 (322) T ss_pred -eeeeccccccccccccCcccccccceeecc-cccccchhhhhhhhHhhhhh--hhhcccCccccccceeeeeeecceee Confidence 788888752 2211222333344433211 1111111110000 000000 0000000 01121111111111111 Q ss_pred ccccccCcceEEEEeeccCcee Q lcl|NC_011802. 285 AKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 285 tk~~~~~l~~fvVta~~~a~tv 306 (430) -.+.+. +|++..+..+. T Consensus 306 r~e~l~-----~~~a~~~~~~~ 322 (322) T protein:vir:31 306 RDENLV-----CVLANADKVTF 322 (322) T ss_pred cccceE-----EEEeccccccC Confidence 111110 12232222333 No 29 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.53 E-value=1.2e-15 Score=102.43 Aligned_cols=321 Identities=16% Similarity=0.093 Sum_probs=180.5 Q ss_pred Cc---------cc-------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc Q lcl|NC_011802. 1 MA---------LN-------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ 58 (430) Q Consensus 1 ma---------~~-------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~ 58 (430) |+ .| ..-+|+.+--|++..|+..+++..++.+ | ..+-|+++.|+.=-+.+.. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~-r-----ti~~Gksv~f~~iG~~t~~ 74 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTK-R-----TLKNGKSLQFIYTGRMTSS 74 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccc-c-----ccccCceEEEEeeeeeEEe Confidence 21 11 1234555557888999999999999843 3 3356999988865444333 Q ss_pred ---cCCcccCCc-CccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeee Q lcl|NC_011802. 59 ---EGWDLTDKA-TGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT 132 (430) Q Consensus 59 ---~g~~~s~~~-~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~ 132 (430) .|..+.+++ .|+--....++||+.+-..|.+.+-| ....+....+.+.+.++||..+|+.++..+.+.+.+.-. T Consensus 75 ~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p 154 (375) T protein:vir:10 75 FHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASP 154 (375) T ss_pred eecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 354443443 13222335599999999888888543 445566666778899999999999998888754322110 Q ss_pred --------cC--------CCCC---CCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhh--hhhhcccch Q lcl|NC_011802. 133 --------SP--------DAIG---TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL--TKRDIFGRI 191 (430) Q Consensus 133 --------~~--------~t~~---~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~--~~l~~~~~~ 191 (430) .. ++.. .+....++.+-.+.+.|+++.||. .+|.++++|+.+..|+..+ ..+-+.+-. T Consensus 155 ~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~-~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~ 233 (375) T protein:vir:10 155 VSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSS-QGRCAVLNPRQYYALIQDIGSNGLVNRDVQ 233 (375) T ss_pred cccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCC-CCCEEEeChHHHHHHHhcCCccceeeeccc Confidence 00 0000 011124677889999999999998 5899999999998887543 123233222 Q ss_pred hhHHHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeeccc Q lcl|NC_011802. 192 PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGD 271 (430) Q Consensus 192 ~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGD 271 (430) ....+.+|.+++ +.||. +++|.++|..+... .. .|+.. ..+..-+.++ T Consensus 234 ~~~~~~~g~v~~-i~Gv~-V~~Sn~lP~~~~~~---~~-~g~~~--------------------------~~~a~~~~~~ 281 (375) T protein:vir:10 234 GSALQSGNGVIE-IAGIH-IYKSMNIPFLGKYG---VK-YGGTT--------------------------GETSPGNLGS 281 (375) T ss_pred ccceeccceEEE-EeceE-EEEecccccccccc---cc-ccccc--------------------------cccchhhhhc Confidence 444567888886 88995 89999999864321 00 01100 0000001111 Q ss_pred EEEEcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccc Q lcl|NC_011802. 272 KISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTN 351 (430) Q Consensus 272 v~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~N 351 (430) .+.+-. .-..+++ .....|. ..+.. .+-+.= T Consensus 282 ~~~~~~---------------~~~~~~~----------------------g~~~~y~-----------~d~~~-~~~~~~ 312 (375) T protein:vir:10 282 HIGPTP---------------ENANATG----------------------GVNNDYG-----------TNAEL-GAKSCG 312 (375) T ss_pred cccccC---------------Ccceeec----------------------ccccccc-----------ccccc-cCceEE Confidence 111110 0000000 0000110 00000 011223 Q ss_pred eeeccceeeEEee-ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 352 VFWADDAIRIVSQ-PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 352 l~Fhr~A~aLat~-pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |+|||+|..-+-- +|.+ ... +=..++.+|.| ....+. .||...+|||.|+..=.+-+| T Consensus 313 ~~~~~~A~g~v~~~~~~~--------~~~-------~~~~~~~~q~~----~i~~~~--a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 313 LIFQKEAAGVVEAIGPQV--------QVT-------NGDVSVIYQGD----VILGRM--AMGADYLNPAAAVELYIGATA 371 (375) T ss_pred EEEchhheeeeeeecccc--------ccc-------cchhhheeeee----eeeeee--eeccCccCceeEEEEecCcCc Confidence 9999999765411 1110 000 11245556665 444444 499999999998766556555 No 30 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.51 E-value=2.4e-16 Score=106.19 Aligned_cols=272 Identities=14% Similarity=0.126 Sum_probs=157.7 Q ss_pred cchhcccCCCchhhhhccCcEEEEecCcc---cccccCCcccCCcCccccceeEEEeccccccceEecHHH--hhhHHHH Q lcl|NC_011802. 26 MAQKAKKYTPPAASMQRSSNTIWMPVEQE---SPTQEGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAY 100 (430) Q Consensus 26 mA~~V~~~r~~~~~~~k~GdTV~i~~P~~---~~~~~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~ 100 (430) |-|.+ +.|+++.||.=-. .....|..+-++++++......|+||+.+-..|.+.+-| ....+.- T Consensus 1 ~vr~i-----------~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr 69 (324) T protein:vir:99 1 MTRTI-----------TSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVR 69 (324) T ss_pred Ceeee-----------ecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccch Confidence 44444 7899999886533 223347666666777878888899999999888888544 4455566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccc--------eeeecCCCCC---C-Cc------c-hhhhHHHHHHHHHHhhcCC Q lcl|NC_011802. 101 RHRIQSAARKLANNVELKVANMAAEMGS--------LVITSPDAIG---T-NT------A-DAWNFVADAEELMFSRELN 161 (430) Q Consensus 101 ~~~i~~Am~~LAn~Id~dl~~~~~~~~~--------~v~~~~~t~~---~-~~------~-~~~~~~a~a~~~L~~~~aP 161 (430) ..+.+.+.++||..+|+-++.++..... ...+..++.. + +. + ..++.+-++++.|+++.|| T Consensus 70 ~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP 149 (324) T protein:vir:99 70 SEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIP 149 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCC Confidence 6778899999999999988766543221 1111111100 0 00 0 2256677899999999999 Q ss_pred CCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccccccccccccccee Q lcl|NC_011802. 162 RDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAW 241 (430) Q Consensus 162 ~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~ 241 (430) . .+|.+|++|+.+..|.+ ...+.+........+++|.|++ +.||. +|+|.++|... ++.....-+++ T Consensus 150 ~-~gR~~vv~P~~y~~Ll~-~~~~~~~~~~~~~~~~~G~V~~-i~Gf~-V~~Sn~lp~~~-~t~~~~a~~~~-------- 216 (324) T protein:vir:99 150 A-GDRTFYTDPDTYSAILA-ALMPNAANYAALIDPETGNIRN-VMGFE-VVETPHMTAQM-VTNPTDAFDGT-------- 216 (324) T ss_pred C-CCCEEEeChHHHHHHhh-cccccccccccccceecceEEE-EeceE-EEecCCccccc-ccccccccccc-------- Confidence 8 58999999999987754 3444443344556699999997 89995 89999999722 11100000000 Q ss_pred eeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccc Q lcl|NC_011802. 242 QLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLS 321 (430) Q Consensus 242 ~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~ 321 (430) |=.++++| .+... T Consensus 217 ----------------------------~~~~~~~~----------------~~~~~----------------------- 229 (324) T protein:vir:99 217 ----------------------------GHIFPATG----------------DSTTT----------------------- 229 (324) T ss_pred ----------------------------cccccccc----------------ccccc----------------------- Confidence 00111111 00000 Q ss_pred cccccccccccccccCceeEEeccCCcccceeecccee-eEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeeccc Q lcl|NC_011802. 322 PEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI-RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDIST 400 (430) Q Consensus 322 ~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~-aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~ 400 (430) +.|. . -++.+.=|+||++|. +.-..++.. ..+. ...+|.|.-. T Consensus 230 ---~ky~--------~-------d~~~~~gl~~~~~a~~tv~~~~~~~----------e~~~--------~~~~~~d~i~ 273 (324) T protein:vir:99 230 ---GKMT--------V-------GADNVVGLFVHRSAVATLKLKDMAL----------ERAR--------RPEYQADQII 273 (324) T ss_pred ---cccc--------c-------ccCceeEEEEehhheEEEeeeccee----------ccee--------chhhHHHhhh Confidence 0000 0 001112389999965 222223311 1111 1112333111 Q ss_pred ceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 401 LFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . =..||++.+|||.++++-..-.+ T Consensus 274 ~------~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 274 A------KYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred h------hhhhcCcccccceEEEEEEccCc Confidence 1 13499999999999877643333 No 31 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.49 E-value=2.2e-15 Score=100.90 Aligned_cols=299 Identities=15% Similarity=0.035 Sum_probs=169.6 Q ss_pred Cccc--------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc---cCCcc Q lcl|NC_011802. 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) Q Consensus 1 ma~~--------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~---~g~~~ 63 (430) |..- +.=.|+.+.-||...|+..+++..++.. |. .+.|+++.||.=-..+.. .|..+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-rt-----i~~gkS~q~~~iG~~~~~~~~~G~~l 74 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-QE-----VVGTNSVSNKYIGETELQVLSPGKSP 74 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEeeeeeeeEEeeeccCccc Confidence 3321 2334455557888999999999988843 32 578999998876443332 24333 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccc-ceeeecC----C Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMG-SLVITSP----D 135 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~-~~i~~Am~~LAn~Id~dl~~~~~~~~-~~v~~~~----~ 135 (430) .++.+.-.+..|++|+-+-..+.+-+-| +..-+..+ .+-+.+.++||...|+-++.+++..+ .+..... . T Consensus 75 --d~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~ 152 (364) T protein:vir:10 75 --DASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRV 152 (364) T ss_pred --CCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcc Confidence 3455667789999999887666665433 33222223 34467788999999999987665432 1111000 0 Q ss_pred C-------CCCCcch-------hhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc--cchhhHHHHhc Q lcl|NC_011802. 136 A-------IGTNTAD-------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF--GRIPEEAYRDG 199 (430) Q Consensus 136 t-------~~~~~~~-------~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~--~~~~~~a~r~g 199 (430) . +..+..+ ....+-.+.+.|++..||.+ +|.++++|+-+..|+.. ..|.+. +......|++| T Consensus 153 ~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~-~R~~vv~P~~y~~Ll~~-~~lvn~d~~~~~~~~~~~G 230 (364) T protein:vir:10 153 AGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTS-ELCGLMPWTAFNCLRDA-DRIVDKSYTIAASDNTVDG 230 (364) T ss_pred cCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCcc-ccEEEeChHHHHHHhcC-CccccccccccCCCccccc Confidence 0 0011111 13345578899999999994 79999999999888763 223221 22234568999 Q ss_pred cccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEccee Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~ 279 (430) .++. +.||. +++|.++|.... +++. .|... +=-++-+| T Consensus 231 ~v~~-v~Gv~-Vv~Sn~lP~~~~-~~~~---t~~~t----------------------------------~h~ls~~~-- 268 (364) T protein:vir:10 231 FVLK-SWNTP-IVPSNRFPKLSD-NTEG---TGNTK----------------------------------HHKLSNAG-- 268 (364) T ss_pred eeEE-EeceE-EEeccccccccc-cccc---ccccc----------------------------------cccccccc-- Confidence 9986 89995 899999987421 1100 00000 00001111 Q ss_pred eecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeecccee Q lcl|NC_011802. 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~ 359 (430) ..+.|-|+++- +.+.=++|||+|. T Consensus 269 -----------~g~~y~v~~d~---------------------------------------------~~~~~~~f~~~Al 292 (364) T protein:vir:10 269 -----------NGNRYDVTAGQ---------------------------------------------TSAQAVLFTQDAL 292 (364) T ss_pred -----------CCccccccccc---------------------------------------------ceeEEEEEecceE Confidence 11223333210 0011278999865 Q ss_pred eEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 360 aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . +.-+-- .....+.++ .|..++-| ++ ..||...+|||.++++.+++.+ T Consensus 293 ~--tv~~~~-------~t~e~~~~~-----~~~~~~id-------a~--~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 293 L--VGRTIS-------ITGDIFYEK-----KEKTWYID-------TF--LAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred E--EEEEec-------ceeeeeecc-----ceeeeeee-------ee--hcccCcccCccceEEEEecCCC Confidence 5 332210 011111111 12222222 12 2399999999999999999988 No 32 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.44 E-value=1.2e-14 Score=96.97 Aligned_cols=260 Identities=14% Similarity=0.065 Sum_probs=159.3 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcc---c-ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE---S-PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~---~-~~~~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +.+++.+++.+++++.+. ++++-++ +.||||+||+-.. . ...+|.++ .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~--~d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAE--VDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccce--ecccccC-CCCCEEEEeeecCCCccccccCCCcc--cccccc Confidence 9998666665555 566777999999999994 4444343 5799999997322 1 12234444 355677 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++.. -.|++++.+ +...+...+..+++.+.||+++|.+++..+......+.+ ....++.+. T Consensus 76 ~~~~~~~i~~~~-~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--------~~~~~d~i~ 146 (274) T protein:vir:94 76 TKKREAKIRKIA-KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------DITKLNGLQ 146 (274) T ss_pred cceeEEEeeeec-ceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--------cccCHHHHH Confidence 778888886644 458888766 345678888888999999999999999887665433211 123467788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|++.+. ..|.++++|+.+..|..+....|.. .......+++|.||+ +.||+ ++.+.++|..+ ++ T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~~t-----~~ 216 (274) T protein:vir:94 147 SAIDKFNDEDL---EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEAGT-----AI 216 (274) T ss_pred HHHHHhhccCC---CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCcce-----EE Confidence 89999999865 3588999999999988654333322 233445689999998 88996 66778888421 22 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeecc--ceeecccEEEEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT--TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~t--gtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) +-|.+.. .. +.+.. +. ..+ --.+.-|.+..-- .|.|.-.-...-+ T Consensus 217 -l~~~gA~-----~~---~~~~~-------~~-vE~~Rd~~~~~d~i~~~~----------------~y~~~~~~~~~vv 263 (274) T protein:vir:94 217 -LAKKGAV-----KL---ILKRD-------FF-LEVARDASTKTTALYSDK----------------HYVAYLYDESKAV 263 (274) T ss_pred -EEeCcce-----Ee---eecCC-------ce-eccccchhhcccEEEEEE----------------EEEEEEEcCCceE Confidence 2222111 00 00000 00 011 1123445555443 3444332234455 Q ss_pred EEeecccccccccccccc Q lcl|NC_011802. 307 EITPKPVALDDVSLSPEQ 324 (430) Q Consensus 307 ~I~Pai~~~~~~~~~~~~ 324 (430) .|+++.- +.+. T Consensus 264 ~~t~~~~-------~~~~ 274 (274) T protein:vir:94 264 KITKGSG-------SLEM 274 (274) T ss_pred EEecCcc-------cccC Confidence 5555432 1111 No 33 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.44 E-value=1.2e-14 Score=96.97 Aligned_cols=260 Identities=14% Similarity=0.065 Sum_probs=159.3 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcc---c-ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE---S-PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~---~-~~~~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +.+++.+++.+++++.+. ++++-++ +.||||+||+-.. . ...+|.++ .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~--~d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAE--VDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccce--ecccccC-CCCCEEEEeeecCCCccccccCCCcc--cccccc Confidence 9998666665555 566777999999999994 4444343 5799999997322 1 12234444 355677 Q ss_pred cceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++.. -.|++++.+ +...+...+..+++.+.||+++|.+++..+......+.+ ....++.+. T Consensus 76 ~~~~~~~i~~~~-~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--------~~~~~d~i~ 146 (274) T protein:vir:97 76 TKKREAKIRKIA-KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------DITKLNGLQ 146 (274) T ss_pred cceeEEEeeeec-ceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--------cccCHHHHH Confidence 778888886644 458888766 345678888888999999999999999887665433211 123467788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|++.+. ..|.++++|+.+..|..+....|.. .......+++|.||+ +.||+ ++.+.++|..+ ++ T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~~t-----~~ 216 (274) T protein:vir:97 147 SAIDKFNDEDL---EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEAGT-----AI 216 (274) T ss_pred HHHHHhhccCC---CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCcce-----EE Confidence 89999999865 3588999999999988654333322 233445689999998 88996 66778888421 22 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeecc--ceeecccEEEEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT--TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~t--gtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) +-|.+.. .. +.+.. +. ..+ --.+.-|.+..-- .|.|.-.-...-+ T Consensus 217 -l~~~gA~-----~~---~~~~~-------~~-vE~~Rd~~~~~d~i~~~~----------------~y~~~~~~~~~vv 263 (274) T protein:vir:97 217 -LAKKGAV-----KL---ILKRD-------FF-LEVARDASTKTTALYSDK----------------HYVAYLYDESKAV 263 (274) T ss_pred -EEeCcce-----Ee---eecCC-------ce-eccccchhhcccEEEEEE----------------EEEEEEEcCCceE Confidence 2222111 00 00000 00 011 1123445555443 3444332234455 Q ss_pred EEeecccccccccccccc Q lcl|NC_011802. 307 EITPKPVALDDVSLSPEQ 324 (430) Q Consensus 307 ~I~Pai~~~~~~~~~~~~ 324 (430) .|+++.- +.+. T Consensus 264 ~~t~~~~-------~~~~ 274 (274) T protein:vir:97 264 KITKGSG-------SLEM 274 (274) T ss_pred EEecCcc-------cccC Confidence 5555432 1111 No 34 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.42 E-value=1.6e-14 Score=96.22 Aligned_cols=257 Identities=14% Similarity=0.073 Sum_probs=158.6 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc----ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES----PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~----~~~~g~~~s~~~~di~ 71 (430) ||+. =++..+.+ +-+++.++..+++++++..-+.++ .+.||||+||+.... ...+|.++ .++.+. T Consensus 3 ~~~~-T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~---g~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 76 (275) T protein:vir:96 3 LENM-TKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLV---GQPGNTITFPAFVYSGDAKVVPEGEEI--PIDLIE 76 (275) T ss_pred Cccc-chhhhhhchHHHHHHHHHHHHHhhhhcccceeccccc---CCCCCEEEeeeeccCCccccccCCCCc--chhhcc Confidence 5443 44444333 567778999999999985444443 357999999975432 12334444 344666 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++.+ +.-.|.+++.+. ...+...+.++.....||+++|.+++..+.....-+ . .....++.+. T Consensus 77 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~-------~-~~~~~~d~i~ 147 (275) T protein:vir:96 77 TKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKV-------E-ADITKLAGLQ 147 (275) T ss_pred cceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------c-ccccCHHHHH Confidence 6677777744 455688887663 456788888888889999999999998776543221 1 1223567788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|.+... ..|.++++|+.+..|.......|.. .......+++|.||+ +.||+ ++.+.++|. T Consensus 148 dA~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~--------- 213 (275) T protein:vir:96 148 TAIDKFNDEDL---EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGE-ALGAI-IVRSNKIKE--------- 213 (275) T ss_pred HHHHHhccccC---CccEEEeCHHHHHHHHhcccccccccccccccceeccccce-ecCee-EEEeCCCCc--------- Confidence 89999987653 4688999999998886543222221 111223355666655 45554 222221110 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) T Consensus 214 -------------------------------------------------------------------------------- 213 (275) T protein:vir:96 214 -------------------------------------------------------------------------------- 213 (275) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) .+ -++|++.||.+...+- + T Consensus 214 --------------------------------------~t--~~i~~~gA~~~~~~~~---------------------~ 232 (275) T protein:vir:96 214 --------------------------------------GE--AILAKRGAVKLITKRD---------------------F 232 (275) T ss_pred --------------------------------------ce--EEEEeccceeeeecCC---------------------c Confidence 00 1567777777765431 1 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) . +-.+.|++......+-+.-||++.++|+-. |.|.=.-+ T Consensus 233 ~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 271 (275) T protein:vir:96 233 F--LETERHASHKSTALFSDKHYVAYLYDESKV-VKITKSAS 271 (275) T ss_pred c--cccccchhhcCcEEEEeEEEEEEEEcCccE-EEEEeccc Confidence 1 112235555566777788899999999985 55543333 No 35 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.42 E-value=1.5e-14 Score=96.41 Aligned_cols=260 Identities=15% Similarity=0.066 Sum_probs=157.9 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc-c---cccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-P---TQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~-~---~~~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +.+++.++..+++++++..-+.|+ .++||||+||+.... . ..+|.++ .++.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~---g~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV---GQPGDTLTFPAFIYSGDAKVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc---CCCCCEEEeeeecCCCccccccCCCcc--chhhcc Confidence 9998666666555 556677999999999985555554 257999999976432 1 2234333 345778 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++ ..-.|.+++.+. ...+...+.++++...||+++|.+|+..+......+.. ....++.+. T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~--------~~~~~d~i~ 146 (274) T protein:vir:95 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA--------DITKLTGLQ 146 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--------cccCHHHHH Confidence 8888888866 455688887663 45788888889999999999999999888765433211 113467788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|.+... ..|.++++|+.+..|.......|.. .+.....+++|.||+ +.||+ ++.+.++|..+ ++ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~~~~t-----~~ 216 (274) T protein:vir:95 147 TAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAV-IVRSNKLEAGT-----AI 216 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeE-EEEeCCCCCce-----EE Confidence 89999998763 4699999999999987654333332 233445689999998 89997 56678877422 22 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccc--eeecccEEEEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg--tlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) +-|.+..+ . +.+.. +. ..+. -.+.-|.+...-+|.++-.... -+|..-..++++ T Consensus 217 -l~~~gA~~-----~---~~~~~-------~~-vE~~Rd~~~~~d~i~~~~~y~~~~~~~~-------~~v~~tk~~~~~ 272 (274) T protein:vir:95 217 -LAKKGAVK-----L---ITKRD-------FF-LETDRDPSTKTTALYSDKHYVAYLYDES-------KAVKITKGSGSL 272 (274) T ss_pred -EEecccee-----e---eecCC-------cc-cccccccccccCEEEEeEEEEEEEEcCC-------cEEEEEcCCccc Confidence 22222111 0 00000 00 0111 1234455554433322111110 112221223333 Q ss_pred EE Q lcl|NC_011802. 307 EI 308 (430) Q Consensus 307 ~I 308 (430) +. T Consensus 273 ~~ 274 (274) T protein:vir:95 273 EM 274 (274) T ss_pred cC Confidence 33 No 36 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.42 E-value=1.5e-14 Score=96.41 Aligned_cols=260 Identities=15% Similarity=0.066 Sum_probs=157.9 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc-c---cccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-P---TQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~-~---~~~g~~~s~~~~di~ 71 (430) ||+..=++..+.+ +.+++.++..+++++++..-+.|+ .++||||+||+.... . ..+|.++ .++.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~---g~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV---GQPGDTLTFPAFIYSGDAKVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc---CCCCCEEEeeeecCCCccccccCCCcc--chhhcc Confidence 9998666666555 556677999999999985555554 257999999976432 1 2234333 345778 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) ..+..+++++ ..-.|.+++.+. ...+...+.++++...||+++|.+|+..+......+.. ....++.+. T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~--------~~~~~d~i~ 146 (274) T protein:vir:96 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA--------DITKLTGLQ 146 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--------cccCHHHHH Confidence 8888888866 455688887663 45788888889999999999999999888765433211 113467788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|.+... ..|.++++|+.+..|.......|.. .+.....+++|.||+ +.||+ ++.+.++|..+ ++ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~~~~t-----~~ 216 (274) T protein:vir:96 147 TAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAV-IVRSNKLEAGT-----AI 216 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeE-EEEeCCCCCce-----EE Confidence 89999998763 4699999999999987654333332 233445689999998 89997 56678877422 22 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccc--eeecccEEEEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg--tlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) +-|.+..+ . +.+.. +. ..+. -.+.-|.+...-+|.++-.... -+|..-..++++ T Consensus 217 -l~~~gA~~-----~---~~~~~-------~~-vE~~Rd~~~~~d~i~~~~~y~~~~~~~~-------~~v~~tk~~~~~ 272 (274) T protein:vir:96 217 -LAKKGAVK-----L---ITKRD-------FF-LETDRDPSTKTTALYSDKHYVAYLYDES-------KAVKITKGSGSL 272 (274) T ss_pred -EEecccee-----e---eecCC-------cc-cccccccccccCEEEEeEEEEEEEEcCC-------cEEEEEcCCccc Confidence 22222111 0 00000 00 0111 1234455554433322111110 112221223333 Q ss_pred EE Q lcl|NC_011802. 307 EI 308 (430) Q Consensus 307 ~I 308 (430) +. T Consensus 273 ~~ 274 (274) T protein:vir:96 273 EM 274 (274) T ss_pred cC Confidence 33 No 37 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.40 E-value=8.3e-14 Score=92.31 Aligned_cols=256 Identities=16% Similarity=0.171 Sum_probs=152.0 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc----ccccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ----ESPTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~----~~~~~~g~~~s~~~~di~ 71 (430) ||+..-++-.+.+ +.+++.|++.+++.+++. +++.-+ .+.|++|.||+-. ...+.+|.+. .++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~--~~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAE--VDTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhcccc--cccccc-CCCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 9965444444434 666777999999999884 333322 2579999999742 2223345333 345777 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) .+++.+++.+.. .-|.+++.+. +..+....+.+...+.+++++|.+++..+......+ +....++.+. T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~---------~~~~t~d~i~ 145 (272) T protein:vir:98 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV---------EATATVDGVS 145 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---------ccccCHHHHH Confidence 888888887754 4588887774 456777777888999999999999998765432211 1123467888 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) ++...|.+.+.+ .|.++++|..+..|.......+.. .+.....+++|.+| T Consensus 146 da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-------------------------- 196 (272) T protein:vir:98 146 KALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-------------------------- 196 (272) T ss_pred HHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccch-------------------------- Confidence 899999888644 578999999887775322111000 00001112222222 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) +|.|+ .|.. T Consensus 197 ---------------------------------------------~i~G~-----------------~Vi~--------- 205 (272) T protein:vir:98 197 ---------------------------------------------EVLGV-----------------QIVR--------- 205 (272) T ss_pred ---------------------------------------------hhcCe-----------------eEEE--------- Confidence 23441 1111 Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) ++.+ | . . ..++|++.||.++.+.- + T Consensus 206 s~~~-p---------------------~----------~--t~~~~~~~a~~~~~~~~---------------------~ 230 (272) T protein:vir:98 206 SRKC-P---------------------K----------G--TAYMVRKGALRIMLKRN---------------------T 230 (272) T ss_pred cCCC-C---------------------c----------c--eEEEEcCCeEEEEecCC---------------------c Confidence 0000 0 0 0 02678888888775421 1 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEec----CCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGL----PGQT 429 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l----~~q~ 429 (430) . +-.++|.+......+...-||++.++|+-. |.+ |||| T Consensus 231 ~--ve~~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a~~~ 272 (272) T protein:vir:98 231 M--VETDRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDAAKK 272 (272) T ss_pred e--eeeccccccceeEEEEEEEEEEEEEcCCce-EEEEecccccC Confidence 1 112234445556666677799999999964 555 4445 No 38 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.40 E-value=8.3e-14 Score=92.31 Aligned_cols=256 Identities=16% Similarity=0.171 Sum_probs=152.0 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc----ccccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ----ESPTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~----~~~~~~g~~~s~~~~di~ 71 (430) ||+..-++-.+.+ +.+++.|++.+++.+++. +++.-+ .+.|++|.||+-. ...+.+|.+. .++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~--~~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAE--VDTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhcccc--cccccc-CCCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 9965444444434 666777999999999884 333322 2579999999742 2223345333 345777 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) .+++.+++.+.. .-|.+++.+. +..+....+.+...+.+++++|.+++..+......+ +....++.+. T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~---------~~~~t~d~i~ 145 (272) T protein:vir:30 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV---------EATATVDGVS 145 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---------ccccCHHHHH Confidence 888888887754 4588887774 456777777888999999999999998765432211 1123467888 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) ++...|.+.+.+ .|.++++|..+..|.......+.. .+.....+++|.+| T Consensus 146 da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-------------------------- 196 (272) T protein:vir:30 146 KALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-------------------------- 196 (272) T ss_pred HHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccch-------------------------- Confidence 899999888644 578999999887775322111000 00001112222222 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) +|.|+ .|.. T Consensus 197 ---------------------------------------------~i~G~-----------------~Vi~--------- 205 (272) T protein:vir:30 197 ---------------------------------------------EVLGV-----------------QIVR--------- 205 (272) T ss_pred ---------------------------------------------hhcCe-----------------eEEE--------- Confidence 23441 1111 Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) ++.+ | . . ..++|++.||.++.+.- + T Consensus 206 s~~~-p---------------------~----------~--t~~~~~~~a~~~~~~~~---------------------~ 230 (272) T protein:vir:30 206 SRKC-P---------------------K----------G--TAYMVRKGALRIMLKRN---------------------T 230 (272) T ss_pred cCCC-C---------------------c----------c--eEEEEcCCeEEEEecCC---------------------c Confidence 0000 0 0 0 02678888888775421 1 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEec----CCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGL----PGQT 429 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l----~~q~ 429 (430) . +-.++|.+......+...-||++.++|+-. |.+ |||| T Consensus 231 ~--ve~~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a~~~ 272 (272) T protein:vir:30 231 M--VETDRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDAAKK 272 (272) T ss_pred e--eeeccccccceeEEEEEEEEEEEEEcCCce-EEEEecccccC Confidence 1 112234445556666677799999999964 555 4445 No 39 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.38 E-value=1.1e-14 Score=97.06 Aligned_cols=288 Identities=15% Similarity=0.139 Sum_probs=165.7 Q ss_pred Ccccc----------------cchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccc---ccCC Q lcl|NC_011802. 1 MALNE----------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT---QEGW 61 (430) Q Consensus 1 ma~~~----------------~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~---~~g~ 61 (430) |++=- .=+|+..--||+..|+..+++..++.. |. .+.|+|+.|+.=-+.+. ..|. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~-r~-----i~~G~s~~~~~iG~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNV-RS-----LRGTNQLRVDRVGASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcccee-ee-----ccccceEEEeeecceeeeeecCCC Confidence 55431 223344446888889999999999853 43 47799999996544333 3455 Q ss_pred cccCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceee----ec-- Q lcl|NC_011802. 62 DLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI----TS-- 133 (430) Q Consensus 62 ~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~----~~-- 133 (430) .+.++ .+.-.++.|+||+.+-..|.+.+-| +..-+.-..+-+...++||...|+.++....+.+.+.- .. T Consensus 75 ~l~~~--~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~ 152 (334) T protein:vir:80 75 ELVVQ--KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAF 152 (334) T ss_pred CCCCC--CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 55444 4666789999999999888887544 33334444566788899999999988766554432210 00 Q ss_pred -CC-----CCCCCcchh-------hhHHHHHHHHHHhhcCCCC--CCcEEEeChHHHHHHHHh--hhhhhcccchhhHHH Q lcl|NC_011802. 134 -PD-----AIGTNTADA-------WNFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAY 196 (430) Q Consensus 134 -~~-----t~~~~~~~~-------~~~~a~a~~~L~~~~aP~~--~~R~~vl~p~~~~~~~~~--~~~l~~~~~~~~~a~ 196 (430) +| ...++..+. .+.+-.+++.|++..+|.+ .+|.+|++|+-+..|+.. +...++.......-| T Consensus 153 ~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~ 232 (334) T protein:vir:80 153 HDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSF 232 (334) T ss_pred cCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccc Confidence 00 011111111 2345589999999999942 379999999999998875 222222222223347 Q ss_pred HhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEc Q lcl|NC_011802. 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) Q Consensus 197 r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Tia 276 (430) ..|.+++ +.||. +|++.++|... ++.... | T Consensus 233 ~~g~i~~-v~G~~-V~~Sn~~P~~~-~t~~~~-----------------------------------------g------ 262 (334) T protein:vir:80 233 VGGRIAM-LNGVR-VVETPRFPQSA-ITANAL-----------------------------------------G------ 262 (334) T ss_pred cceeEEE-EeceE-EEeecCCCCcc-cccccc-----------------------------------------c------ Confidence 8888886 78995 67888887521 110000 0 Q ss_pred ceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeecc Q lcl|NC_011802. 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) Q Consensus 277 GV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr 356 (430) | .|-+++ +.+ +.++=++||+ T Consensus 263 ~----------------~~~~~a-------------------------------------gd~-------t~~~~~~~~~ 282 (334) T protein:vir:80 263 A----------------DFNVTD-------------------------------------AEV-------RRKMITFIPS 282 (334) T ss_pred c----------------cccccc-------------------------------------ccc-------cceEEEEEeC Confidence 0 000110 000 0112389999 Q ss_pred ceeeEEeec-cccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 357 DAIRIVSQP-IPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 357 ~A~aLat~p-l~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) +|+.-+-.. +. ...+.+ ..+|.| ...+. ..||.+.+|||-++|+----+- T Consensus 283 ~Al~t~~~~~~~----------~e~~~~--------~~~~~d----~i~~~--~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 283 MALISAQVHPVS----------AQFWEE--------KKDFGH----YLDTF--QSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred ceEEEEEEeecc----------eeeeec--------hhhHHH----HHHHH--HHcCCceeccceEEEEEEeeec Confidence 987644321 21 011111 112222 11111 4699999999998776432222 No 40 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.26 E-value=1e-12 Score=86.35 Aligned_cols=258 Identities=15% Similarity=0.087 Sum_probs=155.5 Q ss_pred CcccccchhhhhH-----HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccc----ccccCCcccCCcCccc Q lcl|NC_011802. 1 MALNEGQIVTLAV-----DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES----PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 ma~~~~~~~t~~~-----~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~----~~~~g~~~s~~~~di~ 71 (430) ||...=++..+.+ .-+++.++..+++++++. .+.+-+ .+.|+||+||.-... ...+|.++ .++.+. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~--~~~~l~-g~~G~ti~iP~~~~igda~~~~eg~~i--~~~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFAD--IDSTLV-GQPGDTLTFPAFVYSGDATVVPEGQKI--PVDKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccce--eccccc-CCCCCEEEeeeecCCCccccccCCCcc--Cccccc Confidence 9977555555544 555667999999999994 333333 257999999964222 23455444 344566 Q ss_pred cceeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a 149 (430) .++..+++.+ +.-.|.+++.+. ...+...+.++.....||+++|.+++.........+.. .....+.+. T Consensus 76 ~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~--------~~~t~d~i~ 146 (276) T protein:vir:10 76 TNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSA--------DIGTLAGLE 146 (276) T ss_pred cceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--------cccCHHHHH Confidence 6677777743 566677776663 45788888889999999999999999887664432211 112356788 Q ss_pred HHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcc-cchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIF-GRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~-~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +|...|.+... ..+.++++|..++.|.......|.. .+.....+++|+ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~---------------------------- 195 (276) T protein:vir:10 147 AAIDTFDDEDL---EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGA---------------------------- 195 (276) T ss_pred HHHHHhccccC---cccEEEEcHHHHHHHHHhccccccccccccccceeccc---------------------------- Confidence 89999998764 3588999999998876432111111 111111123333 Q ss_pred ccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEE Q lcl|NC_011802. 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I 308 (430) +-+|.|+ .|..+. T Consensus 196 -------------------------------------------ig~~~G~-----------------~Vi~s~------- 208 (276) T protein:vir:10 196 -------------------------------------------FGEALGA-----------------VIVRSK------- 208 (276) T ss_pred -------------------------------------------cceecce-----------------eEEEcC------- Confidence 3344452 111100 Q ss_pred eeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceE Q lcl|NC_011802. 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Gl 388 (430) . + | ..+ -++|++.||.+..... + T Consensus 209 --~------------------~---p-----------~~t--~~l~~~gAi~~~~~~~---------------------~ 231 (276) T protein:vir:10 209 --K------------------L---D-----------EGE--AILAKRGAVKLITKRD---------------------F 231 (276) T ss_pred --C------------------C---C-----------cce--EEEEeccceeeeecCC---------------------c Confidence 0 0 0 000 1678888888765431 1 Q ss_pred EEEEEEeeecccceeEEEEEeeccceecCcceeEEecC-CCCC Q lcl|NC_011802. 389 NGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLP-GQTA 430 (430) Q Consensus 389 slrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~-~q~~ 430 (430) . +-.+.|++......+-+.-||++.++|+-. |.|. +-.. T Consensus 232 ~--vE~dRd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~~ 271 (276) T protein:vir:10 232 F--LETDRDPSTKTTALYSDKHYVAYLYDESKA-VKVTKGAGT 271 (276) T ss_pred e--eecccchhhcccEEEEeeEEEEEEEcCcce-EEEecCCcC Confidence 1 122345556666777788899999999975 3332 2222 No 41 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.13 E-value=1.2e-11 Score=80.44 Aligned_cols=286 Identities=15% Similarity=0.081 Sum_probs=153.7 Q ss_pred Cccc---c------cchhhhhHHHHHHHHhhhcccc--hhcccCCCchhh--hhccCcEEEEecCccccccc-CC----- Q lcl|NC_011802. 1 MALN---E------GQIVTLAVDEIIETISAITPMA--QKAKKYTPPAAS--MQRSSNTIWMPVEQESPTQE-GW----- 61 (430) Q Consensus 1 ma~~---~------~~~~t~~~~evi~~len~lvmA--~~V~~~r~~~~~--~~k~GdTV~i~~P~~~~~~~-g~----- 61 (430) |+++ - -++-+ -++..|..+..|+ +.=++.|++-.. ..+.++++..+....+.+.. +. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~----~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQ----AFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQS 76 (322) T ss_pred Ccccceeeeeeeeechhhh----HHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccc Confidence 5444 0 12333 3444555555553 111345555332 23556666666554432221 10 Q ss_pred -ccc-CCc-CccccceeEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC Q lcl|NC_011802. 62 -DLT-DKA-TGLLELNVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA 136 (430) Q Consensus 62 -~~s-~~~-~di~e~sv~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t 136 (430) +.. .-| .++-.+...+.+..+ .+.+.+.+.|+. ..+.-..+.+.+..+|+.+.|.-++..+...+. .++.+.+ T Consensus 77 ~d~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~~gt~ 154 (322) T protein:vir:10 77 ADGTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKGTGQP 154 (322) T ss_pred cCcccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccccccc Confidence 111 111 122234445554444 567777776642 344555688899999999999988765544332 1221111 Q ss_pred C---------CCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHH-Hhccccccch Q lcl|NC_011802. 137 I---------GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY-RDGTIQRQVA 206 (430) Q Consensus 137 ~---------~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~-r~g~igr~~~ 206 (430) . ..+....+..+..+++.|+++.||.+.+|.+|++|+.+..|+. -..+...+-...+++ ++|.+|+ +. T Consensus 155 v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~-d~~~ts~D~~~~~~l~~~G~ig~-~l 232 (322) T protein:vir:10 155 VEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQ-ITEATSADYTSAMDLQSKGIITN-WM 232 (322) T ss_pred cccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhc-chhhhhhhcccchhhhhcCeeee-ee Confidence 1 1122344777889999999999998767999999999998774 334444444456665 7899997 88 Q ss_pred hhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccc Q lcl|NC_011802. 207 GFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAK 286 (430) Q Consensus 207 Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk 286 (430) ||.| |.+.++|... ++ -..+| +++ T Consensus 233 Gf~~-i~s~~lp~~~-~t---~~~~~------------------------------------------~~~--------- 256 (322) T protein:vir:10 233 GYTW-IVSTRLDKFD-PT---QWGMA------------------------------------------AED--------- 256 (322) T ss_pred eEEE-EEeccCCccc-cc---ccccc------------------------------------------ccC--------- Confidence 9987 5567777511 00 00000 000 Q ss_pred ccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeec- Q lcl|NC_011802. 287 NVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQP- 365 (430) Q Consensus 287 ~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~p- 365 (430) ++. ...+.=++|||+|+.++..- T Consensus 257 -----------------------------------------------~~~---------~~~~~~~a~~k~Av~~a~~~d 280 (322) T protein:vir:10 257 -----------------------------------------------GPQ---------GDEIWCIAMTDMALGYHSCKD 280 (322) T ss_pred -----------------------------------------------CCC---------ccceeEEEEecCceeEEEeee Confidence 000 00011279999999998752 Q ss_pred cccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 366 IPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 366 l~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) +.. ...-+|+-....+ ++--..||++.++||-. |.|-=-.+ T Consensus 281 v~~----------~i~~~~~~~~a~~-------------I~~~~~~Ga~ri~~~gV-v~i~~~e~ 321 (322) T protein:vir:10 281 IWT----------KVAEDPSASFAWR-------------IYSAFTADCVRVEDEHI-FKLRLKNS 321 (322) T ss_pred eeE----------EeeccCCcchhhh-------------hhhhhhhCceEeccCcE-EEEEEecc Confidence 211 0011112111111 11234578888888875 55544444 No 42 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.12 E-value=1.1e-12 Score=86.13 Aligned_cols=225 Identities=16% Similarity=0.103 Sum_probs=124.1 Q ss_pred hhhhccCcEEEEecC--cccccccCCcccCCcCccccceeEEEeccccccceEecHHH-hh-hHHHHHHHHHHHHHHHHH Q lcl|NC_011802. 38 ASMQRSSNTIWMPVE--QESPTQEGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD-LR-DETAYRHRIQSAARKLAN 113 (430) Q Consensus 38 ~~~~k~GdTV~i~~P--~~~~~~~g~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~ke-L~-~~~~~~~~i~~Am~~LAn 113 (430) .-+.+.||||.+|+= .+-...+|..++ ++.+.-.+...++.+ ..-.|+++|.+ |+ ..+...+.-++....||+ T Consensus 1 ~~~~~~Gdtit~P~~iGda~~v~eG~~i~--~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPNDIGDAADVAEGGEIS--LDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEecccccchhhhcCCCcCC--hhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 446799999999952 111223454442 334555556666633 45569999888 43 566666666777788999 Q ss_pred HHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhc-ccchh Q lcl|NC_011802. 114 NVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDI-FGRIP 192 (430) Q Consensus 114 ~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~-~~~~~ 192 (430) +||.|++..+.... +-. .+...+..+.+|...|.+... ..+.++++|.....|+... .... ..+.. T Consensus 78 kvD~di~~~~~~a~-l~~--------~~~~t~d~i~~A~~~fgde~~---~~~vivv~p~~~~~Lrk~~-~~~~~~~~~g 144 (231) T protein:vir:73 78 KVDDDLLKAAKTTS-QTV--------STKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDA-NAKNIGSEVG 144 (231) T ss_pred hhhHHHHHhhcccc-ccc--------cccccHHHHHHHHHHhccccc---cceEEEEcchHHHhhhhcc-chhhhhhhhc Confidence 99999997666433 111 122446678888888988753 4588999999998886633 3323 23345 Q ss_pred hHHHHhccccccchhhhhhhhcCCcccccCcccccc-ccccccccccceeeeeeccccccccceeeeEEeeccceeeccc Q lcl|NC_011802. 193 EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI-TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGD 271 (430) Q Consensus 193 ~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~-tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGD 271 (430) ...+++|.||+ +.|++ ++.|.++|.-+. ....+ ...||-....-.. -.-+ ..| --+++-| T Consensus 145 ~~i~~~G~iG~-i~G~~-Vi~S~~~~~~~~-~~~~~i~~~gAl~~~~k~~-----~~vE--tdR---------d~~~k~~ 205 (231) T protein:vir:73 145 ANALINGTYAD-VLGAQ-IVRSKKLAEGSA-LMFKIVSNSPALKLVLKRG-----VQVE--TDR---------DIVTKTT 205 (231) T ss_pred cceeeecccce-EcceE-EEEcCCCCCCce-eeeeEEeeccceeeeeccc-----ceee--ccc---------ccccccc Confidence 56789999998 89995 888888885221 11111 1133322100000 0000 000 0112222 Q ss_pred EEEEcceeeecccccccccCcceEEEEeeccCceeEEeeccc Q lcl|NC_011802. 272 KISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPV 313 (430) Q Consensus 272 v~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~ 313 (430) .++.. +.|.|--.-...-+.|+=+.+ T Consensus 206 ~i~~~----------------~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 206 VITAD----------------EHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEEEe----------------EEEEEEEEcCccEEEEEeecC Confidence 22222 223332212223333333333 No 43 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.07 E-value=1.5e-12 Score=85.42 Aligned_cols=290 Identities=12% Similarity=0.014 Sum_probs=162.3 Q ss_pred Cccc--------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCccccc--c-cCCcc Q lcl|NC_011802. 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT--Q-EGWDL 63 (430) Q Consensus 1 ma~~--------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~--~-~g~~~ 63 (430) |..- +.=.|+.+.-||...|+..+++..++.. |. .+.|+++++|.=-..+. . .|..+ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-rt-----i~~GkS~qf~~iG~~~a~y~~~G~~l 74 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEEEEEeeeEEeeecccccc Confidence 3321 2334455557888899999999888843 22 57899998887633322 2 24333 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcccc--ee-------- Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMGS--LV-------- 130 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~-~~i~~Am~~LAn~Id~dl~~~~~~~~~--~v-------- 130 (430) .++.+.-.++.|++|.-.-..+.+-+-| +..-+..+ ++-+...++||...|+-++.+.+..+. +. T Consensus 75 --dg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~ 152 (402) T protein:vir:97 75 --NATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) T ss_pred --CCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcc Confidence 3445666788899998775555554322 33222223 233667789999999988876654331 10 Q ss_pred eecCCCCCC-Cc--------chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhccc--chhhHHHHhc Q lcl|NC_011802. 131 ITSPDAIGT-NT--------ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDG 199 (430) Q Consensus 131 ~~~~~t~~~-~~--------~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~--~~~~~a~r~g 199 (430) ....++.+. .+ ......+-.+...|++..||.+ +|.++++|+-+..|+.. ..|.+.. ......+++| T Consensus 153 ~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~-dRv~vv~P~~y~~Ll~~-~rl~n~d~~~~~~g~~~~G 230 (402) T protein:vir:97 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRDA-DRIVDKTYTISQSGATING 230 (402) T ss_pred cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCcc-ccEEEeChHHHHHHhhc-ccccchhhccccCCccccc Confidence 000001111 00 1123456678899999999995 79999999999988764 2222211 1234558999 Q ss_pred cccccchhhhhhhhcCCccccc-CccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcce Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLT-KSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGV 278 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t-~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV 278 (430) .++. +.||. +|+|.++|... ..++..++ -+| T Consensus 231 ~v~~-v~Gv~-Vv~SnnlP~~a~~it~~~ls---------------------------------------------~a~- 262 (402) T protein:vir:97 231 FVLS-SYNCP-VIPSNRFPTFAQDQAHHLLS---------------------------------------------NED- 262 (402) T ss_pred eeEE-EeceE-EEecCccccccccccccccc---------------------------------------------cCC- Confidence 9986 88995 89999999521 01100000 001 Q ss_pred eeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccce Q lcl|NC_011802. 279 KFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDA 358 (430) Q Consensus 279 ~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A 358 (430) ..+-|-|+++- +.+.=++|||+| T Consensus 263 ------------~G~~y~~t~d~---------------------------------------------t~~~~~~f~~~A 285 (402) T protein:vir:97 263 ------------NGYRYDPIAEM---------------------------------------------NGAVAVLFTSDA 285 (402) T ss_pred ------------CCccCCcCccc---------------------------------------------ceeEEEEEecce Confidence 01122222210 111237899975 Q ss_pred eeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEE--eeccceecCcceeEEecCCCCC Q lcl|NC_011802. 359 IRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 359 ~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riD--vLyG~~~v~PElagv~l~~q~~ 430 (430) . .+.-+. .++.++ +||.... .|=|| ..||...+|||-+||+..-.-+ T Consensus 286 v--~tvk~~-------------------~vT~~~--~~d~r~~--~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (402) T protein:vir:97 286 L--LVGRTI-------------------EVTGDI--FYEKKEK--TYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) T ss_pred E--EEEEee-------------------ccccch--hhchhHH--HHHHHHHHHhCCcccCccceEEEEEeccc Confidence 4 333321 011111 2232111 11133 3599999999999999554422 No 44 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=98.84 E-value=7e-11 Score=76.24 Aligned_cols=203 Identities=16% Similarity=0.101 Sum_probs=111.7 Q ss_pred eccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecC--------CC--CCCCcchh-h Q lcl|NC_011802. 79 MGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSP--------DA--IGTNTADA-W 145 (430) Q Consensus 79 ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~--------~t--~~~~~~~~-~ 145 (430) +|....-+|.+.+-| +..-+.-..+-+.+.++||..+|+.++.++.+.+....... .. ...+.... + T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 666666566666433 44555556677888899999999999988887654322110 00 01111223 4 Q ss_pred hHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhh-hhhhcccchhhH-HHHhc-cccccchhhhhhhhcCCcccccC Q lcl|NC_011802. 146 NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPEE-AYRDG-TIQRQVAGFDDVLRSPKLPVLTK 222 (430) Q Consensus 146 ~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~-~~l~~~~~~~~~-a~r~g-~igr~~~Gfd~~~~~~~v~~~t~ 222 (430) +.+-++++.|++..||. .+|.++++|..+..|+... ..+.+.....+. -++.| .+++ +.||+ +|+|.++|..++ T Consensus 81 dai~~a~~~LdekdVP~-~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~-v~G~~-V~~SnnlP~~~g 157 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPM-DGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYV-NAGIR-IYKSNVLASLYG 157 (221) T ss_pred HHHHHHHHHHhhcCCCC-CCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeee-ecCcE-EEEeccCCcccc Confidence 67888999999999999 5899999998888877432 233332222222 27888 4775 88995 899999997321 Q ss_pred ccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeecc Q lcl|NC_011802. 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 223 gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~ 302 (430) .. + +.. +|+ |.+.+ +. T Consensus 158 t~---~-~~~------------------------------------ag~-~~~~~----~~------------------- 173 (221) T protein:vir:17 158 TN---L-VTD------------------------------------PGD-ATTSG----EN------------------- 173 (221) T ss_pred cc---c-ccC------------------------------------Ccc-ccccc----cc------------------- Confidence 11 1 000 111 11111 00 Q ss_pred CceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEE--eeccccCCCcchheeeEE Q lcl|NC_011802. 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIV--SQPIPANHELFAGMKTTS 380 (430) Q Consensus 303 a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLa--t~pl~~p~g~~~~~~~~~ 380 (430) ...| + +..-.++| |+|||+|..-+ |.|...|+-. +. + T Consensus 174 ---------------------~~~y-r-------~~fs~~~g-------lv~~~~Avgtvkl~~~~~~~~~~----~~-~ 212 (221) T protein:vir:17 174 ---------------------NGSY-R-------PAITDRAG-------LVFHKEAADTVEVLLPPSRPPLV----IS-M 212 (221) T ss_pred ---------------------cccc-c-------ccccceEE-------EEEcchheeeeeeecCCCCCcee----ee-e Confidence 0001 0 00111223 99999987543 3354444321 21 1 Q ss_pred EecCcceEEEEEEEeeecc Q lcl|NC_011802. 381 FSIPDVGLNGIFRTQGDIS 399 (430) Q Consensus 381 ~~~p~~Glslrv~~~yd~~ 399 (430) +|+|- -|-. T Consensus 213 -------~~~~~---~~~~ 221 (221) T protein:vir:17 213 -------FSIRR---PDRR 221 (221) T ss_pred -------eeccC---CCCC Confidence 23321 0100 No 45 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.82 E-value=5.5e-11 Score=76.82 Aligned_cols=291 Identities=14% Similarity=0.043 Sum_probs=160.5 Q ss_pred Cccc--------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc---cCCcc Q lcl|NC_011802. 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) Q Consensus 1 ma~~--------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~---~g~~~ 63 (430) |..- +.-+|+.+.-||...|+..+++..++. .|. .+.|+++.+|.=-..+.. .|..+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~-vRt-----i~~gkS~qf~~~G~s~~~~~~pG~~l 74 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD-VQT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccce-eee-----ecccceEEEEEeeeeEeeeecCCCCc Confidence 4321 223444444678888999998888874 232 688999988866332222 34444 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcccc--e------e-- Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMGS--L------V-- 130 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~-~~i~~Am~~LAn~Id~dl~~~~~~~~~--~------v-- 130 (430) - .+.+.-.++.|++|.-.-..+.+-+-| +..-+..| .+-+...++||...|+-++.+.+..+. . . T Consensus 75 d--~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~ 152 (401) T protein:vir:70 75 A--ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRV 152 (401) T ss_pred C--CCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 3 344556688899998887766665433 33222122 344566789999999988777754321 0 0 Q ss_pred ------eecCCCCCC---CcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccc--hhhHHHHhc Q lcl|NC_011802. 131 ------ITSPDAIGT---NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR--IPEEAYRDG 199 (430) Q Consensus 131 ------~~~~~t~~~---~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~--~~~~a~r~g 199 (430) .+..++... .+......+-.+...|++..||. .|.++|.|..+..++-+-..|.+..- .....|.+| T Consensus 153 ~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~--~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 153 KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDI--SDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQG 230 (401) T ss_pred CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCc--cceEEEcCHHHHHHHHhcCcccchhhccccCCccccc Confidence 001111000 00113344668889999999995 37888877777765533334444331 233558889 Q ss_pred cccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEccee Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~ 279 (430) .+.. +.||. ++++.++|....+ +. |..++-+| T Consensus 231 ~v~~-vaGv~-Vv~SnnlP~~a~~------it--------------------------------------~~~ls~a~-- 262 (401) T protein:vir:70 231 FTLS-SYNCP-VIPSNRFPKYSQG------QT--------------------------------------HHLLSNED-- 262 (401) T ss_pred eEEE-EeceE-EEeeccccccccc------cc--------------------------------------cccccccC-- Confidence 8886 88994 8888888851110 00 00011111 Q ss_pred eecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeecccee Q lcl|NC_011802. 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~ 359 (430) ...-|-++++- +.+.=++|||+|. T Consensus 263 -----------~G~~y~~~~d~---------------------------------------------s~~~~v~f~~~Av 286 (401) T protein:vir:70 263 -----------NGYRYDPLPAM---------------------------------------------NGAIAVLFTADAL 286 (401) T ss_pred -----------CCccCCCCccc---------------------------------------------cceeEEEEehhhe Confidence 01122222210 1112378999954 Q ss_pred eEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEE--eeccceecCcceeEEecCCCCC Q lcl|NC_011802. 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 360 aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riD--vLyG~~~v~PElagv~l~~q~~ 430 (430) .- .-+. .|+.+ .+||.. ...|=|| ..||...+|||.++|+...-+. T Consensus 287 ~t--vk~~-------------------~lt~~--~~~d~r--~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 287 LV--GRSI-------------------DVTGD--IFYEKK--EKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred EE--EEee-------------------ccccc--hhhhhh--hhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 33 2221 01111 233322 2222244 3589999999999998777764 No 46 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.82 E-value=2.4e-10 Score=73.36 Aligned_cols=292 Identities=11% Similarity=0.060 Sum_probs=154.0 Q ss_pred Ccc----------c----ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc---cCCcc Q lcl|NC_011802. 1 MAL----------N----EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) Q Consensus 1 ma~----------~----~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~---~g~~~ 63 (430) |-+ + +.=+|+.+--||+..|+..++++.++.+ | -.|.|+++.+|.=-+.+.. .|..+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-r-----ti~~g~s~~~~~iG~~~~~~~~pG~~l 74 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-R-----DLRGSNVVRLDRLGNVEAKGRRAGEEL 74 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccce-e-----eeccceeEEEeeeeeeeecccccCccc Confidence 221 1 2334555557899999999999999853 3 2488999999976554433 46556 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeec-------C Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS-------P 134 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~-------~ 134 (430) .+++ +...+..|++|..+-..+.+.+-| +...+.-..+-+...++||...|+-++....+.+.+.--+ + T Consensus 75 ~~~~--~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 75 ERSR--VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred CCCC--cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 5554 556778999999887776666433 3333333446677889999999998876555544321100 0 Q ss_pred C-------CCCC--Ccch-hhhHHHHHHHHHHhhcCCCC--CCcEEEeChHHHHHHHHhhhhhhcc---cchhhHHHHhc Q lcl|NC_011802. 135 D-------AIGT--NTAD-AWNFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYDLTKRDIF---GRIPEEAYRDG 199 (430) Q Consensus 135 ~-------t~~~--~~~~-~~~~~a~a~~~L~~~~aP~~--~~R~~vl~p~~~~~~~~~~~~l~~~---~~~~~~a~r~g 199 (430) | +... ..+. ..+.+..+...|++.-+|.. .+|.++++|+-+..|+.. ..+.+. .......+++| T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~-~~l~n~~~~~s~~~~~~~~g 231 (335) T protein:vir:78 153 GVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEH-DKLMSVEYQATGATNDYVKS 231 (335) T ss_pred CcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcc-cccccccccccccccccccc Confidence 0 0000 0111 23457778899999999963 369999999999998864 233332 22233458999 Q ss_pred cccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEccee Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~ 279 (430) .+++ +.||. ++++.++|... +++. ++..+ +|.-..+......+.---..|-.+-..-+ T Consensus 232 ~v~~-v~Gv~-V~~Sn~lP~~~-~t~~--~lg~a-------------~n~~~~d~~~~~~~~~~~~Al~t~~~~~~---- 289 (335) T protein:vir:78 232 RVAI-LNGVK-VLETPRFATKA-ISAH--PLGRH-------------FNVSAEEAERQIALFLPSKTLITAQVAPV---- 289 (335) T ss_pred eeEE-eeceE-EEeeccCCCCC-Cccc--ccccc-------------CCcccccccceEEEEEecceEEEEEEEec---- Confidence 9986 89996 99999998632 2221 11111 11111111111011100011111111111 Q ss_pred eecccccccccCc--ceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccc Q lcl|NC_011802. 280 FLGQMAKNVLAQD--ATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADD 357 (430) Q Consensus 280 ~v~~~tk~~~~~l--~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~ 357 (430) +-+..... +.+++.+..+=|...+-|. ....|+.-|.. +|.+- T Consensus 290 -----~~e~~~~~~~~~~~i~~~~a~G~g~lRPe------------------------~a~~i~~tg~~------~~~~~ 334 (335) T protein:vir:78 290 -----QAKLWEDHDQFSWVLDTFQMYNIGARRPD------------------------TAGAIELKGIE------AFDIT 334 (335) T ss_pred -----ccceeeccchhhHhhhHHHHcCCcccCcc------------------------eEEEEEecCCC------ccccc Confidence 11111111 1122222222233333331 22223332221 22333 Q ss_pred e Q lcl|NC_011802. 358 A 358 (430) Q Consensus 358 A 358 (430) | T Consensus 335 ~ 335 (335) T protein:vir:78 335 A 335 (335) T ss_pred C Confidence 3 No 47 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.73 E-value=9.2e-10 Score=70.13 Aligned_cols=292 Identities=10% Similarity=0.045 Sum_probs=151.5 Q ss_pred Ccc--------------cccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc---cCCcc Q lcl|NC_011802. 1 MAL--------------NEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) Q Consensus 1 ma~--------------~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~---~g~~~ 63 (430) |-+ -++=+|+.+--||+..|+..+++..++.. |. .+.|+++.+|.=-+.+.. .|..+ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-rt-----i~~g~s~~~~~iG~~~~~~~~pG~~l 74 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-RD-----LRGSNVVRLDRLGNVEAKGRRAGEEL 74 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccce-ee-----eccceeEEEeeeeeeeeecccCCcCc Confidence 211 13334455557888999999999988842 32 488999999976554443 46555 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCC---- Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI---- 137 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~---- 137 (430) .+++ +.-.+..+++|...-..+.+.+-| +..-+.-..+-+...++||...|+.++....+.+.+.-.....+ T Consensus 75 ~~~~--~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 75 ERSR--VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred CCCC--ccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 5554 445678899999876666665433 33323333456778889999999988766665544321110000 Q ss_pred --------CCC----cchh-hhHHHHHHHHHHhhcCCCC--CCcEEEeChHHHHHHHHhhhhhhcc--c-chhhHHHHhc Q lcl|NC_011802. 138 --------GTN----TADA-WNFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYDLTKRDIF--G-RIPEEAYRDG 199 (430) Q Consensus 138 --------~~~----~~~~-~~~~a~a~~~L~~~~aP~~--~~R~~vl~p~~~~~~~~~~~~l~~~--~-~~~~~a~r~g 199 (430) .+. ..+. ...+-.+...|++..||.. .+|.++++|+-+..|+.. ..+.+. + ......+.+| T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~-~~l~n~~~~~s~~~~~~~~g 231 (335) T protein:vir:63 153 GVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEH-DKLMNVEYQATGATNDYVKS 231 (335) T ss_pred CcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcc-ccccccccccccccccccCc Confidence 010 1111 1345678899999999962 369999999999998874 333332 2 2233458999 Q ss_pred cccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEccee Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~ 279 (430) .+++ +.||. ++++.++|... +++. ++..+ .|.-..|+....++..--..|-.+-..-++. T Consensus 232 ~v~~-v~Gv~-V~~sn~lP~~~-~t~~--~lg~a-------------~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~-- 291 (335) T protein:vir:63 232 RVAI-LNGVK-VLETPRFATKA-IAAH--PLGRH-------------FNVSAEESERQIALFLPSKTLITAQVAPVQA-- 291 (335) T ss_pred eeEE-eeceE-EEeeccCCCCC-cccc--ccccc-------------CCccccccceeEEEEEecceEEEEEEeeccc-- Confidence 9986 89995 99999998632 2211 11111 1111112211111111111111111111111 Q ss_pred eecccccccccC--cceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccc Q lcl|NC_011802. 280 FLGQMAKNVLAQ--DATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADD 357 (430) Q Consensus 280 ~v~~~tk~~~~~--l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~ 357 (430) +.... -+.+++.+..+=|...+-| .....|+.-|.. +|.+- T Consensus 292 -------e~~~~~~~~~~~i~~~~a~G~g~lRP------------------------e~a~~i~~tg~~------~~~~~ 334 (335) T protein:vir:63 292 -------KLWEDNEKFSWVLDTFQMYNIGARRP------------------------DTAGAIELKGIG------AFDIT 334 (335) T ss_pred -------ceeeccchhhHHhHHHHHcCCccccc------------------------ceEEEEEEcCCC------ceeec Confidence 11111 1112222212222222222 222233332221 22222 Q ss_pred e Q lcl|NC_011802. 358 A 358 (430) Q Consensus 358 A 358 (430) | T Consensus 335 ~ 335 (335) T protein:vir:63 335 A 335 (335) T ss_pred C Confidence 2 No 48 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.71 E-value=3.1e-10 Score=72.70 Aligned_cols=291 Identities=12% Similarity=0.044 Sum_probs=157.5 Q ss_pred Cccc--------------ccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccc---cccCCcc Q lcl|NC_011802. 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP---TQEGWDL 63 (430) Q Consensus 1 ma~~--------------~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~---~~~g~~~ 63 (430) |..- +.=+|+.+.-||...|+..+++..++. .|. .+.|+++.+|.=-..+ ...|..+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~-vRt-----I~~gkS~qf~~lG~s~a~y~~pG~~l 74 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD-VQT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccce-eee-----ecccceEEEEEeeeeEEeeecCCCCc Confidence 3321 223444444678888999988888874 232 6889999988662222 2245444 Q ss_pred cCCcCccccceeEEEeccccccceEecHHH--hhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccc--c------eeee Q lcl|NC_011802. 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMG--S------LVIT 132 (430) Q Consensus 64 s~~~~di~e~sv~v~ld~~k~V~f~~t~ke--L~~~~~~~-~~i~~Am~~LAn~Id~dl~~~~~~~~--~------~v~~ 132 (430) - .+.+.-.+..|++|.-.-..+.+-+-| +..-|.-| .+-+.-..+||...|+-++.+.+... + +.++ T Consensus 75 d--g~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 75 A--ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred C--CCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 3 334666688899988776666555322 32222112 23355567999999998876664432 1 1111 Q ss_pred c--CCCCC--CCcch-------hhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccc--hhhHHHHhc Q lcl|NC_011802. 133 S--PDAIG--TNTAD-------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR--IPEEAYRDG 199 (430) Q Consensus 133 ~--~~t~~--~~~~~-------~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~--~~~~a~r~g 199 (430) . +.++. +.... ....+-.+...|++..||. + |.+++.|..+..++-.-..|.+-.- .....+.+| T Consensus 153 ~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~-~-d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 153 KGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDI-S-DVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQG 230 (400) T ss_pred cccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCc-c-ceEEEcCHHHHHHHHhCCcccchhccccCCCccccc Confidence 0 00110 01111 1223556888899999996 3 6788877777765533333333221 123447888 Q ss_pred cccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEccee Q lcl|NC_011802. 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~ 279 (430) .+.. +.|+ .++++.++|.... + + .+..++-+| T Consensus 231 ~v~~-v~Gv-~Iv~Sn~lP~~a~-~-----~--------------------------------------~~~~lS~a~-- 262 (400) T protein:vir:10 231 FVLS-SYNC-PVIPSNRFPKYSQ-G-----Q--------------------------------------KHHLLSNED-- 262 (400) T ss_pred eEEE-Eece-EEEeeCcCCcccC-c-----c--------------------------------------cccccccCC-- Confidence 8874 7888 4888888884100 0 0 011111111 Q ss_pred eecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeecccee Q lcl|NC_011802. 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~ 359 (430) ....|-++++- +.+.=++|||+|. T Consensus 263 -----------~G~~y~~t~d~---------------------------------------------s~~~av~F~~sAv 286 (400) T protein:vir:10 263 -----------NGYRYDPIAEM---------------------------------------------NGAIAVLFTADAL 286 (400) T ss_pred -----------CCccCCccccc---------------------------------------------cceeEEEEehhhe Confidence 01122222211 1112378999954 Q ss_pred eEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEE--eeccceecCcceeEEecCCCCC Q lcl|NC_011802. 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 360 aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riD--vLyG~~~v~PElagv~l~~q~~ 430 (430) .- .-+. .|+. -.+||... ..|=|| ..||...+|||.++|+..+-++ T Consensus 287 ~t--vk~~-------------------~lt~--~~~~d~r~--~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 287 LV--GRSI-------------------DVIG--DIFYEKKE--KTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred EE--EEee-------------------cccc--ccccchhh--HHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 33 2221 1111 22344222 222244 3589999999999999999888 No 49 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.64 E-value=1.1e-08 Score=64.23 Aligned_cols=281 Identities=10% Similarity=-0.002 Sum_probs=137.8 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhh-hccCcEEEEecCcccccccC-Cc-ccCCcCccccceeEE Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWMPVEQESPTQEG-WD-LTDKATGLLELNVAV 77 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~-~k~GdTV~i~~P~~~~~~~g-~~-~s~~~~di~e~sv~v 77 (430) ||+ .| +.|+.-.++.+.|+..++.+.+. .++++.+- +.-|++|.||.=......+- +. ..-...++.-...++ T Consensus 1 MA~-~n-~a~~~~~~Ld~~~~~~l~~~~L~--~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~ 76 (299) T protein:vir:79 1 MAA-LN-YAKEYSNVLAQAYPYTLNFGDLY--ATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPK 76 (299) T ss_pred Ccc-ch-hHHHHHHHHHHHHHhhceeeeec--cCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEE Confidence 993 22 45777789999999999999888 56665442 23479999996543322221 11 111223455667889 Q ss_pred EeccccccceEecHHHh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHH Q lcl|NC_011802. 78 NMGEPDNDFFQLRADDL-------RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVAD 150 (430) Q Consensus 78 ~ld~~k~V~f~~t~keL-------~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~ 150 (430) +|++.|...|.+.+-|. +......++.+ .+++-+||.......+..+...++...+...+....+..+-. T Consensus 77 ~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~---~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~ 153 (299) T protein:vir:79 77 VLTNQRKWSTLVHPADINQTNYVASIGNITKVYNE---EQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDK 153 (299) T ss_pred EeeccccceeccchhhHHHHhhhhHHHHHHHHHHH---HHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHH Confidence 99999998888883332 12222222222 356777887766554444444433222222223356788999 Q ss_pred HHHHHHhhcCCCCCCcEEEeChHHHHHHHHh--hhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccc Q lcl|NC_011802. 151 AEELMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 151 a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~--~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~ 228 (430) +.+.|++.++|. .+|-++++|+.+..|..+ +......+ ...-.++|.+|+ +.||. +++...- +... .+ T Consensus 154 ~~~~lde~~vP~-~~rvl~vtp~~~~~L~~~~~f~k~~~~~--~~~~~~~g~Vg~-idG~~-Ii~Vps~-r~~t----~~ 223 (299) T protein:vir:79 154 LMEKMTEARVPE-NGRILYVTPVVNTLIKNAKEIQRTVNIK--DAGTSLNRQTTD-IDTVK-IIKVPSN-LMKT----AY 223 (299) T ss_pred HHHHHHhcCCCC-CCeEEEeCHHHHHHHhhchhhhcccccc--cccceeeeeeee-ecceE-EEEechh-hcCc----cc Confidence 999999999998 589999999999876532 22222221 122357888876 78884 6652111 1110 00 Q ss_pred c-cccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeec---c-- Q lcl|NC_011802. 229 T-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV---D-- 302 (430) Q Consensus 229 t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~---~-- 302 (430) . .+|... +.++-.+ + .+.+-.++.-.+.| ..-++-..|-+-+.--.+.+++---|+ . T Consensus 224 ~~~~G~~~-~~~ak~i-------n----~ii~~~~a~~~~~K-----~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk 286 (299) T protein:vir:79 224 DFTTGWKV-GAGAKQI-------F----MSLVHPSAIITPVS-----YQFSKLDEPTAVTEGKYFYFEESFEDVFILNKK 286 (299) T ss_pred eeccCccc-cCccccc-------c----eEEEcCCeeeeeEe-----eeeEEeecCCCCCccceeeeeeeeeeeeeeccc Confidence 0 011000 0000000 0 01000111111111 111111223222221111222221111 0 Q ss_pred CceeEEeeccccccccccccccccccccccccccC Q lcl|NC_011802. 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 303 a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~ 337 (430) ...| |.|+.+ |.+ T Consensus 287 ~~~i--------------------~~~~~~--a~~ 299 (299) T protein:vir:79 287 ADAI--------------------QFVVEG--AGA 299 (299) T ss_pred cCeE--------------------EEEeee--cCC Confidence 1111 222221 111 No 50 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.60 E-value=3.1e-08 Score=61.76 Aligned_cols=259 Identities=12% Similarity=0.057 Sum_probs=134.3 Q ss_pred Cccc---ccchhhhhHHHHHHHHhhh-----cccchhcccCCCchhhhhccCcEEEEecCcccccccC-CcccCCcCccc Q lcl|NC_011802. 1 MALN---EGQIVTLAVDEIIETISAI-----TPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLL 71 (430) Q Consensus 1 ma~~---~~~~~t~~~~evi~~len~-----lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g-~~~s~~~~di~ 71 (430) .|+. -|.+.-. +.....|+.. +.-..++ .++|+ ..-|++|.||+-......+- +......+++. T Consensus 30 ~~~~~~~~nt~~l~--~k~~~~LD~~~~~~~~s~~~~~--N~~~e---~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt 102 (329) T protein:vir:10 30 FANKSVEPGDTLLK--NKHVGILEKVTAANSYSAPAVI--SNDAI---FMQGRSFTVIKGDVTELKDYKRNATNEFDHPQ 102 (329) T ss_pred hcCCccCCchhHHH--HHHHHHHHHHHHhhceeeeeec--cccee---eccCcEEEEeeecccccccccCCCCccccccc Confidence 2322 2322211 3333333332 2222334 35565 35699999997744222221 11122345677 Q ss_pred cceeEEEeccccccceEecHHHhhh--HHH-HHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhH Q lcl|NC_011802. 72 ELNVAVNMGEPDNDFFQLRADDLRD--ETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 72 e~sv~v~ld~~k~V~f~~t~keL~~--~~~-~~~~i~-~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~ 147 (430) ....+++|++.+...|.+.+-|... ... ....+. .+-..++-+||......++..+....+ .+.+....|.. T Consensus 103 ~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~----~~~t~~nay~~ 178 (329) T protein:vir:10 103 IQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLT----VGSGADAQYDA 178 (329) T ss_pred cceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccc----cccCHHHHHHH Confidence 7788999999999888887433221 101 111221 233467888998877766655432222 22233457888 Q ss_pred HHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccc-hhhHHHHhccccccchhhhhhhhcCCcccccCcccc Q lcl|NC_011802. 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT 226 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t 226 (430) +..+...|++.++|. +|-++++|+.+..|..+. .+.... ...+-.++|.+| T Consensus 179 i~~a~~~Lde~~vp~--~Rvl~VtP~~~~~Lk~~~--~f~~~~~~~~~~~~~g~Vg------------------------ 230 (329) T protein:vir:10 179 VLDVSVELDEIGAGA--SRILFVTPKFYKGIKKFV--IELPQGDNRQQVLGKGVQG------------------------ 230 (329) T ss_pred HHHHHHHHHhcCCCC--CcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeee------------------------ Confidence 999999999999994 699999999997664211 111110 011111222222 Q ss_pred ccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 227 GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 227 ~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) +|.|+ .|.. T Consensus 231 -----------------------------------------------~idG~-----------------~Ii~------- 239 (329) T protein:vir:10 231 -----------------------------------------------ELDGF-----------------TIVK------- 239 (329) T ss_pred -----------------------------------------------eecCe-----------------EEEE------- Confidence 23331 1111 Q ss_pred EEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeec--cc--cCCCcchheeeEEEe Q lcl|NC_011802. 307 EITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQP--IP--ANHELFAGMKTTSFS 382 (430) Q Consensus 307 ~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~p--l~--~p~g~~~~~~~~~~~ 382 (430) .|... ...+.| ++.|++|++.+..= ++ .|..... T Consensus 240 --vps~~-----------------------~k~in~---------ii~~~~A~~~~~K~~~~~~~~p~~~~~-------- 277 (329) T protein:vir:10 240 --VPSKM-----------------------LQGVEA---------MAVIGEVMASPIQANEAKLNSNVPGMF-------- 277 (329) T ss_pred --ecCCc-----------------------ccceeE---------EEEcCCceeeeeeeeeeeeeCCCCccc-------- Confidence 01100 001222 67888888876652 11 1110000 Q ss_pred cCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 383 IPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 383 ~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) |- .++-=..||+.+++|+..||.....+| T Consensus 278 ----a~---------------~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 278 ----GT---------------LAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred ----hh---------------eeeeeeeeeeEEEccccCEEEEecccC Confidence 11 111123499999999999988877766 No 51 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.56 E-value=9e-09 Score=64.70 Aligned_cols=256 Identities=18% Similarity=0.208 Sum_probs=140.2 Q ss_pred Ccc-cccchhh--hhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc----ccccccCCcccCCcCccccc Q lcl|NC_011802. 1 MAL-NEGQIVT--LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ----ESPTQEGWDLTDKATGLLEL 73 (430) Q Consensus 1 ma~-~~~~~~t--~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~----~~~~~~g~~~s~~~~di~e~ 73 (430) ||. ...+++. ++-+-+.+++++.+++++++. .+++=+ .+.|+||+||.=. .-...+|.++. ++.+.-+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~--~d~~L~-g~~G~ti~~P~~~~igdae~~~eg~~i~--~~~lt~~ 75 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAV--TDDTLV-GQPGDTITRPKYAYIGAAEDLQEGVAMD--TTQMSMT 75 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccc--cccccC-CCCCCEEEeeeecCCCccccccCCCccc--hhhcccc Confidence 994 5544432 333566777999999988883 343332 3689999999621 11133444442 3344444 Q ss_pred eeEEEeccccccceEecHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHH Q lcl|NC_011802. 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) Q Consensus 74 sv~v~ld~~k~V~f~~t~keL--~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a 151 (430) +...++. +..-.|..++... ...+......+.....||+++|.+|+..+......+ +......++.+| T Consensus 76 ~~~a~i~-~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~---------~~~~t~~~~~dA 145 (270) T protein:vir:95 76 TTKVTVK-ETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA---------TVSADATGILDA 145 (270) T ss_pred hheeeee-hhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc---------ccccCHHHHHHH Confidence 4444442 2344577776662 346777777778888899999999997776543211 112345667788 Q ss_pred HHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccccc- Q lcl|NC_011802. 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV- 230 (430) Q Consensus 152 ~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv- 230 (430) ..+|.+..-. ...++++|..++.|..+. ++...+-....+++|.||. +.|++-++++..++. ++++.+ T Consensus 146 ~~~lgd~~~~---~~~i~vhs~~~~~Lrk~~--~~~~~~~~~~~~~~G~ig~-~~G~~Viv~s~~~~~-----~~~~l~~ 214 (270) T protein:vir:95 146 IEVFNSENDE---DYVLYVNPKDYNKLVKSL--FKVGGNVQDRAISKGDLVE-IVGVSDIVKSKRVSE-----NTAFLQR 214 (270) T ss_pred HHHhccccCC---CcEEEEcHHHHHHHHhhh--cccccccccchhcccccce-ecceeEEEeCCCCCc-----eeEEEEe Confidence 8888776433 467999999999987654 2233333555689999998 789854566554332 222322 Q ss_pred ccccccccceeeeeeccccccccceeeeEEeeccc--eeecccEEEEcceeeecccccccccCcceEEEEeeccCc--ee Q lcl|NC_011802. 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT--HV 306 (430) Q Consensus 231 ~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg--tlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~--tv 306 (430) .||-.. +++.. +. ..+. -++.=|.+... +.|+|--..... .+ T Consensus 215 ~gAi~~----------~~~~~-------~~-vEtdRd~~~~~d~i~~~----------------~~y~v~~~~~skvv~~ 260 (270) T protein:vir:95 215 YGAMEI----------VNKKK-------PE-AYTDFDILKRTHLLSTN----------------YHYSVNLKDETGVVKV 260 (270) T ss_pred ccceee----------eecCC-------ce-eeeccchhhcccEEEee----------------eEEEEEEEccceEEEE Confidence 122110 00000 00 0111 12222333333 344443222222 34 Q ss_pred EEeecccccccccc Q lcl|NC_011802. 307 EITPKPVALDDVSL 320 (430) Q Consensus 307 ~I~Pai~~~~~~~~ 320 (430) ++.|+.-+- . T Consensus 261 t~~~a~~~~----~ 270 (270) T protein:vir:95 261 TFKPSGSLE----M 270 (270) T ss_pred EecCCCCcC----C Confidence 455554321 1 No 52 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.28 E-value=6e-07 Score=54.69 Aligned_cols=287 Identities=13% Similarity=0.059 Sum_probs=130.1 Q ss_pred Cccc---ccchhhhh-HHHHHHHHhhhcccch-hcccCCCchhhhhccCcEEEEecCcccccccC-CcccCCcCccccce Q lcl|NC_011802. 1 MALN---EGQIVTLA-VDEIIETISAITPMAQ-KAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLLELN 74 (430) Q Consensus 1 ma~~---~~~~~t~~-~~evi~~len~lvmA~-~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g-~~~s~~~~di~e~s 74 (430) .|.+ -|++...- .-..|+.+....+.+. +. .-++|+- .-|++|.||+=......+- +......+++.-.. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~-~N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~ 94 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL-ISNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEE 94 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc-cCcceEe---ccCcEEEEeeecccccccccCCCCcccCCcccce Confidence 4555 23332210 0233444444444432 22 1244433 4699999986533222211 11122344677778 Q ss_pred eEEEeccccccceEecHHHhhh--HHH-HHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLRD--ETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVAD 150 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~~--~~~-~~~~i~-~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~ 150 (430) .+++|++.+...|.+.+-|... ... ....+. .+-..++-+||.......+..+.-..+ ...+....|..+.. T Consensus 95 ~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~----~~~t~~n~y~~i~~ 170 (319) T protein:vir:97 95 TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT----VGTGSDAQYDAVLD 170 (319) T ss_pred eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc----cccCHHHHHHHHHH Confidence 8899999999888887544221 111 111222 223356778888776665554332221 22233356888999 Q ss_pred HHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccccc Q lcl|NC_011802. 151 AEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV 230 (430) Q Consensus 151 a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv 230 (430) +.+.|++.+||. +|-++++|+.+..|..+- .+........+.+++|.+|+ +.||. +++..+.. +..-.+ + T Consensus 171 a~~~Lde~~VP~--~Rvl~Vtp~~~~~L~~~~-~f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~-i 240 (319) T protein:vir:97 171 VSVELDEIKAPE--NRVLFVSPTFYKGIKKFV-IALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQA-I 240 (319) T ss_pred HHHHHHhcCCCC--CcEEEeCHHHHHHHHhhh-hhhccccccccceeeeecee-ecCeE-EEEecccc----cccceE-E Confidence 999999999994 699999999998875432 22233333445578999986 88985 66543211 011111 2 Q ss_pred ccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEc----ceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT----GVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 231 ~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Tia----GV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) -|....-....+++....-.+..++.+.+ --.++-.|.|.+. |+|......+..... T Consensus 241 ~~h~~A~~~~~k~~~~~~~~p~~~~~a~~----v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~--------------- 301 (319) T protein:vir:97 241 AVVGEVLASPIQADLAKTNSNIPGMFGTL----AEQLLYTGAFVPEHLQKYIFTIGGTEVATKRD--------------- 301 (319) T ss_pred EEcCCeeeeeeeeeeeeccCCCcccccee----eeeeeeeeeEEeccccceEEEeecCCcccCCC--------------- Confidence 22221111111221111001101111000 0133444444442 222111111111000 Q ss_pred EEeeccccccccccccccccccccccccccCcee Q lcl|NC_011802. 307 EITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 307 ~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aav 340 (430) ++.+++-=.|-|...-.. T Consensus 302 ----------------~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 302 ----------------GVDAHADNVAKPSGSLEM 319 (319) T ss_pred ----------------ccccccccccCCcccccC Confidence 011111111111111111 No 53 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.28 E-value=6e-07 Score=54.69 Aligned_cols=287 Identities=13% Similarity=0.059 Sum_probs=130.1 Q ss_pred Cccc---ccchhhhh-HHHHHHHHhhhcccch-hcccCCCchhhhhccCcEEEEecCcccccccC-CcccCCcCccccce Q lcl|NC_011802. 1 MALN---EGQIVTLA-VDEIIETISAITPMAQ-KAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLLELN 74 (430) Q Consensus 1 ma~~---~~~~~t~~-~~evi~~len~lvmA~-~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g-~~~s~~~~di~e~s 74 (430) .|.+ -|++...- .-..|+.+....+.+. +. .-++|+- .-|++|.||+=......+- +......+++.-.. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~-~N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~ 94 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL-ISNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEE 94 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc-cCcceEe---ccCcEEEEeeecccccccccCCCCcccCCcccce Confidence 4555 23332210 0233444444444432 22 1244433 4699999986533222211 11122344677778 Q ss_pred eEEEeccccccceEecHHHhhh--HHH-HHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLRD--ETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVAD 150 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~~--~~~-~~~~i~-~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~ 150 (430) .+++|++.+...|.+.+-|... ... ....+. .+-..++-+||.......+..+.-..+ ...+....|..+.. T Consensus 95 ~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~----~~~t~~n~y~~i~~ 170 (319) T protein:vir:94 95 TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT----VGTGSDAQYDAVLD 170 (319) T ss_pred eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc----cccCHHHHHHHHHH Confidence 8899999999888887544221 111 111222 223356778888776665554332221 22233356888999 Q ss_pred HHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccccccc Q lcl|NC_011802. 151 AEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV 230 (430) Q Consensus 151 a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv 230 (430) +.+.|++.+||. +|-++++|+.+..|..+- .+........+.+++|.+|+ +.||. +++..+.. +..-.+ + T Consensus 171 a~~~Lde~~VP~--~Rvl~Vtp~~~~~L~~~~-~f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~-i 240 (319) T protein:vir:94 171 VSVELDEIKAPE--NRVLFVSPTFYKGIKKFV-IALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQA-I 240 (319) T ss_pred HHHHHHhcCCCC--CcEEEeCHHHHHHHHhhh-hhhccccccccceeeeecee-ecCeE-EEEecccc----cccceE-E Confidence 999999999994 699999999998875432 22233333445578999986 88985 66543211 011111 2 Q ss_pred ccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEc----ceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT----GVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 231 ~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Tia----GV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) -|....-....+++....-.+..++.+.+ --.++-.|.|.+. |+|......+..... T Consensus 241 ~~h~~A~~~~~k~~~~~~~~p~~~~~a~~----v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~--------------- 301 (319) T protein:vir:94 241 AVVGEVLASPIQADLAKTNSNIPGMFGTL----AEQLLYTGAFVPEHLQKYIFTIGGTEVATKRD--------------- 301 (319) T ss_pred EEcCCeeeeeeeeeeeeccCCCcccccee----eeeeeeeeeEEeccccceEEEeecCCcccCCC--------------- Confidence 22221111111221111001101111000 0133444444442 222111111111000 Q ss_pred EEeeccccccccccccccccccccccccccCcee Q lcl|NC_011802. 307 EITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 307 ~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aav 340 (430) ++.+++-=.|-|...-.. T Consensus 302 ----------------~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 302 ----------------GVDAHADNVAKPSGSLEM 319 (319) T ss_pred ----------------ccccccccccCCcccccC Confidence 011111111111111111 No 54 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=269 Identities=11% Similarity=0.038 Sum_probs=122.8 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc-----ccccccCCcccCCcCcccccee Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ-----ESPTQEGWDLTDKATGLLELNV 75 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~-----~~~~~~g~~~s~~~~di~e~sv 75 (430) ||+|-- ++.-.++.+.|...++.+.+.. ++++- .-|++|.||+=. .+....|+ ...++.-... T Consensus 1 Main~a---~~~~~~Ld~~~~~~~~t~~l~~--~~~~~---~ggktVkI~~i~~~gl~DY~R~~g~----~~g~v~~~~e 68 (290) T protein:vir:78 1 MAINYV---DKYGKELDQKLVFGTYTNELET--PNLLW---LDAKTFKIQTITTTGLKAHTRNKGY----NEGSASNTNK 68 (290) T ss_pred CchhHH---HHHHHHHHHHHHhhheeeeccc--cceee---ccCCEEEEeeeccCcccccccCCCc----ccCcccccee Confidence 999864 4555788888999999988873 34432 348999998632 23222222 1123444456 Q ss_pred EEEeccccccceEecHHH-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHH Q lcl|NC_011802. 76 AVNMGEPDNDFFQLRADD-------LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 76 ~v~ld~~k~V~f~~t~ke-------L~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~ 148 (430) +.+|++.+...|.+..-| ++......++ +-.+++-+||.......+..+.-....... ..+....+..+ T Consensus 69 t~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef---~~~~v~PEiDayr~skla~~a~~~~~~~~~-t~t~~n~~~~i 144 (290) T protein:vir:78 69 SYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEF---NSRHAGPEMDAYRFSKLATAAKTNSNSVAE-EITKDNVFTKL 144 (290) T ss_pred eEEeeccccceeeccccchhHHhhhhhHHHHHHHH---HHHHhhhhhhHHHHHHHHhhhhccCccccc-ccCHHHHHHHH Confidence 789999999888887222 2222223322 223567778877554443333222222111 11223457777 Q ss_pred HHHHHHHHhhcCCCCCCcEEEeChHHHHHHHH--hhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcccc Q lcl|NC_011802. 149 ADAEELMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT 226 (430) Q Consensus 149 a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~--~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t 226 (430) -.+...|++ +|. ++|-++++|+.+..|.. .+......+. ..+-..+|.+++ +.|| .+++.....++.. T Consensus 145 ~~~~~~lde--vp~-~~rvl~vtp~~~~lL~~~~~f~r~~~~~~-~~~~~i~~~V~~-idG~-~ii~vps~~r~~t---- 214 (290) T protein:vir:78 145 KAAIRKVKK--YGT-QNLVMYVSPDVMAALELSDDFVRAINVQN-IGPSSIETRITA-IDGT-RIVEVEAEDRFYD---- 214 (290) T ss_pred HHHHHHHHh--cCC-CCeEEEECHHHHHHHhhChhhhccccccc-cccccccceeee-ecCc-EEEEecccchhhh---- Confidence 778888886 787 58999999999986642 2332222221 122234777765 7777 3665332212110 Q ss_pred ccc-cccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccccc-ccCcceEEEEeec--- Q lcl|NC_011802. 227 GIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV-LAQDATFSVVRVV--- 301 (430) Q Consensus 227 ~~t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~-~~~l~~fvVta~~--- 301 (430) .+. .+|... ... +.+--...+..++.-.+.|=|.+-+ ..|-+-+. .+++=+|+.--|+ T Consensus 215 ~~~f~~G~~~--------~~~----ak~in~ii~~~~a~i~~~K~~~~~~-----~~P~~~~~~d~~~~~~r~y~d~~v~ 277 (290) T protein:vir:78 215 TFDFTDGYKP--------AAG----AKKLNFLLVNKGSVVGGAKHASIYL-----HAPGSVGQGDGWLYQYRVYHDIFVL 277 (290) T ss_pred hhhhcccccc--------cCC----ccceeEEEEcCCceeeeeeeeEEEe-----eCCCCCcCcceeeeeeeeeeeeeee Confidence 000 001000 000 0000011111111111111121111 11222221 1223333322111 Q ss_pred c--CceeEEeeccccccccccccccccccccccccccCcee Q lcl|NC_011802. 302 D--GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 302 ~--a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aav 340 (430) . ...| |. +++| T Consensus 278 ~nk~~~i--------------------~~--------~~~~ 290 (290) T protein:vir:78 278 DQQKDGV--------------------IA--------STEV 290 (290) T ss_pred ccccCee--------------------EE--------EeeC Confidence 0 0011 00 0011 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=96.82 E-value=0.00019 Score=41.01 Aligned_cols=272 Identities=12% Similarity=0.040 Sum_probs=120.3 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc---ccccccCCcccCCcCccccceeEE Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ---ESPTQEGWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~---~~~~~~g~~~s~~~~di~e~sv~v 77 (430) ||+|..+ .....+.+.|...+..+.++..-.+...++ .=|.+|.||+=+ ..+-. .+....+..++.-..-+. T Consensus 1 Main~~~---k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~-~gak~VkIp~ist~~gl~dY-~R~~g~~~g~v~~~~et~ 75 (285) T protein:vir:79 1 MTVVLDS---KDLARIDEEYKADSQVWSYLTGGNGVTQRF-RGHNEVRINKLSGFVDATAY-KRGQDNARKTISVGKETV 75 (285) T ss_pred CcchhhH---HHHHHHHHHHHHhhhhhhhcccCCcceeEe-cCCCEEEEeeeccccccccc-ccccCccccccceeeeEE Confidence 9998544 334667777777777777773221222222 337899999642 12222 122222334555556788 Q ss_pred EeccccccceEecHHHhhhHHHHHHHHHHHHH-----HHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHH Q lcl|NC_011802. 78 NMGEPDNDFFQLRADDLRDETAYRHRIQSAAR-----KLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 78 ~ld~~k~V~f~~t~keL~~~~~~~~~i~~Am~-----~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~ 152 (430) +|++.+...|.+..-|. ++...--+.-.|. ..+=+||..-...++..+. +...+ ..+...-+..+-.+. T Consensus 76 tl~~DR~~~f~iD~mDv--dEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~---~~~~~-~~T~~nv~~~i~~~~ 149 (285) T protein:vir:79 76 KLTHEDWFGYDLDQFDM--DENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA---KKATD-SITKDNALDAYDTAE 149 (285) T ss_pred Eeeccccceecccccch--hhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc---ccccc-ccCHHHHHHHHHHHH Confidence 99999998887773332 2211100111111 2333555553333322222 21112 122334677888999 Q ss_pred HHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCccc-ccCccccccccc Q lcl|NC_011802. 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPV-LTKSTATGITVS 231 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~-~t~gt~t~~tv~ 231 (430) ..|++.++|. +|-+++.|+.+..|- +...+ -......+.+..|.|-|+++.+|..+.--.+|. +-++...+=.+| T Consensus 150 ~~lde~~vp~--~rvl~vTp~~~~~Lk-~s~~~-~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~In 225 (285) T protein:vir:79 150 AYMFDNEVPG--GFVMFVSSAYYTALK-QSAAV-TRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVN 225 (285) T ss_pred HHHHHcCCCC--ceEEEEChHHHHHHH-hhhhh-heecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhcc Confidence 9999999993 699999999887553 23222 111113344666777777877763111112221 111111000000 Q ss_pred cccccccceeeeeeccccccccceeeeEEeeccc----eeecccEEEEcceeeecccccccccCcceEEEEeecc Q lcl|NC_011802. 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT----GLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg----tlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~ 302 (430) ==--+......+... ..+.-......-++-+ ..+=+|+|.+.-- .. .-|+-...+- T Consensus 226 fiiv~~~a~i~~~K~---~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk--------~~----~Iy~~~~a~~ 285 (285) T protein:vir:79 226 FILTPLSAIAPIVKY---DSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNA--------KK----GIYVAATAGV 285 (285) T ss_pred EEEecCceeccceee---eeeEeECCCCCCCcceeeeeeeeeeeeeehhhc--------cc----eeeeeecccC Confidence 000000000000000 0000000000001111 2334566666421 00 1133221111 No 56 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=94.85 E-value=0.0033 Score=34.15 Aligned_cols=288 Identities=11% Similarity=0.079 Sum_probs=121.0 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecC--cc---cccccCCcccCCcCcccccee Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVE--QE---SPTQEGWDLTDKATGLLELNV 75 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P--~~---~~~~~g~~~s~~~~di~e~sv 75 (430) ||+.. +.-+....++-+.+...++-+-+. .-+..-++ .=|++|.||+= .. +.+..| ... +..++....- T Consensus 1 Mantl-~ya~~~~~~LD~~~~~~~~s~~l~--~~~~~v~~-~ggktVkIp~i~~~gl~DY~R~~g-~~~-~~g~v~~~~e 74 (312) T protein:vir:10 1 MANTL-AYGQVLQQGLDKQATQELLTGWMD--SNAKQIKY-EGGKEVKIGKLSTDGLGDYSRGSA-NAY-VGGDVKFEYE 74 (312) T ss_pred CCcch-hHHHHHHHHHHHHHHhhhcccccc--CCCceEEE-ecCcEEEEEeeecccccccccccC-Ccc-ccccccccce Confidence 99554 444555556555666666665553 12222223 44799999863 22 222112 111 1124556677 Q ss_pred EEEeccccccceEecHHHh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC--CCC-Ccchhh Q lcl|NC_011802. 76 AVNMGEPDNDFFQLRADDL-------RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA--IGT-NTADAW 145 (430) Q Consensus 76 ~v~ld~~k~V~f~~t~keL-------~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t--~~~-~~~~~~ 145 (430) +.+|++.+...|.+..-|. ++.....++.+ ...+=+||..-....+..+....+.... ..+ +...-+ T Consensus 75 t~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r---~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~ 151 (312) T protein:vir:10 75 TKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQR---LKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTII 151 (312) T ss_pred eEEeeecccceeeccccchhhHhhHHHHHHHHHHHHH---hhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHH Confidence 8999999998888873222 22222222221 1344456665333333322222221111 111 223457 Q ss_pred hHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccc Q lcl|NC_011802. 146 NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 146 ~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~ 225 (430) ..+-.+...|++.++|. +|.+++.|+.+..| .+ ...+ ......+.+|.|-|.++.+|. +.--.||- .=.- T Consensus 152 ~~i~~~~~~lde~~vp~--~rvl~vTp~~~~lL-k~-~~~~---~~~~~~~~~~~i~~~V~~iDg-v~Ii~VPs--~r~~ 221 (312) T protein:vir:10 152 NKIKTGIKIIRENGYNG--PLVCHLTYDSMFAI-EE-KVLE---KLTAVTFAQGGIQTQVPSIDG-CALIKTPQ--NRMY 221 (312) T ss_pred HHHHHHHHHHHHccCCC--ceEEEeChHHHHHH-hh-hhhc---eecccccccceeeeeeeeecc-cEEEEchh--hhcc Confidence 77888999999999994 79999999988444 33 1222 223334455656666655554 22112221 1111 Q ss_pred ccccc-cc--ccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccccc-ccCcceEEEEeec Q lcl|NC_011802. 226 TGITV-SG--AQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV-LAQDATFSVVRVV 301 (430) Q Consensus 226 t~~tv-~g--A~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~-~~~l~~fvVta~~ 301 (430) +.+.. .| ++|.. .++.-...+- +--...+-.++.-.+.|=|.+-| .-|-+-+. .+++-+|+.--|+ T Consensus 222 t~~~f~dG~t~~~~~-gg~~~~~~ak----~INfiiv~~~a~i~~~K~~~~~i-----f~P~~~~~~d~~~~~~R~Y~D~ 291 (312) T protein:vir:10 222 SSILLNDGTTSNQTA-GGYLKGTKAL----DTNFIIAPVDVPLAITKQDKMRI-----FDPETNQTANAWSMDYRRYHDL 291 (312) T ss_pred ceeeeccCccccccc-CceeecCccc----ccceEEeCCceeeceeeeeeeee-----eCCCCCCCcceeeeeeeeeeee Confidence 11110 01 00000 1111111111 11111212222222222232222 11211111 1233333332221 Q ss_pred -----cCceeEEeeccccccccccccccccccccccccccC Q lcl|NC_011802. 302 -----DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 302 -----~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~ 337 (430) ....| |.|+..+-+-| T Consensus 292 fv~~nk~~~I--------------------yv~~k~a~~~~ 312 (312) T protein:vir:10 292 WVTDNKANSV--------------------YANFKDAKPVG 312 (312) T ss_pred eeeccccCeE--------------------EEEeecccCCC Confidence 11122 11111111111 No 57 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=94.78 E-value=0.0035 Score=34.04 Aligned_cols=312 Identities=13% Similarity=0.085 Sum_probs=129.6 Q ss_pred Ccccccchhh-hhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc-----cCCcccCCcC---cc- Q lcl|NC_011802. 1 MALNEGQIVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-----EGWDLTDKAT---GL- 70 (430) Q Consensus 1 ma~~~~~~~t-~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~-----~g~~~s~~~~---di- 70 (430) -..+-.++=| |..++.|+.-+..+|+-++.+. +|. -.+.|.||..|+...+... .|-+..++.- .+ T Consensus 16 ~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~-~pi---Pkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y 91 (401) T protein:vir:95 16 DGANSDQMQTFFWLKKAIITARKEQYFMPLASV-TNM---PKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLY 91 (401) T ss_pred cccccceeeehhhHHHHHhhhhhhhhhhhcccc-ccc---ccccCCeEEEEecccccccccchhcCCCcccccccCcccc Confidence 1111233444 5568889888888887776632 222 2588999998877555443 3443333310 00 Q ss_pred ----ccceeEE------------Eec-----------cccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_011802. 71 ----LELNVAV------------NMG-----------EPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVE----- 116 (430) Q Consensus 71 ----~e~sv~v------------~ld-----------~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id----- 116 (430) .-+++.. +.- +|.+...+||+.-+. ++....++|.-=|..=++.+. T Consensus 92 ~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~ 171 (401) T protein:vir:95 92 GSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQ 171 (401) T ss_pred ccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHH Confidence 0111111 111 244555666654321 222222223222333333333 Q ss_pred HHHHHHH-----hcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhcCCCCC----------C------cEEEeChHHH Q lcl|NC_011802. 117 LKVANMA-----AEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRELNRDM----------G------TSYFFNPQDY 175 (430) Q Consensus 117 ~dl~~~~-----~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~aP~~~----------~------R~~vl~p~~~ 175 (430) +||+.-+ ........+..+....++.-.++++-.+.++|++|-+|+-. . |-+++-|+-. T Consensus 172 ~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~ 251 (401) T protein:vir:95 172 KDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELV 251 (401) T ss_pred HHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCch Confidence 4444222 11111122222233344455788899999999999999710 0 1233333221 Q ss_pred HH------HHH--hhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeecc Q lcl|NC_011802. 176 KK------AGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDG 247 (430) Q Consensus 176 ~~------~~~--~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g 247 (430) .. +-+ .+.-.++. +..+.+-+|+||. +.+|+++...+..+---.|..++. .+++ T Consensus 252 ~di~a~~D~~~~~~fi~v~kY--a~~~~i~~gEiG~-i~~vR~i~~p~~~~w~~ag~~a~~-~~~~-------------- 313 (401) T protein:vir:95 252 PELKAMKDLFGNKAFIETQHY--ADAGTIMNGEVGS-IDKFRIIQVPEMLHWAGAGAQATG-ANPG-------------- 313 (401) T ss_pred hHHHHHHHhcCCCCceehhhc--CCccccccccccc-cCceeEEecccceeecCCcccccc-cccc-------------- Confidence 11 111 11111111 1344456666664 666665543222111000100000 0000 Q ss_pred ccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeeccccccccccccccccc Q lcl|NC_011802. 248 NKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAY 327 (430) Q Consensus 248 ~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~ 327 (430) |.+.-+..+++..+||.++ T Consensus 314 -----------------------------------------------y~~~~~~~gg~~dVyp~lV-------------- 332 (401) T protein:vir:95 314 -----------------------------------------------YRTSMVSGQEHYDVYPMLV-------------- 332 (401) T ss_pred -----------------------------------------------cccccccCCCcceeeeeeE-------------- Confidence 1111112334444555543 Q ss_pred cccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEE Q lcl|NC_011802. 328 ANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRI 407 (430) Q Consensus 328 ~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~ri 407 (430) +-++|| ++.||.-. +.+..-.....-|+-|. ....|.-...-...+ T Consensus 333 --------------------------~G~dAf--~~~~l~g~--g~~~~~~~ivk~pG~~~----ad~~DPlgQ~g~vgw 378 (401) T protein:vir:95 333 --------------------------VGDDSF--TSIGFQTD--GKSLKFTVMTKMPGKET----ADRNDPYGETGFSSI 378 (401) T ss_pred --------------------------Eccccc--eecccccC--CccccceeEeecCCcCC----CCCCCcccceehhhh Confidence 333333 23444310 00000000000010000 001233344444556 Q ss_pred EeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 408 ALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 408 DvLyG~~~v~PElagv~l~~q~~ 430 (430) =.+||+..++||+. ++|----- T Consensus 379 K~~~a~~vL~~e~m-~~ies~a~ 400 (401) T protein:vir:95 379 KWYYGILVKRPERL-ALIKTVAP 400 (401) T ss_pred hhhhhhheecccee-EEEEeecC Confidence 67889999999985 55421111 No 58 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=91.06 E-value=0.018 Score=30.19 Aligned_cols=311 Identities=6% Similarity=-0.035 Sum_probs=120.5 Q ss_pred CcccccchhhhhHHHHHHHHhhhccc-chhcccCCCchhhh-hccCcEEEEecCc---ccccccCCcccCCcCcccccee Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPM-AQKAKKYTPPAASM-QRSSNTIWMPVEQ---ESPTQEGWDLTDKATGLLELNV 75 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvm-A~~V~~~r~~~~~~-~k~GdTV~i~~P~---~~~~~~g~~~s~~~~di~e~sv 75 (430) ||+|-.+..+ .++.+.|...++. +.+. ..|...+- +.-|++|.||+=+ -.+-.+-..+.....++.-..- T Consensus 1 Mainya~~~~---~~Ld~~~~~~~lts~~l~--~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~e 75 (346) T protein:vir:10 1 MTINYAEKYQ---AAVQQAFYDGHLYSAELW--NSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWD 75 (346) T ss_pred CcchhHHHHH---HHHHHHHHhhhccchhhc--ccccccceEecCCCEEEEEEeeeecccccccccCCccccccccccee Confidence 9998765555 4444445554443 2222 12222211 1347999998753 2222221122222234555677 Q ss_pred EEEeccccccceEecHHH-------hhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcccceeeecCCCCCCCcchhhhH Q lcl|NC_011802. 76 AVNMGEPDNDFFQLRADD-------LRDETAYRHRIQSAARKLANNVELKVAN-MAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 76 ~v~ld~~k~V~f~~t~ke-------L~~~~~~~~~i~~Am~~LAn~Id~dl~~-~~~~~~~~v~~~~~t~~~~~~~~~~~ 147 (430) +.+|++-+...|.+..-| ++......++.+- +.+=+||..-.. ++........+...+...+...-+.. T Consensus 76 t~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~---~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~ 152 (346) T protein:vir:10 76 SYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLD---SKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPA 152 (346) T ss_pred EEEeeccccceecccccchHHHHHHhHHHHHHHHHHHH---hhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHH Confidence 899999999888887322 2222222222211 233455655322 22222211111001111122345777 Q ss_pred HHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccc Q lcl|NC_011802. 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~ 227 (430) +-.+...|++.++|. ++|.++++|+.+..|- +...+-........--.+|.+++ +.|| .+++-..- +. -+. T Consensus 153 i~~~~~~lde~~vp~-~~rvl~vTp~~~~lLk-~s~~f~k~~~v~~~~~i~~~V~s-iDGv-~Ii~VPs~-r~----~t~ 223 (346) T protein:vir:10 153 FDNMMLDFDEARIPS-TNRILYVTPKTNAILK-RAEAMNRALTLKDPNNIQRTVYS-LDDV-TIRVVPSD-LM----QTA 223 (346) T ss_pred HHHHHHHHHHccCCC-CCeEEEECHHHHHHHh-hchhheeccccccccccceeeee-ecCe-EEEEcchh-hc----ccc Confidence 888999999999998 5899999999997543 22221111111111123666664 6666 35442111 11 011 Q ss_pred ccc-ccccccccceeeeeeccccccccceeeeEEeecc-ceeecccEEEEcceeeeccccccccc-CcceEEEEeec--- Q lcl|NC_011802. 228 ITV-SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLA-QDATFSVVRVV--- 301 (430) Q Consensus 228 ~tv-~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~t-gtlkaGDv~TiaGV~~v~~~tk~~~~-~l~~fvVta~~--- 301 (430) +.. +|... ...+-..+ .+.+-.++. .-.|--.+-.| .|- -+.-+ ++-+|+---|+ T Consensus 224 ~~f~~G~~~--------~t~ak~IN----fiiv~~~A~ia~~K~~~~~if------~P~-~~~~g~~l~~~R~Y~D~fv~ 284 (346) T protein:vir:10 224 YDFSDGSKI--------IDTAKQIE----MFLIYNGVQIAPEKYSFVGFD------QPS-AATSGNYLYYEQSYDDVLLL 284 (346) T ss_pred hhhccCccc--------cCCcccee----EEEECCceeeeeeeeeeeEee------CCC-CCcccceeeeeeeeeeeeee Confidence 110 11100 00000000 011111111 11121112222 221 12222 23333322211 Q ss_pred ----cCceeEEeeccccccccccccccccccccccccccCceeE----EeccCC--------cccceeecc Q lcl|NC_011802. 302 ----DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVN----ILNVKD--------ARTNVFWAD 356 (430) Q Consensus 302 ----~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavT----v~~~~s--------~~~Nl~Fhr 356 (430) .+--+.+..+.-..-+.. -+ .+.|.++--|- |+-.++ -..=|+.-+ T Consensus 285 ~nk~~~Iyv~~~~a~~~~~~~~-------~~--~~kpt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (346) T protein:vir:10 285 NTKTKGIQFVVSDKPKKDQEQS-------GQ--DAKPTAESTLEEIKAYLDKNHIDYTGKTKKDELLALVK 346 (346) T ss_pred ccccceEEEeeecccccCccCc-------cc--ccCcccccchHHHHHHhcccccccccccchhhHHhhcC Confidence 111223333321110000 00 12222221111 111111 011122222 No 59 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=86.42 E-value=0.046 Score=27.93 Aligned_cols=266 Identities=11% Similarity=0.039 Sum_probs=110.1 Q ss_pred Ccccccch-hhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccceeE Q lcl|NC_011802. 1 MALNEGQI-VTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLELNVA 76 (430) Q Consensus 1 ma~~~~~~-~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~sv~ 76 (430) ...+.+.+ -+-...+||+.++...++.+++.+..- .|.++.+|+.......-.|...+. ..+..=.++. T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~ 189 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTT-------ESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELEN 189 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhccceec-------CCCceEEEEEecCCCceeeecCCccccccccceeEEE Confidence 22223333 233448899999999999998854331 244566666533221112222222 1233333444 Q ss_pred EEeccccccceEecHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhc--------ccceeeecCCCCCCCcchhhhH Q lcl|NC_011802. 77 VNMGEPDNDFFQLRADDLRDETAYRHRIQSAA-RKLANNVELKVANMAAE--------MGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 77 v~ld~~k~V~f~~t~keL~~~~~~~~~i~~Am-~~LAn~Id~dl~~~~~~--------~~~~v~~~~~t~~~~~~~~~~~ 147 (430) ++..+... .+.+|.+=|.+..+.+.+|...+ .+++..+|..++.-.-. ....+.....+........+++ T Consensus 190 ~~~~k~~~-~~~is~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~ 268 (395) T protein:vir:43 190 APVRTIAH-LFKASRQILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDR 268 (395) T ss_pred EeeeeEEE-eehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHH Confidence 44444443 24455443445556777876655 48889999888741100 0000111111111112234556 Q ss_pred HHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccc Q lcl|NC_011802. 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~ 227 (430) +..+-..|.....+. -..++||.+...+.. +- ++ -||.+ +. . + ..+ +. T Consensus 269 i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~----lk-----------d~-~G~~i------~~--~-~--~~~--~~ 316 (395) T protein:vir:43 269 IRLAILQAQLAEFPA---SGIVLNPIDWALIEL----NK-----------DA-ENRYI------IG--S-P--QNG--TT 316 (395) T ss_pred HHHHHHhhccccCCC---cEEEEcHHHHHHHHH----hh-----------cc-CCcee------cc--c-c--ccC--CC Confidence 665555555554432 358899998766531 21 11 13322 11 0 0 011 11 Q ss_pred cccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeE Q lcl|NC_011802. 228 ITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) Q Consensus 228 ~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~ 307 (430) .++.|-+- -.+..+.+|+++ | | +.+++....+..+-+|. T Consensus 317 ~~l~G~pV--------------------------v~~~~~~~~~~~-~-g-------------d~~~~~~~~~~~~~~i~ 355 (395) T protein:vir:43 317 PTLWRLPV--------------------------VETQAITQDEFL-T-G-------------AFSLGAQIFDRMDIEVL 355 (395) T ss_pred ceecceee--------------------------EEcCCCCCCcEE-E-E-------------eccceEEEEEecceEEE Confidence 12222210 011122233332 1 2 22332222222333444 Q ss_pred EeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccc Q lcl|NC_011802. 308 ITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIP 367 (430) Q Consensus 308 I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~ 367 (430) +.+-.-. .++ -|-.++.+..--.. -..+++||+..+.+-. T Consensus 356 ~~~~~~~-----------~f~------~~~~~~r~~~r~d~---~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 356 VSTENDK-----------DFE------NNMVTIRAEERLAF---AVYRPEAFVTGSLTAS 395 (395) T ss_pred Eeccccc-----------hhh------cCcEEEEEEEeecc---EEecccceEEEEeccC Confidence 4331100 000 01111111000000 3345666666655443 No 60 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=84.70 E-value=0.058 Score=27.35 Aligned_cols=263 Identities=14% Similarity=0.058 Sum_probs=115.2 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc----------ccccccCCcccCCcCcc Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ----------ESPTQEGWDLTDKATGL 70 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~----------~~~~~~g~~~s~~~~di 70 (430) ||+.. +..+....++-+.+...++-+.+. --+..-+ ..=|.+|.||+=+ .+.+..|+. .++ + T Consensus 1 Mantl-~ya~~~~~~Ld~~~~~~~~t~~l~--~~~~~v~-~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~-~g~---v 72 (302) T protein:vir:78 1 MANSL-ALAQIYQDNIDKAIAVNSKSAFLE--ANPNNVQ-YNGGNTIKIADISFGSGTTGDLKAYNRSTGFT-QGS---V 72 (302) T ss_pred CCchh-HHHHHHHHHHHHHHHhhhceeecc--cCCceEE-EecCcEEEEEEEEeeccccccccccccccCcc-ccc---e Confidence 99555 445655566777777777777665 1222223 3347899888653 244333432 222 4 Q ss_pred ccceeEEEeccccccceEecHHHh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC-CCC-Cc Q lcl|NC_011802. 71 LELNVAVNMGEPDNDFFQLRADDL-------RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA-IGT-NT 141 (430) Q Consensus 71 ~e~sv~v~ld~~k~V~f~~t~keL-------~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t-~~~-~~ 141 (430) .-..-+.+|++-++..|.+..-|. ++.....++.+ ...+=+||+.-....+..+.-.++.... .+. +. T Consensus 73 ~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r---~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~ 149 (302) T protein:vir:78 73 TLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQR---TKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASA 149 (302) T ss_pred eeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHH---hhhcchhhHHHHHHHHHhhhccCccccccccchhH Confidence 444566888898988888873221 22222222221 1233344544332222222111221111 111 12 Q ss_pred chhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccch-hhHHHHhccccccchhhhhhhhcCCccc- Q lcl|NC_011802. 142 ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPV- 219 (430) Q Consensus 142 ~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~v~~- 219 (430) ..-+..+..+...|++. ++|-+++.|+.+..|- +...+ ++. ..+.+.+|.|-|.+..+|+ +.--.||- T Consensus 150 ~nvl~~i~~~~~~~~e~-----~~~vl~vtp~~~~~Lk-~a~~~---~~~~~~~~~~~~~i~~~V~~lDg-v~Ii~VPs~ 219 (302) T protein:vir:78 150 QALMGDIATAMELVDDS-----NQLILVTSPTTLAGLL-NTALI---RESKNTQVLRRGEVDTKITFIQD-VEVLQVPSE 219 (302) T ss_pred HHHHHHHHHHHHHhhcc-----CCeEEEEChHHHHHHh-cchhh---ccceeccccccccccceeeeecc-cEEEEchhh Confidence 23466777778888885 4799999999886543 22222 222 2334566777777777775 22222331 Q ss_pred -------ccCccccccccccccccccceeeeeecc-ccccccc-----eeeeEEeecc----ceeecccEEEEcceeeec Q lcl|NC_011802. 220 -------LTKSTATGITVSGAQSFKPVAWQLDNDG-NKVNVDN-----RFATVTLSAT----TGLKRGDKISFTGVKFLG 282 (430) Q Consensus 220 -------~t~gt~t~~tv~gA~~~~~~~~~v~~~g-~~~~~d~-----~~~~~t~s~t----gtlkaGDv~TiaGV~~v~ 282 (430) .+.|... ..++-+ --+-+.-.. .-..+.- ......-++. ...+=+|+|.+.-- T Consensus 220 r~~t~~~f~~G~~~---~~~ak~---INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk---- 289 (302) T protein:vir:78 220 YLYDKVAPKVGVPD---YTGAKK---IPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQ---- 289 (302) T ss_pred hcccceeccCCccc---cCCccc---eeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccc---- Confidence 1111100 000000 000000000 0000000 0000001111 13445777777531 Q ss_pred ccccccccCcceE-EEEeecc Q lcl|NC_011802. 283 QMAKNVLAQDATF-SVVRVVD 302 (430) Q Consensus 283 ~~tk~~~~~l~~f-vVta~~~ 302 (430) .. .-| .++++++ T Consensus 290 ----~~----gI~~~~~~~~~ 302 (302) T protein:vir:78 290 ----RP----GIIKASFGTIA 302 (302) T ss_pred ----cC----eEEEeeccccC Confidence 00 011 1222223 No 61 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=84.34 E-value=0.061 Score=27.23 Aligned_cols=278 Identities=11% Similarity=-0.032 Sum_probs=101.5 Q ss_pred Ccccc---cchhh-hhHHHHHHHHhhhccc-chh--cccCCCchhhhhccCcEEEEecCcccccccCCcccCCc--Cccc Q lcl|NC_011802. 1 MALNE---GQIVT-LAVDEIIETISAITPM-AQK--AKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKA--TGLL 71 (430) Q Consensus 1 ma~~~---~~~~t-~~~~evi~~len~lvm-A~~--V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~--~di~ 71 (430) ||-.. ...|. ..--+++.+|+.++.= -++ |.+..|.+ .|+||.+|+= .+.-..+-...+.. -+-+ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a-----~G~tIt~pK~-~~tgda~dVaEGe~Iplskv 74 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLT-----NDLKIQTYKW-EVTLDQTDPGEGETIPLSKV 74 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccc-----cCCeEEeeee-eeecccccccCCcccchhhh Confidence 88741 11111 1012344444322211 000 12233333 3999999883 33211111111111 0111 Q ss_pred cc----eeEEEeccccccceEecHHHhhhHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchh Q lcl|NC_011802. 72 EL----NVAVNMGEPDNDFFQLRADDLRDETAYR---HRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADA 144 (430) Q Consensus 72 e~----sv~v~ld~~k~V~f~~t~keL~~~~~~~---~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~ 144 (430) +. ...+++.++... .|++.+...-+.. +-=+.=.+.|+++|+.|++..++...--+.. ... T Consensus 75 t~~~~~t~t~kikK~rK~---tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg---------~~l 142 (295) T protein:vir:99 75 TRTKDKDYTVKWFKKRRA---TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKG---------VGL 142 (295) T ss_pred eeeeeeeeEEEeeeeccc---ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeeh---------hhH Confidence 11 234555554442 3555542111111 1112234579999999999887554322210 011 Q ss_pred hhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccC-c Q lcl|NC_011802. 145 WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK-S 223 (430) Q Consensus 145 ~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~-g 223 (430) -.+++.+...|+...--.+...-+|+||.+.+.++++...-.-..+.-.--|.. +|.|++.++++..+|.-+. . T Consensus 143 q~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~-----nfLG~q~II~S~kv~~G~~~a 217 (295) T protein:vir:99 143 QKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLK-----NFLGMQNVIVMPSVPEGKIYS 217 (295) T ss_pred HHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhh-----hhhccceEEEcccCCCceEEE Confidence 223444444444332222234578899999999877554332211112222333 3677765778888776221 1 Q ss_pred cccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccC Q lcl|NC_011802. 224 TATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDG 303 (430) Q Consensus 224 t~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a 303 (430) |+.- -++.|.. ...+...+ ..+..-.|.-=+=||.- |+.++..+.. ...- T Consensus 218 T~~~-Ni~~ay~--------~~~~g~l~------------~~f~~~~D~tglIg~~h-~~~~~~~t~e--------t~~~ 267 (295) T protein:vir:99 218 TAVE-NLVFASL--------NVKGGDLG------------GLFADFTDETGLIAAAR-NRQLSNLTYE--------SVFF 267 (295) T ss_pred eecc-ceEEEEe--------cCCchhhh------------hhhhhccCcccceEEEe-ccccceeeeh--------hhhH Confidence 1100 0111110 00000000 00000112222223210 1111111000 0111 Q ss_pred ceeEEeeccccccccccccccccccccccccccCc Q lcl|NC_011802. 304 THVEITPKPVALDDVSLSPEQRAYANVNTSLADAM 338 (430) Q Consensus 304 ~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~a 338 (430) ..++++|-+.- ..-+..=+..+.|.-|. T Consensus 268 ~~~~lfpE~~d-------giv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 268 GANVLFAEIPE-------GVVEATIEAAAVPGIGG 295 (295) T ss_pred hHHHhcccccc-------eEEEEEEecCcCCCCCC Confidence 23445553310 00000000011111111 No 62 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=64.92 E-value=0.29 Score=23.51 Aligned_cols=275 Identities=9% Similarity=0.052 Sum_probs=105.3 Q ss_pred CcccccchhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCc-----ccccccCCcccCCcCcccccee Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ-----ESPTQEGWDLTDKATGLLELNV 75 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~-----~~~~~~g~~~s~~~~di~e~sv 75 (430) ||+|-.+.- ..++-+.|...++-+.+. -.+ .+++.-|.+|.||+=+ .+.+..|+. ..++.-..- T Consensus 8 mAlnya~~~---~~~Ld~~~~~~~~t~~l~--~~~--~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~----~g~v~~~~e 76 (311) T protein:vir:99 8 RGFNYVTKD---GNLLDQKITAGLFTAALG--TPE--VDLVNGGRSFTLKTISTSGLKDHTRGKGFN----SGTISDEKT 76 (311) T ss_pred hHHHHHHHH---HHHHHHHHHhhhccccee--cCc--hheeecCCEEEEEeeeeccccccccccCcc----ccceeeeee Confidence 665543333 355555566677666665 222 2344458899988652 233333322 234555667 Q ss_pred EEEeccccccceEecHH---H----hh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCC-------- Q lcl|NC_011802. 76 AVNMGEPDNDFFQLRAD---D----LR----DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDA-------- 136 (430) Q Consensus 76 ~v~ld~~k~V~f~~t~k---e----L~----~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t-------- 136 (430) +.+|++.+...|.+..- | ++ ..+|.+...-| +||+.-....+..+..++....+ T Consensus 77 t~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvP-------EiDayrfskla~~a~~~~~~~~~~~~~~~~~ 149 (311) T protein:vir:99 77 IYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQP-------ELDSYRFSKIATSFDNLDGTDTEGTLLAKTH 149 (311) T ss_pred EEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcc-------hhhHHHHHHHHhhhhcccccccchhhhcccc Confidence 78999999988888832 2 22 23344433333 23322222222111111111000 Q ss_pred --CCCCcc-hhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhh Q lcl|NC_011802. 137 --IGTNTA-DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLR 213 (430) Q Consensus 137 --~~~~~~-~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~ 213 (430) ....++ .-++.+-.+...|++ +|. ++|-+++.|+.+..| .+...+.. ......+..+.|-|.++.+|.+ T Consensus 150 ~~~~~lt~~nvl~~l~~~~~~~~~--v~~-~~rvl~vTp~~~~lL-k~~~~~~r--~~~~~~~~~~~i~~~V~~lDgv-- 221 (311) T protein:vir:99 150 KTEETLDETNAYSQLKTGIGKVRK--YGT-QNLVGYVSSEVMDAL-ERSKEFTR--NITNQNVGTTALESRITSIDGV-- 221 (311) T ss_pred ccccccCHHHHHHHHHHHHHHHHh--cCC-CCeEEEEChHHHHHH-hhchhhhe--eeecccccccccccccceecCe-- Confidence 001111 235556667777776 687 589999999988744 33222211 1122234455566666666542 Q ss_pred cCCcccccCc-cccccc-cccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccC Q lcl|NC_011802. 214 SPKLPVLTKS-TATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQ 291 (430) Q Consensus 214 ~~~v~~~t~g-t~t~~t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~ 291 (430) ..+.+.... .-+.+. .+|... ..++ .+--...+-.++.-.+.|=|.+-+-- =..|+.- .++ T Consensus 222 -~Ii~V~ps~r~~t~~~ft~G~~~--------~~~a----k~INfiiv~~~a~i~~~K~~~v~~f~-P~~~~~g---d~~ 284 (311) T protein:vir:99 222 -QLIEVYESNRFMTKYDFTDGAKP--------TEDA----KAINFLVVAKPAVISIVKENAVFLFA-PGQHTDG---DGY 284 (311) T ss_pred -EEEEecCchhhcchhhhcCCccc--------cCcc----cccceEEeCCCeeeeeeeeeeeeeeC-CCCCCCc---cee Confidence 111110000 000000 011100 0000 00000111111111111111111100 0011110 012 Q ss_pred cceEEEEeeccCceeEEeeccccccccccccccccccccccc Q lcl|NC_011802. 292 DATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTS 333 (430) Q Consensus 292 l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~ 333 (430) +=+|+---|+ .++ ++ ....=|.|+..+ T Consensus 285 l~~~R~Y~D~----------fv~-~n----k~~~Iyv~~k~A 311 (311) T protein:vir:99 285 LYQNRLYHDL----------FIK-KH----KRDGIFVSVKKA 311 (311) T ss_pred eeeeeeeeee----------eee-cc----ccCeEEEeeecC Confidence 2222221111 000 00 000002222111 No 63 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=63.72 E-value=0.31 Score=23.36 Aligned_cols=271 Identities=14% Similarity=0.056 Sum_probs=101.7 Q ss_pred Ccccccchhh-hhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccceeE Q lcl|NC_011802. 1 MALNEGQIVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLELNVA 76 (430) Q Consensus 1 ma~~~~~~~t-~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~sv~ 76 (430) ||++-+.++- -...|+|+.++...++.+++++. +- .+..++||+-.... .-+|...+. ..++.=.++. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~-~~------~~~~~~ip~~~~~~-~a~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK-PI------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee-ec------cCCceEEEEEecCc-ceEEecCCccccccccceeEEE Confidence 9999888773 34589999999999988888422 21 12345666532221 112333222 1232223344 Q ss_pred EEeccccccceEecHHHhh--h--HHHHHHHHHH-HHHHHHHHHHHHHHHHHh---cccc------eeeecC-C--CCCC Q lcl|NC_011802. 77 VNMGEPDNDFFQLRADDLR--D--ETAYRHRIQS-AARKLANNVELKVANMAA---EMGS------LVITSP-D--AIGT 139 (430) Q Consensus 77 v~ld~~k~V~f~~t~keL~--~--~~~~~~~i~~-Am~~LAn~Id~dl~~~~~---~~~~------~v~~~~-~--t~~~ 139 (430) ++..+... -+.+|.+-|+ . ....+++|+. -.++++..+|..++.-.- .... ...+.. . .... T Consensus 73 l~~~k~a~-~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) T protein:vir:16 73 MVPIKVEY-GARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPR 151 (298) T ss_pred EeeeeEEE-eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccc Confidence 44333222 2344433342 1 2345556654 446888899888863210 0000 000000 0 0011 Q ss_pred CcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCccc Q lcl|NC_011802. 140 NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPV 219 (430) Q Consensus 140 ~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~ 219 (430) .....+.++..+...+.....+. -..++||.+...+.. +-..+ ||.+ | +..+ T Consensus 152 ~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~----lkd~~------------G~~i----~----~~~~- 203 (298) T protein:vir:16 152 GIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK----QKDLQ------------DNAL----F----PELK- 203 (298) T ss_pred ccccHHHHHHHHHHHhhhcCCCc---cEEEEcHHHHHHHHH----hhccC------------CCee----e----cCcc- Confidence 11123445555555565555442 348899999877643 21111 2322 1 0000 Q ss_pred ccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEe Q lcl|NC_011802. 220 LTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVR 299 (430) Q Consensus 220 ~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta 299 (430) ..|. ..++.|-+- +..+..... ..+....+-.|| -.+++... T Consensus 204 -~~~~--~~~l~G~PV-------~~~~~v~~~--------~~~~~~~~~~GD--------------------fs~~~~~~ 245 (298) T protein:vir:16 204 -WGAT--PDTINGLPV-------DVNKTVSDM--------SLTQRDRAIIGD--------------------FANGFKWG 245 (298) T ss_pred -cCCC--Cceecceee-------EEecccccc--------cCCCccEEEEee--------------------ccceEEEE Confidence 0111 112222211 000000000 000001122222 11221122 Q ss_pred eccCceeEEeeccccccccc------cccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecc Q lcl|NC_011802. 300 VVDGTHVEITPKPVALDDVS------LSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPI 366 (430) Q Consensus 300 ~~~a~tv~I~Pai~~~~~~~------~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl 366 (430) ....-+++|.+-.-+ +... .-...++..-+.-. ..|++||+....-= T Consensus 246 ~~~~~~~~~~~~~~~-~~~~~~~f~~~~v~~ra~~r~d~~-------------------v~~~~a~~~l~~at 298 (298) T protein:vir:16 246 YAKEVPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGWG-------------------ILDATKFARVTEAN 298 (298) T ss_pred EecCceEEEeeccCC-cCcchhhhhcCcEEEEEEEEEccE-------------------eecccceEEEeecC Confidence 122223444332100 0000 00001111111122 22233333321110 No 64 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=61.09 E-value=0.36 Score=23.02 Aligned_cols=267 Identities=9% Similarity=0.020 Sum_probs=99.1 Q ss_pred Cccc-------ccchhhhh--------H-HHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEec--Cc-----cccc Q lcl|NC_011802. 1 MALN-------EGQIVTLA--------V-DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPV--EQ-----ESPT 57 (430) Q Consensus 1 ma~~-------~~~~~t~~--------~-~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~--P~-----~~~~ 57 (430) |-.- |++-||.. | ..+++.+ .++.++.++ +|.++ ++.+-.|.... |. ...+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~-~~~~iad~l--f~~~~---a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMM-VNQFISESL--FRNGG---ANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHH-hccchhhhh--hhccc---ccccceeEEEecccccccCcHhhc Confidence 2211 33333320 1 2333334 444555555 44433 23333333332 11 0011 Q ss_pred ccC--CcccCCcCccccceeEEEeccccccceEecHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeec Q lcl|NC_011802. 58 QEG--WDLTDKATGLLELNVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS 133 (430) Q Consensus 58 ~~g--~~~s~~~~di~e~sv~v~ld~~k~V~f~~t~keL~--~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~ 133 (430) ..| .+.+. .-.+.-.+-..+-.+..|.+|++... ..+...|.++.+.++++..+|+..+....... .-+. T Consensus 75 aEggEiP~~~----~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~--t~~~ 148 (318) T protein:vir:10 75 AEFGEIPVSA----GARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPI--VPTL 148 (318) T ss_pred cCcccccccC----CCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccc Confidence 111 01111 11111222222234556788776653 46667777788888888888888776432221 1111 Q ss_pred CCCCCCCc-chhhhHHHHHH-------HHHHhhc-----CCCC-CCcEEEeChHHHHHHHHhhhhhhcc-cch--hhHHH Q lcl|NC_011802. 134 PDAIGTNT-ADAWNFVADAE-------ELMFSRE-----LNRD-MGTSYFFNPQDYKKAGYDLTKRDIF-GRI--PEEAY 196 (430) Q Consensus 134 ~~t~~~~~-~~~~~~~a~a~-------~~L~~~~-----aP~~-~~R~~vl~p~~~~~~~~~~~~l~~~-~~~--~~~a~ 196 (430) +.+.+... ..-.+|++.|. ..++... .+.+ .--.+|++|.....+.++..-+... .+. ..... T Consensus 149 ~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~ 228 (318) T protein:vir:10 149 AVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAP 228 (318) T ss_pred cCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcc Confidence 11111000 00011222232 2222110 0000 0126899999998887654433221 111 11122 Q ss_pred H-hccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEE Q lcl|NC_011802. 197 R-DGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISF 275 (430) Q Consensus 197 r-~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~Ti 275 (430) . .|.+++.+.|++++ .+.++|.-+ .+++. .+.. +.. .|.+ -++. T Consensus 229 ~~tg~~~g~~lGl~vi-~s~~~p~~~-----alvlq-~g~v---G~~---------~d~~----------------pl~~ 273 (318) T protein:vir:10 229 DWTGNFPGSVMGLNVI-RSRTFPIDR-----VLIME-RGTV---GFY---------SDTR----------------PLQF 273 (318) T ss_pred cccccccceeeceEEe-ecCccCCCe-----eEEEe-cCCc---cee---------eccc----------------ccee Confidence 2 46666677899875 568888622 23221 1110 000 0001 0122 Q ss_pred cceeeecc---cccccccCcceEEEEee--ccC-ceeEEeeccccc Q lcl|NC_011802. 276 TGVKFLGQ---MAKNVLAQDATFSVVRV--VDG-THVEITPKPVAL 315 (430) Q Consensus 276 aGV~~v~~---~tk~~~~~l~~fvVta~--~~a-~tv~I~Pai~~~ 315 (430) .+.|.=+- ..+......+-+.+++- ... .-+.|+ -|+.| T Consensus 274 t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~it-gi~~~ 318 (318) T protein:vir:10 274 TALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLT-GIVTP 318 (318) T ss_pred eecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEe-eccCC Confidence 22211000 01111222222222221 111 112221 11111 No 65 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=57.64 E-value=0.43 Score=22.59 Aligned_cols=281 Identities=13% Similarity=0.064 Sum_probs=106.9 Q ss_pred CcccccchhhhhHHHHHHHHhhhccc-------chhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCcCccccc Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETISAITPM-------AQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLEL 73 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~len~lvm-------A~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~~di~e~ 73 (430) ||.+-=+|..+..-|+...+-.+... ++.+..--..+......||+|+||.=... +| .++++.|+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l---~G-----~~~~~~dg 72 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDL---TG-----DSEVLGNG 72 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccC---CC-----cccccCCC Confidence 99653333333334554442222211 22231011133344568999999863222 22 22222222 Q ss_pred eeEE-----Eeccccccc----eEecHHHhhhHHHHHHHHHHHHHH----HHHHHHHHHHHHHhccccee---------e Q lcl|NC_011802. 74 NVAV-----NMGEPDNDF----FQLRADDLRDETAYRHRIQSAARK----LANNVELKVANMAAEMGSLV---------I 131 (430) Q Consensus 74 sv~v-----~ld~~k~V~----f~~t~keL~~~~~~~~~i~~Am~~----LAn~Id~dl~~~~~~~~~~v---------~ 131 (430) ...+ +-.++..+- --|+..||...-...+.++...++ ++...+.+|+..+...-+.. . T Consensus 73 ~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~ 152 (330) T protein:vir:10 73 DKALETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEE 152 (330) T ss_pred ccccchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhh Confidence 1111 111111111 123334443222223333333333 33445555555544222110 0 Q ss_pred ecCCCC-CCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhh Q lcl|NC_011802. 132 TSPDAI-GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDD 210 (430) Q Consensus 132 ~~~~t~-~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~ 210 (430) +..... +....-+.+.+..|.+.|.++.- .-.-++++|..+..|..+ .-+.+.. .++ ..+.|+. +.|.. T Consensus 153 ~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~---~~~~ivmhS~v~~~L~~~-~li~~~~--~s~--~~~~i~~-~~G~~- 222 (330) T protein:vir:10 153 THVSDQSKASTGIDAGMVLDAKQLLGDSAD---QVTAIAMHSAVYTKLQKD-NLIQYIQ--PTT--ATINIPT-YLGYR- 222 (330) T ss_pred hheecccccccccCHHHHHHHHHHhccccc---cceEEEEcHHHHHHHHHh-hhhhhhc--ccc--cCccccc-ccceE- Confidence 000000 01111233557788899988863 235677999999998763 2222222 111 2456776 67865 Q ss_pred hhhcCCcccccCcccccccc-ccccccccceeeeeeccccccccceeeeEEeeccc--eeecccEEEEcceeeecccccc Q lcl|NC_011802. 211 VLRSPKLPVLTKSTATGITV-SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKN 287 (430) Q Consensus 211 ~~~~~~v~~~t~gt~t~~tv-~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg--tlkaGDv~TiaGV~~v~~~tk~ 287 (430) ++.+..+|... +..+++.+ .||-.... .......++ .+. -++.=|.+...=-|.+||.=. T Consensus 223 VivdD~~p~~~-~~yt~yl~~~GAi~~~~-----~~~~~~v~~----------EtdRd~~~g~~~l~~r~~~~~hp~G~- 285 (330) T protein:vir:10 223 VIIDDGIAPTG-DIYTSYLFRTGSIGLNT-----GNPSGLTTF----------ETSREAAKGNDMIYTRRALVMHPYGV- 285 (330) T ss_pred EEEeCCCCCCC-CceeEEEEecCceeeec-----ccCCccccc----------cccCCccccceEEEEeeEEEeeeeee- Confidence 56678887532 22333322 33321100 000000010 000 011113333333333333110 Q ss_pred cccCcceEEE------------EeeccCc--eeEEeecccccc--cccccc Q lcl|NC_011802. 288 VLAQDATFSV------------VRVVDGT--HVEITPKPVALD--DVSLSP 322 (430) Q Consensus 288 ~~~~l~~fvV------------ta~~~a~--tv~I~Pai~~~~--~~~~~~ 322 (430) .|.. ++-.+++ ...+.|+-|++- .+-.+. T Consensus 286 ------s~~~~~~~~~~~sPt~~~L~~~~NW~~v~~~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 286 ------KWTGAEVDAGNITPSNADLAKFKNWKRVYEPKNIGIIALKHKIGK 330 (330) T ss_pred ------eecccccccCcCCcChHHhcCCcCcccccChhhcceEEEEEecCC Confidence 1110 0001122 233444433322 222222 No 66 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=55.69 E-value=0.47 Score=22.36 Aligned_cols=288 Identities=16% Similarity=0.117 Sum_probs=116.2 Q ss_pred Ccccccc-------------hhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCCc Q lcl|NC_011802. 1 MALNEGQ-------------IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKA 67 (430) Q Consensus 1 ma~~~~~-------------~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~ 67 (430) ||.++.+ +-+-..+++|+.++...++.+++++ .+ -.+..+++|+-..... -.|...+.. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~-~~------~~~~~~~~p~~~~~~~-a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARK-VP------MGPTGISIPHWTGAVS-ASWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcce-ee------ccCCceEEEEEcCCcc-eeEecCCCc Confidence 6665332 2233448999999999998888732 11 1234466665533221 123222221 Q ss_pred ---CccccceeEEEeccccccceEecHHHhhh-HHHHHHHHHHH-HHHHHHHHHHHHHH----------HHhccc---ce Q lcl|NC_011802. 68 ---TGLLELNVAVNMGEPDNDFFQLRADDLRD-ETAYRHRIQSA-ARKLANNVELKVAN----------MAAEMG---SL 129 (430) Q Consensus 68 ---~di~e~sv~v~ld~~k~V~f~~t~keL~~-~~~~~~~i~~A-m~~LAn~Id~dl~~----------~~~~~~---~~ 129 (430) .++.=.++.++..+.. ..+.+|.+-|.+ ....+++|+.. .++++.++|..++. ...... .. T Consensus 73 ~~~~~~~f~~i~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~ 151 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKIT-TIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSL 151 (330) T ss_pred cccccceeeEEEEeEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccccccee Confidence 1222233334332221 223444433443 33466666554 45899999988862 000000 00 Q ss_pred eeecCCCCCCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhh Q lcl|NC_011802. 130 VITSPDAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFD 209 (430) Q Consensus 130 v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd 209 (430) ......+........+.++..+-..|.....+. ...++||.....+.. + +++ -||.+ T Consensus 152 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~----l-----------kd~-~G~~l---- 208 (330) T protein:vir:77 152 ADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW---TGTLLDNVTEPILNT----A-----------VDG-NGRPL---- 208 (330) T ss_pred ecccccccccccchhHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHH----H-----------hcc-CCcee---- Confidence 001001111112234555665555555555442 358899988866542 1 111 13322 Q ss_pred hhhhcCCcccccCcc---ccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccc Q lcl|NC_011802. 210 DVLRSPKLPVLTKST---ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAK 286 (430) Q Consensus 210 ~~~~~~~v~~~t~gt---~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk 286 (430) |. . ....+. ....++.|-+- +..+. .+ .++. .....-+-| T Consensus 209 ~~---~---~~~~~~~~~~~~~~l~G~PV-------~~~~~--~p----------~~~~---~~~~~~~~g--------- 251 (330) T protein:vir:77 209 FV---E---STYTEQVGAIREGRILGRPT-------YVADN--VV----------NGTV---GNRVVGVMG--------- 251 (330) T ss_pred ec---C---ccccccccccCCceecceee-------EEecc--cc----------CCCC---CCccEEEEE--------- Confidence 10 0 000000 00111111100 00000 00 0000 000111111 Q ss_pred ccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecc Q lcl|NC_011802. 287 NVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPI 366 (430) Q Consensus 287 ~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl 366 (430) +..+|++ .+..+-+|.++. ++. + T Consensus 252 ----d~s~~~i-~~~~~~~i~~~~----------------------------------------------e~~------~ 274 (330) T protein:vir:77 252 ----DFSQVIW-GQIGGLSFDVTD----------------------------------------------QAT------L 274 (330) T ss_pred ----ecceEEE-EEecCcEEEEee----------------------------------------------cce------e Confidence 1112211 111111111100 000 0 Q ss_pred ccCCCcchheeeEEEecCcceEEEEEEEee-ecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 367 PANHELFAGMKTTSFSIPDVGLNGIFRTQG-DISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 367 ~~p~g~~~~~~~~~~~~p~~Glslrv~~~y-d~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ....+..+.. ....+ ...++...+|.-.-+|++.++|+-. ++|-+.+| T Consensus 275 -------------~~~~~~~~~~--~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~-~~i~~~~~ 323 (330) T protein:vir:77 275 -------------DFGEEQGGVW--VPKLISLWQHNMVAVRCEAEFAFMVNDKDAF-VKLTDQVA 323 (330) T ss_pred -------------eecccccccc--cccccchhhcCcEEEEEEEEeccEEecccce-EEEEeccC Confidence 0000000000 00001 1345678888888899999999975 68888888 No 67 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=50.31 E-value=0.61 Score=21.75 Aligned_cols=281 Identities=9% Similarity=0.000 Sum_probs=103.6 Q ss_pred Cccc-ccchh--hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccce Q lcl|NC_011802. 1 MALN-EGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLELN 74 (430) Q Consensus 1 ma~~-~~~~~--t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~s 74 (430) ||+- .+.+| +-..++||+.+++..++.+++.+. +- .+..+++|+-.... .-+|...+. ..+..=.+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i-~~------~~~~~~~p~~~~~~-~a~wv~Eg~~~~~~~~~f~~ 72 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE-PQ------EFGEQQYMTLTAPP-RGEVVGEGAQKSESTATFAP 72 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee-ec------CCCceEEEEEeCCc-eeEEeecCcccccccceeeE Confidence 6655 66666 555699999999999998888532 21 22346666532211 112222222 12222234 Q ss_pred eEEEeccccccceEecHHHhh-----hHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccc-ceee-------ecCCCCCC- Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDLR-----DETAYRHRIQS-AARKLANNVELKVANMAAEMG-SLVI-------TSPDAIGT- 139 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL~-----~~~~~~~~i~~-Am~~LAn~Id~dl~~~~~~~~-~~v~-------~~~~t~~~- 139 (430) +.++..+... .+.+ ++||. .....+++|+. ..++|+..+|..++.---... .... ........ T Consensus 73 v~l~~~kl~~-~~~i-S~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~ 150 (311) T protein:vir:81 73 VTAIPRKVQV-TQRF-SQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) T ss_pred EEEeeEEEEE-eehh-hHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeec Confidence 4444333322 2333 45542 12235566644 446888888888763210000 0000 00000001 Q ss_pred --CcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccc---cccchhhhhhhhc Q lcl|NC_011802. 140 --NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI---QRQVAGFDDVLRS 214 (430) Q Consensus 140 --~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~i---gr~~~Gfd~~~~~ 214 (430) .....+.++..+-.++...+.. ....++||.+...+.. |-..+ .+--|.+... +..+.|+- +.-+ T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~---~~~~vmn~~~~~~l~~----lkd~~--G~~l~~~~~~~~~~~tl~G~P-v~~~ 220 (311) T protein:vir:81 151 TGTSATPDLAVEAAVGLVLGDNLS---PDGVALDNTFSFMLAT----QRDSQ--GRKLYPELGFGTDVASFAGLN-AAVS 220 (311) T ss_pred ccccchHHHHHHHHHHHhhhcCCC---ceEEEEcHHHHHHHHh----hhccC--CCeeecCccccCCCceeccee-EEec Confidence 1112234444444444443322 2358899999877642 21111 0000111000 00111221 1111 Q ss_pred CCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcce Q lcl|NC_011802. 215 PKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDAT 294 (430) Q Consensus 215 ~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~ 294 (430) ..+|...... .. .........++..-+.| +-.+ T Consensus 221 ~~i~~~~~~~--------~~--------------------------~~~~~~~~~~~~~~~~g-------------Dfs~ 253 (311) T protein:vir:81 221 DTVRGGPEAV--------TA--------------------------STGVYRTTNPNVKAIAG-------------DFSA 253 (311) T ss_pred cccccccccc--------cc--------------------------ccchhcccCCccEEEEE-------------eccc Confidence 1111100000 00 00000111222222233 3344 Q ss_pred EEEEeeccCceeEEeecccccc----ccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecccc Q lcl|NC_011802. 295 FSVVRVVDGTHVEITPKPVALD----DVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 295 fvVta~~~a~tv~I~Pai~~~~----~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~ 368 (430) |++.. ...-++.+.+-.-+.. -...-...+...-+... ..|++||+..+..-.. T Consensus 254 ~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~-------------------v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 254 FRWGV-QVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIG-------------------IMSTDAFAVVRDADES 311 (311) T ss_pred EEEEE-eccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccE-------------------eecccceEEEEeeccC Confidence 54433 3334555544321100 00000011111111112 2334444443322211 No 68 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=47.77 E-value=0.69 Score=21.46 Aligned_cols=275 Identities=13% Similarity=0.055 Sum_probs=98.7 Q ss_pred Ccccccchhhhh-HHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccceeE Q lcl|NC_011802. 1 MALNEGQIVTLA-VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLELNVA 76 (430) Q Consensus 1 ma~~~~~~~t~~-~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~sv~ 76 (430) ||++-+.++--- .+++++.++...++.++++..+ - .+..+++|+-.... .-+|...+. ..+..=.++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~-~------~~~~~~~p~~~~~~-~a~~v~Eg~~~~~~~~~f~~v~ 72 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKP-I------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceee-c------cCCceEEEEEecCc-ceEEeeCCccccccccceeEEE Confidence 999999877443 4889999999999888884322 1 12345555432111 112222222 1122223344 Q ss_pred EEeccccccceEecHHHh-h----hHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccc----ce-----eee-cCCC--CC Q lcl|NC_011802. 77 VNMGEPDNDFFQLRADDL-R----DETAYRHRIQS-AARKLANNVELKVANMAAEMG----SL-----VIT-SPDA--IG 138 (430) Q Consensus 77 v~ld~~k~V~f~~t~keL-~----~~~~~~~~i~~-Am~~LAn~Id~dl~~~~~~~~----~~-----v~~-~~~t--~~ 138 (430) ++..+... -+.+ ++|| . .....+.+|+. -.++|+.++|..++.-..... .. +.. .... .. T Consensus 73 l~~~k~~~-~~~i-S~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 73 MVPIKVEY-GARI-SDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred EeeeEEEE-eeeh-hHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccc Confidence 43322221 2333 4554 2 22345556654 445888888888863210000 00 000 0000 01 Q ss_pred CCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcc Q lcl|NC_011802. 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLP 218 (430) Q Consensus 139 ~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~ 218 (430) ......+.++..+-..|....... ...++||.+...+.. |-..+ ||.+ |.. T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~----lkd~~------------G~~l--~~~-------- 201 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK----QKDLQ------------GNAL--FPE-------- 201 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHH----hhccC------------CCee--ecC-------- Confidence 111223556666666666655442 359999998877643 21111 2211 000 Q ss_pred cccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEE Q lcl|NC_011802. 219 VLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVV 298 (430) Q Consensus 219 ~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVt 298 (430) ..++|. ..++.|-+- +......... .+....+-.|| .++.... T Consensus 202 ~~~~~~--~~tl~G~PV-------~~~~~v~~~~--------~~~~~~~~~Gd--------------------fs~~~~~ 244 (298) T protein:vir:94 202 LKWGAT--PDTINGLPV-------DVNKTVSDMS--------LTQRDRAIIGD--------------------FANGFKW 244 (298) T ss_pred cccCCC--Cceecceee-------EEeccccccc--------CCCccEEEEee--------------------ccceEEE Confidence 000111 011222110 0000000000 00000111122 1111111 Q ss_pred eeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccc-eeeccceeeEEeecc Q lcl|NC_011802. 299 RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTN-VFWADDAIRIVSQPI 366 (430) Q Consensus 299 a~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~N-l~Fhr~A~aLat~pl 366 (430) .....-++++.+-.- ++. . .++. .-...|.+.. ..+.. -..|++||+....-= T Consensus 245 ~~~~~~~~~~~~~~~-~d~---~-------~~~~--f~~~~v~~r~--~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 245 GYAKEVPLEVIQYGD-PDN---S-------GLDL--KGYNQVYIRA--ELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEecCceEEEeecCC-CcC---c-------chhh--hhcCcEEEEE--EEEeccEeecccceEEEEecC Confidence 111222333322100 000 0 0000 0000000000 00000 112233333322111 No 69 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=46.18 E-value=0.74 Score=21.29 Aligned_cols=266 Identities=16% Similarity=0.145 Sum_probs=123.9 Q ss_pred Ccccccc----hhhhhHHHHHHHHhhhcc---cchhcccCCCchhhhhccCcEEEEecCcccccccC---CcccCCcCcc Q lcl|NC_011802. 1 MALNEGQ----IVTLAVDEIIETISAITP---MAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG---WDLTDKATGL 70 (430) Q Consensus 1 ma~~~~~----~~t~~~~evi~~len~lv---mA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g---~~~s~~~~di 70 (430) |-+-.|. ..++--|||+..|-..|. ++|.|.+| ..||++.||-=-....++- .+..-+ .+ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF--------~~G~~L~I~tiGs~~~~~~~E~~~~~~~--~i 70 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDF--------GSGETLHIKTIGSVTLQEAEEDTPLIYN--PI 70 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccC--------CCCCEEEecccCceeeeccccCCCeeec--cc Confidence 5443332 123333777777655442 35656433 4578888874333333321 122223 34 Q ss_pred ccceeEEEeccccccceEecHHHhh-hHHHHHHHH--H--HHHHHHHHHHHHHHHHHHhccc-----ce-eeecCC---C Q lcl|NC_011802. 71 LELNVAVNMGEPDNDFFQLRADDLR-DETAYRHRI--Q--SAARKLANNVELKVANMAAEMG-----SL-VITSPD---A 136 (430) Q Consensus 71 ~e~sv~v~ld~~k~V~f~~t~keL~-~~~~~~~~i--~--~Am~~LAn~Id~dl~~~~~~~~-----~~-v~~~~~---t 136 (430) --+-+++.|.+.|+...++++ +|+ +.+...+.. + ...|+|-..-+.|++.+...+- ++ +-+.|- + T Consensus 71 ~TGEIt~~i~~Y~G~A~~vt~-~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~ 149 (313) T protein:vir:95 71 ETGEITFQITEYKGDAWYVTD-DLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVS 149 (313) T ss_pred ccceEEEEEEeecCChhhhhh-hhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEe Confidence 456688888899998877764 454 333333333 2 2557888889999987765421 11 111111 1 Q ss_pred CCCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhc-ccchhhHHHHhcc-----ccccchhhhh Q lcl|NC_011802. 137 IGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDI-FGRIPEEAYRDGT-----IQRQVAGFDD 210 (430) Q Consensus 137 ~~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~-~~~~~~~a~r~g~-----igr~~~Gfd~ 210 (430) ..+...-+++++...+-.|++.++|. ++|-.++||..++-|-+ +....+ ...-.+=-++.|. .-|.+.|+| T Consensus 150 ~~T~~~~~~~~~~~~~~~~~~a~~P~-~G~v~IvDP~~~~~L~~-l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~D- 226 (313) T protein:vir:95 150 AETNGVFALKHLIAMRLAFDKANVPA-EGRVFIVDPVAEATLNG-LVTITHDVTDFGKMILESGMARGQRFIMNLYGWD- 226 (313) T ss_pred ccCCceehhhHHHHhhhhhhhccCCc-cceEEEEcchhhhhhhh-hheeecccccccceeeeccCCchhHHHHHHhhhh- Confidence 12233346888999999999999999 58999999998877643 332222 1111111123332 225567886 Q ss_pred hhhcCCcccccCccccccccccccccccceeeeeecc-------cc-ccccc------ee--eeEEeec-cceeecccEE Q lcl|NC_011802. 211 VLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDG-------NK-VNVDN------RF--ATVTLSA-TTGLKRGDKI 273 (430) Q Consensus 211 ~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g-------~~-~~~d~------~~--~~~t~s~-tgtlkaGDv~ 273 (430) ++-|..+. ..+-+.+..|-+| .-+---+-+...+ ++ .+... +. ..++-.- +-+|.+-|-+ T Consensus 227 i~~SN~L~-~AN~~D~~tT~~G--~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L 303 (313) T protein:vir:95 227 ILTSNRLH-VANYNDGTTTGNG--YVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTL 303 (313) T ss_pred hhhhhhhh-hccccccccccCc--eeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecce Confidence 55544432 2222222222222 1111111111111 11 00000 00 0111000 1123333322 Q ss_pred EEcceeeecccccccccCcceEEEEeeccCcee Q lcl|NC_011802. 274 SFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 274 TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv 306 (430) -.- + .+++.- T Consensus 304 ~~~----------------------~-~~A~~~ 313 (313) T protein:vir:95 304 GLL----------------------A-TSATAY 313 (313) T ss_pred eEE----------------------E-eccccC Confidence 211 1 111111 No 70 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=44.44 E-value=0.81 Score=21.10 Aligned_cols=294 Identities=15% Similarity=0.059 Sum_probs=115.7 Q ss_pred CcccccchhhhhHHHHHHHH-----hhhccc--chhcccCCCchhhh--hccCcEEEEecCcccc-----cccCCcccCC Q lcl|NC_011802. 1 MALNEGQIVTLAVDEIIETI-----SAITPM--AQKAKKYTPPAASM--QRSSNTIWMPVEQESP-----TQEGWDLTDK 66 (430) Q Consensus 1 ma~~~~~~~t~~~~evi~~l-----en~lvm--A~~V~~~r~~~~~~--~k~GdTV~i~~P~~~~-----~~~g~~~s~~ 66 (430) ||.= +|.-+.+-|+...+ .+.+-+ +..+...-.....+ +..||+|++|.-...- +.++.++. T Consensus 1 MA~T--~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~-- 76 (324) T protein:vir:59 1 MAYT--KISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV-- 76 (324) T ss_pred CCce--eeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc-- Confidence 9932 22222223443332 222111 22231111122222 3579999998764431 11122221 Q ss_pred cCcccccee-EEEeccccccceEecH--HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cceeeecCCCC Q lcl|NC_011802. 67 ATGLLELNV-AVNMGEPDNDFFQLRA--DDLRDETAYRHRIQSAARKLANNVELKVANMAAEM------GSLVITSPDAI 137 (430) Q Consensus 67 ~~di~e~sv-~v~ld~~k~V~f~~t~--keL~~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~------~~~v~~~~~t~ 137 (430) ++.+..++- -+++..-| .|+.++ .+++-.++.++.-+.=...++++++.+|+..+... .....+...+ T Consensus 77 ~~~l~t~~~~a~i~~~~k--~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~- 153 (324) T protein:vir:59 77 PQKINAGQDKAVLILRGN--AWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGT- 153 (324) T ss_pred hhhcccceeeEEEEeecC--ceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecc- Confidence 122221111 12222222 355553 33444555555444433456778888887665421 2222222111 Q ss_pred CCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCc Q lcl|NC_011802. 138 GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKL 217 (430) Q Consensus 138 ~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v 217 (430) ....-..+.+..|.++|.++. +.-.-++++|..+.+|.... -+.+.. .++ ..+.|+. +.|.. ++.++.+ T Consensus 154 -~~~~~s~~~l~~A~~~~GD~~---~~~~~ivmhS~v~~~L~~~~-li~~~~--~s~--~~~~i~~-~~G~~-VivdD~~ 222 (324) T protein:vir:59 154 -ADGIYSAETFVDASYKLGDHE---SLLTAIGMHSATMASAVKQD-LIEFVK--DSQ--SGIRFPT-YMNKR-VIVDDSM 222 (324) T ss_pred -ccceecHHHHHHHHHHhCCcc---cCcEEEEEchHHHHHHHHhh-hhhhcc--ccc--cCceeee-ecccE-EEEeCCC Confidence 112223456778888898874 23356779999999987642 222221 111 2456765 67865 5667888 Q ss_pred ccccC-ccc---ccccc-ccccccccceeeeeeccccccccceeeeEEeeccc--eeecccEEEEcceeeeccccccccc Q lcl|NC_011802. 218 PVLTK-STA---TGITV-SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKNVLA 290 (430) Q Consensus 218 ~~~t~-gt~---t~~tv-~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tg--tlkaGDv~TiaGV~~v~~~tk~~~~ 290 (430) |.... ++. +++.+ .||- .+....-..++ .+. -++.=|.+.-...|.+||.= T Consensus 223 p~~~~~~~~~~y~s~l~~~GAi-------~~~~~~~~v~v----------E~dRd~~~g~~~l~~r~~~~~~p~G----- 280 (324) T protein:vir:59 223 PVETLEDGTKVFTSYLFGAGAL-------GYAEGQPEVPT----------ETARNALGSQDILINRKHFVLHPRG----- 280 (324) T ss_pred CccccCCCCceEEEEEEecCeE-------EEeecCCCcce----------ecccCccccceEEEEeeEEEeEeee----- Confidence 76322 221 12221 2221 11100000000 000 01111334433333333321 Q ss_pred CcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccce Q lcl|NC_011802. 291 QDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDA 358 (430) Q Consensus 291 ~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A 358 (430) |.-+.... -.++|..--+ .+.+-|.+- .+--+|-++- |-+.=+| T Consensus 281 ----~s~~~~~~---~~~sPt~~~L--~~~~NW~~v--------~~~k~i~i~~-------~~~~~~~ 324 (324) T protein:vir:59 281 ----VKFTENAM---AGTTPTDEEL--ANGANWQRV--------YDPKKIRIVQ-------FKHRLQA 324 (324) T ss_pred ----EEeccccc---CCCCCChhhh--cCCcccccc--------cCccccceEE-------EEeeccC Confidence 12221111 1234432111 111112121 1222222221 2333333 No 71 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=35.44 E-value=1.2 Score=20.09 Aligned_cols=270 Identities=12% Similarity=0.059 Sum_probs=112.0 Q ss_pred Ccc----cccc-hhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCcccc Q lcl|NC_011802. 1 MAL----NEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLE 72 (430) Q Consensus 1 ma~----~~~~-~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e 72 (430) |+. .-+. +-+-...++|+.++...++.+++++. +- .+.+++||+-...... .|...+. ..+..= T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~-~~------~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~f 85 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-PM------GTTGQKIPHWVGDVSA-QWIGEGDMKPITKGNM 85 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee-ec------cCCceEEEEEeCCcce-EEecCCccccccccce Confidence 221 1122 33444489999999999999888432 21 2345666654322111 1212111 122222 Q ss_pred ceeEEEeccccccceEecHHHhhh-HHHHHHHHHH-HHHHHHHHHHHHHHHHHhccc--ce---eeecCCCCCCCcchhh Q lcl|NC_011802. 73 LNVAVNMGEPDNDFFQLRADDLRD-ETAYRHRIQS-AARKLANNVELKVANMAAEMG--SL---VITSPDAIGTNTADAW 145 (430) Q Consensus 73 ~sv~v~ld~~k~V~f~~t~keL~~-~~~~~~~i~~-Am~~LAn~Id~dl~~~~~~~~--~~---v~~~~~t~~~~~~~~~ 145 (430) .++.++..+.. +-+.+|.+-|.+ ....+++|+. -.++++.++|..++.=--... .. ......+........+ T Consensus 86 ~~i~~~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (318) T protein:vir:24 86 TSQTIAPHKIA-TIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVY 164 (318) T ss_pred eEEEEeeEEEE-EeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchH Confidence 33444433322 234444443443 3346667754 456899999998873110000 00 0000000011111111 Q ss_pred -hHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCcc Q lcl|NC_011802. 146 -NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKST 224 (430) Q Consensus 146 -~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt 224 (430) .++..+...+.... . ..-..++||.....+.. + +++ -|+.+ +. + ..+.+. T Consensus 165 ~~~~~~~~~~~~~~~--~-~~~~~v~n~~~~~~L~~----l-----------kd~-~G~~l------~~-~---~~~~~~ 215 (318) T protein:vir:24 165 DQVAVNGLSLLVNDG--K-KWTHTLLDDITEPILNG----A-----------KDQ-NGRPL------FI-E---STYGEA 215 (318) T ss_pred HHHHHHHHHhhcccc--C-CCCEEEEcHHHHHHHHH----h-----------hcc-CCcee------ec-C---ccccCc Confidence 22333333333332 2 23458899998866542 2 111 13322 11 0 000000 Q ss_pred ccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCc Q lcl|NC_011802. 225 ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) Q Consensus 225 ~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~ 304 (430) .. ++.-+ ++-|+ ..+++ T Consensus 216 ~~---------------------------------------~~~~~---~i~g~---------------pv~~~------ 232 (318) T protein:vir:24 216 AS---------------------------------------PFRSG---RIVAR---------------PTILS------ 232 (318) T ss_pred cc---------------------------------------cccCc---eEEEE---------------eeEEe------ Confidence 00 00000 11110 00010 Q ss_pred eeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeeccccCCCcchheeeEEEecC Q lcl|NC_011802. 305 HVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIP 384 (430) Q Consensus 305 tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~p~g~~~~~~~~~~~~p 384 (430) |.+ +.+..+-++|-- +-+.+..+ T Consensus 233 -----~~~----------------------~~~~~~~~~gdf----------s~~~~~~~-------------------- 255 (318) T protein:vir:24 233 -----DHV----------------------VEGTTVGFMGDF----------SQLIWGQI-------------------- 255 (318) T ss_pred -----CCC----------------------CCCccEEEEeec----------ceEEEEEe-------------------- Confidence 110 011111122200 00111111 Q ss_pred cceEEEEEEEee--------------ecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 385 DVGLNGIFRTQG--------------DISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 385 ~~Glslrv~~~y--------------d~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) + |+++++.+++ ...+++...|.-.-+|+++++|+-. +.|-+-+| T Consensus 256 ~-~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~-~~i~~~~a 313 (318) T protein:vir:24 256 G-GLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAF-VALTNVVS 313 (318) T ss_pred c-CeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccce-EEEEeecc Confidence 0 2222222221 1345688888899999999999986 67888888 No 72 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=33.17 E-value=1.4 Score=19.83 Aligned_cols=109 Identities=11% Similarity=0.048 Sum_probs=55.7 Q ss_pred ccccceEecHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhhc- Q lcl|NC_011802. 82 PDNDFFQLRADDLR-DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRE- 159 (430) Q Consensus 82 ~k~V~f~~t~keL~-~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~~- 159 (430) ...|.|.|...||. ....+++.+..+|..+++..-..+-+.+++.++|---++ .||+-|+-.. T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg---------------~ARq~i~~~~~ 65 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTG---------------AARAGLHAVAS 65 (120) T ss_pred CceEEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccch---------------hhhhhhccccc Confidence 67788999999974 455668888999999999999999999999988775421 1455554422 Q ss_pred CCCCCCcEEEeChHHHHHH-HH--hhhhhhccc---chhhHHHH---hccccccchhhhhhhh Q lcl|NC_011802. 160 LNRDMGTSYFFNPQDYKKA-GY--DLTKRDIFG---RIPEEAYR---DGTIQRQVAGFDDVLR 213 (430) Q Consensus 160 aP~~~~R~~vl~p~~~~~~-~~--~~~~l~~~~---~~~~~a~r---~g~igr~~~Gfd~~~~ 213 (430) .+......+.|....++.- ++ +-.+-+... ..-...+. ++.+|+ + + T Consensus 66 ~~~~~~~~Iylsh~veYG~~LEla~~~kyaIl~PTi~~~~~~il~g~~~ll~~-l-------~ 120 (120) T protein:vir:10 66 TPQPDRYEIVFAHTVHYGIWLEIANSGRYEIIMPTVHHEGKLMAQRLRGLLGR-L-------R 120 (120) T ss_pred cCCCceEEEEEecCeeecceEEeeCCCCcccccchHHHHhHHHHHHHHHHhhh-c-------C Confidence 2321112333433222220 00 000000000 00011111 111111 1 0 No 73 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=30.37 E-value=1.6 Score=19.49 Aligned_cols=289 Identities=12% Similarity=0.008 Sum_probs=103.8 Q ss_pred Ccccc---c-chh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCcccc Q lcl|NC_011802. 1 MALNE---G-QIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLE 72 (430) Q Consensus 1 ma~~~---~-~~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e 72 (430) ||..- + -++ +-..++||+.+++..++-+++++. +- .+..++||+-.... .-+|...+. .++..= T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i-~~------~~~~~~ip~~~~~~-~a~wv~Eg~~~~~s~~~f 72 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ-PT------IFGPVKGAVFSGVP-RAKIVGEGEVKPSASVDV 72 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee-ec------CCCceEEEEEeCCc-ceEEeeCCccccccccce Confidence 99762 2 222 223389999999999998887433 21 12345555422111 112222222 122222 Q ss_pred ceeEEEeccccccceEecHHHhh-h--H---HHHHHHHHHH-HHHHHHHHHHHHHHHHhc----cc----ceeeecCCCC Q lcl|NC_011802. 73 LNVAVNMGEPDNDFFQLRADDLR-D--E---TAYRHRIQSA-ARKLANNVELKVANMAAE----MG----SLVITSPDAI 137 (430) Q Consensus 73 ~sv~v~ld~~k~V~f~~t~keL~-~--~---~~~~~~i~~A-m~~LAn~Id~dl~~~~~~----~~----~~v~~~~~t~ 137 (430) +++.++.-+.. +.+.+ ++||. + . ...+.+|... .++|+.++|..++.=--. .. ........ . T Consensus 73 ~~v~l~~~kl~-~~~~i-S~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~-~ 149 (315) T protein:vir:80 73 SAFTAQPIKVV-TQQRV-SDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKN-I 149 (315) T ss_pred eeeEeeeeeEE-eeehh-hHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccc-e Confidence 22333322211 12333 45542 1 1 2245666544 457888888776521000 00 00000000 0 Q ss_pred CCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCc Q lcl|NC_011802. 138 GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKL 217 (430) Q Consensus 138 ~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v 217 (430) ...+...|.++..+-..+.....-. ....++||.....+.. +. ..+ |++..|. -++ +.+ T Consensus 150 ~~~~~~~~~d~~~~~~~~~~~~~~~--~~~~imn~~~~~~L~~-l~---~~~------------g~~~~g~-~~~--~~~ 208 (315) T protein:vir:80 150 VDATDSATADLVKAVGLIAGAGLQV--PNGVALDPAFSFALST-EV---YPK------------GSPLAGQ-PMY--PAA 208 (315) T ss_pred eeccccchHHHHHHHHHHhhccCcc--ceEEEEcHHHHHHHHH-Hh---hcc------------CCccccc-ccc--ccc Confidence 0111223555555554554332222 2348899988877642 21 111 1111110 000 000 Q ss_pred ccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEE Q lcl|NC_011802. 218 PVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSV 297 (430) Q Consensus 218 ~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvV 297 (430) . .| ...++.|-+-. .+- ..+ .....-...+..-|.| +-++|.+ T Consensus 209 ~---~g--~~~tl~G~PV~--------~~~-~~~----------~~~~~~~~~~~~~~~G-------------Dfs~~~~ 251 (315) T protein:vir:80 209 G---FA--GLDNWRGLNVG--------ASS-TVS----------GAPEMSPASGVKAIVG-------------DFSRVHW 251 (315) T ss_pred c---cC--CCceecceeeE--------ecC-cCC----------cccccccccccEEEEe-------------ecccEEE Confidence 0 00 01122222210 000 000 0000000111122323 2223222 Q ss_pred EeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEee---ccccCCCcc Q lcl|NC_011802. 298 VRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQ---PIPANHELF 373 (430) Q Consensus 298 ta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~---pl~~p~g~~ 373 (430) .. ...-+++|.+-.-+ +.+ .++.-.-|-.++-+.---. =-..|++||+.... |-+.|+.+. T Consensus 252 g~-~~~~~i~i~~~~~~-~~~----------~~~~~~~~~v~~r~~~r~~---~~v~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 252 GF-QRNFPIELIEYGDP-DQT----------GRDLKGHNEVMVRAEAVLY---VAIESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred EE-ecCeeEEEeccccc-cCc----------ccchhhcCcEEEEEEEEec---ceeecccceEEEeeccCCCCCCCCCC Confidence 11 22334444432100 000 0001111112222100000 13457788877553 445554433 No 74 >protein:vir:3426 Length: 117 # NCBI annotation: head-tail joining protein # Family: family:all:1908 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040589;genbank:gi:9626253;genbank:GeneID:2703484 Probab=28.96 E-value=1.3 Score=19.94 Aligned_cols=111 Identities=21% Similarity=0.259 Sum_probs=34.7 Q ss_pred eChHHHHHHHHhhhhhhcccchhhHHHH--hccccccchhhhhhhhcCCcccccCccccccccccccccccceeeeeecc Q lcl|NC_011802. 170 FNPQDYKKAGYDLTKRDIFGRIPEEAYR--DGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDG 247 (430) Q Consensus 170 l~p~~~~~~~~~~~~l~~~~~~~~~a~r--~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g 247 (430) |. .+-.|| .+|+- +..|-+.| |+- ..++.+.+...+|+|== -.. ..+...+ T Consensus 1 m~---------~~dNlf------d~a~~~aD~~i~~~f-g~~--------a~i~~~~g~~~~i~gVF--DdP-~~~~~~~ 53 (117) T protein:vir:34 1 MA---------DFDNLF------DAAIARADETIRGYM-GTS--------ATITSGEQSGAVIRGVF--DDP-ENISYAG 53 (117) T ss_pred CC---------cccchh------HHHHhhcchhhHhhc-Cee--------EEEEeCCCcceEEEEEe--cCc-cchhhcc Confidence 11 011111 11111 11221111 111 11111222222222210 000 0011111 Q ss_pred ccccccceeeeEEee--ccceeecccEEEEcceeeecccccccccCcceEEEEeec-c-CceeEEeeccccccccccccc Q lcl|NC_011802. 248 NKVNVDNRFATVTLS--ATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV-D-GTHVEITPKPVALDDVSLSPE 323 (430) Q Consensus 248 ~~~~~d~~~~~~t~s--~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~-~-a~tv~I~Pai~~~~~~~~~~~ 323 (430) .+.-+.+....+.+- --..|+++|.+||+| +.|.|+... + .++=.|+.+-- T Consensus 54 gG~~i~~s~P~L~vk~aDv~~l~r~D~v~I~G---------------~~y~V~~~~PD~~G~~~l~L~rg---------- 108 (117) T protein:vir:34 54 QGVRVEGSSPSLFVRTDEVRQLRRGDTLTIGE---------------ENFWVDRVSPDDGGSCHLWLGRG---------- 108 (117) T ss_pred CCEEeecCCcEEEeeechhhccCCCCEEEECC---------------CeeEeeecccCCCceEEEEeecC---------- Confidence 111122222222222 235799999999999 778887632 1 23333333221 Q ss_pred cccccccccccccCcee Q lcl|NC_011802. 324 QRAYANVNTSLADAMAV 340 (430) Q Consensus 324 ~~~~~nVta~~A~~aav 340 (430) ..| +++-.= T Consensus 109 ~pp--------~~~~~~ 117 (117) T protein:vir:34 109 VPP--------AVNRRR 117 (117) T ss_pred CCC--------ccccCC Confidence 111 111111 No 75 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=28.36 E-value=1.8 Score=19.24 Aligned_cols=291 Identities=12% Similarity=0.069 Sum_probs=112.2 Q ss_pred Ccc--c-ccc--------hhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccc----cC---Cc Q lcl|NC_011802. 1 MAL--N-EGQ--------IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EG---WD 62 (430) Q Consensus 1 ma~--~-~~~--------~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~----~g---~~ 62 (430) |+. + ++. +=+-+.++||+.++...++.+++++. + -.+..+.||+-...+.. .+ +. T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~-~------~~~~~~~ip~~~~~~~a~~v~~~~~~~~ 82 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI-P------ISYGETIIPTTVKRPEVGQVGVGTSNEQ 82 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhccee-e------ccCCceEEEEEecCccceeecccccccc Confidence 211 1 222 33444589999999999998888431 1 12456666664221110 00 00 Q ss_pred ccCC---cCccccceeEEEeccccccceEecHHHhhh-HHHHHHHHHH-HHHHHHHHHHHHHHHHH-----hcc---cce Q lcl|NC_011802. 63 LTDK---ATGLLELNVAVNMGEPDNDFFQLRADDLRD-ETAYRHRIQS-AARKLANNVELKVANMA-----AEM---GSL 129 (430) Q Consensus 63 ~s~~---~~di~e~sv~v~ld~~k~V~f~~t~keL~~-~~~~~~~i~~-Am~~LAn~Id~dl~~~~-----~~~---~~~ 129 (430) ..+. ..++.=.++.++.-+.. ..+.+|.+=|.+ ....+.+|+. -.++++..+|..++.=- ... ..+ T Consensus 83 ~Eg~~~~~~~~~f~~v~l~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~ 161 (338) T protein:vir:78 83 REGGTKPLSGTAWDTRSVAPIKLA-TIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTN 161 (338) T ss_pred cccccccccccceeEEEEEEEEEE-EeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccc Confidence 1111 12222233334333221 233444433343 3445667754 44588999998886310 000 000 Q ss_pred eeecCCC----CCCCcchhhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccc Q lcl|NC_011802. 130 VITSPDA----IGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQV 205 (430) Q Consensus 130 v~~~~~t----~~~~~~~~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~ 205 (430) ......+ ........+.++..+.+.+..+. .. .....+++|.....|.. +..+...+ ||.+ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~m~~~~~~~L~~-~~~l~d~~------------g~~l 226 (338) T protein:vir:78 162 NVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANT-DV-DFNGWAADPRYRARLLR-SQAYRDAN------------GNVD 226 (338) T ss_pred cccccccccccccccchhhHHHHHHHHHHhhhhc-cc-cceEEEEchHHHHHHHH-HhhhccCC------------Ccee Confidence 0000000 01111123444554544444332 22 12357889988876642 22222211 2322 Q ss_pred hhhhhhhhcCCcccccCccccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccc Q lcl|NC_011802. 206 AGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMA 285 (430) Q Consensus 206 ~Gfd~~~~~~~v~~~t~gt~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~t 285 (430) | -. ... .|-.-+|-|+ T Consensus 227 ----~-~~-----~~~-----------------------------------------------~~~~~~l~G~------- 242 (338) T protein:vir:78 227 ----P-TR-----INL-----------------------------------------------AASAGDLLGL------- 242 (338) T ss_pred ----e-cc-----ccc-----------------------------------------------CCCCceeeee------- Confidence 1 00 000 0111123331 Q ss_pred cccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeec Q lcl|NC_011802. 286 KNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQP 365 (430) Q Consensus 286 k~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~p 365 (430) ..+++... |... .+..+...+-++| .....+...+..+.+..-+ T Consensus 243 --------PV~~~~~i--------p~~~------------------~~~~~~~~~~~~g--dfs~~~~~~~~~~~i~~~~ 286 (338) T protein:vir:78 243 --------PVQFGKAV--------GGDL------------------GAATDSKVRVVGG--DFSQLKYGFADEIRVKMSD 286 (338) T ss_pred --------eEEEcccc--------Cccc------------------cccCCcccEEEEE--ecceEEEEeecccEEEEee Confidence 01111100 0000 0000000111122 1100111111111111000 Q ss_pred cccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 366 IPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 366 l~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) =. .. ....+|. -+..-..++ ++..+|.-.-+|++.++|+.. ++|-.-++ T Consensus 287 ~~-------~~--~~~~~~~--~~~~~~~~~----~~~~~r~~~r~d~~v~~~~a~-~~l~~~~~ 335 (338) T protein:vir:78 287 TA-------TL--TDNTSPT--PQTVSMWQT----NQIAILIEVTFGWLLGDKQAF-VKFVDDED 335 (338) T ss_pred cc-------cc--ccccccc--ccchhhhhc----CcEEEEEEEEeccEeecccce-EEEecccC Confidence 00 00 0000000 011112233 488888999999999999996 67776666 No 76 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=27.79 E-value=1.8 Score=19.17 Aligned_cols=281 Identities=11% Similarity=0.061 Sum_probs=108.4 Q ss_pred Cccc-----------------cc-chhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCc Q lcl|NC_011802. 1 MALN-----------------EG-QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWD 62 (430) Q Consensus 1 ma~~-----------------~~-~~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~ 62 (430) ||-. -+ -|-+....++|+.+++..++.+++++.. - .+.++++|+-..... -.|. T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~-~------~~~~~~~p~~~~~~~-a~~v 72 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVP-M------GTTGQKIPHWIGDVS-AQWI 72 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceee-c------cCCceEEEEEeCCcc-eEEe Confidence 2221 12 2334444889999999999888884321 1 245677776432211 1222 Q ss_pred ccCC---cCccccceeEEEeccccccceEecHHHhhh-HHHHHHHHHHHH-HHHHHHHHHHHHHHHhc-ccce-eeecCC Q lcl|NC_011802. 63 LTDK---ATGLLELNVAVNMGEPDNDFFQLRADDLRD-ETAYRHRIQSAA-RKLANNVELKVANMAAE-MGSL-VITSPD 135 (430) Q Consensus 63 ~s~~---~~di~e~sv~v~ld~~k~V~f~~t~keL~~-~~~~~~~i~~Am-~~LAn~Id~dl~~~~~~-~~~~-v~~~~~ 135 (430) ..+. ..++.=.++.++..+. .+.+.+|.+-|++ ....+++|+..+ ++++..+|..++.=--. .... .+...+ T Consensus 73 ~E~~~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~ 151 (320) T protein:vir:10 73 GEGDMKPITKGNMTSQNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKS 151 (320) T ss_pred cCCccccccccceeEEEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccccc Confidence 2222 1122222333333221 1224444433443 334566665444 68899999888621100 0000 000000 Q ss_pred ----CCCCCcchh---hh-HHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchh Q lcl|NC_011802. 136 ----AIGTNTADA---WN-FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAG 207 (430) Q Consensus 136 ----t~~~~~~~~---~~-~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~G 207 (430) .....+.+. +. ++..+...+.....+ .-..+++|.....+.. +-..+ |+.+ . T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~n~~~~~~L~~----lkd~~------------G~~l-~ 211 (320) T protein:vir:10 152 VSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK---WTHTLLDDIVEPILNG----AKDKN------------GRPL-F 211 (320) T ss_pred ccceecccccccccccHHHHHHHHHhhhhcccCC---CcEEEEcHHHHHHHHH----hhccC------------Ccee-e Confidence 001111111 11 222233333333322 3468899988876542 21111 3322 1 Q ss_pred hhhhhhcCCcccccCcccc---ccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccc Q lcl|NC_011802. 208 FDDVLRSPKLPVLTKSTAT---GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQM 284 (430) Q Consensus 208 fd~~~~~~~v~~~t~gt~t---~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~ 284 (430) .+... .+... +.++.|-+ +-.+..+..|+.+.+-| T Consensus 212 ~~~~~---------~~~~~~~~~~~i~g~p--------------------------v~~~~~~~~~~~~~~~g------- 249 (320) T protein:vir:10 212 IESTY---------TDENSPFRAGRIVSRP--------------------------TILSDHVADGTTVGYMG------- 249 (320) T ss_pred ccccc---------cCccccccCceeeeee--------------------------eEecCCCCCCceEEEEe------- Confidence 00000 00000 00000000 00000111122211111 Q ss_pred ccccccCcceEEEEeeccCceeEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEee Q lcl|NC_011802. 285 AKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQ 364 (430) Q Consensus 285 tk~~~~~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~ 364 (430) +..+|.+ ....+-++.++ + T Consensus 250 ------d~~~~~~-~~~~~~~i~~~------------------------------------------------------~ 268 (320) T protein:vir:10 250 ------DFRNVIW-GQVGGLSFDVT------------------------------------------------------D 268 (320) T ss_pred ------ecceEEE-EEecCeEEEEe------------------------------------------------------e Confidence 1111111 00000000000 0 Q ss_pred ccccCCCcchheeeEEEecCcceEEEEEEEeeecccceeEEEEEeeccceecCcceeEEecCCCCC Q lcl|NC_011802. 365 PIPANHELFAGMKTTSFSIPDVGLNGIFRTQGDISTLFRLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 365 pl~~p~g~~~~~~~~~~~~p~~Glslrv~~~yd~~~~~~~~riDvLyG~~~v~PElagv~l~~q~~ 430 (430) ..- . ...+++. +. ......+++..+|.-.-+|++.++|+.. ++|.+-+| T Consensus 269 ~~~---------~-~~~~~~~-~~-----~~~~f~~~~~~~r~~~~~d~~v~~~~a~-~~l~~~~a 317 (320) T protein:vir:10 269 QAT---------L-NLGTPTE-PN-----FVSLWQHNLVAVRVEAEYAFHNNDKDAF-VKLTNVVT 317 (320) T ss_pred cce---------e-eeccccc-cc-----cchhhhcCcEEEEEEEeeccEEecccce-EEEEeccC Confidence 000 0 0000000 00 0011234577777778889999999986 78888888 No 77 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=27.76 E-value=1.8 Score=19.17 Aligned_cols=274 Identities=11% Similarity=0.088 Sum_probs=93.2 Q ss_pred Cccc----ccc-hh-hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCccc--CC---cCc Q lcl|NC_011802. 1 MALN----EGQ-IV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLT--DK---ATG 69 (430) Q Consensus 1 ma~~----~~~-~~-t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s--~~---~~d 69 (430) +|.+ ++. ++ +-+..+|++.++...++.+++++.. .+..+++|+............. +. ..+ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~--------~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~ 212 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVK--------TKENIKYPVLVKKAEAQGHKNERTNNEMPETD 212 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceec--------cCCceEEEEEecCCcccceecccccccccccc Confidence 2222 222 22 3334779999999999988885432 1224666665332222221111 11 112 Q ss_pred cccceeEEEeccccccceEecHHHh-hhHH-HHHHHHHH-HHHHHHHHHHHHHHHHH-hcc-cceeeecC-CCCCCCcch Q lcl|NC_011802. 70 LLELNVAVNMGEPDNDFFQLRADDL-RDET-AYRHRIQS-AARKLANNVELKVANMA-AEM-GSLVITSP-DAIGTNTAD 143 (430) Q Consensus 70 i~e~sv~v~ld~~k~V~f~~t~keL-~~~~-~~~~~i~~-Am~~LAn~Id~dl~~~~-~~~-~~~v~~~~-~t~~~~~~~ 143 (430) ..=.++.++..+... .+.+ ++|| .+.. ..+.+|+. -..+|+..+|..++.=- ... ........ .+..+.... T Consensus 213 ~~f~~v~~~~~k~~~-~~~i-S~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~ 290 (434) T protein:vir:62 213 IEFDEIELSPTEFDA-LATV-TKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKN 290 (434) T ss_pred cceeeEEeeheeeEe-ehhh-HHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccc Confidence 222234443333222 2233 5554 4433 35667754 44588899998887210 000 00001100 111112223 Q ss_pred hhhHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCc Q lcl|NC_011802. 144 AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKS 223 (430) Q Consensus 144 ~~~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~g 223 (430) .|+++......|.....+ +=..++||.+...+.. + -..+ ||.+ |. |..... T Consensus 291 ~~d~l~~l~~~l~~~~~~---~a~~v~n~~~~~~L~~-l---kd~~------------G~~l----~~------~~~~~~ 341 (434) T protein:vir:62 291 LYDALVKMKNTPVKEVRK---KARWVLNTAALTKIET-M---KTDD------------GFPL----LR------PFNQAE 341 (434) T ss_pred hhhHHHHHHhhcchhhhc---CCEEEEcHHHHHHHHH-h---hccC------------CCEe----ec------cCCCcc Confidence 466665554444443222 2246899988866542 2 1111 3322 10 000000 Q ss_pred cccccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccc-cCcce-EEEEeec Q lcl|NC_011802. 224 TATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVL-AQDAT-FSVVRVV 301 (430) Q Consensus 224 t~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~-~~l~~-fvVta~~ 301 (430) .+...++.|-+-. ++ +.-..+..+....+. -|-++.+=++-.-|...+-.. .+.. ...+. |++..-. T Consensus 342 ~g~~~tl~G~pV~------~~-~~~~~~~~~~~~~i~---~Gdfs~~~i~~~~g~~~i~~~-~~~~~~~~~v~~~~~~r~ 410 (434) T protein:vir:62 342 GGIGYTLLGFPVE------EE-DAIDIPDSPDTPVFY---FGDFSKFYIQDVIGSLEVQKL-VELFSRTNRVGFRIWNLL 410 (434) T ss_pred CCCCceecceeeE------Ee-cCccCccCCCceEEE---EeeccceEEEEeeceeEEEee-hhhhcccCceEEEEEeee Confidence 1122233333210 00 000000000000000 000000000000010000000 0000 11222 5554433 Q ss_pred cCceeEEe-eccccccccccccccccccccccccccCceeEEeccCCccc Q lcl|NC_011802. 302 DGTHVEIT-PKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDART 350 (430) Q Consensus 302 ~a~tv~I~-Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~ 350 (430) ++- .|+ |- +-+.+.+.+...+.. T Consensus 411 Dgk--~i~~~~------------------------~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 411 DAQ--LIHSPF------------------------EVPVYKYVLKAPTGA 434 (434) T ss_pred cce--eecCcc------------------------cceEEEEEeccCCCC Confidence 322 122 21 111222222221111 No 78 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=25.80 E-value=2 Score=18.91 Aligned_cols=239 Identities=13% Similarity=0.086 Sum_probs=88.0 Q ss_pred Ccccccchh--hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC--c--Cccccce Q lcl|NC_011802. 1 MALNEGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK--A--TGLLELN 74 (430) Q Consensus 1 ma~~~~~~~--t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~--~--~di~e~s 74 (430) .-..++..+ +-...+|++.+++..++.++++... . .+.+..+|+.......-+|...+. + .+..=.+ T Consensus 131 ~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~-~------~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 203 (394) T protein:vir:97 131 IKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ-A------KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) T ss_pred cccccccccChHHHHHHHHHHhhhhhhhhhhceeee-c------cCcceEEEEEecCCCccceeccccccccccccccee Confidence 011122222 3334678888888888877774322 1 122345554422211112222221 1 1122234 Q ss_pred eEEEeccccccceEecHHHh-hhHH-HHHHHHHHH-HHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDL-RDET-AYRHRIQSA-ARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL-~~~~-~~~~~i~~A-m~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a 151 (430) +.++..+... .+.+ ++|| .+.. ..+.+|+.. ..+|+..+|..++.-. ++..+.+...++++..+ T Consensus 204 v~l~~~k~~~-~i~i-s~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~-----------~~~~~~~~~~~~~~~~~ 270 (394) T protein:vir:97 204 VAWNIDTYRG-AIPL-SQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL-----------KSFTTKTVKNLDEIKAL 270 (394) T ss_pred EEeehhheee-ehhh-HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------ccccccccccHHHHHHH Confidence 4444433332 2333 4444 3322 255666543 3467777777665311 11222233445555433 Q ss_pred HHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccc Q lcl|NC_011802. 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) Q Consensus 152 ~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~ 231 (430) -. ...-|. .+-..++||.+...+.. +-..+ ||.+ | . .. .+.| ++.++. T Consensus 271 ~~---~~~~~~-~~a~~v~n~~~~~~l~~----lkd~~------------G~~i----~--~-~~---~~~~--~~~~l~ 318 (394) T protein:vir:97 271 LN---GGFDPA-YNVSLIVSQSFYQTLDT----LKDGN------------GRYL----L--Q-DD---ITAV--SGKVLL 318 (394) T ss_pred HH---hhhhhh-hCCEEEEcHHHHHHHHH----hhccC------------CCee----e--e-cC---cCCC--CCceec Confidence 32 222232 34568899998766532 21111 3322 1 0 00 0111 112233 Q ss_pred cccccccceeeeeeccccccccceeeeEEeeccceeecccE---EEEc---c--eeeecccccccccCcceEEEEeeccC Q lcl|NC_011802. 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDK---ISFT---G--VKFLGQMAKNVLAQDATFSVVRVVDG 303 (430) Q Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv---~Tia---G--V~~v~~~tk~~~~~l~~fvVta~~~a 303 (430) |-+-. ..+... .+.+.+.-||. +.|. | +.+.. .....+.|++..-.+. T Consensus 319 G~pv~-----~~~~~~--------------~~~~~~~~gd~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~r~d~ 374 (394) T protein:vir:97 319 GKPVF-----VLSDEV--------------LGANKAFIGDFKRGVLFADRKDLGLRWAD-----NEIYGQYLQAVLRFGV 374 (394) T ss_pred cceeE-----Eecccc--------------cCCccEEEeeccccEEEEEecceEEEEec-----ccccceeEEEEEEEcc Confidence 32210 000000 00111112220 0000 0 00000 0111123444332221 Q ss_pred --------ceeEEeeccccc Q lcl|NC_011802. 304 --------THVEITPKPVAL 315 (430) Q Consensus 304 --------~tv~I~Pai~~~ 315 (430) -.++++|+..|+ T Consensus 375 ~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 375 SKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EEecccceEEEEecccccCC Confidence 135666655543 No 79 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=23.75 E-value=2.3 Score=18.64 Aligned_cols=266 Identities=8% Similarity=0.026 Sum_probs=96.6 Q ss_pred Cccc----ccchhhh-hH-----HHHHHHHhhhcccchh---cccCCCchhhhhccCcEEEEecCcccccccCCcccCCc Q lcl|NC_011802. 1 MALN----EGQIVTL-AV-----DEIIETISAITPMAQK---AKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKA 67 (430) Q Consensus 1 ma~~----~~~~~t~-~~-----~evi~~len~lvmA~~---V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~~ 67 (430) |-.. |.||.+. .| -++..+|+.++.==.. |.+..|.+ .|+||.+.+...+.-..|-...+.+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla-----~GstIkt~k~~~y~gda~dVaEGe~ 75 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVS-----EGMTLKTYAGYDVTLAEGNVPEGEV 75 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhccccccc-----CCCEEeeccceeeeeccccccCCcc Confidence 5554 5554432 01 1334443333221111 12233433 3999866543333222221122221 Q ss_pred --Cccccc----eeEEEeccccccceEecHHHhhhHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCC Q lcl|NC_011802. 68 --TGLLEL----NVAVNMGEPDNDFFQLRADDLRDETAYR---HRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIG 138 (430) Q Consensus 68 --~di~e~----sv~v~ld~~k~V~f~~t~keL~~~~~~~---~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~ 138 (430) -+-++. ...+++.++... .|++.+...-+.. +-=+.=.+.|+++|+.|++...+....-+ .. T Consensus 76 Iplskvt~~~~~t~t~~ikK~rK~---tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~---~~--- 146 (296) T protein:vir:98 76 IPLSKVERKIHSEKKIELKKYRKA---TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ---DA--- 146 (296) T ss_pred cchhhheeeecceEEEEeeccccc---cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccccee---ee--- Confidence 011111 234555554443 2555542111111 11122345799999999988776553211 10 Q ss_pred CCcchhhhH-----HHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhh Q lcl|NC_011802. 139 TNTADAWNF-----VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLR 213 (430) Q Consensus 139 ~~~~~~~~~-----~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~ 213 (430) ..+.+.. +..+...+++.. +...-+|+||.+.+..+++.. + -.+..-.--|.++ |.|- .+++ T Consensus 147 --t~~~lQ~Ala~~~~~l~~~feded---~~~~V~FVnP~D~a~ylg~a~-i-t~qt~fG~tyl~n-----fLG~-~II~ 213 (296) T protein:vir:98 147 --LGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAG-I-TTQTAFGLTYLVD-----FTGT-VIIS 213 (296) T ss_pred --chhhHHHHHHHHhhhhhhhccccC---CCceEEEEehHHHHHHhcCCc-c-chhheechhhhhh-----cccc-EEEE Confidence 1122221 222334555543 234678899999988776441 2 1222222233333 4553 4677 Q ss_pred cCCcccccCccccccc---cccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeeccccccccc Q lcl|NC_011802. 214 SPKLPVLTKSTATGIT---VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLA 290 (430) Q Consensus 214 ~~~v~~~t~gt~t~~t---v~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~ 290 (430) |..+|. |+.-... ++.+.. +..+...+ ..+..-+|.-=+=||.- ++.++..+. T Consensus 214 S~kV~~---G~~~~T~~~Ni~~ay~--------~~~~~~l~------------~~f~~~~d~tglIGv~h-~~~~~~~t~ 269 (296) T protein:vir:98 214 TNDVTK---GEIWATVPENIIFAYI--------NPNNSELA------------KEFNLYGDPTGYIGMNH-FQENTTLTI 269 (296) T ss_pred cCcCCC---ceEEEeeecceEEEee--------cccccchh------------hhhccccccccceEEEe-ccccceeee Confidence 777774 3211111 122211 00000000 00111112222223210 111111100 Q ss_pred CcceEEEEeeccCceeEEeecccccccccccccccccccccccc Q lcl|NC_011802. 291 QDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSL 334 (430) Q Consensus 291 ~l~~fvVta~~~a~tv~I~Pai~~~~~~~~~~~~~~~~nVta~~ 334 (430) . ...-..++++|-+.- - .-+ .+.+++. T Consensus 270 e--------T~~~~~~~lfpE~~d--g-----iv~--~tI~~~~ 296 (296) T protein:vir:98 270 Q--------TLLVSGMLMYPERID--G-----IVK--VTLTPGV 296 (296) T ss_pred h--------hHhHhHHHhcccccc--e-----EEE--EEecCCC Confidence 0 011122334442210 0 000 0000000 No 80 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=23.64 E-value=2.3 Score=18.62 Aligned_cols=266 Identities=14% Similarity=0.065 Sum_probs=99.3 Q ss_pred Cccc---ccc-hhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccc Q lcl|NC_011802. 1 MALN---EGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLEL 73 (430) Q Consensus 1 ma~~---~~~-~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~ 73 (430) |... .+. +.+...+++|+.+....++-+++...+ . .|..+.+|+-......-.|...+. ..+..=. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-~------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 177 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGR-T------SSNALEYVREEVFTNNADVVAEKALKPESDITFS 177 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceec-c------cCcceEEEEEecCCcceeeeccCcccccccccee Confidence 2222 222 334445788888888888888874322 1 123444443221111111111111 1232233 Q ss_pred eeEEEeccccccceEecHHHh-hhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcc---cce---eeecCCCCCCCcchhh Q lcl|NC_011802. 74 NVAVNMGEPDNDFFQLRADDL-RDETAYRHRIQSAA-RKLANNVELKVANMAAEM---GSL---VITSPDAIGTNTADAW 145 (430) Q Consensus 74 sv~v~ld~~k~V~f~~t~keL-~~~~~~~~~i~~Am-~~LAn~Id~dl~~~~~~~---~~~---v~~~~~t~~~~~~~~~ 145 (430) ++.++..+... .+.+| +|| .+....+++|+..+ .+++..+|..++.=--.. .+. ......+........+ T Consensus 178 ~~~~~~~k~~~-~~~is-~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 255 (385) T protein:vir:18 178 KQTANVKTIAH-WVQAS-RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRA 255 (385) T ss_pred EEEEeeeeEEE-eehhh-HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchH Confidence 34444444332 24455 454 44555677776544 588999998886311000 000 0000001111112345 Q ss_pred hHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccc Q lcl|NC_011802. 146 NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 146 ~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~ 225 (430) +++..+-..|.....+. -..+++|.+...+.. +-..+ ||.+ +. . + ..| T Consensus 256 d~i~~~~~~l~~~~~~~---~~~~~~~~~~~~l~~----lkd~~------------G~~l------~~-~--~--~~~-- 303 (385) T protein:vir:18 256 DIIAHAIYQVTESEFSA---SGIVLNPRDWHNIAL----LKDNE------------GRYI------FG-G--P--QAF-- 303 (385) T ss_pred HHHHHHHHhhccccCCC---CEEEEcHHHHHHHHH----hhcCC------------Ccee------cc-C--c--ccC-- Confidence 66666666666555443 358899998876542 11111 2322 10 0 0 011 Q ss_pred cccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCce Q lcl|NC_011802. 226 TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 226 t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~t 305 (430) +..++.|-+- -.+..+.+|+++ | | +.+++.+..+..+-+ T Consensus 304 ~~~~l~G~pV--------------------------~~~~~~p~~~~~-~-g-------------d~~~~~~~~~~~~~~ 342 (385) T protein:vir:18 304 TSNIMWGLPV--------------------------VPTKAQAAGTFT-V-G-------------GFDMASQVWDRMDAT 342 (385) T ss_pred CCceecceee--------------------------EEcCcCCCCcEE-E-e-------------ecccEEEEEEecceE Confidence 1112222210 001112233322 1 1 112222222222222 Q ss_pred eEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecccc Q lcl|NC_011802. 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 306 v~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~ 368 (430) |.++...-. +-.......+.+.-+.-. ..+++||+..+..-.. T Consensus 343 v~~~~~~~~-~~~~~~~~~~~~~r~~~~-------------------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 343 VEVSREDRD-NFVKNMLTILCEERLALA-------------------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccccc-hhhcCcEEEEEEEeeccE-------------------EecccceEEEEeccCC Confidence 322221100 000000001111111111 2333343333332221 No 81 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=23.64 E-value=2.3 Score=18.62 Aligned_cols=266 Identities=14% Similarity=0.065 Sum_probs=99.3 Q ss_pred Cccc---ccc-hhhhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccCC---cCccccc Q lcl|NC_011802. 1 MALN---EGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK---ATGLLEL 73 (430) Q Consensus 1 ma~~---~~~-~~t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~~---~~di~e~ 73 (430) |... .+. +.+...+++|+.+....++-+++...+ . .|..+.+|+-......-.|...+. ..+..=. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-~------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 177 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGR-T------SSNALEYVREEVFTNNADVVAEKALKPESDITFS 177 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceec-c------cCcceEEEEEecCCcceeeeccCcccccccccee Confidence 2222 222 334445788888888888888874322 1 123444443221111111111111 1232233 Q ss_pred eeEEEeccccccceEecHHHh-hhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcc---cce---eeecCCCCCCCcchhh Q lcl|NC_011802. 74 NVAVNMGEPDNDFFQLRADDL-RDETAYRHRIQSAA-RKLANNVELKVANMAAEM---GSL---VITSPDAIGTNTADAW 145 (430) Q Consensus 74 sv~v~ld~~k~V~f~~t~keL-~~~~~~~~~i~~Am-~~LAn~Id~dl~~~~~~~---~~~---v~~~~~t~~~~~~~~~ 145 (430) ++.++..+... .+.+| +|| .+....+++|+..+ .+++..+|..++.=--.. .+. ......+........+ T Consensus 178 ~~~~~~~k~~~-~~~is-~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 255 (385) T protein:vir:19 178 KQTANVKTIAH-WVQAS-RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRA 255 (385) T ss_pred EEEEeeeeEEE-eehhh-HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchH Confidence 34444444332 24455 454 44555677776544 588999998886311000 000 0000001111112345 Q ss_pred hHHHHHHHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccc Q lcl|NC_011802. 146 NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 146 ~~~a~a~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~ 225 (430) +++..+-..|.....+. -..+++|.+...+.. +-..+ ||.+ +. . + ..| T Consensus 256 d~i~~~~~~l~~~~~~~---~~~~~~~~~~~~l~~----lkd~~------------G~~l------~~-~--~--~~~-- 303 (385) T protein:vir:19 256 DIIAHAIYQVTESEFSA---SGIVLNPRDWHNIAL----LKDNE------------GRYI------FG-G--P--QAF-- 303 (385) T ss_pred HHHHHHHHhhccccCCC---CEEEEcHHHHHHHHH----hhcCC------------Ccee------cc-C--c--ccC-- Confidence 66666666666555443 358899998876542 11111 2322 10 0 0 011 Q ss_pred cccccccccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCce Q lcl|NC_011802. 226 TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 226 t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~t 305 (430) +..++.|-+- -.+..+.+|+++ | | +.+++.+..+..+-+ T Consensus 304 ~~~~l~G~pV--------------------------~~~~~~p~~~~~-~-g-------------d~~~~~~~~~~~~~~ 342 (385) T protein:vir:19 304 TSNIMWGLPV--------------------------VPTKAQAAGTFT-V-G-------------GFDMASQVWDRMDAT 342 (385) T ss_pred CCceecceee--------------------------EEcCcCCCCcEE-E-e-------------ecccEEEEEEecceE Confidence 1112222210 001112233322 1 1 112222222222222 Q ss_pred eEEeeccccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecccc Q lcl|NC_011802. 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 306 v~I~Pai~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~ 368 (430) |.++...-. +-.......+.+.-+.-. ..+++||+..+..-.. T Consensus 343 v~~~~~~~~-~~~~~~~~~~~~~r~~~~-------------------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 343 VEVSREDRD-NFVKNMLTILCEERLALA-------------------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccccc-hhhcCcEEEEEEEeeccE-------------------EecccceEEEEeccCC Confidence 322221100 000000001111111111 2333343333332221 No 82 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=21.99 E-value=2.5 Score=18.39 Aligned_cols=255 Identities=11% Similarity=0.011 Sum_probs=103.2 Q ss_pred Ccccccchh--hhhHHHHHHHHhhhcccchhcccCCCchhhhhccCcEEEEecCcccccccCCcccC-CcC---ccccce Q lcl|NC_011802. 1 MALNEGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTD-KAT---GLLELN 74 (430) Q Consensus 1 ma~~~~~~~--t~~~~evi~~len~lvmA~~V~~~r~~~~~~~k~GdTV~i~~P~~~~~~~g~~~s~-~~~---di~e~s 74 (430) +...++..+ +-...+||+.++...++.++++... -.+.++.+|++......-+|...+ ... +..=.+ T Consensus 137 ~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 209 (400) T protein:vir:38 137 VKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQ-------ASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKP 209 (400) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEe-------ccCcceEEEEEecCCCcccccccccccccccccccee Confidence 222233322 2234678888888887777773211 123456667664332221222111 111 112223 Q ss_pred eEEEeccccccceEecHHHh-hhHH-HHHHHHHHH-HHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHH Q lcl|NC_011802. 75 VAVNMGEPDNDFFQLRADDL-RDET-AYRHRIQSA-ARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) Q Consensus 75 v~v~ld~~k~V~f~~t~keL-~~~~-~~~~~i~~A-m~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a 151 (430) +.++..+... -+.+ ++|| .+.. ..+.+|... ..+|+..+|..++.-. ++..+++...++++..+ T Consensus 210 i~~~~~k~~~-~~~i-s~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~-----------~~~~~~~~~~~~~~~~~ 276 (400) T protein:vir:38 210 VNWSVETYRQ-ALPV-SQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLL-----------KGFTAKTISSVDDLKHI 276 (400) T ss_pred eEeehhheee-ehhh-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcc-----------ccccccccccHHHHHHH Confidence 4444333331 2333 4454 3322 355566443 3466666666664211 11222233455655544 Q ss_pred HHHHHhhcCCCCCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhccccccchhhhhhhhcCCcccccCccccccccc Q lcl|NC_011802. 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) Q Consensus 152 ~~~L~~~~aP~~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~v~~~t~gt~t~~tv~ 231 (430) -... .-|. .+-..++||.....+.. +- ++ -|+.+ +. +. .+.| ++.++. T Consensus 277 ~~~~---~~~~-~~a~~v~~~~~~~~l~~----lk-----------d~-~G~~i------~~-~~---~~~~--~~~~l~ 324 (400) T protein:vir:38 277 NNVD---LDPA-YSRVIIASQSFYNFLDT----VK-----------DG-NGRYL------LQ-DS---ILTP--SGKSVL 324 (400) T ss_pred HHhh---hhhh-hCcEEEEcHHHHHHHHH----hh-----------cc-CCCee------ee-cC---cCCC--Cccccc Confidence 3222 1222 34568899988766432 21 11 14422 11 11 1112 122334 Q ss_pred cccccccceeeeeeccccccccceeeeEEeeccceeecccEEEEcceeeecccccccccCcceEEEEeeccCceeEEeec Q lcl|NC_011802. 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPK 311 (430) Q Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~~~~t~s~tgtlkaGDv~TiaGV~~v~~~tk~~~~~l~~fvVta~~~a~tv~I~Pa 311 (430) |-+- ... .......+||.+.|-| +.++|+...+-.+-++..+.- T Consensus 325 G~pv--------~~~---------------~~~~~~~~g~~~~~~g-------------d~s~~~~~~~~~~~~~~~~~~ 368 (400) T protein:vir:38 325 GMPI--------AVV---------------SDDTLGAAGEAHAFLG-------------DIKRAILFANRADFMVRWVDD 368 (400) T ss_pred ccee--------EEe---------------cccccCCCCceEEEEE-------------eccccEEEEeecceEEEEecc Confidence 3331 000 0011122566655555 455554444333444444321 Q ss_pred cccccccccccccccccccccccccCceeEEeccCCcccceeeccceeeEEeecccc Q lcl|NC_011802. 312 PVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 312 i~~~~~~~~~~~~~~~~nVta~~A~~aavTv~~~~s~~~Nl~Fhr~A~aLat~pl~~ 368 (430) .. .....+.|..+ + =-..|++||+..+..... T Consensus 369 ~~------~~~~~~~~~r~-------------d------~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 369 QI------YGQFLQAGMRF-------------G------VSVADEKAGYFLTYTPKA 400 (400) T ss_pred cc------cceeEEEEEEe-------------c------cEEecccceEEEEeecCC Confidence 10 00011222111 1 134466666666554332 No 83 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=21.02 E-value=2.7 Score=18.24 Aligned_cols=100 Identities=11% Similarity=0.043 Sum_probs=57.8 Q ss_pred ccccceEecHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeecCCCCCCCcchhhhHHHHHHHHHHhh-- Q lcl|NC_011802. 82 PDNDFFQLRADDLR-DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSR-- 158 (430) Q Consensus 82 ~k~V~f~~t~keL~-~~~~~~~~i~~Am~~LAn~Id~dl~~~~~~~~~~v~~~~~t~~~~~~~~~~~~a~a~~~L~~~-- 158 (430) ...|.|.|...+|+ ....+++.+..+|..+++..-..|-+.+++.++|.--+.. ||+-|..- T Consensus 1 ~~~~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~---------------ARqgl~~~~~ 65 (123) T protein:vir:74 1 MAKVTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGA---------------ARSGLLAVAN 65 (123) T ss_pred CceeEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchh---------------hhhhhccccc Confidence 67788999999985 4556688889999999999999999999999988754322 23322110 Q ss_pred --c-----------CC----C---CCCcEEEeChHHHHHHHHhhhhhhcccchhhHHHHhcc Q lcl|NC_011802. 159 --E-----------LN----R---DMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGT 200 (430) Q Consensus 159 --~-----------aP----~---~~~R~~vl~p~~~~~~~~~~~~l~~~~~~~~~a~r~g~ 200 (430) | +. . .+.+-.+|-|+...-...-+.++.+ .-..+|+++ T Consensus 66 ~~g~~~~~Iylsh~veYG~~LEla~~~kyaIi~Ptv~~~~~~im~g~~~----ll~~l~~~~ 123 (123) T protein:vir:74 66 KLGPGSHELIMSYSVHYGIWLEIANSGQYAVIGPFLPVMGRKLMHDLEH----LIDRLERAQ 123 (123) T ss_pred cCCCceEEEEEecCeeecceeeecCCCCceeecchHHHHhHHHHHHHHH----HHHHhhccC Confidence 0 00 0 0234455555544332222222211 222333333 Done!