Query lcl|NC_019501.1_cdsid_YP_007004324.1 [gene=5] [protein=coat protein] [protein_id=YP_007004324.1] [location=24368..25663] Match_columns 431 No_of_seqs 53 out of 64 Neff 5.4 Searched_HMMs 1612 Date Thu Nov 7 17:14:22 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_20 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_20_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2106 Length: 430 # 100.0 4E-172 2E-175 960.3 35.8 430 1-431 1-430 (430) 2 protein:vir:9265 Length: 430 # 100.0 7E-171 4E-174 953.4 35.3 430 1-431 1-430 (430) 3 protein:vir:100939 Length: 430 100.0 7E-171 4E-174 953.4 35.3 430 1-431 1-430 (430) 4 protein:vir:108303 Length: 418 100.0 1E-120 8E-124 678.0 32.5 400 1-431 1-417 (418) 5 protein:vir:105522 Length: 423 100.0 6E-115 3E-118 647.0 32.2 399 1-430 1-423 (423) 6 protein:vir:105374 Length: 423 100.0 3E-113 2E-116 637.7 30.7 397 1-430 1-423 (423) 7 protein:vir:174 Length: 423 # 100.0 3E-112 2E-115 632.4 31.0 402 1-430 1-423 (423) 8 protein:vir:3525 Length: 423 # 100.0 1E-111 7E-115 628.8 29.9 398 1-430 1-423 (423) 9 protein:vir:99075 Length: 392 100.0 3.5E-40 2.2E-43 236.9 24.9 373 1-411 1-392 (392) 10 protein:vir:105822 Length: 273 100.0 1E-32 6.4E-36 196.0 18.9 266 1-430 1-273 (273) 11 protein:vir:102605 Length: 273 100.0 1E-32 6.4E-36 196.0 18.9 266 1-430 1-273 (273) 12 protein:vir:7990 Length: 273 # 100.0 1.2E-31 7.5E-35 190.1 18.7 266 1-431 1-272 (273) 13 protein:vir:94622 Length: 341 100.0 4.3E-31 2.7E-34 187.0 15.7 315 1-431 1-340 (341) 14 protein:vir:80180 Length: 381 99.9 1.4E-25 8.5E-29 156.9 15.8 348 1-431 1-381 (381) 15 protein:vir:1541 Length: 347 # 99.8 5.5E-21 3.4E-24 131.6 16.0 300 1-431 1-345 (347) 16 protein:vir:3364 Length: 347 # 99.7 1.9E-20 1.2E-23 128.7 14.5 300 1-431 1-345 (347) 17 protein:vir:94711 Length: 347 99.7 7.8E-21 4.8E-24 130.8 11.9 304 1-431 1-346 (347) 18 protein:vir:78739 Length: 332 99.7 6.5E-20 4E-23 125.8 12.6 291 1-428 19-332 (332) 19 protein:vir:10450 Length: 344 99.7 1.6E-19 1E-22 123.6 12.1 299 1-430 1-344 (344) 20 protein:vir:94576 Length: 347 99.6 3.6E-18 2.2E-21 116.2 13.3 303 1-431 1-347 (347) 21 protein:vir:8885 Length: 347 # 99.6 1.1E-17 6.6E-21 113.6 13.3 303 1-431 1-347 (347) 22 protein:vir:93742 Length: 274 99.6 5.4E-17 3.4E-20 109.8 16.1 258 1-431 1-270 (274) 23 protein:vir:80930 Length: 278 99.6 5.7E-17 3.5E-20 109.7 14.7 265 1-431 1-277 (278) 24 protein:vir:2201 Length: 345 # 99.6 6.4E-17 4E-20 109.4 13.6 300 1-430 1-345 (345) 25 protein:vir:1239 Length: 274 # 99.6 2.1E-16 1.3E-19 106.5 16.4 258 1-431 1-270 (274) 26 protein:vir:100057 Length: 375 99.6 3E-16 1.8E-19 105.7 16.8 323 1-431 9-371 (375) 27 protein:vir:96123 Length: 274 99.5 4.5E-16 2.8E-19 104.7 17.2 258 1-431 1-270 (274) 28 protein:vir:95898 Length: 274 99.5 6.2E-16 3.9E-19 104.0 16.3 256 1-431 1-268 (274) 29 protein:vir:96262 Length: 274 99.5 6.2E-16 3.9E-19 104.0 16.3 256 1-431 1-268 (274) 30 protein:vir:97433 Length: 274 99.5 1E-15 6.2E-19 102.8 16.4 258 1-431 1-270 (274) 31 protein:vir:94494 Length: 274 99.5 1E-15 6.2E-19 102.8 16.4 258 1-431 1-270 (274) 32 protein:vir:3613 Length: 272 # 99.5 5.4E-16 3.3E-19 104.3 14.9 260 1-314 1-272 (272) 33 protein:vir:103323 Length: 364 99.5 6.9E-15 4.3E-18 98.2 17.3 298 1-431 15-340 (364) 34 protein:vir:3136 Length: 322 # 99.4 2E-15 1.2E-18 101.2 13.5 284 1-343 1-322 (322) 35 protein:vir:99675 Length: 324 99.4 1.5E-15 9.2E-19 101.9 11.3 273 26-431 1-297 (324) 36 protein:vir:96833 Length: 275 99.4 1.7E-14 1.1E-17 96.0 15.9 258 1-431 1-271 (275) 37 protein:vir:9820 Length: 272 # 99.3 1.7E-13 1.1E-16 90.6 17.8 256 1-430 1-272 (272) 38 protein:vir:3033 Length: 272 # 99.3 1.7E-13 1.1E-16 90.6 17.8 256 1-430 1-272 (272) 39 protein:vir:105334 Length: 276 99.3 6.5E-13 4.1E-16 87.4 16.1 259 1-431 1-271 (276) 40 protein:vir:80213 Length: 334 99.2 3.1E-13 1.9E-16 89.1 11.9 290 1-431 1-333 (334) 41 protein:vir:78935 Length: 335 99.2 1.1E-12 6.9E-16 86.1 14.2 288 1-431 1-330 (335) 42 protein:vir:739 Length: 231 # 99.1 1.5E-12 9.2E-16 85.4 10.6 225 38-314 1-231 (231) 43 protein:vir:97031 Length: 402 99.1 1.6E-12 9.9E-16 85.3 9.9 292 1-431 15-334 (402) 44 protein:vir:7019 Length: 401 # 98.9 2.6E-11 1.6E-14 78.6 10.2 291 1-431 1-334 (401) 45 protein:vir:1781 Length: 221 # 98.8 4.4E-11 2.8E-14 77.3 9.8 203 79-388 1-221 (221) 46 protein:vir:102655 Length: 322 98.8 7.3E-10 4.5E-13 70.7 16.3 287 1-431 1-321 (322) 47 protein:vir:105645 Length: 400 98.8 8.1E-11 5E-14 75.9 9.4 292 1-431 1-334 (400) 48 protein:vir:6324 Length: 335 # 98.7 2.2E-09 1.4E-12 68.0 16.1 289 1-333 15-335 (335) 49 protein:vir:107120 Length: 329 98.7 1.8E-08 1.1E-11 63.1 19.2 266 1-431 33-306 (329) 50 protein:vir:79008 Length: 299 98.6 3.4E-08 2.1E-11 61.6 18.1 288 1-336 1-299 (299) 51 protein:vir:95107 Length: 270 98.5 1.6E-08 1E-11 63.3 13.7 256 1-321 1-270 (270) 52 protein:vir:97331 Length: 319 98.3 3.1E-07 1.9E-10 56.3 17.7 282 1-343 19-319 (319) 53 protein:vir:94800 Length: 319 98.3 3.1E-07 1.9E-10 56.3 17.7 282 1-343 19-319 (319) 54 protein:vir:78920 Length: 290 97.3 9.6E-05 5.9E-08 42.6 15.9 276 1-333 1-290 (290) 55 protein:vir:102335 Length: 312 96.7 0.00043 2.7E-07 39.0 16.7 288 1-338 1-312 (312) 56 protein:vir:79712 Length: 285 96.2 0.00068 4.2E-07 38.0 13.1 265 1-303 1-285 (285) 57 protein:vir:105464 Length: 346 93.4 0.0075 4.7E-06 32.2 16.2 313 1-357 1-346 (346) 58 protein:vir:95875 Length: 401 89.1 0.028 1.8E-05 29.1 13.1 311 1-431 16-400 (401) 59 protein:vir:6212 Length: 434 # 83.6 0.067 4.1E-05 27.0 15.1 277 1-388 141-434 (434) 60 protein:vir:102944 Length: 330 83.3 0.07 4.3E-05 26.9 12.7 286 1-323 1-330 (330) 61 protein:vir:78090 Length: 302 83.2 0.07 4.4E-05 26.9 14.9 286 1-336 1-302 (302) 62 protein:vir:3870 Length: 400 # 78.1 0.12 7.3E-05 25.7 13.2 255 1-369 137-400 (400) 63 protein:vir:4339 Length: 395 # 74.7 0.16 9.6E-05 25.0 17.9 267 1-368 117-395 (395) 64 protein:vir:9927 Length: 295 # 74.5 0.16 9.8E-05 25.0 10.4 258 1-316 1-295 (295) 65 protein:vir:2430 Length: 318 # 65.4 0.28 0.00018 23.6 17.2 270 1-431 14-313 (318) 66 protein:vir:5974 Length: 324 # 60.4 0.37 0.00023 22.9 13.7 292 1-359 1-324 (324) 67 protein:vir:104085 Length: 320 57.6 0.43 0.00027 22.6 17.9 283 1-431 1-317 (320) 68 protein:vir:94771 Length: 298 57.1 0.44 0.00027 22.5 15.6 275 1-337 1-298 (298) 69 protein:vir:8187 Length: 311 # 44.7 0.8 0.00049 21.1 16.8 287 1-369 2-311 (311) 70 protein:vir:8420 Length: 477 # 37.8 1.1 0.00068 20.4 10.9 275 1-308 160-477 (477) 71 protein:vir:1638 Length: 298 # 35.9 1.2 0.00075 20.1 17.1 271 1-367 1-298 (298) 72 protein:vir:10364 Length: 390 35.5 1.2 0.00076 20.1 16.5 260 1-334 117-390 (390) 73 protein:vir:4830 Length: 397 # 34.1 1.3 0.00082 19.9 16.9 274 1-379 109-397 (397) 74 protein:vir:9704 Length: 394 # 33.7 1.3 0.00083 19.9 16.5 239 1-316 131-394 (394) 75 protein:vir:7771 Length: 330 # 33.5 1.3 0.00084 19.9 16.1 284 1-326 1-330 (330) 76 protein:vir:3991 Length: 404 # 28.7 1.7 0.0011 19.3 13.9 273 1-379 114-404 (404) 77 protein:vir:108211 Length: 318 28.5 1.7 0.0011 19.3 12.8 263 1-316 20-318 (318) 78 protein:vir:3426 Length: 117 # 24.3 1.4 0.00089 19.7 4.1 113 167-341 1-117 (117) 79 protein:vir:4856 Length: 293 # 22.4 2.4 0.0015 18.5 14.3 277 1-379 5-293 (293) 80 protein:vir:80491 Length: 467 22.4 2.4 0.0015 18.5 11.1 360 1-431 34-458 (467) 81 protein:vir:63741 Length: 468 22.1 2.5 0.0015 18.4 10.9 363 1-431 1-459 (468) 82 protein:vir:395 Length: 117 # 21.0 2.4 0.0015 18.5 4.6 110 167-341 1-117 (117) 83 protein:vir:96223 Length: 324 20.0 2.8 0.0018 18.1 12.9 266 1-307 30-324 (324) No 1 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=100.00 E-value=4e-172 Score=960.27 Aligned_cols=430 Identities=72% Similarity=1.144 Sum_probs=420.3 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccCccccccceeEEEec Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGNATGILELSVKCNMG 80 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~~di~e~~V~v~ld 80 (431) ||+|+|++||++++|+|+.|||+||||++|++||||+.||+|+|||||||+|+++++++|++.+++++|++|++||++|| T Consensus 1 Ma~~~~~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t~~~~~~~e~~v~~~~~ 80 (430) T protein:vir:21 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccccchhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccccccccCCCccceeeeEeEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEeeHHHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHHHHhhc Q lcl|NC_019501. 81 DPDNDFFELRADDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRE 160 (431) Q Consensus 81 ~~k~v~f~lt~keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~ 160 (431) +||+|+|+|++|||++++++||+|+|||++|||+||+||++++.++++||.+...+++ +.+.++|+|+++++++|++++ T Consensus 81 ~~~~V~~~~~~kEl~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~-~~~~~~~~~~A~a~~~L~~~~ 159 (430) T protein:vir:21 81 EPDNDFFQLRADDLRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIG-TNTADAWNFVADAEEIMFSRE 159 (430) T ss_pred eeccceEEeehhHhcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCC-CCCCcchhhHHHHHHHHHHhc Confidence 9999999999999999999999999999999999999999999999999988655543 334478999999999999999 Q ss_pred CCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccce Q lcl|NC_019501. 161 LNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQ 240 (431) Q Consensus 161 vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~ 240 (431) ||++++|++|+||++++++.+++++++++++.++++||+|+|+|+++||||+|+++++|+|++|+++++||+||+|+|++ T Consensus 160 vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~ 239 (430) T protein:vir:21 160 LNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPV 239 (430) T ss_pred CCCCCCcEEEeChHHHHHHhhhhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCceeccccccccc Confidence 99998999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccccccccc Q lcl|NC_019501. 241 AYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDAS 320 (431) Q Consensus 241 ~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t 320 (431) +++|+++|+++++|++..++++|++|+||+||+|||+|||+|||+|||++|++|||+|++++++++|+|||+|+|+++++ T Consensus 240 ~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~ 319 (430) T protein:vir:21 240 AWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVS 319 (430) T ss_pred cceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEeccccc Q lcl|NC_019501. 321 LTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDIN 400 (431) Q Consensus 321 ~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~ 400 (431) ++.+++|||||+++||||++|||+|++++++||+||||||+|+|||||+|+|+.+++.++++++|++|||+|++||||++ T Consensus 320 ~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~ 399 (430) T protein:vir:21 320 LSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIS 399 (430) T ss_pred ccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 401 TLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 401 ~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++++||||+|||+++|||||+||+|+||+| T Consensus 400 ~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 400 TLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 9999999999999999999999999999999 No 2 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=100.00 E-value=7.2e-171 Score=953.37 Aligned_cols=430 Identities=72% Similarity=1.142 Sum_probs=420.3 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccCccccccceeEEEec Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGNATGILELSVKCNMG 80 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~~di~e~~V~v~ld 80 (431) ||+|++++|+++++|+|+.|||+|||+++|++||||+.+|+|+|||||||+|+++++++|++.+.+++|++|++||++|| T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEeeHHHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHHHHhhc Q lcl|NC_019501. 81 DPDNDFFELRADDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRE 160 (431) Q Consensus 81 ~~k~v~f~lt~keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~ 160 (431) +||+|+|+|++|||++++++||+|+|||++|||+||+||++++.++++||.+....+++ .+.++|+|+++++++|++++ T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~-~~~~~~~~~A~a~~~L~~~~ 159 (430) T protein:vir:92 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGT-NTADAWNFVADAEELMFSRE 159 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCC-cCCcchhhHHHHHHHHHHhc Confidence 99999999999999999999999999999999999999999999999999886555433 33578999999999999999 Q ss_pred CCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccce Q lcl|NC_019501. 161 LNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQ 240 (431) Q Consensus 161 vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~ 240 (431) ||++++|++||||++++++.+++++++++++.++++||+|+|+|+++||||+|+++++|+|++|+++++|||||+|+|++ T Consensus 160 vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~ 239 (430) T protein:vir:92 160 LNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPV 239 (430) T ss_pred CCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceeccccccccc Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccccccccc Q lcl|NC_019501. 241 AYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDAS 320 (431) Q Consensus 241 ~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t 320 (431) +++|+++|+++++|++.+++++|++|+||+||+|||+|||+|||+|||+++++|||+||+++++++|+|||+|+|+++++ T Consensus 240 ~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~ 319 (430) T protein:vir:92 240 AWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVS 319 (430) T ss_pred cceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEecccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEeccccc Q lcl|NC_019501. 321 LTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDIN 400 (431) Q Consensus 321 ~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~ 400 (431) ++.+++|||||+++||+|++|||+|++++++||+||||||+|+|||||+|+|+..++.+.++++|++||++|+++|||++ T Consensus 320 ~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~ 399 (430) T protein:vir:92 320 LSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIS 399 (430) T ss_pred ccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 401 TLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 401 ~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++++||||+|||+++|||||+||+|+||+| T Consensus 400 ~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 400 TLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9999999999999999999999999999999 No 3 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=100.00 E-value=7.2e-171 Score=953.37 Aligned_cols=430 Identities=72% Similarity=1.142 Sum_probs=420.3 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccCccccccceeEEEec Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGNATGILELSVKCNMG 80 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~~di~e~~V~v~ld 80 (431) ||+|++++|+++++|+|+.|||+|||+++|++||||+.+|+|+|||||||+|+++++++|++.+.+++|++|++||++|| T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEeeHHHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHHHHhhc Q lcl|NC_019501. 81 DPDNDFFELRADDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRE 160 (431) Q Consensus 81 ~~k~v~f~lt~keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~ 160 (431) +||+|+|+|++|||++++++||+|+|||++|||+||+||++++.++++||.+....+++ .+.++|+|+++++++|++++ T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~-~~~~~~~~~A~a~~~L~~~~ 159 (430) T protein:vir:10 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGT-NTADAWNFVADAEELMFSRE 159 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCC-cCCcchhhHHHHHHHHHHhc Confidence 99999999999999999999999999999999999999999999999999886555433 33578999999999999999 Q ss_pred CCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccce Q lcl|NC_019501. 161 LNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQ 240 (431) Q Consensus 161 vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~ 240 (431) ||++++|++||||++++++.+++++++++++.++++||+|+|+|+++||||+|+++++|+|++|+++++|||||+|+|++ T Consensus 160 vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~ 239 (430) T protein:vir:10 160 LNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPV 239 (430) T ss_pred CCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceeccccccccc Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccccccccc Q lcl|NC_019501. 241 AYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDAS 320 (431) Q Consensus 241 ~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t 320 (431) +++|+++|+++++|++.+++++|++|+||+||+|||+|||+|||+|||+++++|||+||+++++++|+|||+|+|+++++ T Consensus 240 ~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~ 319 (430) T protein:vir:10 240 AWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVS 319 (430) T ss_pred cceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEecccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEeccccc Q lcl|NC_019501. 321 LTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDIN 400 (431) Q Consensus 321 ~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~ 400 (431) ++.+++|||||+++||+|++|||+|++++++||+||||||+|+|||||+|+|+..++.+.++++|++||++|+++|||++ T Consensus 320 ~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~ 399 (430) T protein:vir:10 320 LSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIS 399 (430) T ss_pred ccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 401 TLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 401 ~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++++||||+|||+++|||||+||+|+||+| T Consensus 400 ~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 400 TLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9999999999999999999999999999999 No 4 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.2e-120 Score=677.98 Aligned_cols=400 Identities=20% Similarity=0.216 Sum_probs=350.8 Q ss_pred CccccccchhHH--HHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccCccccccceeEEE Q lcl|NC_019501. 1 MALNEGQLVTYA--LDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGNATGILELSVKCN 78 (431) Q Consensus 1 ~~~~~~~~lt~~--~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~~di~e~~V~v~ 78 (431) ||.++|+|||+- .+|+|+.||++|||+++| ||+|+.||.|+||||+||+|..+++++|... ++++++|++++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv--~r~y~~e~~~~GDTV~I~vp~~~~v~dg~~~--~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCV--YRNYEKTFGKVGDTIRLKLPYRVKSASGRTL--VKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhh--cCCCchHHhhCCCEEEEeeCCceeecccCCc--cccccccceEEEE Confidence 999999999954 599999999999999999 9999999999999999999999999999765 5678999999999 Q ss_pred eccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHHH Q lcl|NC_019501. 79 MGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLM 156 (431) Q Consensus 79 ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L 156 (431) ||++|++.|.++++| +++.+|++++++||+++||++||.+|+++++..++.+++.+..+ +.|+++++++++| T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~------~~~~~i~~a~~~L 150 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRP------GAFIDFANAGAKQ 150 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCc------chHHHHHHHHHHH Confidence 999999999999888 78999999999999999999999999999999998887644332 4589999999999 Q ss_pred HhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccc-cceecccc Q lcl|NC_019501. 157 FSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT-GVTVSGAQ 235 (431) Q Consensus 157 ~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~-~~tv~gA~ 235 (431) ++++||++++|.+|++|+.++.+.+++... ..+...+++||+|+||| ++||+ ||+++++|.|++|+++ +.+|+|+. T Consensus 151 d~~~VP~~G~R~lVv~P~~~~~L~~~~~~~-~~~~~~~~~lr~G~IG~-i~GF~-V~~S~nip~~tag~~~~t~~v~ga~ 227 (418) T protein:vir:10 151 TTYAVPQDGMRHAVLDPFTCASLSDEVTKL-FKESMVEQAYKMGYRGN-VAAYE-VYESQNLPKHTVGDHGGTPLVNGTV 227 (418) T ss_pred HhcCCCCCCceEEEeCHHHHHHHhhhcccc-ccccccchhhheeeeee-eeceE-EEEecCCCcccccccccceeeeccc Confidence 999999987788999999999998877654 44556678999999997 88997 7999999999999744 58899997 Q ss_pred cccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeec-----CceeEEe Q lcl|NC_019501. 236 KFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVID-----STHIEIT 310 (431) Q Consensus 236 q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~-----a~ti~I~ 310 (431) +.+ .++..++. +.+.+|+|++||+|||+||++|||+||++++.+|||+|++++. +++|+|| T Consensus 228 ~~~---~~~~~~~~-----------t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~ 293 (418) T protein:vir:10 228 VNG---DTVGFDGG-----------TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKIS 293 (418) T ss_pred ccc---eeEEEeec-----------ceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEec Confidence 743 22222221 3466789999999999999999999999999999999999863 4689999 Q ss_pred eccccccccccccc-----ccccceeecccccCceeEEeccCCc--ceeeeeccceeEEEeecccCCCCCcceeeEEEee Q lcl|NC_019501. 311 PKPIALDDASLTKE-----EKAYANVNTSLADNTPVNVLNVATT--TANVFWADDSIRLLSQPIPVTHELFAGMKTSSFS 383 (431) Q Consensus 311 Paii~~~~~t~~~~-----~~~y~nVsa~~A~~aavTv~g~~s~--~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~ 383 (431) |+|++...+....+ ..+|+||+++||+|++|||+|++++ ++||+||||||+|+||||++|+|.+...++ . T Consensus 294 p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~---~ 370 (418) T protein:vir:10 294 PSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRA---A 370 (418) T ss_pred cccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEE---E Confidence 99987766553322 3589999999999999999997554 689999999999999999999997777653 2 Q ss_pred cCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 384 IPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 384 ~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) .+..||++|+.++||+++++.+||||+|||+++|||||+ |+|.||-+ T Consensus 371 ~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~-~~~~g~~~ 417 (418) T protein:vir:10 371 DPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELA-LRLWGAAS 417 (418) T ss_pred eccCCeEEEEEEcccccccceEEEEEeecCceeecccce-EEEEeecC Confidence 344699999999999999999999999999999999995 89999999 No 5 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=5.6e-115 Score=646.96 Aligned_cols=399 Identities=15% Similarity=0.184 Sum_probs=334.1 Q ss_pred Ccccccc-chhHHHHHHHHHHHhhcccchhhccCCCchHHH--hhcccEEEeeccccccccccccc--cc-Cccccccce Q lcl|NC_019501. 1 MALNEGQ-LVTYALDEIIETVQNLTPMASKVTKYTPPAESM--QRSSNTVWMPVEQEAPTQTGWNL--TG-NATGILELS 74 (431) Q Consensus 1 ~~~~~~~-~lt~~~~evi~~len~lvma~~V~~~r~y~~e~--~k~GdTv~ip~p~~~~~~~g~~~--s~-~~~di~e~~ 74 (431) ||++... .-++..+|+|+.||++|||+++| ||+|+.|| +|+||||+||+|..++.+++... +. .+++++|.+ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV--~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~ 78 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTV--DRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAK 78 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhh--ccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccce Confidence 9977754 34555699999999999999999 99999996 67999999999999998886543 33 356899999 Q ss_pred eEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHH-HhhccccceeeccCCCCCCccchhhhhhHHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIA-KQATEMGSLVVHDTRAIGPSTGLSGWDFVSD 151 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~-~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~ 151 (431) |+++||++|++.|+|+++|+ ++.+| +|+|+|||++||++||++|+ .++...++++++++.+ .+.|+++++ T Consensus 79 v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~------~~a~~~~a~ 151 (423) T protein:vir:10 79 ATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTP------IKKWSDVAQ 151 (423) T ss_pred EEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc------cccHHHHHH Confidence 99999999999999999884 57888 89999999999999999997 5666677766653333 356999999 Q ss_pred HHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccc-cce Q lcl|NC_019501. 152 AERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT-GVT 230 (431) Q Consensus 152 a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~-~~t 230 (431) ++++|+++++|+++ |.+|++|+.++.+.+++..+.+.++..+++||+|+|.++++||| ||+|+++|.||+|+.+ ..+ T Consensus 152 a~~~L~~~~vP~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFd-i~~Sn~vp~~T~g~~~ga~~ 229 (423) T protein:vir:10 152 TASFLKDLGINSGE-NYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIR-ALMSNGLASRTQGAFGGKLT 229 (423) T ss_pred HHHHHhhccCCcCC-CEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceE-EEEecCCcccccccccceee Confidence 99999999999987 66999999999999999999999999999999999843588998 7999999999999865 566 Q ss_pred ecccccccceeeeeecccccc--cccceeeEEEeeccceeeeccEEEEcceeeecccccc-----ccCCCceEEEEEee- Q lcl|NC_019501. 231 VSGAQKFKPQAYTLDTDGNKE--NVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKN-----VLTDDATFSITRVI- 302 (431) Q Consensus 231 v~gA~q~~~~~~~v~~~g~~~--~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~-----~~~~lq~fvVta~~- 302 (431) ++|+.. +..++.+. ..+.....++.+.+|+||+||+|||+|||+|||+||| ++|.+|||+|++++ T Consensus 230 ~~~~~~-------vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~ 302 (423) T protein:vir:10 230 VKGTPE-------VNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDAN 302 (423) T ss_pred eeeeeE-------EEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEeccc Confidence 776543 22222222 2233344566677899999999999999999999999 57999999999875 Q ss_pred ----cCceeEEeecccccccccccccccccceeecccccCceeEEeccCCc--ceeeeeccceeEEEeecccCCCCCcce Q lcl|NC_019501. 303 ----DSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATT--TANVFWADDSIRLLSQPIPVTHELFAG 376 (431) Q Consensus 303 ----~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~--~~Nl~fhr~A~aLat~pl~~p~g~~~a 376 (431) .+++|+|||+|+++. .+.+|+||+++||+|++|||+|++++ ++||+|||+||+|+|+||++|.+ T Consensus 303 ~~a~~~~tv~i~p~~~~~~------~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~---- 372 (423) T protein:vir:10 303 AHSSGDVTVKISGVPIFDA------GYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHS---- 372 (423) T ss_pred ccccCceEEEecccccccc------CcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCc---- Confidence 245899999998642 36789999999999999999998765 59999999999999999997754 Q ss_pred eeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 377 MKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 377 ~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ++.++.++. |||+|+.++||+++++.++|||+|||+++|||||++ +|.|-. T Consensus 373 ~~~~~~~~~--g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:10 373 IDSAVATYE--GFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGG-QFFGNP 423 (423) T ss_pred cceeecccc--cceEEEEEeeeccccceEEEEEeecceeeeccceEE-EEEecC Confidence 444555533 899999999999999999999999999999999985 666655 No 6 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=2.7e-113 Score=637.73 Aligned_cols=397 Identities=16% Similarity=0.197 Sum_probs=326.0 Q ss_pred Cccccccchh----HHHHHHHHHHHhhcccchhhccCCCchHHH--hhcccEEEeeccccccccccc--cc-ccCccccc Q lcl|NC_019501. 1 MALNEGQLVT----YALDEIIETVQNLTPMASKVTKYTPPAESM--QRSSNTVWMPVEQEAPTQTGW--NL-TGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt----~~~~evi~~len~lvma~~V~~~r~y~~e~--~k~GdTv~ip~p~~~~~~~g~--~~-s~~~~di~ 71 (431) ||+ +||| +.++|+|+.||++|||+++| ||+|+.|| +|+||||+||+|..++..+.. +. +.+++|++ T Consensus 1 MaN---~llT~~p~iia~~aL~~l~~~lV~~~lV--nr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~ 75 (423) T protein:vir:10 1 MPN---NLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Ccc---chhhhhHHHHHHHHHHHHHhhcccchhh--cccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccc Confidence 994 5554 35699999999999999999 99999997 579999999999988877754 33 35789999 Q ss_pred cceeEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) |++|+++||++|++.|+|+++|+ ++.+| +|+|+|||++||++||.+|++++...++++.+..++ ..+.|+++ T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t-----~~~a~~~i 149 (423) T protein:vir:10 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNT-----PITKWSDV 149 (423) T ss_pred cceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCc-----ccchHHHH Confidence 99999999999999999999885 56777 899999999999999999999988765544432222 23569999 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccc- Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG- 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~- 228 (431) ++++++|++++||+++ |.+|++|+.++.+.++...+...++..+++||+|+|.++++||| ||+|+++|.||+|++++ T Consensus 150 ~~a~~~Ld~~~vP~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-v~~Snnip~~T~gt~~~t 227 (423) T protein:vir:10 150 AQTASFLKDLGVNEGE-NYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIR-ALMSNGLASRTQGAFGGT 227 (423) T ss_pred HHHHHHHHhccCCcCC-CEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceE-EEEeCCCccccccccccc Confidence 9999999999999987 66999999999998888888888888999999999843589997 79999999999998653 Q ss_pred ceecccccccceeeeeeccccccccccee--eEEEeeccceeeeccEEEEcceeeecccccc-----ccCCCceEEEEEe Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRV--ATVTVSSTTGFKRGDKISFTGVKFLSQMAKN-----VLTDDATFSITRV 301 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~--~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~-----~~~~lq~fvVta~ 301 (431) .++..++ ++...+.+.+.+... +..+++.+++||+||+|||+|||+|||+||+ +++.+|||+|+++ T Consensus 228 ~~~~~~~-------~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~ 300 (423) T protein:vir:10 228 LTVKTQP-------TVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTAD 300 (423) T ss_pred eeeeecc-------eeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEee Confidence 3332222 121111111111111 1234566789999999999999999999999 6699999999998 Q ss_pred ec-----CceeEEeecccccccccccccccccceeecccccCceeEEeccCCc--ceeeeeccceeEEEeecccCCCCCc Q lcl|NC_019501. 302 ID-----STHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATT--TANVFWADDSIRLLSQPIPVTHELF 374 (431) Q Consensus 302 ~~-----a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~--~~Nl~fhr~A~aLat~pl~~p~g~~ 374 (431) +. +++|+|+|+||++ ..+.+|+||+++||+|++|||+|++++ ++||+||||||+|+|+||++|.+ T Consensus 301 ~~~~~~g~~tv~i~p~~i~~------~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~-- 372 (423) T protein:vir:10 301 ANSDSGGDVTVTLSGVPIYD------TTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHS-- 372 (423) T ss_pred eeeccCCceeeeccCccccc------cCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCc-- Confidence 73 3589999999986 246789999999999999999998766 59999999999999999997753 Q ss_pred ceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 375 AGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 375 ~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ++.++.++. |||+|+.++||+++++.++|||+|||+++|||||++ +|.|-. T Consensus 373 --~~~~~~~~~--g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:10 373 --IDSAVATYE--GFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGG-QFFGNP 423 (423) T ss_pred --cceeecccc--CceEEEEEeeeccccceEEEEEeecceeeeccceEE-EEEecC Confidence 444555533 899999999999999999999999999999999984 666666 No 7 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=2.6e-112 Score=632.36 Aligned_cols=402 Identities=15% Similarity=0.177 Sum_probs=326.1 Q ss_pred Cccccccc-hhHHHHHHHHHHHhhcccchhhccCCCchHHH--hhcccEEEeecccccccccc--cccc-cCccccccce Q lcl|NC_019501. 1 MALNEGQL-VTYALDEIIETVQNLTPMASKVTKYTPPAESM--QRSSNTVWMPVEQEAPTQTG--WNLT-GNATGILELS 74 (431) Q Consensus 1 ~~~~~~~~-lt~~~~evi~~len~lvma~~V~~~r~y~~e~--~k~GdTv~ip~p~~~~~~~g--~~~s-~~~~di~e~~ 74 (431) ||+|.-++ -.+...|+|+.||++|||+++| ||+|+.|+ .|+||||+||+|..+...+. ++.+ .++++++|++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lV--nr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~ 78 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhh--cccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccce Confidence 99553222 2445689999999999999999 99999997 57999999999988876664 3433 4688999999 Q ss_pred eEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDA 152 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a 152 (431) |+++||++|++.|+|+++|+ ++.+| +|+|+|||++||++||++|++++....+++.+..++ ..+.|++++++ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t-----~~~a~~~i~~a 152 (423) T protein:vir:17 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNT-----PITKWSDVAQT 152 (423) T ss_pred eEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCc-----ccccHHHHHHH Confidence 99999999999999999885 57777 899999999999999999999976654443322222 23569999999 Q ss_pred HHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccc-ccee Q lcl|NC_019501. 153 ERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT-GVTV 231 (431) Q Consensus 153 ~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~-~~tv 231 (431) +++|++++||+++ |.+|++|+.++.+.++...+...++..+++||+|+|.++++||| ||+|+++|.||+|+++ +.++ T Consensus 153 ~~~Ld~~~vP~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-vy~Snnip~~T~gt~~~t~~~ 230 (423) T protein:vir:17 153 ASFLKDLGVNEGE-NYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIR-ALMSNGLASRTQGAFGGTLTV 230 (423) T ss_pred HHHHHhccCCcCC-CEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceE-EEEeCCCccccccceeceeee Confidence 9999999999987 66999999999998888888888888999999999944599997 7999999999999854 3444 Q ss_pred cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeecccccc-----ccCCCceEEEEEeec--- Q lcl|NC_019501. 232 SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKN-----VLTDDATFSITRVID--- 303 (431) Q Consensus 232 ~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~-----~~~~lq~fvVta~~~--- 303 (431) ..+++. .+..+ .+......+ .+..+.+.+++|++||+|||+|||+|||+||+ +++++|||+|+++++ T Consensus 231 ~~~~~v--~~~a~--~~~~~~~~~-~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a 305 (423) T protein:vir:17 231 KTQPTV--TYNAV--KDSYQFTVT-LTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDS 305 (423) T ss_pred cccccc--ccccc--ccccceeee-eeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccc Confidence 433321 11111 111111111 23345567899999999999999999999999 668999999998773 Q ss_pred --CceeEEeecccccccccccccccccceeecccccCceeEEeccCCc--ceeeeeccceeEEEeecccCCCCCcceeeE Q lcl|NC_019501. 304 --STHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATT--TANVFWADDSIRLLSQPIPVTHELFAGMKT 379 (431) Q Consensus 304 --a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~--~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~ 379 (431) +++|+|||+||++. .+.+|+||+++||+|++|||+|++++ ++||+||||||+|+|+||++|.+ ++. T Consensus 306 ~~~~tv~i~p~~i~~~------~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~----~~~ 375 (423) T protein:vir:17 306 SGDVTVTLSGVPIYDT------TNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHS----IDS 375 (423) T ss_pred cCceEEEecCcccccc------CCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCc----cce Confidence 45899999999862 46789999999999999999998766 59999999999999999997743 444 Q ss_pred EEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 380 SSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 380 ~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ++.++. |||+|+.++||+++++.+||||+|||+++|||||++ +|.|-. T Consensus 376 ~~~~~~--g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~-~~~g~~ 423 (423) T protein:vir:17 376 AVATYE--GFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGG-QFFGNP 423 (423) T ss_pred eecccC--CcEEEEEEecccccceeEEEEEeecceeeeccceEE-EEEecC Confidence 555543 899999999999999999999999999999999985 666655 No 8 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=1.1e-111 Score=628.81 Aligned_cols=398 Identities=13% Similarity=0.170 Sum_probs=330.9 Q ss_pred CccccccchhH----HHHHHHHHHHhhcccchhhccCCCchHHH--hhcccEEEeecccccccccccc---cccCccccc Q lcl|NC_019501. 1 MALNEGQLVTY----ALDEIIETVQNLTPMASKVTKYTPPAESM--QRSSNTVWMPVEQEAPTQTGWN---LTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~----~~~evi~~len~lvma~~V~~~r~y~~e~--~k~GdTv~ip~p~~~~~~~g~~---~s~~~~di~ 71 (431) || |+|||. .+.|+|+.||++|||+++| ||+|+.|+ +|+||||+||+|.++++.++.. ...++++++ T Consensus 1 MA---N~llT~iP~iia~~al~~l~~~lV~~~lV--~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MA---NNLESNISQIVLKKFLPGFMSDIVLCKTV--DRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLF 75 (423) T ss_pred Cc---cchhhhhHHHHHHHHHHHHHhhcccchhc--ccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccc Confidence 99 556553 3689999999999999999 99999987 6899999999999998888643 345788999 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHH-hhccccceeeccCCCCCCccchhhhhh Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAK-QATEMGSLVVHDTRAIGPSTGLSGWDF 148 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~-~~~~~~~~v~t~~~t~~~~~~~~~~~d 148 (431) |.+|+++||++|++.|+++++| |++.+| +|+|+|+|++||++||.+|++ ++...++++++++. +.+.|++ T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t------~~~~~~~ 148 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGSPNT------AIKKWAD 148 (423) T ss_pred cceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC------CcchHHH Confidence 9999999999999999999888 467888 799999999999999999997 66667777665333 2356999 Q ss_pred HHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccc-c Q lcl|NC_019501. 149 VSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTA-T 227 (431) Q Consensus 149 ~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~-~ 227 (431) +++++++|++++||+++ |.+|++|+.++.+.++...+...++..+++||+|+|.++++||| ||+|+++|.||+|++ + T Consensus 149 i~~a~~~Ld~~~vP~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-v~~Snnvp~~T~gt~~~ 226 (423) T protein:vir:35 149 VAQTASFIKDIGIKTGE-NYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIR-ALMSNGLASRKQGDFDG 226 (423) T ss_pred HHHHHHHHHHhcCCcCC-CEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceE-EEEcCCCcccccccccc Confidence 99999999999999987 66999999999988888888888888999999999833489997 799999999999985 4 Q ss_pred cceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccc-----cCCCceEEEEEee Q lcl|NC_019501. 228 GVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNV-----LTDDATFSITRVI 302 (431) Q Consensus 228 ~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~-----~~~lq~fvVta~~ 302 (431) +.+++++++.. ...+ .+......+ .+..+++.+++|++||+|||+||++|||+||++ ++++|||+|+++. T Consensus 227 ~~~v~~a~~v~--~~a~--~~~~~~~~~-~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~ 301 (423) T protein:vir:35 227 AITVKTAPNVD--YLSV--KDSYQFTVA-LTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEET 301 (423) T ss_pred ceeeccccccc--cccc--cccccceee-eeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccc Confidence 45566655321 1111 111111111 223466778899999999999999999999995 7999999999876 Q ss_pred -----cCceeEEeecccccccccccccccccceeecccccCceeEEeccCCc--ceeeeeccceeEEEeecccCCCCCcc Q lcl|NC_019501. 303 -----DSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATT--TANVFWADDSIRLLSQPIPVTHELFA 375 (431) Q Consensus 303 -----~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~--~~Nl~fhr~A~aLat~pl~~p~g~~~ 375 (431) .+++|+|+|+|+++. ++.+|+||+++||+|++|||+|++++ ++||+||||||+|+|+|||+|.+ T Consensus 302 ~~~a~g~~~v~i~p~~~~~~------~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~--- 372 (423) T protein:vir:35 302 NSTASGDVTVKLSGVPIYDE------KNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHS--- 372 (423) T ss_pred cccccCceeEEccccccccC------CCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccCCc--- Confidence 256899999998863 46789999999999999999998766 49999999999999999998754 Q ss_pred eeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 376 GMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 376 a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ++.++.++. |||+|+.++||+++++++||||+|||+++|||||+ |+|.|-. T Consensus 373 -~~~~~~~~~--g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~-~~~~g~~ 423 (423) T protein:vir:35 373 -LDSAVATYE--GFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMG-GQFFGNP 423 (423) T ss_pred -cceeecccc--CceEEEEEeeccccCceEEEEEeecceeeecccce-EEEEecC Confidence 333444433 89999999999999999999999999999999997 5666666 No 9 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=3.5e-40 Score=236.92 Aligned_cols=373 Identities=14% Similarity=0.093 Sum_probs=218.4 Q ss_pred CccccccchhHH--HHHHHHHHHhhcccchhhccCCCchHHHh-hcccEEEeeccccccccc------ccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYA--LDEIIETVQNLTPMASKVTKYTPPAESMQ-RSSNTVWMPVEQEAPTQT------GWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~--~~evi~~len~lvma~~V~~~r~y~~e~~-k~GdTv~ip~p~~~~~~~------g~~~s~~~~di~ 71 (431) ||+ ++|++= .+|+|+.|+++|||+++| ||+|+.||. |+||||+||+|..+...+ ++....+++++. T Consensus 1 Ma~---~~~~p~~~a~~~l~~l~~~lv~~~lv--~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) T protein:vir:99 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLV--WLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFT 75 (392) T ss_pred Ccc---ccccHHHHHHHHHHHHHhhccchhhh--ccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccc Confidence 994 556643 378999999999999999 999999985 689999999998776543 334456788999 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) +.+++++||++|.+.|.++++| +.+.++.+++++|++++||++||.+|++++...+..+.+..... ++...++++ T Consensus 76 ~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~---~~~~~~~~i 152 (392) T protein:vir:99 76 EDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEV---APDEFFKGV 152 (392) T ss_pred cceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc---ChhhhHHHH Confidence 9999999999999999999887 46899999999999999999999999999988776655433222 233457889 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccc-c Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKST-A 226 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt-~ 226 (431) .++++.|++.++|. + |.++++|+.++.+..+ +....+.+....+++|+|+||+ +.||+ ||+++++|.++.-. + T Consensus 153 ~~a~~~L~~~~vP~-~-R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~-i~G~~-v~~s~~~~~~t~~a~~ 228 (392) T protein:vir:99 153 NGARRALNELYIPQ-G-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYE-IVESTLIPHGDAYLYH 228 (392) T ss_pred HHHHHHHhhcCCCC-C-CEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeee-eeeeE-EEeecccccccceeee Confidence 99999999999997 5 6699999999987654 3333444445567899999997 88997 79999999876421 1 Q ss_pred -ccce-ecccccccceeeeeecccccccccce-eeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEe-- Q lcl|NC_019501. 227 -TGVT-VSGAQKFKPQAYTLDTDGNKENVDNR-VATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRV-- 301 (431) Q Consensus 227 -~~~t-v~gA~q~~~~~~~v~~~g~~~~~d~~-~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~-- 301 (431) +..+ ..+++. .+.+..- +.....+.. ....+.+..++ .+.|.+++..+....-.++..... |..... T Consensus 229 ~~a~~~at~a~v-~~~~~~~---~~s~s~~~~v~~~~~~~~~~t-~~s~~~~v~~~~g~~~v~~~~~~~---~~~~~~~~ 300 (392) T protein:vir:99 229 PTAFIMATRAPA-PPMGAVR---STAISGDQRIAMRWLVDYDST-ITSNRSLIDTYFGLKVVEDPNGVG---FVRARKIH 300 (392) T ss_pred cccccccccccc-ccccccc---eeEEecccceecceeecccce-eeccccccceeEEEEEEeeccccc---eeeeeeee Confidence 1111 111110 0000000 000000000 00011111111 223444444333332222221111 111111 Q ss_pred ecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEE Q lcl|NC_019501. 302 IDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSS 381 (431) Q Consensus 302 ~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t 381 (431) ....++++.|..+.....++......-...+..|.++.. .+.++-|.-+-=..|++.- .|.-.++.. T Consensus 301 ~~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~--------~~~~vtw~Ssn~~vAtV~~---~G~Vt~v~~-- 367 (392) T protein:vir:99 301 LIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDD--------VTALCDFESSATDKATVAA---GGLVTGVAA-- 367 (392) T ss_pred eecceeeeeeeecccceeEeeeccceeEEEEEEecCCcc--------ccceEEEEEcCCeeEEEcC---CceEEEEec-- Confidence 112334444432222111111111111233333333222 1234556555556666653 232222221 Q ss_pred eecCcceEEEEEEecccccccceEEEEEee Q lcl|NC_019501. 382 FSIPGIGVNGIFATQGDINTLSGKCRIAVW 411 (431) Q Consensus 382 ~~~~g~glslrv~~~yd~~~~~~~~r~dvl 411 (431) | --.+.+.+...-......|.+.|+ T Consensus 368 ----G-~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 368 ----G-TSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred ----c-eEEEEEEEEcCCCcEEEEEEEEeC Confidence 1 112233221111235666777777 No 10 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1e-32 Score=195.95 Aligned_cols=266 Identities=17% Similarity=0.138 Sum_probs=190.7 Q ss_pred Cccccccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc--ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT--GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~--g~~~s~~~~di~e~~V~v 77 (431) ||++ .++ +..-+++++.|++.++|+++| +|+|+.++ +.||||.||+|......+ +......++++.+..+++ T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 9994 444 444589999999999999999 89999887 679999999998877654 233445677899999999 Q ss_pred EeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHH Q lcl|NC_019501. 78 NMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERL 155 (431) Q Consensus 78 ~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~ 155 (431) +||+++.+.|.+++.| +.+.++ +.++++++.+||++||.++++++.....-+. ...+.+..+.++.+.++++. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~----~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT----GSAPTDADDAFDLIAKALKE 150 (273) T ss_pred EEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccchhHHHHHHHHHHHH Confidence 9999999999999755 456665 6689999999999999999998877543322 12233444668889999999 Q ss_pred HHhhcCCccCCcEEEeChHHHhhhhhhh--hhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecc Q lcl|NC_019501. 156 MFSRELNRDMGISYFLNPDDYRKAGRNL--VDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSG 233 (431) Q Consensus 156 L~~~~vP~~~~r~~v~np~~~a~~~~~~--~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~g 233 (431) |++..||.++ |.++++|..++.+...- ....... .....+|+|.||| +.||+ ||+++++|.+.. . T Consensus 151 ld~~~vP~~~-R~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~l~~G~ig~-i~G~~-v~~s~~lp~~~~-----~---- 217 (273) T protein:vir:10 151 LTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGN-LLGAR-IVESNNLRDTDD-----E---- 217 (273) T ss_pred hhhcCCCcCC-CEEEECHHHHHHHhcchhhhhhhhcc-ccccceeeeeeeE-EeceE-EEEecccccCCc-----c---- Confidence 9999999976 77999999999875532 2221111 1345799999998 89997 688888874210 0 Q ss_pred cccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeecc Q lcl|NC_019501. 234 AQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKP 313 (431) Q Consensus 234 A~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Pai 313 (431) + T Consensus 218 -----------------------------------------------------------------~-------------- 218 (273) T protein:vir:10 218 -----------------------------------------------------------------Q-------------- 218 (273) T ss_pred -----------------------------------------------------------------E-------------- Confidence 0 Q ss_pred cccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEE Q lcl|NC_019501. 314 IALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIF 393 (431) Q Consensus 314 i~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv 393 (431) + ++||++||.++.+-... +...++ T Consensus 219 -----------------------------~---------~~~~~~A~~~a~q~~~~----------e~~r~~-------- 242 (273) T protein:vir:10 219 -----------------------------F---------VAFHPSAAAYVSQIDTV----------EALRDQ-------- 242 (273) T ss_pred -----------------------------E---------EEEeccceeeeeeeehh----------hcccCC-------- Confidence 0 45888999887764322 111111 Q ss_pred EecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 394 ATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 394 ~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) +......+-+..||++.+|||-..+.=+.=. T Consensus 243 ------~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 243 ------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ------CcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1122234456789999999996533221212 No 11 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1e-32 Score=195.95 Aligned_cols=266 Identities=17% Similarity=0.138 Sum_probs=190.7 Q ss_pred Cccccccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc--ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT--GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~--g~~~s~~~~di~e~~V~v 77 (431) ||++ .++ +..-+++++.|++.++|+++| +|+|+.++ +.||||.||+|......+ +......++++.+..+++ T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 9994 444 444589999999999999999 89999887 679999999998877654 233445677899999999 Q ss_pred EeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHH Q lcl|NC_019501. 78 NMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERL 155 (431) Q Consensus 78 ~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~ 155 (431) +||+++.+.|.+++.| +.+.++ +.++++++.+||++||.++++++.....-+. ...+.+..+.++.+.++++. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~----~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT----GSAPTDADDAFDLIAKALKE 150 (273) T ss_pred EEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccchhHHHHHHHHHHHH Confidence 9999999999999755 456665 6689999999999999999998877543322 12233444668889999999 Q ss_pred HHhhcCCccCCcEEEeChHHHhhhhhhh--hhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecc Q lcl|NC_019501. 156 MFSRELNRDMGISYFLNPDDYRKAGRNL--VDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSG 233 (431) Q Consensus 156 L~~~~vP~~~~r~~v~np~~~a~~~~~~--~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~g 233 (431) |++..||.++ |.++++|..++.+...- ....... .....+|+|.||| +.||+ ||+++++|.+.. . T Consensus 151 ld~~~vP~~~-R~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~l~~G~ig~-i~G~~-v~~s~~lp~~~~-----~---- 217 (273) T protein:vir:10 151 LTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGN-LLGAR-IVESNNLRDTDD-----E---- 217 (273) T ss_pred hhhcCCCcCC-CEEEECHHHHHHHhcchhhhhhhhcc-ccccceeeeeeeE-EeceE-EEEecccccCCc-----c---- Confidence 9999999976 77999999999875532 2221111 1345799999998 89997 688888874210 0 Q ss_pred cccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeecc Q lcl|NC_019501. 234 AQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKP 313 (431) Q Consensus 234 A~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Pai 313 (431) + T Consensus 218 -----------------------------------------------------------------~-------------- 218 (273) T protein:vir:10 218 -----------------------------------------------------------------Q-------------- 218 (273) T ss_pred -----------------------------------------------------------------E-------------- Confidence 0 Q ss_pred cccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEE Q lcl|NC_019501. 314 IALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIF 393 (431) Q Consensus 314 i~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv 393 (431) + ++||++||.++.+-... +...++ T Consensus 219 -----------------------------~---------~~~~~~A~~~a~q~~~~----------e~~r~~-------- 242 (273) T protein:vir:10 219 -----------------------------F---------VAFHPSAAAYVSQIDTV----------EALRDQ-------- 242 (273) T ss_pred -----------------------------E---------EEEeccceeeeeeeehh----------hcccCC-------- Confidence 0 45888999887764322 111111 Q ss_pred EecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 394 ATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 394 ~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) +......+-+..||++.+|||-..+.=+.=. T Consensus 243 ------~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 243 ------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ------CcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1122234456789999999996533221212 No 12 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.96 E-value=1.2e-31 Score=190.07 Aligned_cols=266 Identities=18% Similarity=0.141 Sum_probs=188.5 Q ss_pred Cccccccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc--ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT--GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~--g~~~s~~~~di~e~~V~v 77 (431) ||++ +++ +.+-+++++.|+++++|+++| +|+|+.++ ++||||.||++......+ +.......+++.+..+++ T Consensus 1 MA~~--~~~pei~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:79 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLV--NREYEGIA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhccchhhh--hccccccc-cCCcEEEEeecCcccccccccCCCccCccccccceEEE Confidence 9995 344 444589999999999999999 89998764 789999999997766544 223345677889999999 Q ss_pred EeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHH Q lcl|NC_019501. 78 NMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERL 155 (431) Q Consensus 78 ~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~ 155 (431) +||+++.+.|.+++.| +.+.++ ++++++++.+||+++|.++++++........ ...+.++.+.++.+.++++. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~----~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:79 76 LIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT----GSAPSDADDAFDLIASALKE 150 (273) T ss_pred EEeeecccceeeccHHHHhhcccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cccccchhhHHHHHHHHHHH Confidence 9999999999999755 456666 5689999999999999999999877543221 11223334557889999999 Q ss_pred HHhhcCCccCCcEEEeChHHHhhhhhhhhhhhcccc-chhhhHhhccccccchhhhhhHhcCCCccccccccccceeccc Q lcl|NC_019501. 156 MFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGR-VTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGA 234 (431) Q Consensus 156 L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~-~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA 234 (431) |++..||.++ |.++++|..++.+...-........ .....+|+|.||| +.||+ |++++++|.++ T Consensus 151 ld~~~vP~~~-R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~-i~~s~~lp~~~------------ 215 (273) T protein:vir:79 151 LTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGAR-IVESNNLRDTD------------ 215 (273) T ss_pred hhhccCCccC-cEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeE-EeceE-EEecccccccC------------ Confidence 9999999987 6689999998876543211111111 1345799999998 88997 68888887421 Q ss_pred ccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccc Q lcl|NC_019501. 235 QKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPI 314 (431) Q Consensus 235 ~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii 314 (431) | T Consensus 216 -------------------------------------------~------------------------------------ 216 (273) T protein:vir:79 216 -------------------------------------------D------------------------------------ 216 (273) T ss_pred -------------------------------------------c------------------------------------ Confidence 0 Q ss_pred ccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEE Q lcl|NC_019501. 315 ALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFA 394 (431) Q Consensus 315 ~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~ 394 (431) .+ =++||++||+++.+-.... ... T Consensus 217 --------------------------~~---------~~a~~~~A~~~a~~~~~~e----------~~r----------- 240 (273) T protein:vir:79 217 --------------------------EQ---------FVAFHPSAAAYVSQIDTVE----------ALR----------- 240 (273) T ss_pred --------------------------eE---------EEEEeccceeeeeehhhhh----------ccc----------- Confidence 00 0457888988887654321 111 Q ss_pred ecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 395 TQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 395 ~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) |.+......+-+..||++.+|||-.. +|. -++ T Consensus 241 ---~~~~~~~~v~~~~~yg~~v~~p~~vv-~~~-~~g 272 (273) T protein:vir:79 241 ---DQDSFSDRIRALHVYGGKVVRPTGVV-VFN-KTG 272 (273) T ss_pred ---CcccceeeeeeeeeeeeEEecCceEE-EEe-ccC Confidence 11222334456788999999999543 332 122 No 13 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.95 E-value=4.3e-31 Score=187.04 Aligned_cols=315 Identities=12% Similarity=0.059 Sum_probs=195.2 Q ss_pred CccccccchhH----------H-----HHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccc-ccc Q lcl|NC_019501. 1 MALNEGQLVTY----------A-----LDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGW-NLT 64 (431) Q Consensus 1 ~~~~~~~~lt~----------~-----~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~-~~s 64 (431) |++- +-||- + -+|+++.|+.++++++++ |+|+-++ +.||||+||.+.+....+-. ... T Consensus 1 ~~~~--~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~---~d~~~~~-~~Gdtv~ip~~g~~~~~d~~~~~~ 74 (341) T protein:vir:94 1 MALG--NTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV---KTWGAQV-KKGDTFHVPRISELGVEDKATDVP 74 (341) T ss_pred Ccch--hhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc---ccccccc-cCCceEEEeccCcceeeeecCCCc Confidence 5542 22332 3 488999999999999988 7887766 45999999999877665532 223 Q ss_pred cCccccccceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccC----CCCC Q lcl|NC_019501. 65 GNATGILELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDT----RAIG 138 (431) Q Consensus 65 ~~~~di~e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~----~t~~ 138 (431) ..++++.+..++++||+++...|.+++.| ....++.+++++++.++||+++|.++++++........... .... T Consensus 75 i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~ 154 (341) T protein:vir:94 75 VGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAI 154 (341) T ss_pred cccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccc Confidence 45668889999999999999999999755 45788999999999999999999999998876443221111 1111 Q ss_pred Cc-cchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCC Q lcl|NC_019501. 139 PS-TGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPK 217 (431) Q Consensus 139 ~~-~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~ 217 (431) .+ .....|..+.++++.|++.+||.++ |.++++|..++.+... ..+...+...+..+|+|.||+ +.||+ ||++++ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~g-R~lvv~P~~~~~Ll~~-~~~~~~~~~g~~~l~~G~ig~-i~G~~-V~~Sn~ 230 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEK-IVLLISPGQESALFTI-PQFISKDFINNAPIAQGQIGS-LMGVR-VIRTSL 230 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccC-CEEEeCHHHHHHHhhc-hhhhhhhccccchhheeeeee-EeceE-EEEecc Confidence 11 1112467888999999999999977 6689999999998553 222233333445799999987 88997 799999 Q ss_pred CccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEE Q lcl|NC_019501. 218 LPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFS 297 (431) Q Consensus 218 v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fv 297 (431) +|.+.... . +.|+.- ....+..-.|.|+.. T Consensus 231 lp~~~~~~---~-~~~~~~------------------------------~~~~~~~~~i~~~~~---------------- 260 (341) T protein:vir:94 231 IGNNSATG---W-RNGAPT------------------------------IAPAEATPGFTGSRY---------------- 260 (341) T ss_pred cccccccc---c-cccccc------------------------------eeccccccccccccc---------------- Confidence 99653211 0 111100 000111111222000 Q ss_pred EEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEE-eecccCCCCCcce Q lcl|NC_019501. 298 ITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLL-SQPIPVTHELFAG 376 (431) Q Consensus 298 Vta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLa-t~pl~~p~g~~~a 376 (431) + +.| ++ -.+...-|+|||+|+..+ ...++.- + T Consensus 261 ------------~---------------~~~--------~~-------~~~~~~gl~~~~~av~~~k~~~~~~~-----~ 293 (341) T protein:vir:94 261 ------------L---------------PKQ--------DS-------FTSLPATFTGNSRPVHTAVMCHMDWA-----A 293 (341) T ss_pred ------------c---------------ccc--------cc-------ccccEEEEEEecccccceeeecchhh-----h Confidence 0 000 00 011223599999997665 2222210 0 Q ss_pred eeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecC-CCCC Q lcl|NC_019501. 377 MKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLP-NQTA 431 (431) Q Consensus 377 ~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~-~q~~ 431 (431) +.... -+++...|++...-..++=..+||++.+|||.+ |-|. +-.. T Consensus 294 ~~~~~--------~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~-v~~~~~~~~ 340 (341) T protein:vir:94 294 AVVSK--------APRVTQSFENREQVWLMVGRQAYGARLYRPLHA-VNIHTTGDT 340 (341) T ss_pred ccccc--------cccccccchhhhhhhhhhhhhhhcccccCccee-EEEecCcCC Confidence 00000 122233333333333445567999999999996 4442 2211 No 14 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.89 E-value=1.4e-25 Score=156.90 Aligned_cols=348 Identities=10% Similarity=-0.010 Sum_probs=203.8 Q ss_pred Ccc--------------c-cccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccc-cc Q lcl|NC_019501. 1 MAL--------------N-EGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGW-NL 63 (431) Q Consensus 1 ~~~--------------~-~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~-~~ 63 (431) ||. . -++++ +..-+|+++.|+++++|.+++. +++|+- +.||||+||.+......+-. .. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~-~~~~~~---~~GdTV~ip~~g~~~a~d~~~g~ 76 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATK-KIPFEG---KKGDLIHIPNISRAAVYDKQPQT 76 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccc-ccccee---ecCceEEeeccCcceeeeecCCC Confidence 332 1 13344 4455899999999999999984 456754 45999999988765543321 12 Q ss_pred ccCccccccceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeec----cCCC- Q lcl|NC_019501. 64 TGNATGILELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVH----DTRA- 136 (431) Q Consensus 64 s~~~~di~e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t----~~~t- 136 (431) ....+++.+..++++||+.+...+.+++.| ...-++.+++++++..+||+++|.+++.++......... .... T Consensus 77 ~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i 156 (381) T protein:vir:80 77 PVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTL 156 (381) T ss_pred cccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccc Confidence 335667889999999999999999999755 457789999999999999999999999887654432111 0000 Q ss_pred --------CCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchh Q lcl|NC_019501. 137 --------IGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAG 208 (431) Q Consensus 137 --------~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~G 208 (431) .........+..+.++++.|++..||.++ |.++++|..+..+.... .+...+......+|+|.|++ +.| T Consensus 157 ~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~eg-R~lvv~P~~~~~Ll~~~-~~~~ad~~~~~~l~~G~Ig~-i~G 233 (381) T protein:vir:80 157 GDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEG-RIVMVSPAQYIDLLSIN-QFISVDFSQVKPVTSGVVGT-ILG 233 (381) T ss_pred cccccccccccchhhHHHHHHHHHHHHHhhcCCCcCC-cEEEeCHHHHHHHhhch-hhhhhhhccchhhhceeeeE-Ecc Confidence 00111222467789999999999999876 66899999999876542 22223334556899999997 899 Q ss_pred hhhhHhcCCCcccccccccccee-cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccc Q lcl|NC_019501. 209 FDEILRSPKLPAVTKSTATGVTV-SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAK 287 (431) Q Consensus 209 fd~~~~s~~v~~~t~gt~~~~tv-~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk 287 (431) |+ |+++.++|.... +.+.. .|++.. ....+ .+.. +..... -..++.+++|.-|.-+......++-.++ T Consensus 234 ~~-Vv~Sn~lp~~~~---t~~~~~agap~~--~~~~~--~~~~--~~g~~s-~~a~av~~~k~yd~~~~~~~~~~~~~~g 302 (381) T protein:vir:80 234 ME-VIVTTQIGINSL---TGYVNGQGAPTQ--PTPGV--LGSP--YLPDQA-GTANVVNTGSASDLAVSLSYFGLPVFSG 302 (381) T ss_pred eE-EEeecccccccc---cceeeecccccc--ccccc--cccc--cccccc-cceeeeeeeeeeceeeeeeeccceeeec Confidence 97 789999987422 22111 122211 00011 1111 111111 1123345566666655544333322222 Q ss_pred cccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecc Q lcl|NC_019501. 288 NVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPI 367 (431) Q Consensus 288 ~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl 367 (431) .. .+.+...+++.++++ |.|.+-++++-|= T Consensus 303 ~~--------~~~~~~~~~~~~~~~------------------------------------------~~~~~~~~~~~~~ 332 (381) T protein:vir:80 303 AG--------ATAADGGQTLGSFGG------------------------------------------ANRWATAVVCHPD 332 (381) T ss_pred ce--------eeecCCCceeeeehh------------------------------------------hhhhhhhcccccc Confidence 11 111122223333221 4444433332221 Q ss_pred cCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 368 PVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 368 ~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) = +..++++. +.+ +-+..+.||+| ....| +.||++.+||.++ |-|----. T Consensus 333 ~--~~~~~~~~--~~~----~~~~~~~~~~~----~~~~~--~~~~~~~~~~~~~-~~~~~~~~ 381 (381) T protein:vir:80 333 W--LAVGVQQN--VKS----ESSRETMYLAD----AFVTS--CVYGAKVFRPDHC-VLLHTSGI 381 (381) T ss_pred c--ccccceeE--eec----ccchhheeehh----hhhhh--hhhccccccchhh-hhhhhcCC Confidence 0 00111222 111 35678888885 33333 6899999999984 55532212 No 15 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.78 E-value=5.5e-21 Score=131.63 Aligned_cols=300 Identities=12% Similarity=0.084 Sum_probs=185.0 Q ss_pred Cc-------------------cccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc-- Q lcl|NC_019501. 1 MA-------------------LNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT-- 59 (431) Q Consensus 1 ~~-------------------~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~-- 59 (431) || ..++.+|+.+-.||+..|+...+|..++.. |+ .+.|+++.||.=-+.+..+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~-~~-----~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-RS-----IASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhcccc-cc-----ccccceeEeeeccceeeeeec Confidence 32 223456666679999999999999999943 22 4569999998765555443 Q ss_pred -ccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhcccccee------ Q lcl|NC_019501. 60 -GWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLV------ 130 (431) Q Consensus 60 -g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v------ 130 (431) |.....+++++......++||+++-..|.+.+ +.....++...+.+.+..+||.++|+.++.+......+- T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 44444566677788888999999988877763 223455677788889999999999999997654322110 Q ss_pred ----eccC----CCCCCccc-------hhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhh Q lcl|NC_019501. 131 ----VHDT----RAIGPSTG-------LSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTED 195 (431) Q Consensus 131 ----~t~~----~t~~~~~~-------~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~ 195 (431) +..+ ..+.++.. ...+..+-++++.|++..||.++ |-++++|..+..+....- +...+...+. T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~g-R~~vv~P~~y~~LL~~~~-~~~~d~~~~~ 232 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAAD-RTFYTTPDNYSAILAALM-PNAANYQALI 232 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccC-CEEEeCHHHHHHHhcccc-cccccccccc Confidence 0000 01111111 11256677889999999999987 669999999988766432 2222223446 Q ss_pred hHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEE Q lcl|NC_019501. 196 AYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKIS 275 (431) Q Consensus 196 a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~T 275 (431) .+++|.++. +.||+ ||++.++|....+..... . T Consensus 233 ~~~~G~Vg~-i~G~~-V~~Sn~lp~~~~t~~~~~---------------------------------------------~ 265 (347) T protein:vir:15 233 DHERGTIRN-VMGFE-VVEVPHLTAGGAGDTRED---------------------------------------------A 265 (347) T ss_pred cccceEEEE-EeceE-EEeccccccccccccccc---------------------------------------------c Confidence 799999985 88997 799999985321110000 0 Q ss_pred EcceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeee Q lcl|NC_019501. 276 FTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFW 355 (431) Q Consensus 276 iaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~f 355 (431) ++| +.|...++ .+.++. ..+ +-.+=|+| T Consensus 266 ~~g---------------~~~~~~~~---~~~~~~------------------------------~~f----~~~~~l~~ 293 (347) T protein:vir:15 266 PAD---------------QKHAFPAT---SSTTVK------------------------------VAL----DNVVGLFQ 293 (347) T ss_pred ccc---------------cccccccc---ccceee------------------------------ecc----ccceeeee Confidence 011 01111110 011110 000 00124999 Q ss_pred ccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 356 ADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 356 hr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ||+|...+..-. + .+...+|.+..-...+--..||++.+|||.+ |.|.=+|. T Consensus 294 h~~A~g~v~~~~---------------------~--~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~a-v~~~~~~~ 345 (347) T protein:vir:15 294 HRSAVGTVKLKD---------------------L--ALERARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred ccceeeeeEeec---------------------e--eeeecccchhhhhhhehhhhcCCceeccccE-EEEecCCC Confidence 999887665221 1 1111223333444556677999999999997 56666666 No 16 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.75 E-value=1.9e-20 Score=128.67 Aligned_cols=300 Identities=12% Similarity=0.091 Sum_probs=182.1 Q ss_pred Ccc-ccc------------------cchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc-- Q lcl|NC_019501. 1 MAL-NEG------------------QLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT-- 59 (431) Q Consensus 1 ~~~-~~~------------------~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~-- 59 (431) ||+ +-+ -+|+.+-.||+..|+...+|+++|.. |+ ++.|++|.|+.=-+.+..+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~-r~-----~~~G~sv~i~~iG~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-RS-----IASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhcc-cc-----ccccceeEeeeccceeeeeec Confidence 552 222 25666668999999999999999953 33 3559999999665554433 Q ss_pred -ccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhcccccee------ Q lcl|NC_019501. 60 -GWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLV------ 130 (431) Q Consensus 60 -g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v------ 130 (431) |.....++.++......+++|+++-..|.+.+ +.....++...+.+.+..+||.++|+.++.+........ T Consensus 75 ~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 55544566677788888999999987776664 334455666778889999999999999985532211100 Q ss_pred ----e----ccCCCCCCccch-------hhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhh Q lcl|NC_019501. 131 ----V----HDTRAIGPSTGL-------SGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTED 195 (431) Q Consensus 131 ----~----t~~~t~~~~~~~-------~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~ 195 (431) + .....+.+++.. ..+..+-++++.|++..||.++ |-++++|..+..+....- +....-...+ T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~g-R~~vv~P~~y~~Ll~~~~-~~~~d~~~~~ 232 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAAD-RTFYTTPDNYSAILAALM-PNAANYQALL 232 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccC-cEEEeCHHHHHHHhcccc-cccccccccc Confidence 0 000111112221 2356677899999999999977 669999999988765321 1122222345 Q ss_pred hHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEE Q lcl|NC_019501. 196 AYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKIS 275 (431) Q Consensus 196 a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~T 275 (431) .+++|.+++ +.||+ ||++.++|...........+.|+. -. T Consensus 233 ~~~~G~V~~-i~G~~-V~~Sn~lp~~~~~~~~~~~~ag~~--------------------------------------~~ 272 (347) T protein:vir:33 233 DPERGTIRN-VMGFE-VVEVPHLTAGGAGDTREDAPADQK--------------------------------------HA 272 (347) T ss_pred ccccceeEE-Eecee-EEEecccccCcccccccccccccc--------------------------------------cc Confidence 789999986 88997 789999987421111111111110 00 Q ss_pred EcceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeee Q lcl|NC_019501. 276 FTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFW 355 (431) Q Consensus 276 iaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~f 355 (431) +. ++ .+.++.++ .+-..-|+| T Consensus 273 ~~--------------------~~-----~~~~~~~a----------------------------------~~~~~gl~~ 293 (347) T protein:vir:33 273 FP--------------------AT-----SSTTVKVA----------------------------------LDNVVGLFQ 293 (347) T ss_pred cc--------------------CC-----cccceecc----------------------------------ccceeeeee Confidence 00 00 00001000 011124999 Q ss_pred ccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 356 ADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 356 hr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ||+|+..+-.- .+.+...||.+..-..++--+.||++++|||.+ |.|.=+|. T Consensus 294 h~~A~g~v~~~-----------------------~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~a-v~i~~~~~ 345 (347) T protein:vir:33 294 HRSAVGTVKLK-----------------------DLALERARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred cchhheeeeee-----------------------ceeeeeccchhhhhHhhhhhhhcCCceecccce-EEEecCCC Confidence 99987544311 111222234444445666778999999999997 56666666 No 17 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.74 E-value=7.8e-21 Score=130.82 Aligned_cols=304 Identities=14% Similarity=0.110 Sum_probs=184.2 Q ss_pred Cccccc------------------cchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccc---c Q lcl|NC_019501. 1 MALNEG------------------QLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQ---T 59 (431) Q Consensus 1 ~~~~~~------------------~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~---~ 59 (431) ||.--. -+|+-+.-||...|+...++.+++..+ -.+.|++|.||.=-+.... . T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r------~i~~G~sv~i~~iG~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVR------TIQNGKSAQFPVMGRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc------cccccceEEEecccceeeeeecC Confidence 433222 234555688888899999999888422 2467999999866555443 3 Q ss_pred ccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCC Q lcl|NC_019501. 60 GWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAI 137 (431) Q Consensus 60 g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~ 137 (431) |..+..+++++....+.|++|+++-..|.+.+ +.....++...+.+.+..+||..+|+.++.++..+.++...+...+ T Consensus 75 G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~ 154 (347) T protein:vir:94 75 GERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENI 154 (347) T ss_pred CCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 55665677778888999999999977776664 4455666777888999999999999999987765444333221111 Q ss_pred CC-------------------ccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHh Q lcl|NC_019501. 138 GP-------------------STGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYR 198 (431) Q Consensus 138 ~~-------------------~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r 198 (431) .+ ......+..+-++++.|++..||.++ |-++++|..+..+.... ............++ T Consensus 155 ~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~-R~~vv~P~~~~~Ll~~~-~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 155 AGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGD-RYFYTTPDNYSAILAAL-MPNAANYAALIDPE 232 (347) T ss_pred CCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCC-cEEEeCHHHHHHHhccc-hhhhhhcccccccc Confidence 10 01123356677899999999999987 66889999998875432 22222223344689 Q ss_pred hccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcc Q lcl|NC_019501. 199 NGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTG 278 (431) Q Consensus 199 ~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaG 278 (431) +|.+++ +.||+ ||+|.++|....+. . +. .+..-+.+| T Consensus 233 ~G~Vg~-i~G~~-V~~Sn~lp~~~~t~---~-~~-------------------------------------~~~~~~~aG 269 (347) T protein:vir:94 233 TGNIRN-VMGFV-VVEVPHLVQGGAGE---T-RG-------------------------------------DDGITIASG 269 (347) T ss_pred ccceEE-EeceE-EEecCccccccccc---c-cc-------------------------------------cCcceecCc Confidence 999986 88997 79999999532110 0 00 111111222 Q ss_pred eeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccc Q lcl|NC_019501. 279 VKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADD 358 (431) Q Consensus 279 V~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~ 358 (431) - ..-|.=+ .++ .+ -+..+.+.=|+|||+ T Consensus 270 ~-------------~~~~~~~---~~~--~~----------------------------------~~~~~~~~~l~~h~~ 297 (347) T protein:vir:94 270 Q-------------KHAFPAT---ASS--DV----------------------------------KVTMDNVVGLFSHRS 297 (347) T ss_pred c-------------ccccccc---chh--hh----------------------------------cccccceeEEEeehh Confidence 0 0000000 000 00 001111234999999 Q ss_pred eeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 359 SIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 359 A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) |...+-.-. +.++ .+||.+......+==+.||++.+|||.+++.-+. .| T Consensus 298 A~~~v~~~~---------------------~~~e--~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 298 AVGTVKLRD---------------------LALE--RDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred hhhhhhccc---------------------cccc--chhchhhHHHHhhhhhhhcCcccccceeEEEEec-CC Confidence 987443211 1111 2333333332323346799999999999877666 66 No 18 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.71 E-value=6.5e-20 Score=125.77 Aligned_cols=291 Identities=12% Similarity=0.086 Sum_probs=179.1 Q ss_pred Cccc---cccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---ccccccCccccccce Q lcl|NC_019501. 1 MALN---EGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GWNLTGNATGILELS 74 (431) Q Consensus 1 ~~~~---~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~~~s~~~~di~e~~ 74 (431) =+-. ..-+|+..--||++.|++..+|++++.. |+. +.|+||.||.=.+.+..+ |.....+ .++.... T Consensus 19 ~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~-r~i-----~~G~tv~i~~ig~~~~~~~~~g~~l~~~-~~~~~~~ 91 (332) T protein:vir:78 19 NADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YDL-----RGGKSKQFMFTGKLSAGYHTPGTPIVGD-AGIKANE 91 (332) T ss_pred ccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc-ccc-----cccceEEEEeccceeEeeecCCCCCCCC-CCCCCce Confidence 1122 2356777778999999999999999952 443 569999999776665544 3333222 2566677 Q ss_pred eEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeecc----C------CCCCCccc Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHD----T------RAIGPSTG 142 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~----~------~t~~~~~~ 142 (431) +.+++|+.+-..|.+.+ +.....++...+.+.+..+||.++|..++.++.......... + +.....++ T Consensus 92 ~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~~~ 171 (332) T protein:vir:78 92 KTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDA 171 (332) T ss_pred EEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccccccCCccccCH Confidence 88999999987777753 223455677788889999999999999999887643321100 0 11112233 Q ss_pred hhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhcc-c-cchhhhHhhcc-ccccchhhhhhHhcCCCc Q lcl|NC_019501. 143 LSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIF-G-RVTEDAYRNGT-IQRQIAGFDEILRSPKLP 219 (431) Q Consensus 143 ~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~-~-~~~~~a~r~g~-i~r~~~Gfd~~~~s~~v~ 219 (431) ...+..+-++++.|++..||.++ |-++++|..+..+.........+ + ......+++|. |+ +++||+ ||++.++| T Consensus 172 ~~~~~~i~~a~~~Lde~~VP~~g-R~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~-~i~G~~-V~~Sn~lp 248 (332) T protein:vir:78 172 QAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY-SIAGIR-ILKSNNLA 248 (332) T ss_pred HHHHHHHHHHHHHHhhcCCCccC-CEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeee-EEeeeE-EEecCccc Confidence 34567788999999999999887 66899999998886532222211 1 11234678886 65 699997 89999998 Q ss_pred cccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEE Q lcl|NC_019501. 220 AVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSIT 299 (431) Q Consensus 220 ~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVt 299 (431) ..++...... + ++|++ .-|.+ T Consensus 249 ~~~g~~~~~~----~-----------------------------------------~~~~~-------------n~~~~- 269 (332) T protein:vir:78 249 GLYGQDLSSA----A-----------------------------------------VTGEN-------------NDYQV- 269 (332) T ss_pred cCcccccccc----c-----------------------------------------ccccc-------------ccccc- Confidence 5321100000 0 00100 00111 Q ss_pred EeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEee-cccCCCCCcceee Q lcl|NC_019501. 300 RVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQ-PIPVTHELFAGMK 378 (431) Q Consensus 300 a~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~-pl~~p~g~~~a~~ 378 (431) ++ +.+.=|+|||+|.+++.. ++.+ T Consensus 270 ------------------------------------------~~----~~~~~~~~h~~a~~~v~~~~~~~--------- 294 (332) T protein:vir:78 270 ------------------------------------------DA----SALAGLIFHREAAGCIQSVAPTI--------- 294 (332) T ss_pred ------------------------------------------cc----ccceEEeecccceeeeeeeccch--------- Confidence 01 112249999998766531 2211 Q ss_pred EEEeecCcceEEEEEEe-cccccccceEEEEEeeccceecccceeEEecCC Q lcl|NC_019501. 379 TSSFSIPGIGVNGIFAT-QGDINTLSGKCRIAVWYSACAVRPEAIGVGLPN 428 (431) Q Consensus 379 ~~t~~~~g~glslrv~~-~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~ 428 (431) ++.. .|+.+......+=-+.||++.+|||.+++.-+- T Consensus 295 -------------~~t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 295 -------------QTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred -------------hhhhcccchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 1111 122222222334456899999999999877766 No 19 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.69 E-value=1.6e-19 Score=123.60 Aligned_cols=299 Identities=14% Similarity=0.111 Sum_probs=179.8 Q ss_pred Ccccc----------cc----------chhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccc--- Q lcl|NC_019501. 1 MALNE----------GQ----------LVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPT--- 57 (431) Q Consensus 1 ~~~~~----------~~----------~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~--- 57 (431) ||++- .+ +|+.+--||+..|+..++|.+++.. |. .+.|+++.+|.=-+.+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~-r~-----i~~g~s~~~~~iG~~~~~~~ 74 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-RS-----ISSGKSAQFPVLGRTQAAYL 74 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccccee-ee-----ecccceEEEEeeceeEEEee Confidence 77441 11 5666668999999999999999953 32 45599999886533332 Q ss_pred ccccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhcccccee----- Q lcl|NC_019501. 58 QTGWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLV----- 130 (431) Q Consensus 58 ~~g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v----- 130 (431) ..|..+....+|+.-..+.|++|+.+-..|.+.+ +-....++...+.+.+..+||..+|+.++.+......+. T Consensus 75 ~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 75 APGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred ecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 2355554445567778888999999987777774 224556677778889999999999999986654322110 Q ss_pred ----------e-c-cCCC---CCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhh Q lcl|NC_019501. 131 ----------V-H-DTRA---IGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTED 195 (431) Q Consensus 131 ----------~-t-~~~t---~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~ 195 (431) . . ..+. .+.......+..+-++++.|++..||.++ |-++++|..+..+...- .+......... T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~g-R~~vv~P~~y~~Ll~~~-~~~~~~~~~~~ 232 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSD-RVFYCDPDSYSAILAAL-MPNAANYAALI 232 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccC-CEEEeChHHHHHHhhcc-ccccccccccc Confidence 0 0 0000 11111123466778899999999999887 66889999998874432 11112222345 Q ss_pred hHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEE Q lcl|NC_019501. 196 AYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKIS 275 (431) Q Consensus 196 a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~T 275 (431) .+++|.++. ++||. +|++.++|....++.... T Consensus 233 ~~~~G~V~~-v~G~~-V~~Sn~lp~~~~~~~~~~---------------------------------------------- 264 (344) T protein:vir:10 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTSREG---------------------------------------------- 264 (344) T ss_pred ceeeeEEEE-EeceE-EEeccccccccCCccccc---------------------------------------------- Confidence 689999986 88996 899999985211111000 Q ss_pred EcceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeee Q lcl|NC_019501. 276 FTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFW 355 (431) Q Consensus 276 iaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~f 355 (431) ++|-. | .+|+. ++....+ ..+.+.-|+| T Consensus 265 ~tg~~---------------~------------~~~~~-----------------------~~~~~~~--~~s~~~~l~~ 292 (344) T protein:vir:10 265 TTGQK---------------H------------AFPAT-----------------------KSGNDKV--AKDNVIGLFM 292 (344) T ss_pred ccCcc---------------c------------cccCC-----------------------cccceee--ecceeEEEee Confidence 11100 0 00000 0001111 1111234899 Q ss_pred ccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 356 ADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 356 hr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ||+|...+-.-. +.++. +||.+......+==+.||.+.+|||.+|++..=+| T Consensus 293 h~~A~~~v~~~~---------------------~~~e~--~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 293 HRSAVGTVKLRD---------------------LALER--ARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred chhhhhhhhhcc---------------------ceeec--ccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 999985443111 11111 11222122121223579999999999999999899 No 20 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.63 E-value=3.6e-18 Score=116.25 Aligned_cols=303 Identities=15% Similarity=0.131 Sum_probs=175.5 Q ss_pred Cccc-ccc------------------chhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccc--- Q lcl|NC_019501. 1 MALN-EGQ------------------LVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQ--- 58 (431) Q Consensus 1 ~~~~-~~~------------------~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~--- 58 (431) ||++ -++ +|+.+--||++.|+..++|.+++.. |. .+.|+++.+|.=-+.+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-rt-----i~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLV-RS-----IQSGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhh-ee-----ccccceEEeeeccceeEeeee Confidence 5521 111 4555558999999999999999943 22 466999999866555433 Q ss_pred cccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCC- Q lcl|NC_019501. 59 TGWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTR- 135 (431) Q Consensus 59 ~g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~- 135 (431) .|..+-..-.|+.-..+.|++|+.+-..|.+.+ +-....|+...+.+.+..+||..+|+-++.++....++...+.. T Consensus 75 ~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:94 75 PGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNEN 154 (347) T ss_pred cCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 243332222355566778999998887776664 22445566667888999999999999998765543332111000 Q ss_pred -------------------CCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhh Q lcl|NC_019501. 136 -------------------AIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDA 196 (431) Q Consensus 136 -------------------t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a 196 (431) .+....+...+..+-+++..|++..||.++ |-+|++|..+..+....-.. ......... T Consensus 155 ~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~-R~~vv~P~~y~~LLk~~~~~-~~~~~~~~~ 232 (347) T protein:vir:94 155 IAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSD-RVFYTTPDNYSAILAALMPN-AANYQALID 232 (347) T ss_pred cccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCC-CEEEeChHHHHHHHHhhccc-ccccccccc Confidence 000111223467788999999999999887 66999999998876532222 122223446 Q ss_pred HhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEE Q lcl|NC_019501. 197 YRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISF 276 (431) Q Consensus 197 ~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Ti 276 (431) +++|.|+. +.||. +|++.++|....+. ++.+ +|. ++ T Consensus 233 ~~~G~V~~-v~G~~-V~~Sn~~p~~~~~~------~~~~----------------------------------~~~--~~ 268 (347) T protein:vir:94 233 PSTGSIRN-VMGFE-VIEVPHLTAGGAGD------NRAE----------------------------------EGV--AP 268 (347) T ss_pred cccceeEE-eeceE-EEEcCccccccCcc------cccc----------------------------------ccc--cc Confidence 88999986 88996 89999998632111 0000 111 22 Q ss_pred cceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeec Q lcl|NC_019501. 277 TGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWA 356 (431) Q Consensus 277 aGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fh 356 (431) ++ +.+-|.... +..|. +.+ +-+.+|+|| T Consensus 269 ~~-------------~~~~~~~~~------------------------~~~y~-----------~d~----~~~~~l~~~ 296 (347) T protein:vir:94 269 TN-------------QKHAFPDTA------------------------SGDTR-----------VAL----DNVVGLFNH 296 (347) T ss_pred cc-------------ccccccccc------------------------ccccc-----------ccc----cceEEEEec Confidence 22 000000000 00010 001 113479999 Q ss_pred cceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 357 DDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 357 r~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) |+|...+ -+-- +.++. +||.......+.==..||...+|||.+++.+.. +| T Consensus 297 ~~A~~tv--~~~~-------------------~~~e~--~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~-~a 347 (347) T protein:vir:94 297 RSAVGTV--KLKD-------------------MALER--ARRANFQADQIIAKYAMGHGGLRPEACGALVFK-KA 347 (347) T ss_pred hhhhhhh--hhcc-------------------cceee--eechhhhhhhhhhhhhhcCcccccceeEEEEec-CC Confidence 9976533 2211 11111 233332222222235789999999998655544 45 No 21 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.61 E-value=1.1e-17 Score=113.64 Aligned_cols=303 Identities=12% Similarity=0.099 Sum_probs=177.6 Q ss_pred Cccc-------------------cccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccc--- Q lcl|NC_019501. 1 MALN-------------------EGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQ--- 58 (431) Q Consensus 1 ~~~~-------------------~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~--- 58 (431) ||++ ..-+|+.+-.||+..|+...+|.+++.. |+ .+.|+++.+|.=-+.+.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-r~-----i~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-RT-----IQNGKSASFPVMGRTKGYYLA 74 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccc-cc-----ccCcceEEEeeecceeeeeec Confidence 5522 1224565568999999999999999943 22 467999999865444432 Q ss_pred cccccccCccccccceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhcccccee------ Q lcl|NC_019501. 59 TGWNLTGNATGILELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLV------ 130 (431) Q Consensus 59 ~g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v------ 130 (431) .|..+...-.|+.-..+.|+||+.+-..|.+.+-| ....|....+.+.+..+||..+|+-++.......+.. T Consensus 75 ~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:88 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) T ss_pred cccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 23332222235556788999999998777777533 3455677788899999999999999886544322111 Q ss_pred ------------eccCCCCC-CccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhH Q lcl|NC_019501. 131 ------------VHDTRAIG-PSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAY 197 (431) Q Consensus 131 ------------~t~~~t~~-~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~ 197 (431) ++.+.... .......+..+-++++.|+++.||.++ |.++++|..+..+....- ...........+ T Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~g-R~~vv~P~~y~~Ll~~~~-~~~~~~~~~~~~ 232 (347) T protein:vir:88 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD-RRFYCAPEDYSAILSALM-PNAANYAALIDP 232 (347) T ss_pred cCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCC-CEEEeCHHHHHHHhcchh-hhhhhhccccch Confidence 00000000 011112366778899999999999987 669999988877644321 112222233468 Q ss_pred hhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc Q lcl|NC_019501. 198 RNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT 277 (431) Q Consensus 198 r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia 277 (431) ++|.++. +.||+ |+++.++|....+... .|+.+... T Consensus 233 ~~G~vg~-i~G~~-V~~s~nlp~~~~~~~~------------------------------------------~~~~~~~t 268 (347) T protein:vir:88 233 ETGNIRN-VMGFE-VIEVPHLTVGGAGDNN------------------------------------------PADGVAPT 268 (347) T ss_pred hcceeee-eccce-EEEeeccccccccccc------------------------------------------cccccccc Confidence 9999986 88997 8999999842211100 11111111 Q ss_pred ceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeecc Q lcl|NC_019501. 278 GVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWAD 357 (431) Q Consensus 278 GV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr 357 (431) + ....| ..+..+-| . +..+-+..|+||+ T Consensus 269 ~-------------~~~~~------~~~~~~~~-------------------------------~--~d~~~~~~l~~~~ 296 (347) T protein:vir:88 269 N-------------QKHIF------PATATGDD-------------------------------R--VAQNNVVGLFNHR 296 (347) T ss_pred c-------------ccccc------cccccccc-------------------------------c--cccCcEEEEEech Confidence 1 00000 00000000 0 0112234699999 Q ss_pred ceeEEE-eecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 358 DSIRLL-SQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 358 ~A~aLa-t~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +|+..+ ..+|.. +. +||.+......+==+.||++.+|||.+++.-..-.| T Consensus 297 ~a~g~v~~~d~~~----------e~--------------~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 297 SAVGTVKLKDMAL----------ER--------------ARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhhheeccccee----------ee--------------eechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 998766 333321 11 123222222222235799999999999888777777 No 22 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.59 E-value=5.4e-17 Score=109.76 Aligned_cols=258 Identities=12% Similarity=0.067 Sum_probs=168.0 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc-c---cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA-P---TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~-~---~~~g~~~s~~~~di~ 71 (431) ||++.=++-...+ +.+++++++.+++++++ .++++-++ +.|+||+||+-... . ..+|.+ ..++.+. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~--~~~~~l~g-~~G~tv~ip~~~~~g~~~~~~eg~~--i~~~~it 75 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEK--IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccc--cccccccC-CCCCEEEEEeeccCCCcccccCCCc--ccccccc Confidence 9998755554333 66778899999999999 56665554 57999999985322 1 122333 3455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ..+..+++++. .--|.+++-+ ++..+...+..+++.+.+|+++|.+++..+......+.+ . ...++.+ T Consensus 76 ~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~--------~-~~~~d~i 145 (274) T protein:vir:93 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------D-ITKLNGL 145 (274) T ss_pred cceeEEEeeee-cccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--------c-ccCHHHH Confidence 78888888663 3457777644 456788888889999999999999999888765433211 1 1235778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++.+. ..|.++++|..++.+.++....+..... ....+++|.||+ +.||+ ++.+.++|. T Consensus 146 ~dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~-------- 212 (274) T protein:vir:93 146 QSAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA-------- 212 (274) T ss_pred HHHHHHhhhccC---CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccce-ecCee-EEEcCCCCc-------- Confidence 889999998764 3477999999998876543222211111 123456666665 55664 444433330 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:93 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ -++||+.||.+...+. T Consensus 213 ---------------------------------------~t--~~l~~~gai~~~~~~~--------------------- 230 (274) T protein:vir:93 213 ---------------------------------------GT--AILAKKGAVKLILKRD--------------------- 230 (274) T ss_pred ---------------------------------------ce--EEEEeCCeEEEEecCC--------------------- Confidence 00 1566777777654332 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ++ +..++|.+......+.+..||++.++|+-. |.|.-..| T Consensus 231 ~~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~-v~~t~~~~ 270 (274) T protein:vir:93 231 FF--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred cc--cccccchhhcccEEEEEEEEEEEEEcCCce-EEEeeCcc Confidence 11 122346666777888899999999999985 66665555 No 23 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.58 E-value=5.7e-17 Score=109.66 Aligned_cols=265 Identities=14% Similarity=0.095 Sum_probs=168.9 Q ss_pred Ccccc---ccchhH--HHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccc----cccccccccCccccc Q lcl|NC_019501. 1 MALNE---GQLVTY--ALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAP----TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~---~~~lt~--~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~----~~~g~~~s~~~~di~ 71 (431) ||..- .+++-+ +-+.++++|++.+++++++. ++++-++ +.||||+||.-...- ..+|.. ..++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~--~~~~l~g-~~G~tv~ip~~~~~g~a~~~~~g~~--i~~~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAP--IDNSLEG-QPGSEITVPKYKYIGDAQDVAEGAA--IDYSALE 75 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccce--ecccccC-CCCCEEEEeeeccCCcceeecCCCc--Ccccccc Confidence 98643 333332 12667888999999999993 4444343 579999999853221 122322 2455777 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ..+..+++++.+. -|.+++-+ ++..++.++..+++...++.++|.++++.++.....+. +++......+.+..+ T Consensus 76 ~~~~~~~i~~~~~-a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~---~~~t~~~~~~~~~~~ 151 (278) T protein:vir:80 76 TESVKHGIKKAGK-GVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVK---GAINIGLIDKIENTF 151 (278) T ss_pred cceeeEeeehhhc-cccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---cccccchhhhHHHHH Confidence 8888888877543 57777644 35678889999999999999999999998887554432 222233333456778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhc-cccchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDI-FGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~-~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+++..|++..+|.. +.++++|..++.|.++...... ........+++|.||+ +.||+ ++.+.++|. T Consensus 152 ~da~~~l~~~~~~~~--~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~-------- 219 (278) T protein:vir:80 152 TDAPDAIEDESITTT--GVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGE-LLGWE-IVRTKKLAD-------- 219 (278) T ss_pred HHHHHhhcccCCCcc--cEEEECHHHHHHHHhhhhhhccccccccccceeecccee-eccee-EEEcCCCCc-------- Confidence 889999999999974 3488899998887554222221 1111223466777765 66775 455444430 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) | T Consensus 220 -------------------------------------------------~------------------------------ 220 (278) T protein:vir:80 220 -------------------------------------------------G------------------------------ 220 (278) T ss_pred -------------------------------------------------c------------------------------ Confidence 0 Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .-++||++||.+.... + T Consensus 221 ------------------------------------------t~~l~~~gAi~~~~~~---------------------~ 237 (278) T protein:vir:80 221 ------------------------------------------NALAVKAGALKTFLKR---------------------N 237 (278) T ss_pred ------------------------------------------eEEEEeccceeeeecC---------------------C Confidence 0145677777653211 1 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ++++ .++|.+......+.+..||++.++||-. |.|.=+-+ T Consensus 238 ~~vE--~~Rd~~~~~d~i~~~~~yg~~v~~~~~~-v~it~~a~ 277 (278) T protein:vir:80 238 LLAE--SGRDMDHKLTKFNADQHYAVALVDETKA-VKVVPVAG 277 (278) T ss_pred cccc--cccchhhccceeeeeeEEEEEEEcCcce-EEEeeccC Confidence 2222 2335566677778889999999999986 55544333 No 24 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.56 E-value=6.4e-17 Score=109.37 Aligned_cols=300 Identities=15% Similarity=0.127 Sum_probs=175.1 Q ss_pred Ccccc--------------------ccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc- Q lcl|NC_019501. 1 MALNE--------------------GQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT- 59 (431) Q Consensus 1 ~~~~~--------------------~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~- 59 (431) ||.+- .-+|+.+--||++.|+..+++.+++.. |+. +-|+++.+|.=-+.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r~i-----~~gks~~~~~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-RSI-----SSGKSAQFPVLGRTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-eec-----cccceEEEeeecceEEEee Confidence 33322 234555557899999999999999943 333 349999888654444332 Q ss_pred --ccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeec--- Q lcl|NC_019501. 60 --GWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVH--- 132 (431) Q Consensus 60 --g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t--- 132 (431) |..+..++.|+.-....|++|+.+-..|.+.+ +-....|+...+.+.+..+||..+|+-++.+......+..+ T Consensus 75 ~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~ 154 (345) T protein:vir:22 75 APGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNE 154 (345) T ss_pred ecCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 33333333344445566999999987777774 22455667777889999999999999888655432221110 Q ss_pred --------------cCCC---CCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhh Q lcl|NC_019501. 133 --------------DTRA---IGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTED 195 (431) Q Consensus 133 --------------~~~t---~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~ 195 (431) ..+. .+..+....|..+-++++.|++..||.++ |-++++|..+..+....- +.......+. T Consensus 155 ~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~-R~~vv~P~~y~~Ll~~~~-~~~~~~~~~~ 232 (345) T protein:vir:22 155 NIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAAD-RVFYCDPDSYSAILAALM-PNAANYAALI 232 (345) T ss_pred cccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccC-CEEEeChHHHHHHhcccc-cccccccccc Confidence 0011 11122234578888999999999999987 669999999988744321 1112222345 Q ss_pred hHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEE Q lcl|NC_019501. 196 AYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKIS 275 (431) Q Consensus 196 a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~T 275 (431) .+++|.++. ++||. +|++.++|....++.... + .++... T Consensus 233 ~~~~G~V~~-i~G~~-V~~sn~lp~~~~~~~~~~---~------------------------------------~~~~~~ 271 (345) T protein:vir:22 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTAREG---T------------------------------------TGQKHV 271 (345) T ss_pred ccccceEEE-EeceE-EEecccccccccCccccC---c------------------------------------cccccc Confidence 689999986 89996 999999885221111100 0 111111 Q ss_pred EcceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeee Q lcl|NC_019501. 276 FTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFW 355 (431) Q Consensus 276 iaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~f 355 (431) +.. . .++ ...++ ..+.++=|+| T Consensus 272 ~~~-----------------------~-~g~--------------------------------~~~~~--~~~~~~~l~~ 293 (345) T protein:vir:22 272 FPA-----------------------N-KGE--------------------------------GNVKV--AKDNVIGLFM 293 (345) T ss_pred ccc-----------------------c-ccc--------------------------------eeeee--ccCceEEEEE Confidence 111 0 000 00000 0011235999 Q ss_pred ccceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCC Q lcl|NC_019501. 356 ADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQT 430 (431) Q Consensus 356 hr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~ 430 (431) ||+|...+-.-. ..++... ...++.| ..+==+.||++.+|||.+++...--+ T Consensus 294 h~~A~~~v~~~~---------~~~e~~r--------~~~~~~d------~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 294 HRSAVGTVKLRD---------LALERAR--------RANFQAD------QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ehhheeeeeeec---------ceeeeee--------chhHHHH------HHHHHHhcCCcccccceeEEEEEeeC Confidence 999776443111 0111111 1112222 11123579999999999999888777 No 25 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.56 E-value=2.1e-16 Score=106.53 Aligned_cols=258 Identities=13% Similarity=0.087 Sum_probs=168.9 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc---c-cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA---P-TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~---~-~~~g~~~s~~~~di~ 71 (431) ||+..=+|-...+ +.+++++++.+++++++ .++++=++ +.||||+||.-... . ..+|.+. .++.+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~--~~d~~l~g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccc--eecccccC-CCCCEEEEeeecCCCccccccCCCcc--chhhcc Confidence 9998766555443 56778899999999999 67766554 57999999974321 1 2223333 344566 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ..+..+++.+ ..--|.+++-+ +...+...+.++++...||+++|.+++.........+.. . ...++.+ T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~-------~--a~~~d~i 145 (274) T protein:vir:12 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-------D--ITKLNGL 145 (274) T ss_pred cceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------c--ccCHHHH Confidence 7777777755 34457777644 346788888889999999999999999887764433211 1 1236778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhcccc-chhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGR-VTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~-~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++... ..|.++++|..++.+.++....+.... .....+|+|.||+ +.||+ ++.+.++|.. T Consensus 146 ~dA~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~~------- 213 (274) T protein:vir:12 146 QSAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRSNKLEAG------- 213 (274) T ss_pred HHHHHHhccccc---cccEEEeCHHHHHHHHhhhhhhccccccccccceeccccee-ecCee-EEEeCCCCcc------- Confidence 889999988653 457899999999888665332222221 1234567777775 66775 5554444310 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 214 -------------------------------------------------------------------------------- 213 (274) T protein:vir:12 214 -------------------------------------------------------------------------------- 213 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) + -++|++.||.+.... + T Consensus 214 ----------------------------------------t--~~l~~~gA~~~~~~~---------------------~ 230 (274) T protein:vir:12 214 ----------------------------------------T--AILAKKGAVKLILKR---------------------D 230 (274) T ss_pred ----------------------------------------e--EEEEeccceeeeecC---------------------C Confidence 0 145555555543322 1 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ++ +..++|++......+-+..||++.++|+-. |.|.-..| T Consensus 231 ~~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:12 231 FF--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred ce--eccccchhhcccEEEeeeEEEEEEEcCCce-EEEEcCCc Confidence 22 223346666777888889999999999985 77876666 No 26 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.55 E-value=3e-16 Score=105.71 Aligned_cols=323 Identities=16% Similarity=0.080 Sum_probs=176.7 Q ss_pred Cc-cc-------------cccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---cccc Q lcl|NC_019501. 1 MA-LN-------------EGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GWNL 63 (431) Q Consensus 1 ~~-~~-------------~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~~~ 63 (431) |. +| ..-+|+.+--||+..|+..+++..++.+ |.. +-|+++.++.=-+.+... |... T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~-rti-----~~Gksv~f~~iG~~t~~~~t~G~~i 82 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTK-RTL-----KNGKSLQFIYTGRMTSSFHTPGTPI 82 (375) T ss_pred cCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccc-ccc-----ccCceEEEEeeeeeEEeeecCCcCc Confidence 22 11 1234555557899999999999999953 333 449999888665554443 3222 Q ss_pred ccCcc-ccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhcccccee--------ec Q lcl|NC_019501. 64 TGNAT-GILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLV--------VH 132 (431) Q Consensus 64 s~~~~-di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v--------~t 132 (431) .+++. |+--....|++|+.+-..|.+.+ +.....++...+.+.+..+||..+|+.++.++.....+. +. T Consensus 83 ~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~ 162 (375) T protein:vir:10 83 LGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATNFVE 162 (375) T ss_pred CCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 22221 22223345999999887777774 334566777788899999999999999998776432111 00 Q ss_pred cCCC----------CCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhh--hhccccchhhhHhhc Q lcl|NC_019501. 133 DTRA----------IGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVD--GDIFGRVTEDAYRNG 200 (431) Q Consensus 133 ~~~t----------~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~--~~~~~~~~~~a~r~g 200 (431) ++++ ....++...+..+-++++.|++..||..+ |-++++|..+..+..+.-. +.+.+-.....+.+| T Consensus 163 ~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~-R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~~~g 241 (375) T protein:vir:10 163 PGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQG-RCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQSGN 241 (375) T ss_pred cCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCC-CEEEeChHHHHHHHhcCCccceeeecccccceeccc Confidence 0110 00112334578888999999999999877 6699999998877543211 111122233456788 Q ss_pred cccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEccee Q lcl|NC_019501. 201 TIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVK 280 (431) Q Consensus 201 ~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~ 280 (431) .+++ ++||. +|++.++|..+... .. .|+.- + . +..-+.++.+. T Consensus 242 ~v~~-i~Gv~-V~~Sn~lP~~~~~~---~~-~g~~~------------~------------~--~a~~~~~~~~~----- 284 (375) T protein:vir:10 242 GVIE-IAGIH-IYKSMNIPFLGKYG---VK-YGGTT------------G------------E--TSPGNLGSHIG----- 284 (375) T ss_pred eEEE-EeceE-EEEecccccccccc---cc-ccccc------------c------------c--cchhhhhcccc----- Confidence 8875 88997 99999999763211 00 01100 0 0 00001111111 Q ss_pred eeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeecccee Q lcl|NC_019501. 281 FLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSI 360 (431) Q Consensus 281 ~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~ 360 (431) |.+. ++ ++++...+.=...+..- +-+.-|+|||+|. T Consensus 285 ------------------------------~~~~---~~----------~~~~g~~~~y~~d~~~~-~~~~~~~~~~~A~ 320 (375) T protein:vir:10 285 ------------------------------PTPE---NA----------NATGGVNNDYGTNAELG-AKSCGLIFQKEAA 320 (375) T ss_pred ------------------------------ccCC---cc----------eeecccccccccccccc-CceEEEEEchhhe Confidence 1100 00 00000000000111000 1133599999987 Q ss_pred EEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 361 RLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 361 aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) .-+ -+ + +.+... + +=..++.||.| .+.=-+.||...+|||.|+..=.+-+| T Consensus 321 g~v--~~-~----~~~~~~--~-----~~~~~~~~q~~------~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 321 GVV--EA-I----GPQVQV--T-----NGDVSVIYQGD------VILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred eee--ee-e----cccccc--c-----cchhhheeeee------eeeeeeeeccCccCceeEEEEecCcCc Confidence 644 11 0 001111 0 00234444443 222346799999999998766566555 No 27 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.55 E-value=4.5e-16 Score=104.73 Aligned_cols=258 Identities=13% Similarity=0.067 Sum_probs=168.0 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccc-cccc---cccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQE-APTQ---TGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~-~~~~---~g~~~s~~~~di~ 71 (431) ||...=++-...+ +-+++.+++.+++++++ -++++-++ +.||||+||.-.. .... +|.+ ..++.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~--~~~~~l~g-~~G~tv~ip~~~~~g~~~~~~~g~~--i~~~~it 75 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFA--DIDSTLVG-QPGDTLTFPAFTYSGDAQVIAEGEK--IPVDQIG 75 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccc--cccccccC-CCCCEEEEEeeccCCCccccCCCCc--Cchhhcc Confidence 9986655554443 45566799999999998 45554343 5799999997532 1222 2322 2455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) .....+++++ ..--|.+++-+ +...+...+..+++...||+++|.+++.........+ .... ..++.+ T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~--------~~~~-~~~d~i 145 (274) T protein:vir:96 76 TSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV--------EADI-TKLDGL 145 (274) T ss_pred cceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc--------Cccc-ccHHHH Confidence 8888888866 44457777544 4567888899999999999999999998876533211 1111 235778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++... ..|.++++|..++.+.++....+..... ....+|+|.||+ +.||+ ++.+.++|.. T Consensus 146 ~dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~-~~G~~-Vi~s~~~p~~------- 213 (274) T protein:vir:96 146 QTAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAV-IVRSNKLNKG------- 213 (274) T ss_pred HHHHHHhcccCC---CceEEEeCHHHHHHHHhcccccccccccccccceeecccce-ecCee-EEEcCCCCcc------- Confidence 899999998765 3477999999988876653222221111 123566777765 66765 5555444410 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 214 -------------------------------------------------------------------------------- 213 (274) T protein:vir:96 214 -------------------------------------------------------------------------------- 213 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) ..++||+.||.++...- T Consensus 214 ------------------------------------------t~~l~~~gA~~~~~~~~--------------------- 230 (274) T protein:vir:96 214 ------------------------------------------EALLAKKGAVKLITKRD--------------------- 230 (274) T ss_pred ------------------------------------------eEEEEeCcceeeeecCC--------------------- Confidence 02566777777654431 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++ ..++|.+......+.+..||++.++|+=. |.|.-..| T Consensus 231 ~~v--E~~Rd~~~~~d~i~~~~~yg~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:96 231 FFL--EKDRDASRKSTALYSDKHYVAYLYDESKV-VKITKGAG 270 (274) T ss_pred ccc--ccccchhhcccEEEEeeEEEEEEEcCccE-EEEEcCcc Confidence 111 12345666677778888999999999985 78877777 No 28 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.53 E-value=6.2e-16 Score=103.95 Aligned_cols=256 Identities=13% Similarity=0.081 Sum_probs=165.8 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc-c---cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA-P---TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~-~---~~~g~~~s~~~~di~ 71 (431) ||...=+|-...+ +.+++.++..+++++++..-+.|+ + ++||||+||+.... . ..+|.+. .++.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~--g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV--G-QPGDTLTFPAFIYSGDAKVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc--C-CCCCEEEeeeecCCCccccccCCCcc--chhhcc Confidence 9996655554443 667777999999999984344454 3 57999999976432 1 2223222 455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ..+..+++++ ..--|.+++.+ +...++..+.++++...||+++|.+++..++.....+.+ .. ..++.+ T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-------~~--~~~d~i 145 (274) T protein:vir:95 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-------DI--TKLTGL 145 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------cc--cCHHHH Confidence 8888888865 44457777644 346788999999999999999999999988875443221 11 136778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccc-cchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFG-RVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~-~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++... ..|.++++|..++.+.++....+... ......+|+|.||+ +.||+ ++++.++|. T Consensus 146 ~~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~~~-------- 212 (274) T protein:vir:95 146 QTAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAV-IVRSNKLEA-------- 212 (274) T ss_pred HHHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeE-EEEeCCCCC-------- Confidence 889999987653 45789999999988866532222221 11234567777775 66775 454433220 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:95 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ .++|++.||.+..... T Consensus 213 ---------------------------------------~t--~~l~~~gA~~~~~~~~--------------------- 230 (274) T protein:vir:95 213 ---------------------------------------GT--AILAKKGAVKLITKRD--------------------- 230 (274) T ss_pred ---------------------------------------ce--EEEEeccceeeeecCC--------------------- Confidence 00 1455666666543321 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++ ..++|++......+.+..||++.++|+=. |+|. |. T Consensus 231 ~~v--E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~-v~~t--k~ 268 (274) T protein:vir:95 231 FFL--ETDRDPSTKTTALYSDKHYVAYLYDESKA-VKIT--KG 268 (274) T ss_pred ccc--ccccccccccCEEEEeEEEEEEEEcCCcE-EEEE--cC Confidence 121 22336667777888899999999999985 5554 44 No 29 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.53 E-value=6.2e-16 Score=103.95 Aligned_cols=256 Identities=13% Similarity=0.081 Sum_probs=165.8 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc-c---cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA-P---TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~-~---~~~g~~~s~~~~di~ 71 (431) ||...=+|-...+ +.+++.++..+++++++..-+.|+ + ++||||+||+.... . ..+|.+. .++.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~--g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV--G-QPGDTLTFPAFIYSGDAKVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc--C-CCCCEEEeeeecCCCccccccCCCcc--chhhcc Confidence 9996655554443 667777999999999984344454 3 57999999976432 1 2223222 455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ..+..+++++ ..--|.+++.+ +...++..+.++++...||+++|.+++..++.....+.+ .. ..++.+ T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-------~~--~~~d~i 145 (274) T protein:vir:96 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-------DI--TKLTGL 145 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------cc--cCHHHH Confidence 8888888865 44457777644 346788999999999999999999999988875443221 11 136778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccc-cchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFG-RVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~-~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++... ..|.++++|..++.+.++....+... ......+|+|.||+ +.||+ ++++.++|. T Consensus 146 ~~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~~~-------- 212 (274) T protein:vir:96 146 QTAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAV-IVRSNKLEA-------- 212 (274) T ss_pred HHHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeE-EEEeCCCCC-------- Confidence 889999987653 45789999999988866532222221 11234567777775 66775 454433220 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:96 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ .++|++.||.+..... T Consensus 213 ---------------------------------------~t--~~l~~~gA~~~~~~~~--------------------- 230 (274) T protein:vir:96 213 ---------------------------------------GT--AILAKKGAVKLITKRD--------------------- 230 (274) T ss_pred ---------------------------------------ce--EEEEeccceeeeecCC--------------------- Confidence 00 1455666666543321 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +++ ..++|++......+.+..||++.++|+=. |+|. |. T Consensus 231 ~~v--E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~-v~~t--k~ 268 (274) T protein:vir:96 231 FFL--ETDRDPSTKTTALYSDKHYVAYLYDESKA-VKIT--KG 268 (274) T ss_pred ccc--ccccccccccCEEEEeEEEEEEEEcCCcE-EEEE--cC Confidence 121 22336667777888899999999999985 5554 44 No 30 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.51 E-value=1e-15 Score=102.82 Aligned_cols=258 Identities=12% Similarity=0.071 Sum_probs=165.4 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccc---cc-cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQE---AP-TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~---~~-~~~g~~~s~~~~di~ 71 (431) ||+..=+|-...+ +.+++++++.+++++++ -++++-++ +.|+||+||.-.. .. ..+|.+. .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~--~~d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccc--eecccccC-CCCCEEEEeeecCCCccccccCCCcc--cccccc Confidence 9997655544443 66777899999999999 56655554 5799999997321 11 2224333 455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) .....+++++.. --|.+++.+ +...+...+.++++...||+++|.+++..+......+.+ .. ..++.+ T Consensus 76 ~~~~~~~i~~~~-~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--------~~-~~~d~i 145 (274) T protein:vir:97 76 TKKREAKIRKIA-KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------DI-TKLNGL 145 (274) T ss_pred cceeEEEeeeec-ceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--------cc-cCHHHH Confidence 777788886643 347777644 456788888899999999999999999988765433211 11 135678 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++.+.. .|.++++|..++.+.++....+..... ....+++|.||+ +.||+ ++++.++|. T Consensus 146 ~dA~~~l~d~~~~---~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~-------- 212 (274) T protein:vir:97 146 QSAIDKFNDEDLE---PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA-------- 212 (274) T ss_pred HHHHHHhhccCCC---ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCc-------- Confidence 8899999987653 477999999998886543222222111 223456666665 55664 455444431 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:97 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ -++||+.||.+..... T Consensus 213 ---------------------------------------~t--~~l~~~gA~~~~~~~~--------------------- 230 (274) T protein:vir:97 213 ---------------------------------------GT--AILAKKGAVKLILKRD--------------------- 230 (274) T ss_pred ---------------------------------------ce--EEEEeCcceEeeecCC--------------------- Confidence 00 1455566665543221 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +. +..++|++......+.+..||++.++|+-. |.|.-..| T Consensus 231 ~~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:97 231 FF--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred ce--eccccchhhcccEEEEEEEEEEEEEcCCce-EEEecCcc Confidence 12 122336666677778888899999999985 67776666 No 31 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.51 E-value=1e-15 Score=102.82 Aligned_cols=258 Identities=12% Similarity=0.071 Sum_probs=165.4 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccc---cc-cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQE---AP-TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~---~~-~~~g~~~s~~~~di~ 71 (431) ||+..=+|-...+ +.+++++++.+++++++ -++++-++ +.|+||+||.-.. .. ..+|.+. .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~--~~d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFA--EVDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccc--eecccccC-CCCCEEEEeeecCCCccccccCCCcc--cccccc Confidence 9997655544443 66777899999999999 56655554 5799999997321 11 2224333 455677 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) .....+++++.. --|.+++.+ +...+...+.++++...||+++|.+++..+......+.+ .. ..++.+ T Consensus 76 ~~~~~~~i~~~~-~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--------~~-~~~d~i 145 (274) T protein:vir:94 76 TKKREAKIRKIA-KGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--------DI-TKLNGL 145 (274) T ss_pred cceeEEEeeeec-ceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--------cc-cCHHHH Confidence 777788886643 347777644 456788888899999999999999999988765433211 11 135678 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|.+.|++.+.. .|.++++|..++.+.++....+..... ....+++|.||+ +.||+ ++++.++|. T Consensus 146 ~dA~~~l~d~~~~---~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~-------- 212 (274) T protein:vir:94 146 QSAIDKFNDEDLE---PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA-------- 212 (274) T ss_pred HHHHHHhhccCCC---ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCc-------- Confidence 8899999987653 477999999998886543222222111 223456666665 55664 455444431 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:94 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ -++||+.||.+..... T Consensus 213 ---------------------------------------~t--~~l~~~gA~~~~~~~~--------------------- 230 (274) T protein:vir:94 213 ---------------------------------------GT--AILAKKGAVKLILKRD--------------------- 230 (274) T ss_pred ---------------------------------------ce--EEEEeCcceEeeecCC--------------------- Confidence 00 1455566665543221 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +. +..++|++......+.+..||++.++|+-. |.|.-..| T Consensus 231 ~~--vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:94 231 FF--LEVARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred ce--eccccchhhcccEEEEEEEEEEEEEcCCce-EEEecCcc Confidence 12 122336666677778888899999999985 67776666 No 32 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.51 E-value=5.4e-16 Score=104.29 Aligned_cols=260 Identities=17% Similarity=0.127 Sum_probs=151.2 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc---c-cccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA---P-TQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~---~-~~~g~~~s~~~~di~ 71 (431) ||+.+=++-...+ +-++++++..+++++++...+.++ + +.|+||+||+-... . ..+|.+. .++.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~--g-~~G~ti~iP~~~~~gda~~~~eg~~i--~~~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQ--G-QPGNTLKFPAFTYIGDAADVAEGGEI--SLDKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccc--c-CCCCEEEEeeeccCccccccCCCCcc--ChhhcC Confidence 9998766666543 566778999999999995444443 3 57999999974222 1 2334333 444566 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) .....+++.+ ..-.|.+++-+ +...+...+..+++...||+++|.+|+..+.....-+ +....++++ T Consensus 76 ~~~~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~----------~~~~~~d~i 144 (272) T protein:vir:36 76 TTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTV----------STKANVDGV 144 (272) T ss_pred CcceeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----------cccccHHHH Confidence 6666777744 33357777533 4567788888889999999999999998876533211 112346788 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGV 229 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~ 229 (431) .+|+..|.+...+ .|.+++||..++.+.++.-............+++|.||+ +.||+ ++.|.++|.-++ ....+ T Consensus 145 ~~A~~~lgd~~~~---~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~-~~G~~-Vv~s~~~p~~~~-~~~~~ 218 (272) T protein:vir:36 145 QAALDIFNDEDAQ---AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD-VLGAQ-IVRSKKLAEGSA-LMFKI 218 (272) T ss_pred HHHHHHhhhcCCC---ceEEEEcHHHHHHHhcccccccccccccccceeeeccce-ecCee-EEEeCCCCCCce-eEEEE Confidence 8999999998877 367999999999986654333333344456889999997 88997 789999985322 11111 Q ss_pred -eecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 230 -TVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 230 -tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) ...||-. +-. ..+.. -...|. -.+..|.|.- ++.|.+--.-...-++ T Consensus 219 ~~~~gA~~-----~~~-~~~~~-vE~~R~---------~~~~~d~i~~----------------~~~y~~~v~~~~~vv~ 266 (272) T protein:vir:36 219 VSNSPALK-----LVL-KRGVQ-VETDRD---------IVTKTTVITA----------------DEHYAAYLYDLTKVVN 266 (272) T ss_pred Eeccccee-----eee-cCCcc-cccccc---------hhhcCcEEEE----------------EEEEEEEEEcCccEEE Confidence 0122211 000 00000 000010 0112222221 1222221111122222 Q ss_pred Eeeccc Q lcl|NC_019501. 309 ITPKPI 314 (431) Q Consensus 309 I~Paii 314 (431) ++=+.+ T Consensus 267 ~t~~g~ 272 (272) T protein:vir:36 267 ITFTGV 272 (272) T ss_pred EeecCC Confidence 322333 No 33 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.46 E-value=6.9e-15 Score=98.23 Aligned_cols=298 Identities=15% Similarity=0.064 Sum_probs=169.0 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~~~s~~~~di~e~~V~v 77 (431) -+-...-+|+.+--||...|+..+++..++.. |. .+.|+++.+|.=-+..... |.. .+++.+.-..+.| T Consensus 15 ~~~~~al~le~f~geV~taf~~~s~~~~~~~~-rt-----i~~gkS~q~~~iG~~~~~~~~~G~~--ld~~~~~~~k~~i 86 (364) T protein:vir:10 15 SGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-QE-----VVGTNSVSNKYIGETELQVLSPGKS--PDASPTEFDKNRL 86 (364) T ss_pred ccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEeeeeeeeEEeeeccCcc--cCCCCcccCcEEE Confidence 12223334566668999999999999988853 33 5679999998764443322 322 2455666778889 Q ss_pred EeccccccceEeeH-HH-hhHHH-HHHhhhhHHHHHHHHHHHHHHHHhhccc--cceeecc-------C-------CCC- Q lcl|NC_019501. 78 NMGDPDNDFFELRA-DD-LRDER-SYRRRIQASAKKLANNIESAIAKQATEM--GSLVVHD-------T-------RAI- 137 (431) Q Consensus 78 ~ld~~k~v~f~lt~-ke-L~i~~-~s~~~L~~Am~~LAn~Id~dl~~~~~~~--~~~v~t~-------~-------~t~- 137 (431) ++|.-+-..+.+.+ +| +..-+ -...+-+.+..+||..+|+-++.+++.. .+..... + +++ T Consensus 87 tID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~i~~~~~a~ 166 (364) T protein:vir:10 87 VVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFSIHIVGLAS 166 (364) T ss_pred EecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcceeeecccCc Confidence 99887754444443 12 33333 2234446778999999999998766432 1111110 0 000 Q ss_pred -CCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhccccccchhhhhhHh Q lcl|NC_019501. 138 -GPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGTIQRQIAGFDEILR 214 (431) Q Consensus 138 -~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~ 214 (431) ....+......+-++.+.|++..||.++ |-++++|..+..+... +...++... .+..+++|.++. +.||. |++ T Consensus 167 ~~~~~~~~l~~ai~~a~~~LdEkdVP~~~-R~~vv~P~~y~~Ll~~~~lvn~d~~~~-~~~~~~~G~v~~-v~Gv~-Vv~ 242 (364) T protein:vir:10 167 SFLTSPQYMMAAIEMAMEQQTEQEVDTSE-LCGLMPWTAFNCLRDADRIVDKSYTIA-ASDNTVDGFVLK-SWNTP-IVP 242 (364) T ss_pred chhhhHHHHHHHHHHHHHHHhhcCCCccc-cEEEeChHHHHHHhcCCcccccccccc-CCCccccceeEE-EeceE-EEe Confidence 0111122334456789999999999988 6699999999887663 333333322 245689999985 99996 999 Q ss_pred cCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCc Q lcl|NC_019501. 215 SPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDA 294 (431) Q Consensus 215 s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq 294 (431) |.++|.... +++. .|... + =-++-+| ..+ T Consensus 243 Sn~lP~~~~-~~~~---t~~~t----~------------------------------h~ls~~~-------------~g~ 271 (364) T protein:vir:10 243 SNRFPKLSD-NTEG---TGNTK----H------------------------------HKLSNAG-------------NGN 271 (364) T ss_pred ccccccccc-cccc---ccccc----c------------------------------ccccccc-------------CCc Confidence 999986411 1000 00000 0 0000011 001 Q ss_pred eEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCc Q lcl|NC_019501. 295 TFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELF 374 (431) Q Consensus 295 ~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~ 374 (431) .|-|+++ -+.+.=++|||+|.. +.-+- T Consensus 272 ~y~v~~d---------------------------------------------~~~~~~~~f~~~Al~--tv~~~------ 298 (364) T protein:vir:10 272 RYDVTAG---------------------------------------------QTSAQAVLFTQDALL--VGRTI------ 298 (364) T ss_pred ccccccc---------------------------------------------cceeEEEEEecceEE--EEEEe------ Confidence 1222110 011224789998654 33331 Q ss_pred ceeeEEEeecCcceEEEEEEecccccccceEEEEE--eeccceecccceeEEecCCCCC Q lcl|NC_019501. 375 AGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIA--VWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 375 ~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~d--vlyG~~~v~PElagv~i~~q~~ 431 (431) ++.+++ +||... ..+-+| ..||...+|||.++++.+++.+ T Consensus 299 -------------~~t~e~--~~~~~~--~~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 299 -------------SITGDI--FYEKKE--KTWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred -------------cceeee--eeccce--eeeeeeeehcccCcccCccceEEEEecCCC Confidence 111111 122111 111222 3499999999999999999988 No 34 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.45 E-value=2e-15 Score=101.17 Aligned_cols=284 Identities=12% Similarity=0.004 Sum_probs=154.7 Q ss_pred Cccccccchh-------HHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccc-ccccCcccccc Q lcl|NC_019501. 1 MALNEGQLVT-------YALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGW-NLTGNATGILE 72 (431) Q Consensus 1 ~~~~~~~~lt-------~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~-~~s~~~~di~e 72 (431) |++-.|+-=| +--+++++.|++.|+..++++ +.+. +.||||.|+-..+.++++=. ......+++.. T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~--~~d~----g~GDtV~InsIg~~tV~dY~~~~~i~~d~ltt 74 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIAR--VVDF----PDGDKLTIPSVGTPVVRSRPEQGDFTFDNLDT 74 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhc--cccc----CCCCeEEeccccccccccccCCCCcccccCCC Confidence 9887776444 234899999999999888774 2222 46999999988877776631 22224556778 Q ss_pred ceeEEEeccccccceEeeHHHhh--HHHHHHhhhhHHHHHHHHHHHHHHHHhhccccc--------eeec-cC---CCCC Q lcl|NC_019501. 73 LSVKCNMGDPDNDFFELRADDLR--DERSYRRRIQASAKKLANNIESAIAKQATEMGS--------LVVH-DT---RAIG 138 (431) Q Consensus 73 ~~V~v~ld~~k~v~f~lt~keL~--i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~--------~v~t-~~---~t~~ 138 (431) ..+.++||+.|--.|.+++ |.. -.++...+.+.|..+||..+|..+.++++..+. ++.. .. -+++ T Consensus 75 ~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~g 153 (322) T protein:vir:31 75 GEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTG 153 (322) T ss_pred ceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccC Confidence 8899999999998899988 653 344555555677789999999999886664321 1111 00 1122 Q ss_pred CccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhh--hhhhhhcc-------ccchhhhHhhccccccchhh Q lcl|NC_019501. 139 PSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGR--NLVDGDIF-------GRVTEDAYRNGTIQRQIAGF 209 (431) Q Consensus 139 ~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~--~~~~~~~~-------~~~~~~a~r~g~i~r~~~Gf 209 (431) . .....|+.+-..+..|++..||+.+ |-+|++|.-+..+.+ ..+..... .+...+-+| -+|+ ++|| T Consensus 154 t-~~~~ay~~lv~l~~kLdkanVP~~g-R~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~-~~GF 228 (322) T protein:vir:31 154 T-DQTMDVTDFSRVNYVMTQSKMPMGG-MIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRS-VYGI 228 (322) T ss_pred C-CchhhHHHHHHHHHHhccccCCCCC-eEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHH-Hhce Confidence 2 2235789999999999999999987 668899988775522 11111111 111111222 2676 8899 Q ss_pred hhhHhcCCCcc--ccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccc Q lcl|NC_019501. 210 DEILRSPKLPA--VTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAK 287 (431) Q Consensus 210 d~~~~s~~v~~--~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk 287 (431) + +|.|.+++. ++-=.+....++++++..- .+.+.-.+ +-.++-+--++-| T Consensus 229 ~-V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~-----------------f~~~~~~~----------~~~~~~~~~~l~~ 280 (322) T protein:vir:31 229 D-LFVSNLLADANETINAGGDARSTTAGKCNM-----------------FMNVSDMG----------LLPFVVAWKEMPT 280 (322) T ss_pred e-eeeeccccccccccccCcccccccceeecc-----------------cccccchh----------hhhhhhHhhhhhh Confidence 7 899998852 1111111222223322110 00000000 0001101111111 Q ss_pred cccCCCce-----EEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEE Q lcl|NC_019501. 288 NVLTDDAT-----FSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNV 343 (431) Q Consensus 288 ~~~~~lq~-----fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv 343 (431) +.-..++. +++++...++-+ +|- ++-...|+.+.+|+ T Consensus 281 ~e~~r~~~~~~d~~~~~~~~g~g~~--r~e-----------------~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 281 TKSFIDDYNDDLNTATTARWGNGLV--RDE-----------------NLVCVLANADKVTF 322 (322) T ss_pred hhcccCccccccceeeeeeecceee--ccc-----------------ceEEEEeccccccC Confidence 11111111 333332222211 111 11222344455666 No 35 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.42 E-value=1.5e-15 Score=101.87 Aligned_cols=273 Identities=14% Similarity=0.120 Sum_probs=156.0 Q ss_pred cchhhccCCCchHHHhhcccEEEeecccccc--cc-cccccccCccccccceeEEEeccccccceEeeH--HHhhHHHHH Q lcl|NC_019501. 26 MASKVTKYTPPAESMQRSSNTVWMPVEQEAP--TQ-TGWNLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSY 100 (431) Q Consensus 26 ma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~--~~-~g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s 100 (431) |-|-+ +.|+++.+|.=-+.+ .. .|..+-.+++++......|++|+.+-..|.+.+ +-...-++. T Consensus 1 ~vr~i-----------~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr 69 (324) T protein:vir:99 1 MTRTI-----------TSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVR 69 (324) T ss_pred Ceeee-----------ecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccch Confidence 33333 579999988653332 22 255554566777788888999999988877775 335566788 Q ss_pred HhhhhHHHHHHHHHHHHHHHHhhccccc--------eeeccCC---------CCC-CccchhhhhhHHHHHHHHHhhcCC Q lcl|NC_019501. 101 RRRIQASAKKLANNIESAIAKQATEMGS--------LVVHDTR---------AIG-PSTGLSGWDFVSDAERLMFSRELN 162 (431) Q Consensus 101 ~~~L~~Am~~LAn~Id~dl~~~~~~~~~--------~v~t~~~---------t~~-~~~~~~~~~d~a~a~~~L~~~~vP 162 (431) ..+.+.+..+||..+|+.++.++..... ++...++ ... ...+...+..+-++++.|+++.|| T Consensus 70 ~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP 149 (324) T protein:vir:99 70 SEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIP 149 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCC Confidence 8888999999999999999877543321 1111111 100 111223466777899999999999 Q ss_pred ccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccceee Q lcl|NC_019501. 163 RDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAY 242 (431) Q Consensus 163 ~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~ 242 (431) .++ |-++++|..+..+... -............+++|.|+. ++||. +|+|.++|... ++... ++. T Consensus 150 ~~g-R~~vv~P~~y~~Ll~~-~~~~~~~~~~~~~~~~G~V~~-i~Gf~-V~~Sn~lp~~~-~t~~~----~a~------- 213 (324) T protein:vir:99 150 AGD-RTFYTDPDTYSAILAA-LMPNAANYAALIDPETGNIRN-VMGFE-VVETPHMTAQM-VTNPT----DAF------- 213 (324) T ss_pred CCC-CEEEeChHHHHHHhhc-ccccccccccccceecceEEE-EeceE-EEecCCccccc-ccccc----ccc------- Confidence 887 6699999999876432 222222222345699999986 89996 99999999632 11100 000 Q ss_pred eeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccccccccccc Q lcl|NC_019501. 243 TLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLT 322 (431) Q Consensus 243 ~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~ 322 (431) ..+ |=.++++| +. ..+. T Consensus 214 --------------------~~~-----~~~~~~~~----------------------~~-----~~~~----------- 230 (324) T protein:vir:99 214 --------------------DGT-----GHIFPATG----------------------DS-----TTTG----------- 230 (324) T ss_pred --------------------ccc-----cccccccc----------------------cc-----cccc----------- Confidence 000 00111111 00 0000 Q ss_pred ccccccceeecccccCceeEEeccCCcceeeeecccee-EEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccc Q lcl|NC_019501. 323 KEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSI-RLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINT 401 (431) Q Consensus 323 ~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~-aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~ 401 (431) .| -+-++.+.=|+||+++. +.-..++.+ +.+. ...++.| T Consensus 231 ----ky---------------~~d~~~~~gl~~~~~a~~tv~~~~~~~----------e~~~--------~~~~~~d--- 270 (324) T protein:vir:99 231 ----KM---------------TVGADNVVGLFVHRSAVATLKLKDMAL----------ERAR--------RPEYQAD--- 270 (324) T ss_pred ----cc---------------ccccCceeEEEEehhheEEEeeeccee----------ccee--------chhhHHH--- Confidence 00 00011223489999965 222222211 1111 1122332 Q ss_pred cceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 402 LSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 402 ~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ..+==..||++.+|||.++++-..-.+ T Consensus 271 ---~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 271 ---QIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred ---hhhhhhhhcCcccccceEEEEEEccCc Confidence 111125689999999999876643333 No 36 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.41 E-value=1.7e-14 Score=96.03 Aligned_cols=258 Identities=14% Similarity=0.074 Sum_probs=157.7 Q ss_pred Ccc-ccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccc----cccccccccCcccc Q lcl|NC_019501. 1 MAL-NEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAP----TQTGWNLTGNATGI 70 (431) Q Consensus 1 ~~~-~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~----~~~g~~~s~~~~di 70 (431) ||. ++=+|-...+ +-++++++..+++++++..-+.++ + ++|+||+||+....- ..+|.+. .++.+ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~--g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~l 75 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLV--G-QPGNTITFPAFVYSGDAKVVPEGEEI--PIDLI 75 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceeccccc--C-CCCCEEEeeeeccCCccccccCCCCc--chhhc Confidence 544 2333333332 567788999999999984344444 3 569999999764321 2233333 34456 Q ss_pred ccceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhh Q lcl|NC_019501. 71 LELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDF 148 (431) Q Consensus 71 ~e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d 148 (431) ..+...+++.+ ..--|.+++-+ +...+...+.++++...||+++|.+++........-+. ... ..++. T Consensus 76 t~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~--------~~~-~~~d~ 145 (275) T protein:vir:96 76 ETKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVE--------ADI-TKLAG 145 (275) T ss_pred ccceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------ccc-cCHHH Confidence 66666777744 44446677644 34578889999999999999999999988776433221 111 23677 Q ss_pred HHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhc-cccchhhhHhhccccccchhhhhhHhcCCCccccccccc Q lcl|NC_019501. 149 VSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDI-FGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT 227 (431) Q Consensus 149 ~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~-~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~ 227 (431) +.+|.+.|.+... ..|.++++|..++.+.++....+. ........+++|.||+ +.||+ ++.|.++|. T Consensus 146 i~dA~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~------- 213 (275) T protein:vir:96 146 LQTAIDKFNDEDL---EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGE-ALGAI-IVRSNKIKE------- 213 (275) T ss_pred HHHHHHHhccccC---CccEEEeCHHHHHHHHhcccccccccccccccceeccccce-ecCee-EEEeCCCCc------- Confidence 8889999987643 357799999999887554322211 1111223456666665 55664 444333320 Q ss_pred cceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCcee Q lcl|NC_019501. 228 GVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHI 307 (431) Q Consensus 228 ~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti 307 (431) T Consensus 214 -------------------------------------------------------------------------------- 213 (275) T protein:vir:96 214 -------------------------------------------------------------------------------- 213 (275) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcc Q lcl|NC_019501. 308 EITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGI 387 (431) Q Consensus 308 ~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~ 387 (431) .+ -++||+.||.+...+- T Consensus 214 ----------------------------------------~t--~~i~~~gA~~~~~~~~-------------------- 231 (275) T protein:vir:96 214 ----------------------------------------GE--AILAKRGAVKLITKRD-------------------- 231 (275) T ss_pred ----------------------------------------ce--EEEEeccceeeeecCC-------------------- Confidence 00 1556666776654331 Q ss_pred eEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 388 GVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 388 glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ++++ .+.|++......+-+.-||++.++|+-. |.|.=..+ T Consensus 232 -~~vE--~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 271 (275) T protein:vir:96 232 -FFLE--TERHASHKSTALFSDKHYVAYLYDESKV-VKITKSAS 271 (275) T ss_pred -cccc--cccchhhcCcEEEEeEEEEEEEEcCccE-EEEEeccc Confidence 1212 2336666777888999999999999975 45533333 No 37 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.34 E-value=1.7e-13 Score=90.59 Aligned_cols=256 Identities=14% Similarity=0.117 Sum_probs=152.0 Q ss_pred Ccccc---ccchhHH--HHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc----ccccccccccCccccc Q lcl|NC_019501. 1 MALNE---GQLVTYA--LDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA----PTQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~---~~~lt~~--~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~----~~~~g~~~s~~~~di~ 71 (431) ||+.. +.++.+- -+.+++.|++.+++.+++ -+++.-++ +.|++|.||+-... .+.+|... .++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~--~~~~~~~g-~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLA--EVDTTLEG-QPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccc--cccccccC-CCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 99544 4444332 266778899999999999 34443333 57999999974221 12334322 345677 Q ss_pred cceeEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ...+.+++.+.. .-|.+++.+. +..++..++.++..+.+++++|.++++.+......+. ....++++ T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~----------~~~t~d~i 144 (272) T protein:vir:98 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE----------ATATVDGV 144 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc----------cccCHHHH Confidence 778888886644 3478886553 4677888888999999999999999987765433211 11235778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|...|.+.+.+ .+.+++||..++.+............. ....+++|.+|+ +.||. ++.+.++| T Consensus 145 ~da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~-i~G~~-Vi~s~~~p--------- 210 (272) T protein:vir:98 145 SKALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGE-VLGVQ-IVRSRKCP--------- 210 (272) T ss_pred HHHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccchh-hcCee-EEEcCCCC--------- Confidence 8899999877644 367999999887763321111000000 001122222222 22221 11211111 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 211 -------------------------------------------------------------------------------- 210 (272) T protein:vir:98 211 -------------------------------------------------------------------------------- 210 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) . . ..++|++.||.++.+. + T Consensus 211 ---~-----------------------------------~--t~~~~~~~a~~~~~~~---------------------~ 229 (272) T protein:vir:98 211 ---K-----------------------------------G--TAYMVRKGALRIMLKR---------------------N 229 (272) T ss_pred ---c-----------------------------------c--eEEEEcCCeEEEEecC---------------------C Confidence 0 0 0266778877776532 1 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEec----CCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGL----PNQT 430 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i----~~q~ 430 (431) ++++ .+.|.+++....+...-||++.++|+-. |.+ +||| T Consensus 230 ~~ve--~~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a~~~ 272 (272) T protein:vir:98 230 TMVE--TDRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDAAKK 272 (272) T ss_pred ceee--eccccccceeEEEEEEEEEEEEEcCCce-EEEEecccccC Confidence 2222 2335566677778888899999999953 555 4444 No 38 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.34 E-value=1.7e-13 Score=90.59 Aligned_cols=256 Identities=14% Similarity=0.117 Sum_probs=152.0 Q ss_pred Ccccc---ccchhHH--HHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc----ccccccccccCccccc Q lcl|NC_019501. 1 MALNE---GQLVTYA--LDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA----PTQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~---~~~lt~~--~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~----~~~~g~~~s~~~~di~ 71 (431) ||+.. +.++.+- -+.+++.|++.+++.+++ -+++.-++ +.|++|.||+-... .+.+|... .++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~--~~~~~~~g-~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLA--EVDTTLEG-QPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccc--cccccccC-CCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 99544 4444332 266778899999999999 34443333 57999999974221 12334322 345677 Q ss_pred cceeEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ...+.+++.+.. .-|.+++.+. +..++..++.++..+.+++++|.++++.+......+. ....++++ T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~----------~~~t~d~i 144 (272) T protein:vir:30 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE----------ATATVDGV 144 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc----------cccCHHHH Confidence 778888886644 3478886553 4677888888999999999999999987765433211 11235778 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc-hhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV-TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~-~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|...|.+.+.+ .+.+++||..++.+............. ....+++|.+|+ +.||. ++.+.++| T Consensus 145 ~da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~-i~G~~-Vi~s~~~p--------- 210 (272) T protein:vir:30 145 SKALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGE-VLGVQ-IVRSRKCP--------- 210 (272) T ss_pred HHHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccchh-hcCee-EEEcCCCC--------- Confidence 8899999877644 367999999887763321111000000 001122222222 22221 11211111 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 211 -------------------------------------------------------------------------------- 210 (272) T protein:vir:30 211 -------------------------------------------------------------------------------- 210 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) . . ..++|++.||.++.+. + T Consensus 211 ---~-----------------------------------~--t~~~~~~~a~~~~~~~---------------------~ 229 (272) T protein:vir:30 211 ---K-----------------------------------G--TAYMVRKGALRIMLKR---------------------N 229 (272) T ss_pred ---c-----------------------------------c--eEEEEcCCeEEEEecC---------------------C Confidence 0 0 0266778877776532 1 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEec----CCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGL----PNQT 430 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i----~~q~ 430 (431) ++++ .+.|.+++....+...-||++.++|+-. |.+ +||| T Consensus 230 ~~ve--~~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a~~~ 272 (272) T protein:vir:30 230 TMVE--TDRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDAAKK 272 (272) T ss_pred ceee--eccccccceeEEEEEEEEEEEEEcCCce-EEEEecccccC Confidence 2222 2335566677778888899999999953 555 4444 No 39 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.25 E-value=6.5e-13 Score=87.38 Aligned_cols=259 Identities=12% Similarity=0.067 Sum_probs=155.4 Q ss_pred CccccccchhHHH-----HHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccc----ccccccccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYAL-----DEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEA----PTQTGWNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~-----~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~----~~~~g~~~s~~~~di~ 71 (431) ||...=+|-...+ +-++++++..+++++++ -.+.+=++ +.|+||+||.-... ...+|.+. .++.+. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~--~~~~~l~g-~~G~ti~iP~~~~igda~~~~eg~~i--~~~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFA--DIDSTLVG-QPGDTLTFPAFVYSGDATVVPEGQKI--PVDKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccc--eecccccC-CCCCEEEeeeecCCCccccccCCCcc--Cccccc Confidence 9987655554443 55667799999999999 45544343 57999999964222 13344333 344566 Q ss_pred cceeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) .++..+++.+ ..--|.+++-+ +...+...+.+++....||+++|.+++...+.....+.+ .. ..+..+ T Consensus 76 ~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~-------~~--~t~d~i 145 (276) T protein:vir:10 76 TNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSA-------DI--GTLAGL 145 (276) T ss_pred cceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------cc--cCHHHH Confidence 6666667633 44446666544 346788888999999999999999999888764433211 11 125678 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhcc-ccchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIF-GRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~-~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+|...|.+... ..+.++++|..++.+.++....+.. .......+++|+||. +.||+ ++.+.++|. T Consensus 146 ~~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~-------- 212 (276) T protein:vir:10 146 EAAIDTFDDEDL---EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGE-ALGAV-IVRSKKLDE-------- 212 (276) T ss_pred HHHHHHhccccC---cccEEEEcHHHHHHHHHhccccccccccccccceeccccce-eccee-EEEcCCCCc-------- Confidence 889999988754 3477899999998875432111111 111111233444433 33432 222221110 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) T Consensus 213 -------------------------------------------------------------------------------- 212 (276) T protein:vir:10 213 -------------------------------------------------------------------------------- 212 (276) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcce Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIG 388 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~g 388 (431) .+ -++|++.||.+..... T Consensus 213 ---------------------------------------~t--~~l~~~gAi~~~~~~~--------------------- 230 (276) T protein:vir:10 213 ---------------------------------------GE--AILAKRGAVKLITKRD--------------------- 230 (276) T ss_pred ---------------------------------------ce--EEEEeccceeeeecCC--------------------- Confidence 00 2567777777654321 Q ss_pred EEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 389 VNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 lslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +. +..+.|++......+-+.-||++.++|+-..++=.+-.. T Consensus 231 ~~--vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 231 FF--LETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred ce--eecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 22 223447777788888999999999999975333222222 No 40 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.20 E-value=3.1e-13 Score=89.13 Aligned_cols=290 Identities=13% Similarity=0.096 Sum_probs=165.3 Q ss_pred Cccccc----------------cchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---cc Q lcl|NC_019501. 1 MALNEG----------------QLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GW 61 (431) Q Consensus 1 ~~~~~~----------------~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~ 61 (431) |+.--+ -+|+.+--||...|+..+++..++.. |. .+.|+|+.||.=-+.+... |. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~-r~-----i~~G~s~~~~~iG~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNV-RS-----LRGTNQLRVDRVGASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcccee-ee-----ccccceEEEeeecceeeeeecCCC Confidence 443322 23344447888889999999999953 43 4779999999654444322 33 Q ss_pred ccccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeec------- Q lcl|NC_019501. 62 NLTGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVH------- 132 (431) Q Consensus 62 ~~s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t------- 132 (431) .. ..+.+.-..+.|++|+.+-..|.+.+ +-+..-|+...+-+.+..+||...|+.++.+...-..+.-- T Consensus 75 ~l--~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~ 152 (334) T protein:vir:80 75 EL--VVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAF 152 (334) T ss_pred CC--CCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 33 33456678889999998887777764 22445566667778889999999999887554432221100 Q ss_pred -cC---CCCCCccch-------hhhhhHHHHHHHHHhhcCCcc--CCcEEEeChHHHhhhhhh--hhhhhccccchhhhH Q lcl|NC_019501. 133 -DT---RAIGPSTGL-------SGWDFVSDAERLMFSRELNRD--MGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAY 197 (431) Q Consensus 133 -~~---~t~~~~~~~-------~~~~d~a~a~~~L~~~~vP~~--~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~ 197 (431) .+ ..+.+++.. -....+-.|++.|++..+|.. ..|-++++|..+..+... +...++.......-+ T Consensus 153 ~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~ 232 (334) T protein:vir:80 153 HDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSF 232 (334) T ss_pred cCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccc Confidence 00 011111111 112344579999999999952 247799999999887654 333333222223458 Q ss_pred hhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc Q lcl|NC_019501. 198 RNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT 277 (431) Q Consensus 198 r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia 277 (431) ..|.+++ ++||. ||++.++|... .+... . | T Consensus 233 ~~g~i~~-v~G~~-V~~Sn~~P~~~-~t~~~------------------------------------~-----g------ 262 (334) T protein:vir:80 233 VGGRIAM-LNGVR-VVETPRFPQSA-ITANA------------------------------------L-----G------ 262 (334) T ss_pred cceeEEE-EeceE-EEeecCCCCcc-ccccc------------------------------------c-----c------ Confidence 8999986 88996 89999988421 00000 0 0 Q ss_pred ceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeecc Q lcl|NC_019501. 278 GVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWAD 357 (431) Q Consensus 278 GV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr 357 (431) |. |-+++ . -.+.++=++||+ T Consensus 263 ~~----------------~~~~a---------------------------------g-----------d~t~~~~~~~~~ 282 (334) T protein:vir:80 263 AD----------------FNVTD---------------------------------A-----------EVRRKMITFIPS 282 (334) T ss_pred cc----------------ccccc---------------------------------c-----------cccceEEEEEeC Confidence 00 00000 0 001123489999 Q ss_pred ceeEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 358 DSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 358 ~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) +|+.-+-..- + ..+... ...++.|.-.- =..||.+.+|||-++|+-.--+- T Consensus 283 ~Al~t~~~~~-~--------~~e~~~--------~~~~~~d~i~~------~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 283 MALISAQVHP-V--------SAQFWE--------EKKDFGHYLDT------FQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred ceEEEEEEee-c--------ceeeee--------chhhHHHHHHH------HHHcCCceeccceEEEEEEeeec Confidence 9876443211 0 011111 11122221111 15799999999998876432222 No 41 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.18 E-value=1.1e-12 Score=86.11 Aligned_cols=288 Identities=13% Similarity=0.116 Sum_probs=166.6 Q ss_pred Cc--------------cccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---cccc Q lcl|NC_019501. 1 MA--------------LNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GWNL 63 (431) Q Consensus 1 ~~--------------~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~~~ 63 (431) |- .-..-+|+.+--||+..|+..++++.++.. |. .|.|+++.+|.=-+..... |..+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-rt-----i~~g~s~~~~~iG~~~~~~~~pG~~l 74 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-RD-----LRGSNVVRLDRLGNVEAKGRRAGEEL 74 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccce-ee-----eccceeEEEeeeeeeeecccccCccc Confidence 21 112234555557999999999999999953 32 5789999999765554332 4344 Q ss_pred ccCccccccceeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceee--c------c Q lcl|NC_019501. 64 TGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVV--H------D 133 (431) Q Consensus 64 s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~--t------~ 133 (431) .+++ +......|++|.-+-..+.+.+ +-+...|....+-+....+||...|+-++.+...-..+.- + . T Consensus 75 ~~~~--~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 75 ERSR--VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred CCCC--cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 3443 5667888999987766665553 2244445556677888999999999988744333221100 0 0 Q ss_pred C--------CCCCCccchhhhhhHHHHHHHHHhhcCCccC--CcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhcc Q lcl|NC_019501. 134 T--------RAIGPSTGLSGWDFVSDAERLMFSRELNRDM--GISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGT 201 (431) Q Consensus 134 ~--------~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~--~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~ 201 (431) + +......+......+..+.+.|++.-+|..+ .|-++++|..+..+... +...++.......-+++|. T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~ 232 (335) T protein:vir:78 153 GVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSR 232 (335) T ss_pred CcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccce Confidence 0 0111112223456677789999999999643 47799999999887554 3333222222234588999 Q ss_pred ccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceee Q lcl|NC_019501. 202 IQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKF 281 (431) Q Consensus 202 i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~ 281 (431) +++ ++||. ++++.++|... +++ +.. |..+ T Consensus 233 v~~-v~Gv~-V~~Sn~lP~~~-~t~----------------------~~l-------------------g~a~------- 261 (335) T protein:vir:78 233 VAI-LNGVK-VLETPRFATKA-ISA----------------------HPL-------------------GRHF------- 261 (335) T ss_pred eEE-eeceE-EEeeccCCCCC-Ccc----------------------ccc-------------------cccC------- Confidence 875 88996 89999998421 110 000 0000 Q ss_pred eccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeE Q lcl|NC_019501. 282 LSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIR 361 (431) Q Consensus 282 vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~a 361 (431) |+++- -.+.++=++||++|.. T Consensus 262 ------------------------------------------------n~~~~-----------d~~~~~~~~~~~~Al~ 282 (335) T protein:vir:78 262 ------------------------------------------------NVSAE-----------EAERQIALFLPSKTLI 282 (335) T ss_pred ------------------------------------------------Ccccc-----------cccceEEEEEecceEE Confidence 00000 0011234899999876 Q ss_pred EEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEE--eeccceecccceeEEe-cCCCCC Q lcl|NC_019501. 362 LLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIA--VWYSACAVRPEAIGVG-LPNQTA 431 (431) Q Consensus 362 Lat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~d--vlyG~~~v~PElagv~-i~~q~~ 431 (431) -+-.-- +..++ +||.... .+-|| ..||...+|||.+++. +-|--| T Consensus 283 t~~~~~---------------------~~~e~--~~~~~~~--~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:78 283 TAQVAP---------------------VQAKL--WEDHDQF--SWVLDTFQMYNIGARRPDTAGAIELKGIEA 330 (335) T ss_pred EEEEEe---------------------cccce--eeccchh--hHhhhHHHHcCCcccCcceEEEEEecCCCc Confidence 552110 11111 2222211 11232 3599999999998643 445444 No 42 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.08 E-value=1.5e-12 Score=85.43 Aligned_cols=225 Identities=19% Similarity=0.157 Sum_probs=125.5 Q ss_pred HHHhhcccEEEeecccccc---cccccccccCccccccceeEEEeccccccceEeeHHH-hh-HHHHHHhhhhHHHHHHH Q lcl|NC_019501. 38 ESMQRSSNTVWMPVEQEAP---TQTGWNLTGNATGILELSVKCNMGDPDNDFFELRADD-LR-DERSYRRRIQASAKKLA 112 (431) Q Consensus 38 ~e~~k~GdTv~ip~p~~~~---~~~g~~~s~~~~di~e~~V~v~ld~~k~v~f~lt~ke-L~-i~~~s~~~L~~Am~~LA 112 (431) .-+.+.||||.+|+= ... ..+|... .+..+.-.+...++.+ ..-.|++++.+ |+ ..++....-++....|| T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i--~~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEI--SLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcC--ChhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 457889999999952 111 2334333 2334555556666633 34458888766 33 56777777788889999 Q ss_pred HHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccc Q lcl|NC_019501. 113 NNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRV 192 (431) Q Consensus 113 n~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~ 192 (431) ++||.|++...+...-- . .+. ..++.+.+|...|.+... ..+-++++|-+.+.|.+........... T Consensus 77 ~kvD~di~~~~~~a~l~-~--------~~~-~t~d~i~~A~~~fgde~~---~~~vivv~p~~~~~Lrk~~~~~~~~~~~ 143 (231) T protein:vir:73 77 NKVDDDLLKAAKTTSQT-V--------STK-ANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) T ss_pred HhhhHHHHHhhcccccc-c--------ccc-ccHHHHHHHHHHhccccc---cceEEEEcchHHHhhhhccchhhhhhhh Confidence 99999999877753211 0 111 235678888899987653 3467889999999886644332223333 Q ss_pred hhhhHhhccccccchhhhhhHhcCCCccccccccccc-eecccccccceeeeeecccccccccceeeEEEeeccceeeec Q lcl|NC_019501. 193 TEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGV-TVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRG 271 (431) Q Consensus 193 ~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~-tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaG 271 (431) ....+++|.||+ +.|++ ++.|.++|.-++ ....+ ...||-... ..-+... ...| --.++- T Consensus 144 g~~i~~~G~iG~-i~G~~-Vi~S~~~~~~~~-~~~~~i~~~gAl~~~------~k~~~~v-EtdR---------d~~~k~ 204 (231) T protein:vir:73 144 GANALINGTYAD-VLGAQ-IVRSKKLAEGSA-LMFKIVSNSPALKLV------LKRGVQV-ETDR---------DIVTKT 204 (231) T ss_pred ccceeeecccce-EcceE-EEEcCCCCCCce-eeeeEEeeccceeee------eccccee-eccc---------cccccc Confidence 456789999997 88997 899999884221 11111 112332100 0000000 0000 011222 Q ss_pred cEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeeccc Q lcl|NC_019501. 272 DKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPI 314 (431) Q Consensus 272 Dv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii 314 (431) |.++. ++.|.|--.-...-+.|+=+++ T Consensus 205 ~~i~~----------------~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 205 TVITA----------------DEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cEEEE----------------eEEEEEEEEcCccEEEEEeecC Confidence 33332 2334443322334444444444 No 43 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.06 E-value=1.6e-12 Score=85.26 Aligned_cols=292 Identities=14% Similarity=0.013 Sum_probs=163.2 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccc--cc-ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPT--QT-GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~--~~-g~~~s~~~~di~e~~V~v 77 (431) -+-...-+|+.+--||...|+..+++.+++.. |. .+.|+++.+|.=-+.+. .+ |... +++.+.-..+.| T Consensus 15 s~~~~al~le~f~geV~taF~~~si~~~~~~v-rt-----i~~GkS~qf~~iG~~~a~y~~~G~~l--dg~~~~~~k~~I 86 (402) T protein:vir:97 15 SGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP--NATPTQADKNQL 86 (402) T ss_pred ccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEEEEEeeeEEeeecccccc--CCCCcccccEEE Confidence 12223334566668999999999999988843 32 56799999887633332 22 3222 344555667778 Q ss_pred EeccccccceEeeH-HH-hhHHHH-HHhhhhHHHHHHHHHHHHHHHHhhccccc-----------eeeccCCCCC----- Q lcl|NC_019501. 78 NMGDPDNDFFELRA-DD-LRDERS-YRRRIQASAKKLANNIESAIAKQATEMGS-----------LVVHDTRAIG----- 138 (431) Q Consensus 78 ~ld~~k~v~f~lt~-ke-L~i~~~-s~~~L~~Am~~LAn~Id~dl~~~~~~~~~-----------~v~t~~~t~~----- 138 (431) ++|.-.-..+.+.+ +| +..-|. -..+-+...++||...|+-++.+.+.... .++..+.... T Consensus 87 tID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~ 166 (402) T protein:vir:97 87 VIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTES 166 (402) T ss_pred EeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccccccccccccc Confidence 88876543333332 11 333332 22344677899999999999876643221 1111000000 Q ss_pred --CccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhccccccchhhhhhHh Q lcl|NC_019501. 139 --PSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGTIQRQIAGFDEILR 214 (431) Q Consensus 139 --~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~ 214 (431) ...+......+-++.+.|++..||.++ |.++++|..+..+... |...++... ....+++|.++. +.||. +|+ T Consensus 167 ~a~~~~~~l~~ai~~a~~~LdEkdVP~~d-Rv~vv~P~~y~~Ll~~~rl~n~d~~~~-~~g~~~~G~v~~-v~Gv~-Vv~ 242 (402) T protein:vir:97 167 EALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDADRIVDKTYTIS-QSGATINGFVLS-SYNCP-VIP 242 (402) T ss_pred hhhcCHHHHHHHHHHHHHHHHhcCCCccc-cEEEeChHHHHHHhhcccccchhhccc-cCCccccceeEE-EeceE-EEe Confidence 112222345566789999999999988 6688999999887653 443444322 234689999875 89996 899 Q ss_pred cCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCc Q lcl|NC_019501. 215 SPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDA 294 (431) Q Consensus 215 s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq 294 (431) +.++|... +.+ .+. .+ +-+| ..+ T Consensus 243 SnnlP~~a----~~i--t~~--------~l------------------------------s~a~-------------~G~ 265 (402) T protein:vir:97 243 SNRFPTFA----QDQ--AHH--------LL------------------------------SNED-------------NGY 265 (402) T ss_pred cCcccccc----ccc--ccc--------cc------------------------------ccCC-------------CCc Confidence 99999521 000 000 00 0000 001 Q ss_pred eEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCc Q lcl|NC_019501. 295 TFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELF 374 (431) Q Consensus 295 ~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~ 374 (431) -|-|++ --+.+.=++|||+|.. +.-+ T Consensus 266 ~y~~t~---------------------------------------------d~t~~~~~~f~~~Av~--tvk~------- 291 (402) T protein:vir:97 266 RYDPIA---------------------------------------------EMNGAVAVLFTSDALL--VGRT------- 291 (402) T ss_pred cCCcCc---------------------------------------------ccceeEEEEEecceEE--EEEe------- Confidence 111111 0011234889997432 3222 Q ss_pred ceeeEEEeecCcceEEEEEEecccccccceEEEEE--eeccceecccceeEEecCCCCC Q lcl|NC_019501. 375 AGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIA--VWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 375 ~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~d--vlyG~~~v~PElagv~i~~q~~ 431 (431) ..|..-.+||.... .+-|| ..||...+|||-+||+..-.-+ T Consensus 292 --------------~~vT~~~~~d~r~~--~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (402) T protein:vir:97 292 --------------IEVTGDIFYEKKEK--TYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) T ss_pred --------------eccccchhhchhHH--HHHHHHHHHhCCcccCccceEEEEEeccc Confidence 11122234443321 11244 4689999999999999554422 No 44 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.88 E-value=2.6e-11 Score=78.65 Aligned_cols=291 Identities=14% Similarity=0.019 Sum_probs=157.2 Q ss_pred Cc--------------cccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccc---ccccc Q lcl|NC_019501. 1 MA--------------LNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQ---TGWNL 63 (431) Q Consensus 1 ~~--------------~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~---~g~~~ 63 (431) |. --..-+|+.+--||...|+..++|..++.. |. .+.|+++.+|.=-+.+.. .|..+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~v-Rt-----i~~gkS~qf~~~G~s~~~~~~pG~~l 74 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhccccee-ee-----ecccceEEEEEeeeeEeeeecCCCCc Confidence 22 122234555557888889999998888852 32 677999988866232222 23333 Q ss_pred ccCccccccceeEEEeccccccceEeeH--HHhhHHHH-HHhhhhHHHHHHHHHHHHHHHHhhcccc-----------ce Q lcl|NC_019501. 64 TGNATGILELSVKCNMGDPDNDFFELRA--DDLRDERS-YRRRIQASAKKLANNIESAIAKQATEMG-----------SL 129 (431) Q Consensus 64 s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~~-s~~~L~~Am~~LAn~Id~dl~~~~~~~~-----------~~ 129 (431) ..+.+.-.++.|++|.-.-..+.+.+ +-++.-|. -..+-+...++||...|+-++.+.+... .. T Consensus 75 --d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~ 152 (401) T protein:vir:70 75 --AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRV 152 (401) T ss_pred --CCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 33345566777888877655444442 11222221 1233456678999999998877664311 00 Q ss_pred eecc-----CC--CCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEe-ChHHHhhhhhh--hhhhhccccchhhhHhh Q lcl|NC_019501. 130 VVHD-----TR--AIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFL-NPDDYRKAGRN--LVDGDIFGRVTEDAYRN 199 (431) Q Consensus 130 v~t~-----~~--t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~-np~~~a~~~~~--~~~~~~~~~~~~~a~r~ 199 (431) .+.. ++ ......+......+-++...|++..||. +++ +++ +|+-|+.+... +...++... .+..+.+ T Consensus 153 ~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~-~r~-vvl~pp~~Ys~Ll~~d~L~nrd~~~s-~~g~~~~ 229 (401) T protein:vir:70 153 KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDI-SDV-AILMPWRYFNVLRDADRIVDKTYTIS-QSGATIQ 229 (401) T ss_pred CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCc-cce-EEEcCHHHHHHHHhcCcccchhhccc-cCCcccc Confidence 0000 00 1111122224455678999999999995 444 555 55555555332 333333322 2346889 Q ss_pred ccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcce Q lcl|NC_019501. 200 GTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGV 279 (431) Q Consensus 200 g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV 279 (431) |.+.. ++||. ++++.++|.... ++. |..++-+| T Consensus 230 G~v~~-vaGv~-Vv~SnnlP~~a~------~it--------------------------------------~~~ls~a~- 262 (401) T protein:vir:70 230 GFTLS-SYNCP-VIPSNRFPKYSQ------GQT--------------------------------------HHLLSNED- 262 (401) T ss_pred ceEEE-EeceE-EEeecccccccc------ccc--------------------------------------cccccccC- Confidence 99875 89996 899999985210 000 00011111 Q ss_pred eeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccce Q lcl|NC_019501. 280 KFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDS 359 (431) Q Consensus 280 ~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A 359 (431) ...-|-+++ .-+.+.=++|||+| T Consensus 263 ------------~G~~y~~~~---------------------------------------------d~s~~~~v~f~~~A 285 (401) T protein:vir:70 263 ------------NGYRYDPLP---------------------------------------------AMNGAIAVLFTADA 285 (401) T ss_pred ------------CCccCCCCc---------------------------------------------cccceeEEEEehhh Confidence 001111111 11112348999995 Q ss_pred eEEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEe--eccceecccceeEEecCCCCC Q lcl|NC_019501. 360 IRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAV--WYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 360 ~aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dv--lyG~~~v~PElagv~i~~q~~ 431 (431) ..- .-+ +.|....+||.+ ...+-||. .||...+|||.++|+...-+. T Consensus 286 v~t--vk~---------------------~~lt~~~~~d~r--~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 286 LLV--GRS---------------------IDVTGDIFYEKK--EKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred eEE--EEe---------------------eccccchhhhhh--hhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 322 111 122233355543 22334554 689999999999998777664 No 45 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=98.83 E-value=4.4e-11 Score=77.34 Aligned_cols=203 Identities=15% Similarity=0.072 Sum_probs=110.0 Q ss_pred eccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeecc-------CCCC---CCccchhhh Q lcl|NC_019501. 79 MGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHD-------TRAI---GPSTGLSGW 146 (431) Q Consensus 79 ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~-------~~t~---~~~~~~~~~ 146 (431) +|....-.|.+.+ +....-++...+.+.+..+||.++|+.++.++.......... .... ..+++...+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 5555554555553 335677777888889999999999999999888654332110 1111 112233344 Q ss_pred hhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhh-hhccccchh-hhHhhc-cccccchhhhhhHhcCCCccccc Q lcl|NC_019501. 147 DFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVD-GDIFGRVTE-DAYRNG-TIQRQIAGFDEILRSPKLPAVTK 223 (431) Q Consensus 147 ~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~-~~~~~~~~~-~a~r~g-~i~r~~~Gfd~~~~s~~v~~~t~ 223 (431) +.+-++++.|++..||.++ |-++++|..+..+....-. ..+.+...+ .-+++| .+++ ++||+ +|+|.++|..+ T Consensus 81 dai~~a~~~LdekdVP~~g-R~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~-v~G~~-V~~SnnlP~~~- 156 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDG-RVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYV-NAGIR-IYKSNVLASLY- 156 (221) T ss_pred HHHHHHHHHHhhcCCCCCC-CEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeee-ecCcE-EEEeccCCccc- Confidence 7788899999999999977 5588999877776542111 111111112 237888 4765 88996 99999999632 Q ss_pred cccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeec Q lcl|NC_019501. 224 STATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVID 303 (431) Q Consensus 224 gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~ 303 (431) |+ .+. ..+|+ |.+.+ T Consensus 157 gt--~~~-------------------------------------~~ag~-~~~~~------------------------- 171 (221) T protein:vir:17 157 GT--NLV-------------------------------------TDPGD-ATTSG------------------------- 171 (221) T ss_pred cc--ccc-------------------------------------cCCcc-ccccc------------------------- Confidence 11 000 00111 11111 Q ss_pred CceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEE--eecccCCCCCcceee-EE Q lcl|NC_019501. 304 STHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLL--SQPIPVTHELFAGMK-TS 380 (431) Q Consensus 304 a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLa--t~pl~~p~g~~~a~~-~~ 380 (431) +. ...| .. ..+.+.-|+|||+|..-+ |.|+..|+= +. +. T Consensus 172 -~~------------------~~~y-----------r~----~fs~~~glv~~~~Avgtvkl~~~~~~~~~----~~~~~ 213 (221) T protein:vir:17 172 -EN------------------NGSY-----------RP----AITDRAGLVFHKEAADTVEVLLPPSRPPL----VISMF 213 (221) T ss_pred -cc------------------cccc-----------cc----cccceEEEEEcchheeeeeeecCCCCCce----eeeee Confidence 00 0001 00 011122499999986532 444444431 21 11 Q ss_pred EeecCcce Q lcl|NC_019501. 381 SFSIPGIG 388 (431) Q Consensus 381 t~~~~g~g 388 (431) +...|..- T Consensus 214 ~~~~~~~~ 221 (221) T protein:vir:17 214 SIRRPDRR 221 (221) T ss_pred eccCCCCC Confidence 11212100 No 46 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.83 E-value=7.3e-10 Score=70.68 Aligned_cols=287 Identities=13% Similarity=0.087 Sum_probs=154.4 Q ss_pred CccccccchhHH------H-HHHHHHHHhhcccc--hhhccCCCchH--HHhhcccEEEeeccccccccc-ccc------ Q lcl|NC_019501. 1 MALNEGQLVTYA------L-DEIIETVQNLTPMA--SKVTKYTPPAE--SMQRSSNTVWMPVEQEAPTQT-GWN------ 62 (431) Q Consensus 1 ~~~~~~~~lt~~------~-~evi~~len~lvma--~~V~~~r~y~~--e~~k~GdTv~ip~p~~~~~~~-g~~------ 62 (431) |+| |.+++-+ | +--|+.|..+..|+ +.=++.|++-+ ...+.++++..+....+.+.. +.. T Consensus 1 ~~~--~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKL--NAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred Ccc--cceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccC Confidence 443 2222210 1 23345555555554 11134455422 134556777766654443221 111 Q ss_pred cc-cCcc-ccccceeEEEeccccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCC-- Q lcl|NC_019501. 63 LT-GNAT-GILELSVKCNMGDPDNDFFELRADDL--RDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRA-- 136 (431) Q Consensus 63 ~s-~~~~-di~e~~V~v~ld~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t-- 136 (431) .. .-|. ++-.....+.++.+ ...+.+.+-|+ ...|.-..|.+.+..+|+.+.|.-|+..+....+ .++.++. T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~~gt~v~ 156 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKGTGQPVE 156 (322) T ss_pred cccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccccccccc Confidence 11 0111 22245555655444 55677776554 2566777788899999999999988876655442 2222111 Q ss_pred -------CCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhh--hhhhhhccccchhhhH-hhccccccc Q lcl|NC_019501. 137 -------IGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGR--NLVDGDIFGRVTEDAY-RNGTIQRQI 206 (431) Q Consensus 137 -------~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~--~~~~~~~~~~~~~~a~-r~g~i~r~~ 206 (431) +..+++ ..|..+-.|++.|+++.||.++.|-++++|..+..+.. .++..++. ..+++ ++|.+++ + T Consensus 157 ~~ss~~i~~g~~g-~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~---~~~~l~~~G~ig~-~ 231 (322) T protein:vir:10 157 FLATQEIGDGTKP-ISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYT---SAMDLQSKGIITN-W 231 (322) T ss_pred cCCCcccccCccc-hhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcc---cchhhhhcCeeee-e Confidence 111122 23678889999999999998887889999998888654 33333333 34565 7899986 8 Q ss_pred hhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeecccc Q lcl|NC_019501. 207 AGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMA 286 (431) Q Consensus 207 ~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~t 286 (431) +||. |+.+.++|... . +.+. + -+++ T Consensus 232 lGf~-~i~s~~lp~~~-----------~-----t~~~-------------------~-----------~~~~-------- 256 (322) T protein:vir:10 232 MGYT-WIVSTRLDKFD-----------P-----TQWG-------------------M-----------AAED-------- 256 (322) T ss_pred eeEE-EEEeccCCccc-----------c-----cccc-------------------c-----------cccC-------- Confidence 9997 57888888421 0 0000 0 0000 Q ss_pred ccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeec Q lcl|NC_019501. 287 KNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQP 366 (431) Q Consensus 287 k~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~p 366 (431) . + ....+.=++|||+|+.++..- T Consensus 257 ---------------~---------~---------------------------------~~~~~~~~a~~k~Av~~a~~~ 279 (322) T protein:vir:10 257 ---------------G---------P---------------------------------QGDEIWCIAMTDMALGYHSCK 279 (322) T ss_pred ---------------C---------C---------------------------------CccceeEEEEecCceeEEEee Confidence 0 0 000111389999999998642 Q ss_pred ccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 367 IPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 367 l~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) = +.+....+|+...... ++--..||++.++||-. |.|-=-.+ T Consensus 280 d---------v~~~i~~~~~~~~a~~-------------I~~~~~~Ga~ri~~~gV-v~i~~~e~ 321 (322) T protein:vir:10 280 D---------IWTKVAEDPSASFAWR-------------IYSAFTADCVRVEDEHI-FKLRLKNS 321 (322) T ss_pred e---------eeEEeeccCCcchhhh-------------hhhhhhhCceEeccCcE-EEEEEecc Confidence 1 1111122222211111 12345678888888875 55544444 No 47 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.77 E-value=8.1e-11 Score=75.90 Aligned_cols=292 Identities=13% Similarity=0.033 Sum_probs=153.4 Q ss_pred Cc--------------cccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccc--c-cccccc Q lcl|NC_019501. 1 MA--------------LNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAP--T-QTGWNL 63 (431) Q Consensus 1 ~~--------------~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~--~-~~g~~~ 63 (431) |. --..-+|+.+--||...|+..++|..++.. |. .+.|+++.+|.=-+.+ . ..|..+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~v-Rt-----I~~gkS~qf~~lG~s~a~y~~pG~~l 74 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhccccee-ee-----ecccceEEEEEeeeeEEeeecCCCCc Confidence 22 112224555457888889999998888842 32 6779999988662222 2 224333 Q ss_pred ccCccccccceeEEEeccccccceEeeH--HHhhHHH-HHHhhhhHHHHHHHHHHHHHHHHhhcccc-----------ce Q lcl|NC_019501. 64 TGNATGILELSVKCNMGDPDNDFFELRA--DDLRDER-SYRRRIQASAKKLANNIESAIAKQATEMG-----------SL 129 (431) Q Consensus 64 s~~~~di~e~~V~v~ld~~k~v~f~lt~--keL~i~~-~s~~~L~~Am~~LAn~Id~dl~~~~~~~~-----------~~ 129 (431) .++.+.-.++.|++|.-.-....+.+ +-++.-| --..+-+.-..+||...|+-++.+.+.-. .. T Consensus 75 --dg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 75 --AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred --CCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 33345566777888776544443332 2222222 12233345568999999998876553321 01 Q ss_pred eeccCCCCC----Cc---cchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhc Q lcl|NC_019501. 130 VVHDTRAIG----PS---TGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNG 200 (431) Q Consensus 130 v~t~~~t~~----~~---~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g 200 (431) +....+... .. .+...-..+-+|...|++..||.. ++.++.+|+-|..+... +..+++... .+..+..| T Consensus 153 ~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~-d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s-~~g~~~~g 230 (400) T protein:vir:10 153 KGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADRIVDKSYTIS-QSGATIQG 230 (400) T ss_pred cccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhCCcccchhcccc-CCCccccc Confidence 110011100 00 111122345568888999999954 45344455555454332 333333222 22457888 Q ss_pred cccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEccee Q lcl|NC_019501. 201 TIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVK 280 (431) Q Consensus 201 ~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~ 280 (431) .+. .+.||. ++++.++|... ++ + .+..++-+| T Consensus 231 ~v~-~v~Gv~-Iv~Sn~lP~~a----~~--~--------------------------------------~~~~lS~a~-- 262 (400) T protein:vir:10 231 FVL-SSYNCP-VIPSNRFPKYS----QG--Q--------------------------------------KHHLLSNED-- 262 (400) T ss_pred eEE-EEeceE-EEeeCcCCccc----Cc--c--------------------------------------cccccccCC-- Confidence 886 488995 89998888410 00 0 011111111 Q ss_pred eeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeecccee Q lcl|NC_019501. 281 FLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSI 360 (431) Q Consensus 281 ~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~ 360 (431) ....|-++++ -+.+.=++|||+|. T Consensus 263 -----------~G~~y~~t~d---------------------------------------------~s~~~av~F~~sAv 286 (400) T protein:vir:10 263 -----------NGYRYDPIAE---------------------------------------------MNGAIAVLFTADAL 286 (400) T ss_pred -----------CCccCCcccc---------------------------------------------ccceeEEEEehhhe Confidence 0011112110 01123488999953 Q ss_pred EEEeecccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEE--eeccceecccceeEEecCCCCC Q lcl|NC_019501. 361 RLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIA--VWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 361 aLat~pl~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~d--vlyG~~~v~PElagv~i~~q~~ 431 (431) .- .-+ +.|....+||... ..+-|| ..||...+|||.++|+..+-++ T Consensus 287 ~t--vk~---------------------~~lt~~~~~d~r~--~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 287 LV--GRS---------------------IDVIGDIFYEKKE--KTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred EE--EEe---------------------eccccccccchhh--HHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 22 111 1223334555442 223455 4689999999999999999888 No 48 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.73 E-value=2.2e-09 Score=68.02 Aligned_cols=289 Identities=10% Similarity=0.058 Sum_probs=145.8 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc---ccccccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT---GWNLTGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~---g~~~s~~~~di~e~~V~v 77 (431) -+.-..-+|+.+--||...|+..+++..++.. |. .|.|+++.+|.=-+..... |..+.+++ +......| T Consensus 15 s~~d~al~le~f~geV~~af~~~s~~~~~~~~-rt-----i~~g~s~~~~~iG~~~~~~~~pG~~l~~~~--~~~~k~~i 86 (335) T protein:vir:63 15 KNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-RD-----LRGSNVVRLDRLGNVEAKGRRAGEELERSR--VVNDKWNL 86 (335) T ss_pred ccchhheehhhhhhhHHHHHHhhhhhccccce-ee-----eccceeEEEeeeeeeeeecccCCcCcCCCC--ccccceEE Confidence 11222334555557899999999999998853 32 4779999998765554332 43343343 45567788 Q ss_pred EeccccccceEeeH-HH-hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceee---cc-----C--------CCCCC Q lcl|NC_019501. 78 NMGDPDNDFFELRA-DD-LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVV---HD-----T--------RAIGP 139 (431) Q Consensus 78 ~ld~~k~v~f~lt~-ke-L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~---t~-----~--------~t~~~ 139 (431) ++|.-.-..+.+.+ +| +.--|....+-+....+||...|+.++.+...-..+.- .. + +.... T Consensus 87 tVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~ 166 (335) T protein:vir:63 87 TVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAK 166 (335) T ss_pred EecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcceeeeeccCccc Confidence 88887655555553 22 34445555577888999999999988743333222110 00 0 00000 Q ss_pred ccchhhhhhHHHHHHHHHhhcCCccC--CcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhccccccchhhhhhHhc Q lcl|NC_019501. 140 STGLSGWDFVSDAERLMFSRELNRDM--GISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRS 215 (431) Q Consensus 140 ~~~~~~~~d~a~a~~~L~~~~vP~~~--~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s 215 (431) ..+......+-.+.+.|++..||..+ .|-++++|..+..+... +...++........+.+|.+++ ++||. ++++ T Consensus 167 ~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~-v~Gv~-V~~s 244 (335) T protein:vir:63 167 QAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAI-LNGVK-VLET 244 (335) T ss_pred ccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEE-eeceE-EEee Confidence 01111224455788999999999632 47799999999887654 3333332222234689999976 88996 9999 Q ss_pred CCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceee--ec-----ccccc Q lcl|NC_019501. 216 PKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKF--LS-----QMAKN 288 (431) Q Consensus 216 ~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~--vn-----~~tk~ 288 (431) .++|... +++. ++..+. |.-..|+....+++--...|-.+-..-++.=.| .. -.+|. T Consensus 245 n~lP~~~-~t~~--~lg~a~-------------n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~ 308 (335) T protein:vir:63 245 PRFATKA-IAAH--PLGRHF-------------NVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQ 308 (335) T ss_pred ccCCCCC-cccc--cccccC-------------CccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHH Confidence 9998632 2211 121111 111112222211211111121222221111000 00 01122 Q ss_pred ccCCCceEEEEEeecCceeEEeecccccccccccccccccceeec Q lcl|NC_019501. 289 VLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNT 333 (431) Q Consensus 289 ~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa 333 (431) ..|+.. .....+..|+ -++++.-+.| + T Consensus 309 a~G~g~----lRPe~a~~i~--~tg~~~~~~~------------~ 335 (335) T protein:vir:63 309 MYNIGA----RRPDTAGAIE--LKGIGAFDIT------------A 335 (335) T ss_pred HcCCcc----cccceEEEEE--EcCCCceeec------------C Confidence 222211 0000111122 1232221111 0 No 49 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.66 E-value=1.8e-08 Score=63.05 Aligned_cols=266 Identities=11% Similarity=0.018 Sum_probs=138.7 Q ss_pred CccccccchhHH-HHHHHHH-HHhh-cccchhhccCCCchHHHhhcccEEEeecccccccccc-cccccCccccccceeE Q lcl|NC_019501. 1 MALNEGQLVTYA-LDEIIET-VQNL-TPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTG-WNLTGNATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt~~-~~evi~~-len~-lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g-~~~s~~~~di~e~~V~ 76 (431) -.+..|++.-.- +.-.|++ +..+ +.-..++ -++|+ ..-|++|.||.-......|- +....+..++...... T Consensus 33 ~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~--N~~~e---~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t 107 (329) T protein:vir:10 33 KSVEPGDTLLKNKHVGILEKVTAANSYSAPAVI--SNDAI---FMQGRSFTVIKGDVTELKDYKRNATNEFDHPQIQETT 107 (329) T ss_pred CccCCchhHHHHHHHHHHHHHHHhhceeeeeec--cccee---eccCcEEEEeeecccccccccCCCCccccccccceeE Confidence 233345443321 3333443 3333 3333445 36665 34699999997744333321 2233345567788888 Q ss_pred EEeccccccceEeeHHHhhHHHH---HHhhh-hHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHHH Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLRDERS---YRRRI-QASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSDA 152 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~i~~~---s~~~L-~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a 152 (431) ++|++.+...|.+.+-|.....+ ....+ +.+-..++-+||+...+.+.... ++. .....+..+.|..+..+ T Consensus 108 ~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a---~~~--~~~~~t~~nay~~i~~a 182 (329) T protein:vir:10 108 YFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNK---AKH--LTVGSGADAQYDAVLDV 182 (329) T ss_pred EEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhc---ccc--cccccCHHHHHHHHHHH Confidence 99999999777776544322111 11112 22445778899999887665533 221 12233455678899999 Q ss_pred HHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceec Q lcl|NC_019501. 153 ERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVS 232 (431) Q Consensus 153 ~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~ 232 (431) ...|++.++|. +|-++++|.-+..+-.+........ ...+-.++|.+ T Consensus 183 ~~~Lde~~vp~--~Rvl~VtP~~~~~Lk~~~~f~~~~~-~~~~~~~~g~V------------------------------ 229 (329) T protein:vir:10 183 SVELDEIGAGA--SRILFVTPKFYKGIKKFVIELPQGD-NRQQVLGKGVQ------------------------------ 229 (329) T ss_pred HHHHHhcCCCC--CcEEEeCHHHHHHHHhhhhhhcccc-ccccceeeeee------------------------------ Confidence 99999999994 4778999988876622110000000 00011122222 Q ss_pred ccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeec Q lcl|NC_019501. 233 GAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPK 312 (431) Q Consensus 233 gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Pa 312 (431) =+|.|+ .|.. .|. T Consensus 230 -----------------------------------------g~idG~-----------------~Ii~---------vps 242 (329) T protein:vir:10 230 -----------------------------------------GELDGF-----------------TIVK---------VPS 242 (329) T ss_pred -----------------------------------------eeecCe-----------------EEEE---------ecC Confidence 123331 1111 011 Q ss_pred ccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeecCcceEEEE Q lcl|NC_019501. 313 PIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGI 392 (431) Q Consensus 313 ii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~~g~glslr 392 (431) ... ..+.| ++.|++|++.+..=-.+ +..+ +.++. T Consensus 243 ~~~-----------------------k~in~---------ii~~~~A~~~~~K~~~~--------~~~~-p~~~~----- 276 (329) T protein:vir:10 243 KML-----------------------QGVEA---------MAVIGEVMASPIQANEA--------KLNS-NVPGM----- 276 (329) T ss_pred Ccc-----------------------cceeE---------EEEcCCceeeeeeeeee--------eeeC-CCCcc----- Confidence 000 01222 67788888776552211 0000 00111 Q ss_pred EEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 393 FATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 393 v~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) ..| .++--..||+.+++|+..||.....++ T Consensus 277 --~a~-------~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 277 --FGT-------LAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred --chh-------eeeeeeeeeeEEEccccCEEEEecccC Confidence 011 233445799999999999988877766 No 50 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.56 E-value=3.4e-08 Score=61.55 Aligned_cols=288 Identities=9% Similarity=-0.076 Sum_probs=140.1 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHH-hhcccEEEeecccccccccc-ccc-ccCccccccceeEE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESM-QRSSNTVWMPVEQEAPTQTG-WNL-TGNATGILELSVKC 77 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~-~k~GdTv~ip~p~~~~~~~g-~~~-s~~~~di~e~~V~v 77 (431) ||. .| +.|..-.++.+.|+.+++.+.+. -++++.+- +.-|++|.||.=......|- +.. .....++.-....+ T Consensus 1 MA~-~n-~a~~~~~~Ld~~~~~~l~~~~L~--~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~ 76 (299) T protein:vir:79 1 MAA-LN-YAKEYSNVLAQAYPYTLNFGDLY--ATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPK 76 (299) T ss_pred Ccc-ch-hHHHHHHHHHHHHHhhceeeeec--cCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEE Confidence 993 33 34666678888899999999888 56765442 23479999996644333321 111 22333566677889 Q ss_pred EeccccccceEeeHHHhh-------HHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 78 NMGDPDNDFFELRADDLR-------DERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 78 ~ld~~k~v~f~lt~keL~-------i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a 150 (431) +|++.+...|.+.+-|.+ ......++.+ .+++-+||+...+....-+..+++.+.. ...+..+.+..+- T Consensus 77 ~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~---~~v~pEiDay~~skl~~~a~~~g~~~~~-~~~T~~n~y~~i~ 152 (299) T protein:vir:79 77 VLTNQRKWSTLVHPADINQTNYVASIGNITKVYNE---EQKFPEMDAYCISKIYADWTALGNTADT-TVLTTTNVLEVFD 152 (299) T ss_pred EeeccccceeccchhhHHHHhhhhHHHHHHHHHHH---HHhhhHhhHHHHHHHHHhhhhcCCcccc-cccCHHHHHHHHH Confidence 999999988888743322 2222333322 3567788888776443334444442222 2234456678888 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccce Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVT 230 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~t 230 (431) .+.+.|++.++|..+ |-++++|.-+..+..+-..........+.-.++|.+++ +.||. +++...- +.. +.+. T Consensus 153 ~~~~~lde~~vP~~~-rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~-idG~~-Ii~Vps~-r~~----t~~~ 224 (299) T protein:vir:79 153 KLMEKMTEARVPENG-RILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTD-IDTVK-IIKVPSN-LMK----TAYD 224 (299) T ss_pred HHHHHHHhcCCCCCC-eEEEeCHHHHHHHhhchhhhcccccccccceeeeeeee-ecceE-EEEechh-hcC----ccce Confidence 999999999999876 77899999988875432111111111222467888876 88995 7762221 111 1110 Q ss_pred -ecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEE Q lcl|NC_019501. 231 -VSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEI 309 (431) Q Consensus 231 -v~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I 309 (431) .+|... ..++ .+--...+..++.-.+.|=|.+-| ..|-+-+.--.+.+++.--|+ -| T Consensus 225 ~~~G~~~--------~~~a----k~in~ii~~~~a~~~~~K~~~~~~-----~~P~~~~~~~~~~~~r~y~d~-----~v 282 (299) T protein:vir:79 225 FTTGWKV--------GAGA----KQIFMSLVHPSAIITPVSYQFSKL-----DEPTAVTEGKYFYFEESFEDV-----FI 282 (299) T ss_pred eccCccc--------cCcc----cccceEEEcCCeeeeeEeeeeEEe-----ecCCCCCccceeeeeeeeeee-----ee Confidence 011100 0000 000001111111111111111111 122222211111222211111 00 Q ss_pred eecccccccccccccccccceeecccc Q lcl|NC_019501. 310 TPKPIALDDASLTKEEKAYANVNTSLA 336 (431) Q Consensus 310 ~Paii~~~~~t~~~~~~~y~nVsa~~A 336 (431) ..- ...+-|.|+.++-+ T Consensus 283 ~~n----------k~~~i~~~~~~a~~ 299 (299) T protein:vir:79 283 LNK----------KADAIQFVVEGAGA 299 (299) T ss_pred ecc----------ccCeEEEEeeecCC Confidence 000 00111222222222 No 51 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.45 E-value=1.6e-08 Score=63.30 Aligned_cols=256 Identities=18% Similarity=0.206 Sum_probs=137.2 Q ss_pred Ccc-ccccchhH--HHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccc----cccccccccccCccccccc Q lcl|NC_019501. 1 MAL-NEGQLVTY--ALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQE----APTQTGWNLTGNATGILEL 73 (431) Q Consensus 1 ~~~-~~~~~lt~--~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~----~~~~~g~~~s~~~~di~e~ 73 (431) ||. ....++-+ +-+-+.+++++.+++++++ -.+++=++ +.|+||+||.=.. -...+|.+.. +..+.-+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~--~~d~~L~g-~~G~ti~~P~~~~igdae~~~eg~~i~--~~~lt~~ 75 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYA--VTDDTLVG-QPGDTITRPKYAYIGAAEDLQEGVAMD--TTQMSMT 75 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhcccc--ccccccCC-CCCCEEEeeeecCCCccccccCCCccc--hhhcccc Confidence 996 55555444 1256677899999999988 45554443 5899999996211 1133343332 2233333 Q ss_pred eeEEEeccccccceEeeHHH--hhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHH Q lcl|NC_019501. 74 SVKCNMGDPDNDFFELRADD--LRDERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSD 151 (431) Q Consensus 74 ~V~v~ld~~k~v~f~lt~ke--L~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~ 151 (431) +...++ ++..-.|..++-+ +...+......++....+|+++|.+|++.++.....+ + ......+|.+ T Consensus 76 ~~~a~i-~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~---~-------~~~t~~~~~d 144 (270) T protein:vir:95 76 TTKVTV-KETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA---T-------VSADATGILD 144 (270) T ss_pred hheeee-ehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc---c-------cccCHHHHHH Confidence 333444 2223346666544 3346778888888889999999999998888643211 0 1112466778 Q ss_pred HHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccccee Q lcl|NC_019501. 152 AERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTV 231 (431) Q Consensus 152 a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv 231 (431) |...|.+..-. ...+++||..++.+.++... .........+++|.||. +.|++-|.++..++. + +++.+ T Consensus 145 A~~~lgd~~~~---~~~i~vhs~~~~~Lrk~~~~--~~~~~~~~~~~~G~ig~-~~G~~Viv~s~~~~~---~--~~~l~ 213 (270) T protein:vir:95 145 AIEVFNSENDE---DYVLYVNPKDYNKLVKSLFK--VGGNVQDRAISKGDLVE-IVGVSDIVKSKRVSE---N--TAFLQ 213 (270) T ss_pred HHHHhccccCC---CcEEEEcHHHHHHHHhhhcc--cccccccchhcccccce-ecceeEEEeCCCCCc---e--eEEEE Confidence 88888665322 35688999999998766422 22222344689999997 889964455544332 1 22221 Q ss_pred -cccccccceeeeeecccccccccceeeEEEeeccc--eeeeccEEEEcceeeeccccccccCCCceEEEEEeecCc--e Q lcl|NC_019501. 232 -SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTT--GFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDST--H 306 (431) Q Consensus 232 -~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg--~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~--t 306 (431) .||- ++. ++.. +. .-+. -++.=|.++. .+.|+|--..... . T Consensus 214 ~~gAi-----~~~-----~~~~-------~~-vEtdRd~~~~~d~i~~----------------~~~y~v~~~~~skvv~ 259 (270) T protein:vir:95 214 RYGAM-----EIV-----NKKK-------PE-AYTDFDILKRTHLLST----------------NYHYSVNLKDETGVVK 259 (270) T ss_pred eccce-----eee-----ecCC-------ce-eeeccchhhcccEEEe----------------eeEEEEEEEccceEEE Confidence 1111 110 0000 00 0010 1122233332 2445443322222 3 Q ss_pred eEEeecccccccccc Q lcl|NC_019501. 307 IEITPKPIALDDASL 321 (431) Q Consensus 307 i~I~Paii~~~~~t~ 321 (431) +++.|++=.. + T Consensus 260 ~t~~~a~~~~----~ 270 (270) T protein:vir:95 260 VTFKPSGSLE----M 270 (270) T ss_pred EEecCCCCcC----C Confidence 4455554221 1 No 52 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.31 E-value=3.1e-07 Score=56.27 Aligned_cols=282 Identities=11% Similarity=0.040 Sum_probs=132.1 Q ss_pred Cccc---cccchhHH-HHHHHHHHHhhcccch-hhccCCCchHHHhhcccEEEeecccccccccc-cccccCccccccce Q lcl|NC_019501. 1 MALN---EGQLVTYA-LDEIIETVQNLTPMAS-KVTKYTPPAESMQRSSNTVWMPVEQEAPTQTG-WNLTGNATGILELS 74 (431) Q Consensus 1 ~~~~---~~~~lt~~-~~evi~~len~lvma~-~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g-~~~s~~~~di~e~~ 74 (431) .|++ .|++...- +...|+.+....+.+. +.. =++|+- .-|++|.||.=......|- +....+..++.-.. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~-N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~ 94 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI-SNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEE 94 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhccc-CcceEe---ccCcEEEEeeecccccccccCCCCcccCCcccce Confidence 3333 44444321 3445555544444432 221 144433 3599999996543322221 22334455677788 Q ss_pred eEEEeccccccceEeeHHHhhHHHH---HHhhh-hHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDLRDERS---YRRRI-QASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL~i~~~---s~~~L-~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a 150 (431) ..++|++.+...|.+.+-|...-.+ ....+ +.+-..++-+||....+.+..-. ++.. +...+..+.|..+. T Consensus 95 ~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a---~~~~--~~~~t~~n~y~~i~ 169 (319) T protein:vir:97 95 TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK---AKHL--TVGTGSDAQYDAVL 169 (319) T ss_pred eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhc---cccc--ccccCHHHHHHHHH Confidence 8899999999777777544322111 11112 22334677788988776655422 2211 12234456788899 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccce Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVT 230 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~t 230 (431) .+.+.|++.+|| .+ |-++++|.-+..+..+-. +.......++.+++|.+++ +.||. +++..+.. +..... T Consensus 170 ~a~~~Lde~~VP-~~-Rvl~Vtp~~~~~L~~~~~-f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~- 239 (319) T protein:vir:97 170 DVSVELDEIKAP-EN-RVLFVSPTFYKGIKKFVI-ALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQA- 239 (319) T ss_pred HHHHHHHhcCCC-CC-cEEEeCHHHHHHHHhhhh-hhccccccccceeeeecee-ecCeE-EEEecccc----cccceE- Confidence 999999999999 44 778999999888744322 2222222345678999986 88996 77643321 111111 Q ss_pred ecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc----ceeeeccccccccCCCceEEEEEeecCce Q lcl|NC_019501. 231 VSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT----GVKFLSQMAKNVLTDDATFSITRVIDSTH 306 (431) Q Consensus 231 v~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia----GV~~vn~~tk~~~~~lq~fvVta~~~a~t 306 (431) +-|.+..-....+++....-.+..++.+. .--+++-.|.|.+. |||....-.+... . T Consensus 240 i~~h~~A~~~~~k~~~~~~~~p~~~~~a~----~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~-~-------------- 300 (319) T protein:vir:97 240 IAVVGEVLASPIQADLAKTNSNIPGMFGT----LAEQLLYTGAFVPEHLQKYIFTIGGTEVATK-R-------------- 300 (319) T ss_pred EEEcCCeeeeeeeeeeeeccCCCccccce----eeeeeeeeeeEEeccccceEEEeecCCcccC-C-------------- Confidence 22222111111222211111110111000 00133444444442 2221111000000 0 Q ss_pred eEEeecccccccccccccccccceeecccccCce-----eEE Q lcl|NC_019501. 307 IEITPKPIALDDASLTKEEKAYANVNTSLADNTP-----VNV 343 (431) Q Consensus 307 i~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aa-----vTv 343 (431) .-+..+||..| +-. T Consensus 301 -----------------------~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 301 -----------------------DGVDAHADNVAKPSGSLEM 319 (319) T ss_pred -----------------------CccccccccccCCcccccC Confidence 00111111110 001 No 53 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.31 E-value=3.1e-07 Score=56.27 Aligned_cols=282 Identities=11% Similarity=0.040 Sum_probs=132.1 Q ss_pred Cccc---cccchhHH-HHHHHHHHHhhcccch-hhccCCCchHHHhhcccEEEeecccccccccc-cccccCccccccce Q lcl|NC_019501. 1 MALN---EGQLVTYA-LDEIIETVQNLTPMAS-KVTKYTPPAESMQRSSNTVWMPVEQEAPTQTG-WNLTGNATGILELS 74 (431) Q Consensus 1 ~~~~---~~~~lt~~-~~evi~~len~lvma~-~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g-~~~s~~~~di~e~~ 74 (431) .|++ .|++...- +...|+.+....+.+. +.. =++|+- .-|++|.||.=......|- +....+..++.-.. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~-N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~ 94 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI-SNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEE 94 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhccc-CcceEe---ccCcEEEEeeecccccccccCCCCcccCCcccce Confidence 3333 44444321 3445555544444432 221 144433 3599999996543322221 22334455677788 Q ss_pred eEEEeccccccceEeeHHHhhHHHH---HHhhh-hHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDLRDERS---YRRRI-QASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL~i~~~---s~~~L-~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a 150 (431) ..++|++.+...|.+.+-|...-.+ ....+ +.+-..++-+||....+.+..-. ++.. +...+..+.|..+. T Consensus 95 ~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a---~~~~--~~~~t~~n~y~~i~ 169 (319) T protein:vir:94 95 TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK---AKHL--TVGTGSDAQYDAVL 169 (319) T ss_pred eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhc---cccc--ccccCHHHHHHHHH Confidence 8899999999777777544322111 11112 22334677788988776655422 2211 12234456788899 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccce Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVT 230 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~t 230 (431) .+.+.|++.+|| .+ |-++++|.-+..+..+-. +.......++.+++|.+++ +.||. +++..+.. +..... T Consensus 170 ~a~~~Lde~~VP-~~-Rvl~Vtp~~~~~L~~~~~-f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~- 239 (319) T protein:vir:94 170 DVSVELDEIKAP-EN-RVLFVSPTFYKGIKKFVI-ALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQA- 239 (319) T ss_pred HHHHHHHhcCCC-CC-cEEEeCHHHHHHHHhhhh-hhccccccccceeeeecee-ecCeE-EEEecccc----cccceE- Confidence 999999999999 44 778999999888744322 2222222345678999986 88996 77643321 111111 Q ss_pred ecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc----ceeeeccccccccCCCceEEEEEeecCce Q lcl|NC_019501. 231 VSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT----GVKFLSQMAKNVLTDDATFSITRVIDSTH 306 (431) Q Consensus 231 v~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia----GV~~vn~~tk~~~~~lq~fvVta~~~a~t 306 (431) +-|.+..-....+++....-.+..++.+. .--+++-.|.|.+. |||....-.+... . T Consensus 240 i~~h~~A~~~~~k~~~~~~~~p~~~~~a~----~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~-~-------------- 300 (319) T protein:vir:94 240 IAVVGEVLASPIQADLAKTNSNIPGMFGT----LAEQLLYTGAFVPEHLQKYIFTIGGTEVATK-R-------------- 300 (319) T ss_pred EEEcCCeeeeeeeeeeeeccCCCccccce----eeeeeeeeeeEEeccccceEEEeecCCcccC-C-------------- Confidence 22222111111222211111110111000 00133444444442 2221111000000 0 Q ss_pred eEEeecccccccccccccccccceeecccccCce-----eEE Q lcl|NC_019501. 307 IEITPKPIALDDASLTKEEKAYANVNTSLADNTP-----VNV 343 (431) Q Consensus 307 i~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aa-----vTv 343 (431) .-+..+||..| +-. T Consensus 301 -----------------------~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 301 -----------------------DGVDAHADNVAKPSGSLEM 319 (319) T ss_pred -----------------------CccccccccccCCcccccC Confidence 00111111110 001 No 54 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=97.26 E-value=9.6e-05 Score=42.62 Aligned_cols=276 Identities=11% Similarity=-0.002 Sum_probs=123.0 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccc--cccccccccccCccccccceeEEE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQE--APTQTGWNLTGNATGILELSVKCN 78 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~--~~~~~g~~~s~~~~di~e~~V~v~ 78 (431) ||++.-++-. .++.+.|...++.+.+.. ++++=+ -|++|.||.=.. .+..+ +...-...++.-..-+.+ T Consensus 1 Main~a~~~~---~~Ld~~~~~~~~t~~l~~--~~~~~~---ggktVkI~~i~~~gl~DY~-R~~g~~~g~v~~~~et~t 71 (290) T protein:vir:78 1 MAINYVDKYG---KELDQKLVFGTYTNELET--PNLLWL---DAKTFKIQTITTTGLKAHT-RNKGYNEGSASNTNKSYT 71 (290) T ss_pred CchhHHHHHH---HHHHHHHHhhheeeeccc--cceeec---cCCEEEEeeeccCcccccc-cCCCcccCccccceeeEE Confidence 9998755444 577777888999888873 344322 389999996422 22222 111111223444555688 Q ss_pred eccccccceEeeHHHhh-------HHHHHHhhhhHHHHHHHHHHHHHHHH-hhccccceeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 79 MGDPDNDFFELRADDLR-------DERSYRRRIQASAKKLANNIESAIAK-QATEMGSLVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 79 ld~~k~v~f~lt~keL~-------i~~~s~~~L~~Am~~LAn~Id~dl~~-~~~~~~~~v~t~~~t~~~~~~~~~~~d~a 150 (431) |++.+...|.+..-|.+ ....+.++ +-.+++-+||+...+ ++.... ..++.... ..+..+.+..+- T Consensus 72 l~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef---~~~~v~PEiDayr~skla~~a~-~~~~~~~~--t~t~~n~~~~i~ 145 (290) T protein:vir:78 72 IDFDRDVEFFVDVMDVDETGQALSAANVTKEF---NSRHAGPEMDAYRFSKLATAAK-TNSNSVAE--EITKDNVFTKLK 145 (290) T ss_pred eeccccceeeccccchhHHhhhhhHHHHHHHH---HHHHhhhhhhHHHHHHHHhhhh-ccCccccc--ccCHHHHHHHHH Confidence 99999988888743332 23333333 334667888888665 333322 11221111 123334566667 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhh--hhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRN--LVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~--~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) .+...|++ +|.. +|-++.+|.-+..+..+ +....... ..++-..+|.+++ +.||. +++.....++.. . T Consensus 146 ~~~~~lde--vp~~-~rvl~vtp~~~~lL~~~~~f~r~~~~~-~~~~~~i~~~V~~-idG~~-ii~vps~~r~~t----~ 215 (290) T protein:vir:78 146 AAIRKVKK--YGTQ-NLVMYVSPDVMAALELSDDFVRAINVQ-NIGPSSIETRITA-IDGTR-IVEVEAEDRFYD----T 215 (290) T ss_pred HHHHHHHh--cCCC-CeEEEECHHHHHHHhhChhhhcccccc-ccccccccceeee-ecCcE-EEEecccchhhh----h Confidence 78888876 8865 58799999999877432 22211111 1122234777765 77884 676433222210 1 Q ss_pred cee-cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccc-cCCCceEEEEEeecCce Q lcl|NC_019501. 229 VTV-SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNV-LTDDATFSITRVIDSTH 306 (431) Q Consensus 229 ~tv-~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~-~~~lq~fvVta~~~a~t 306 (431) +.. +|.. ....+ .+--...+..++.-.+.|=|.+-| ..|-+-+. .+++-+|+.--|+= T Consensus 216 ~~f~~G~~--------~~~~a----k~in~ii~~~~a~i~~~K~~~~~~-----~~P~~~~~~d~~~~~~r~y~d~~--- 275 (290) T protein:vir:78 216 FDFTDGYK--------PAAGA----KKLNFLLVNKGSVVGGAKHASIYL-----HAPGSVGQGDGWLYQYRVYHDIF--- 275 (290) T ss_pred hhhccccc--------ccCCc----cceeEEEEcCCceeeeeeeeEEEe-----eCCCCCcCcceeeeeeeeeeeee--- Confidence 100 1100 00000 000011111111111212222111 11222111 11222222211110 Q ss_pred eEEeecccccccccccccccccceeec Q lcl|NC_019501. 307 IEITPKPIALDDASLTKEEKAYANVNT 333 (431) Q Consensus 307 i~I~Paii~~~~~t~~~~~~~y~nVsa 333 (431) |..- ...+-|.|+.. T Consensus 276 --v~~n----------k~~~i~~~~~~ 290 (290) T protein:vir:78 276 --VLDQ----------QKDGVIASTEV 290 (290) T ss_pred --eecc----------ccCeeEEEeeC Confidence 0000 00001111111 No 55 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=96.66 E-value=0.00043 Score=39.04 Aligned_cols=288 Identities=11% Similarity=0.049 Sum_probs=120.2 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecc--cccccccccccc--cCccccccceeE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVE--QEAPTQTGWNLT--GNATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p--~~~~~~~g~~~s--~~~~di~e~~V~ 76 (431) ||+.. ++-|.+-.++-+.+...++-+-+- .-+..-++ .=|++|.||.= ...+..+ ++.. -+..++....-+ T Consensus 1 Mantl-~ya~~~~~~LD~~~~~~~~s~~l~--~~~~~v~~-~ggktVkIp~i~~~gl~DY~-R~~g~~~~~g~v~~~~et 75 (312) T protein:vir:10 1 MANTL-AYGQVLQQGLDKQATQELLTGWMD--SNAKQIKY-EGGKEVKIGKLSTDGLGDYS-RGSANAYVGGDVKFEYET 75 (312) T ss_pred CCcch-hHHHHHHHHHHHHHHhhhcccccc--CCCceEEE-ecCcEEEEEeeecccccccc-cccCCcccccccccccee Confidence 99544 333433334444455555555443 11111122 34799999963 3333222 1111 122345566677 Q ss_pred EEeccccccceEeeHHHhhH-------HHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCC--CCCCccchhhhh Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLRD-------ERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTR--AIGPSTGLSGWD 147 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~i-------~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~--t~~~~~~~~~~~ 147 (431) .+|++.+...|.+...|.++ ...+.+|. -...+=+||+.-.+....-+...++.+. .+..-+..+-+. T Consensus 76 ~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~---r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~ 152 (312) T protein:vir:10 76 KTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQ---RLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIIN 152 (312) T ss_pred EEeeecccceeeccccchhhHhhHHHHHHHHHHHH---HhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHH Confidence 88999998778777434332 22222222 2244556777744333322222222111 122224455677 Q ss_pred hHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccc Q lcl|NC_019501. 148 FVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT 227 (431) Q Consensus 148 d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~ 227 (431) .+-.+...|++.++| . .|-++++|.-+..+-... ........+.+|.|.|.++.+|.+ +.+.+...=.-+ T Consensus 153 ~i~~~~~~lde~~vp-~-~rvl~vTp~~~~lLk~~~-----~~~~~~~~~~~~~i~~~V~~iDgv---~Ii~VPs~r~~t 222 (312) T protein:vir:10 153 KIKTGIKIIRENGYN-G-PLVCHLTYDSMFAIEEKV-----LEKLTAVTFAQGGIQTQVPSIDGC---ALIKTPQNRMYS 222 (312) T ss_pred HHHHHHHHHHHccCC-C-ceEEEeChHHHHHHhhhh-----hceecccccccceeeeeeeeeccc---EEEEchhhhccc Confidence 788899999999999 3 577889998886664322 122222344556566656555431 222221111111 Q ss_pred ccee-----cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEE-cceeeeccccccccCCCceEEEEEe Q lcl|NC_019501. 228 GVTV-----SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISF-TGVKFLSQMAKNVLTDDATFSITRV 301 (431) Q Consensus 228 ~~tv-----~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Ti-aGV~~vn~~tk~~~~~lq~fvVta~ 301 (431) .+.. +|+.+ .|+.-+..+- +--...+..++.-.+.|=|.+-| += ..||.. .+++-+|+.--| T Consensus 223 ~~~f~dG~t~~~~~---gg~~~~~~ak----~INfiiv~~~a~i~~~K~~~~~if~P--~~~~~~---d~~~~~~R~Y~D 290 (312) T protein:vir:10 223 SILLNDGTTSNQTA---GGYLKGTKAL----DTNFIIAPVDVPLAITKQDKMRIFDP--ETNQTA---NAWSMDYRRYHD 290 (312) T ss_pred eeeeccCccccccc---CceeecCccc----ccceEEeCCceeeceeeeeeeeeeCC--CCCCCc---ceeeeeeeeeee Confidence 1111 11111 1121111111 11112222222222222232222 10 112111 112222222111 Q ss_pred e-----cCceeEEeecccccccccccccccccceeecccccC Q lcl|NC_019501. 302 I-----DSTHIEITPKPIALDDASLTKEEKAYANVNTSLADN 338 (431) Q Consensus 302 ~-----~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~ 338 (431) + ....| |.|+..+-+-| T Consensus 291 ~fv~~nk~~~I--------------------yv~~k~a~~~~ 312 (312) T protein:vir:10 291 LWVTDNKANSV--------------------YANFKDAKPVG 312 (312) T ss_pred eeeeccccCeE--------------------EEEeecccCCC Confidence 1 01111 22222221111 No 56 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=96.15 E-value=0.00068 Score=37.96 Aligned_cols=265 Identities=14% Similarity=0.081 Sum_probs=121.0 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHH-hhcccEEEeeccc---ccccccccccccCccccccceeE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESM-QRSSNTVWMPVEQ---EAPTQTGWNLTGNATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~-~k~GdTv~ip~p~---~~~~~~g~~~s~~~~di~e~~V~ 76 (431) ||++..+.-. ..+.+++...+..+.++. .+.+... ..=|.+|.||+=+ ..+..+ +....+..++.-..-+ T Consensus 1 Main~~~k~~---~~ld~~~~~~~~~~~l~~--~~n~~~~~~~gak~VkIp~ist~~gl~dY~-R~~g~~~g~v~~~~et 74 (285) T protein:vir:79 1 MTVVLDSKDL---ARIDEEYKADSQVWSYLT--GGNGVTQRFRGHNEVRINKLSGFVDATAYK-RGQDNARKTISVGKET 74 (285) T ss_pred CcchhhHHHH---HHHHHHHHHhhhhhhhcc--cCCcceeEecCCCEEEEeeecccccccccc-cccCccccccceeeeE Confidence 9998766555 466666777777777762 2221111 1236899999642 233222 3333344455566667 Q ss_pred EEeccccccceEeeHHHhh------HHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLR------DERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~------i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a 150 (431) .+|++.+...|.+...|.+ ......+|.+ ...+=+||+.-++.+..-. ++...+ +.+..+-+..+- T Consensus 75 ~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~---~~vvPEiDayrfskla~~a---~~~~~~--~~T~~nv~~~i~ 146 (285) T protein:vir:79 75 VKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNK---MITIPHRDKVAVQKLFDSA---AKKATD--SITKDNALDAYD 146 (285) T ss_pred EEeeccccceecccccchhhhhhhhHHHHHHHHHh---hhhcchhhHHHHHHHHhhc---cccccc--ccCHHHHHHHHH Confidence 8899999877777743321 1111222111 1233466666443333222 221111 223455677788 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccce Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVT 230 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~t 230 (431) .+...|++.++| . +|-++.+|.-+..+-.+-. ....-...+.+..|.|-|.+..+|... ..+.+...-.-+ T Consensus 147 ~~~~~lde~~vp-~-~rvl~vTp~~~~~Lk~s~~--~~r~~~~~~~~~~~~i~~~V~~lDg~v--~ii~Vps~r~kt--- 217 (285) T protein:vir:79 147 TAEAYMFDNEVP-G-GFVMFVSSAYYTALKQSAA--VTRTFSTDGTMVINGIDRRVAQLDGGV--PIVRVSSDRLKG--- 217 (285) T ss_pred HHHHHHHHcCCC-C-ceEEEEChHHHHHHHhhhh--hheecccccceeccceeeeecccccee--EEEEcchhhccC--- Confidence 899999999999 3 5778899988887643211 111111223455566667777776211 111111100000 Q ss_pred ecccccccceeeeeecccccccccceee--EEE----eeccc----eeeeccEEEEcceeeeccccccccCCCceEEEEE Q lcl|NC_019501. 231 VSGAQKFKPQAYTLDTDGNKENVDNRVA--TVT----VSSTT----GFKRGDKISFTGVKFLSQMAKNVLTDDATFSITR 300 (431) Q Consensus 231 v~gA~q~~~~~~~v~~~g~~~~~d~~~~--~i~----~s~tg----~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta 300 (431) ..++.+ --+.|.......+...... ... -++-+ ..+=+|+|.+.- |.. .-|+-.. T Consensus 218 ~~~~k~---Infiiv~~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~n--------k~~----~Iy~~~~ 282 (285) T protein:vir:79 218 LGITNH---VNFILTPLSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDN--------AKK----GIYVAAT 282 (285) T ss_pred cCcchh---ccEEEecCceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhh--------ccc----eeeeeec Confidence 000000 0111111000000000000 000 01112 234567776642 110 1133322 Q ss_pred eec Q lcl|NC_019501. 301 VID 303 (431) Q Consensus 301 ~~~ 303 (431) .+- T Consensus 283 a~~ 285 (285) T protein:vir:79 283 AGV 285 (285) T ss_pred ccC Confidence 111 No 57 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=313 Identities=7% Similarity=-0.045 Sum_probs=122.5 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHH-hhcccEEEeeccc---ccccccccccccCccccccceeE Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESM-QRSSNTVWMPVEQ---EAPTQTGWNLTGNATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~-~k~GdTv~ip~p~---~~~~~~g~~~s~~~~di~e~~V~ 76 (431) |+++..+.....+|+ ++...++...++- -.|..... +.-|++|.||+=+ -.+..+-..+.....++.-..-+ T Consensus 1 Mainya~~~~~~Ld~---~~~~~~lts~~l~-~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et 76 (346) T protein:vir:10 1 MTINYAEKYQAAVQQ---AFYDGHLYSAELW-NSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDS 76 (346) T ss_pred CcchhHHHHHHHHHH---HHHhhhccchhhc-ccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeE Confidence 999887766654444 4444444422220 12222210 1237999999753 23433322232222345566777 Q ss_pred EEeccccccceEeeHHHhh-------HHHHHHhhhhHHHHHHHHHHHHHHHH-hhccccceeeccCCCCCCccchhhhhh Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLR-------DERSYRRRIQASAKKLANNIESAIAK-QATEMGSLVVHDTRAIGPSTGLSGWDF 148 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~-------i~~~s~~~L~~Am~~LAn~Id~dl~~-~~~~~~~~v~t~~~t~~~~~~~~~~~d 148 (431) .+|++.+...|.+..-|.+ +.....+|.+ .+.+=+||+.-++ +|......-++ ......-+..+.+.. T Consensus 77 ~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r---~~vvPEiDayrfskLa~~a~~~~~~-~~~~~a~T~~ni~~~ 152 (346) T protein:vir:10 77 YELKNERYWSTLVDPSDIDETNMVVSLANITKQFNL---DSKMPEKDRYMFSHLYSGKEAAHDG-GITTNTLDEKNILPA 152 (346) T ss_pred EEeeccccceecccccchHHHHHHhHHHHHHHHHHH---HhhcchhhHHHHHHHHHhhhhhccc-cccccccCHHHHHHH Confidence 8999999977888743322 2222222222 1234466766433 33332211111 111111233455677 Q ss_pred HHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 149 VSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 149 ~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) +-.+...|++.++|..+ |-++.+|.-+..+-.+-.......-..... -+|.+++ +.||. +++...- +.. +. T Consensus 153 i~~~~~~lde~~vp~~~-rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~-i~~~V~s-iDGv~-Ii~VPs~-r~~----t~ 223 (346) T protein:vir:10 153 FDNMMLDFDEARIPSTN-RILYVTPKTNAILKRAEAMNRALTLKDPNN-IQRTVYS-LDDVT-IRVVPSD-LMQ----TA 223 (346) T ss_pred HHHHHHHHHHccCCCCC-eEEEECHHHHHHHhhchhheeccccccccc-cceeeee-ecCeE-EEEcchh-hcc----cc Confidence 78899999999999876 778999999887633211111111001111 3666654 66774 5552211 110 11 Q ss_pred cee-cccccccceeeeeecccccccccceeeEEEeecccee-eeccEEEEcceeeeccccccccCCCceEEEEEee---- Q lcl|NC_019501. 229 VTV-SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGF-KRGDKISFTGVKFLSQMAKNVLTDDATFSITRVI---- 302 (431) Q Consensus 229 ~tv-~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~l-kaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~---- 302 (431) +.. +|.. ...++- +--...+..++.-.+ |--.+-.|+= .+++.-. ++-+|+.--|+ T Consensus 224 ~~f~~G~~--------~~t~ak----~INfiiv~~~A~ia~~K~~~~~if~P----~~~~~g~--~l~~~R~Y~D~fv~~ 285 (346) T protein:vir:10 224 YDFSDGSK--------IIDTAK----QIEMFLIYNGVQIAPEKYSFVGFDQP----SAATSGN--YLYYEQSYDDVLLLN 285 (346) T ss_pred hhhccCcc--------ccCCcc----ceeEEEECCceeeeeeeeeeeEeeCC----CCCcccc--eeeeeeeeeeeeeec Confidence 110 1110 000000 000111111111111 2222222221 1111111 12222221111 Q ss_pred -cCcee--EEeecccccccccccccccccceeecccccCceeE----EeccCCc--------ceeeeecc Q lcl|NC_019501. 303 -DSTHI--EITPKPIALDDASLTKEEKAYANVNTSLADNTPVN----VLNVATT--------TANVFWAD 357 (431) Q Consensus 303 -~a~ti--~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavT----v~g~~s~--------~~Nl~fhr 357 (431) ....| .+..+.-..-++. +. .+.|.++--|- |+-+++- ..=|+.-+ T Consensus 286 nk~~~Iyv~~~~a~~~~~~~~-~~--------~~kpt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (346) T protein:vir:10 286 TKTKGIQFVVSDKPKKDQEQS-GQ--------DAKPTAESTLEEIKAYLDKNHIDYTGKTKKDELLALVK 346 (346) T ss_pred cccceEEEeeecccccCccCc-cc--------ccCcccccchHHHHHHhcccccccccccchhhHHhhcC Confidence 11122 2222211110000 00 11122211111 1111110 01122222 No 58 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=89.09 E-value=0.028 Score=29.07 Aligned_cols=311 Identities=15% Similarity=0.145 Sum_probs=125.2 Q ss_pred Cccccccchh-HHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccc-----ccccccCc------- Q lcl|NC_019501. 1 MALNEGQLVT-YALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQT-----GWNLTGNA------- 67 (431) Q Consensus 1 ~~~~~~~~lt-~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~-----g~~~s~~~------- 67 (431) -+-.-.+.=| |+.+..|+..+..+++-++.. .+|- -.+.|.||..|....+.... |-+..++. T Consensus 16 ~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~-~~pi---Pkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y 91 (401) T protein:vir:95 16 DGANSDQMQTFFWLKKAIITARKEQYFMPLAS-VTNM---PKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLY 91 (401) T ss_pred cccccceeeehhhHHHHHhhhhhhhhhhhccc-cccc---ccccCCeEEEEecccccccccchhcCCCcccccccCcccc Confidence 1222233444 345788888777777666652 2222 24679999987665554322 22222110 Q ss_pred ---cc----------cccceeEEEec-----------cccccceEeeHHHh--hHHHHHHhhhhHHHHHHHHHH-----H Q lcl|NC_019501. 68 ---TG----------ILELSVKCNMG-----------DPDNDFFELRADDL--RDERSYRRRIQASAKKLANNI-----E 116 (431) Q Consensus 68 ---~d----------i~e~~V~v~ld-----------~~k~v~f~lt~keL--~i~~~s~~~L~~Am~~LAn~I-----d 116 (431) -| +.|..-.|+.- +|.+...+||+.-+ .++.-..+.|..=|..=++.| - T Consensus 92 ~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~ 171 (401) T protein:vir:95 92 GSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQ 171 (401) T ss_pred ccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHH Confidence 01 11111111111 24444555554322 223322332222233333333 4 Q ss_pred HHHHH-----hhccccceeeccCCCCCCccchhhhhhHHHHHHHHHhhcCCc----------cCCc------EEE----e Q lcl|NC_019501. 117 SAIAK-----QATEMGSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRELNR----------DMGI------SYF----L 171 (431) Q Consensus 117 ~dl~~-----~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~----------~~~r------~~v----~ 171 (431) +||+. .|........+.++....++..+ ..++-.+.++|+++-+|+ -+.+ -++ | T Consensus 172 ~dll~ag~~viyAg~ats~At~~~~~~~~t~vt-~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L 250 (401) T protein:vir:95 172 KDLLAAAGTVLYAGAATSDATITGEGSTPSVVS-YKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSEL 250 (401) T ss_pred HHHHhhcCeeecCCccceeeeccccccccceec-hhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCc Confidence 44552 22222222333233333333333 578899999999999997 1111 111 1 Q ss_pred ChHHH--hhhhh--hhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecc Q lcl|NC_019501. 172 NPDDY--RKAGR--NLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTD 247 (431) Q Consensus 172 np~~~--a~~~~--~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~ 247 (431) .|+.+ +++-+ .+...... +..+.+-+|+||. +.+|+++.. ...-+.....+ ++ T Consensus 251 ~~di~a~~D~~~~~~fi~v~kY--a~~~~i~~gEiG~-i~~vR~i~~-p~~~~w~~ag~--------~a----------- 307 (401) T protein:vir:95 251 VPELKAMKDLFGNKAFIETQHY--ADAGTIMNGEVGS-IDKFRIIQV-PEMLHWAGAGA--------QA----------- 307 (401) T ss_pred hhHHHHHHHhcCCCCceehhhc--CCccccccccccc-cCceeEEec-ccceeecCCcc--------cc----------- Confidence 12222 11111 11110000 0112344455543 444443322 11111110000 00 Q ss_pred cccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEeecccccccccccccccc Q lcl|NC_019501. 248 GNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKA 327 (431) Q Consensus 248 g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~ 327 (431) .+...-|.+.-+..++...+||. T Consensus 308 ------------------------------------------~~~~~~y~~~~~~~gg~~dVyp~--------------- 330 (401) T protein:vir:95 308 ------------------------------------------TGANPGYRTSMVSGQEHYDVYPM--------------- 330 (401) T ss_pred ------------------------------------------cccccccccccccCCCcceeeee--------------- Confidence 00000000011112233334444 Q ss_pred cceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCccee-eEEEeecCcceEEEEEEecccccccceEE Q lcl|NC_019501. 328 YANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGM-KTSSFSIPGIGVNGIFATQGDINTLSGKC 406 (431) Q Consensus 328 y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~-~~~t~~~~g~glslrv~~~yd~~~~~~~~ 406 (431) |.+-++|| ++.||+ |++... .......||.|. ....|.=.+.... T Consensus 331 -------------------------lV~G~dAf--~~~~l~---g~g~~~~~~~ivk~pG~~~----ad~~DPlgQ~g~v 376 (401) T protein:vir:95 331 -------------------------LVVGDDSF--TSIGFQ---TDGKSLKFTVMTKMPGKET----ADRNDPYGETGFS 376 (401) T ss_pred -------------------------eEEccccc--eecccc---cCCccccceeEeecCCcCC----CCCCCcccceehh Confidence 34444444 355553 222221 112223333221 1113444455556 Q ss_pred EEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 407 RIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 407 r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) .+=.+||...++||+- ++|----- T Consensus 377 gwK~~~a~~vL~~e~m-~~ies~a~ 400 (401) T protein:vir:95 377 SIKWYYGILVKRPERL-ALIKTVAP 400 (401) T ss_pred hhhhhhhhheecccee-EEEEeecC Confidence 6788999999999995 55521111 No 59 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=83.64 E-value=0.067 Score=27.02 Aligned_cols=277 Identities=14% Similarity=0.117 Sum_probs=99.7 Q ss_pred Cccc----ccc-ch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC-----ccc Q lcl|NC_019501. 1 MALN----EGQ-LV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN-----ATG 69 (431) Q Consensus 1 ~~~~----~~~-~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~-----~~d 69 (431) +|+. ++- ++ +.+..+||+.++...++.+++++.. .+..+.+|+............... .++ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~--------~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~ 212 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVK--------TKENIKYPVLVKKAEAQGHKNERTNNEMPETD 212 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceec--------cCCceEEEEEecCCcccceecccccccccccc Confidence 2221 122 22 3334679999999999988884322 123466766533332222111111 112 Q ss_pred cccceeEEEeccccccceEeeHHHh-hHHHH-HHhhhhH-HHHHHHHHHHHHHHH-hh-ccccceeeccCCCCCCccchh Q lcl|NC_019501. 70 ILELSVKCNMGDPDNDFFELRADDL-RDERS-YRRRIQA-SAKKLANNIESAIAK-QA-TEMGSLVVHDTRAIGPSTGLS 144 (431) Q Consensus 70 i~e~~V~v~ld~~k~v~f~lt~keL-~i~~~-s~~~L~~-Am~~LAn~Id~dl~~-~~-~~~~~~v~t~~~t~~~~~~~~ 144 (431) ..=..+.++..+-.. .+.+ ++|| .+..+ .+.+|+. -..+|+..+|..++. -. ...+..+.+..+.+....... T Consensus 213 ~~f~~v~~~~~k~~~-~~~i-S~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~ 290 (434) T protein:vir:62 213 IEFDEIELSPTEFDA-LATV-TKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKN 290 (434) T ss_pred cceeeEEeeheeeEe-ehhh-HHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccc Confidence 111223333222111 1222 4554 33332 4666754 456889999998882 00 011112222223333333334 Q ss_pred hhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccc Q lcl|NC_019501. 145 GWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKS 224 (431) Q Consensus 145 ~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~g 224 (431) .|.++......|.....+ +-..++||.+.+.+.+ ++++. ||++ |. .+. ... T Consensus 291 ~~d~l~~l~~~l~~~~~~---~a~~v~n~~~~~~L~~---------------lkd~~-G~~l----~~-~~~-----~~~ 341 (434) T protein:vir:62 291 LYDALVKMKNTPVKEVRK---KARWVLNTAALTKIET---------------MKTDD-GFPL----LR-PFN-----QAE 341 (434) T ss_pred hhhHHHHHHhhcchhhhc---CCEEEEcHHHHHHHHH---------------hhccC-CCEe----ec-cCC-----Ccc Confidence 566666544444332222 2235789988866522 12221 3433 10 000 011 Q ss_pred ccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc-ceeeeccccccccCCCceEEEEEeec Q lcl|NC_019501. 225 TATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT-GVKFLSQMAKNVLTDDATFSITRVID 303 (431) Q Consensus 225 t~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia-GV~~vn~~tk~~~~~lq~fvVta~~~ 303 (431) .+...++.|-+- .++ +.. ....+|+...|. | +..+|++..... T Consensus 342 ~g~~~tl~G~pV------~~~-~~~----------------~~~~~~~~~~i~~G-------------dfs~~~i~~~~g 385 (434) T protein:vir:62 342 GGIGYTLLGFPV------EEE-DAI----------------DIPDSPDTPVFYFG-------------DFSKFYIQDVIG 385 (434) T ss_pred CCCCceecceee------EEe-cCc----------------cCccCCCceEEEEe-------------eccceEEEEeec Confidence 122233444331 000 000 000122222221 2 333343322111 Q ss_pred CceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEee Q lcl|NC_019501. 304 STHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFS 383 (431) Q Consensus 304 a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~ 383 (431) .-++...... | +....|.+ .+|+|...-|..+|.+. ++...... T Consensus 386 ~~~i~~~~~~--------------~-------~~~~~v~~---------~~~~r~Dgk~i~~~~~~------~~~~~~~~ 429 (434) T protein:vir:62 386 SLEVQKLVEL--------------F-------SRTNRVGF---------RIWNLLDAQLIHSPFEV------PVYKYVLK 429 (434) T ss_pred eeEEEeehhh--------------h-------cccCceEE---------EEEeeecceeecCcccc------eEEEEEec Confidence 2222221110 0 11112222 22222222222222211 01000000 Q ss_pred cCcce Q lcl|NC_019501. 384 IPGIG 388 (431) Q Consensus 384 ~~g~g 388 (431) .+.-| T Consensus 430 ~~~~~ 434 (434) T protein:vir:62 430 APTGA 434 (434) T ss_pred cCCCC Confidence 00001 No 60 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=83.26 E-value=0.07 Score=26.92 Aligned_cols=286 Identities=12% Similarity=0.075 Sum_probs=114.5 Q ss_pred CccccccchhHHHHHHHHHHHhhcc-----c--chhhccCCC-chHHHhhcccEEEeecccccccccccccccCcccccc Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTP-----M--ASKVTKYTP-PAESMQRSSNTVWMPVEQEAPTQTGWNLTGNATGILE 72 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lv-----m--a~~V~~~r~-y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~~di~e 72 (431) ||.+-=+|...+.-|+...+-.+.. | ++.+. -.+ ........|++|+||.= +.-+| .++++.| T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~-~~~~i~~~~~~~G~~i~~P~~---~~l~G-----~~~~~~d 71 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAV-SDERVSKNITSGGLLVNMPFW---NDLTG-----DSEVLGN 71 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhccccc-ccHHHHHHhhcCCCEEEeccc---ccCCC-----cccccCC Confidence 9975444443333455544211111 1 22220 011 23334457999999963 22233 2222222 Q ss_pred ceeEE-----Eeccccccc----eEeeHHHhhHHHHHHhhhhHHHHHHH----HHHHHHHHHhhccccceeecc------ Q lcl|NC_019501. 73 LSVKC-----NMGDPDNDF----FELRADDLRDERSYRRRIQASAKKLA----NNIESAIAKQATEMGSLVVHD------ 133 (431) Q Consensus 73 ~~V~v-----~ld~~k~v~----f~lt~keL~i~~~s~~~L~~Am~~LA----n~Id~dl~~~~~~~~~~v~t~------ 133 (431) +...| +-.++..+- ..|...||...-...+.+....++|+ .+.+.+|++.++.+.+..... T Consensus 72 g~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~ 151 (330) T protein:vir:10 72 GDKALETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALE 151 (330) T ss_pred CccccchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhh Confidence 21111 111122111 22445555433344444444444443 455666676666543321000 Q ss_pred ----CCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhh Q lcl|NC_019501. 134 ----TRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGF 209 (431) Q Consensus 134 ----~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gf 209 (431) ...+...... ....+..|.+.|.++. +....++++|..+..|..+-. ..+... ++ -.+.|+. +.|. T Consensus 152 ~~~~~~~~~~~a~~-s~~~l~~A~~~~GD~~---~~~~~ivmhS~v~~~L~~~~l-i~~~~~--s~--~~~~i~~-~~G~ 221 (330) T protein:vir:10 152 ETHVSDQSKASTGI-DAGMVLDAKQLLGDSA---DQVTAIAMHSAVYTKLQKDNL-IQYIQP--TT--ATINIPT-YLGY 221 (330) T ss_pred hhheeccccccccc-CHHHHHHHHHHhcccc---ccceEEEEcHHHHHHHHHhhh-hhhhcc--cc--cCccccc-ccce Confidence 0001111111 2356777888887764 345667799999999876421 122111 11 1467876 7788 Q ss_pred hhhHhcCCCcccccccccccee-cccccccceeeeeecccccccccceeeEEEeeccce-eeeccEEEEcceeeeccccc Q lcl|NC_019501. 210 DEILRSPKLPAVTKSTATGVTV-SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTG-FKRGDKISFTGVKFLSQMAK 287 (431) Q Consensus 210 d~~~~s~~v~~~t~gt~~~~tv-~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~-lkaGDv~TiaGV~~vn~~tk 287 (431) . +..+..+|.. .+..+++.+ .||-. +.....+...++.. .-- ++.=|.+...=-|.+||.-. T Consensus 222 ~-VivdD~~p~~-~~~yt~yl~~~GAi~-----~~~~~~~~~v~~Et---------dRd~~~g~~~l~~r~~~~~hp~G~ 285 (330) T protein:vir:10 222 R-VIIDDGIAPT-GDIYTSYLFRTGSIG-----LNTGNPSGLTTFET---------SREAAKGNDMIYTRRALVMHPYGV 285 (330) T ss_pred E-EEEeCCCCCC-CCceeEEEEecCcee-----eecccCCccccccc---------cCCccccceEEEEeeEEEeeeeee Confidence 6 7788888853 222333322 22211 11000000000000 000 11113333333344444221 Q ss_pred cccCC-------CceEEEEEeecCc--eeEEeeccccc--ccccccc Q lcl|NC_019501. 288 NVLTD-------DATFSITRVIDST--HIEITPKPIAL--DDASLTK 323 (431) Q Consensus 288 ~~~~~-------lq~fvVta~~~a~--ti~I~Paii~~--~~~t~~~ 323 (431) .-... ..+.. +-.+++ +..+.|+-||. ...-++. T Consensus 286 s~~~~~~~~~~~sPt~~--~L~~~~NW~~v~~~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 286 KWTGAEVDAGNITPSNA--DLAKFKNWKRVYEPKNIGIIALKHKIGK 330 (330) T ss_pred eecccccccCcCCcChH--HhcCCcCcccccChhhcceEEEEEecCC Confidence 11100 00000 001111 24445554443 2333343 No 61 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=83.22 E-value=0.07 Score=26.91 Aligned_cols=286 Identities=10% Similarity=0.012 Sum_probs=115.2 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccc-------ccccccccccccCccccccc Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQ-------EAPTQTGWNLTGNATGILEL 73 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~-------~~~~~~g~~~s~~~~di~e~ 73 (431) ||+.. +..+.+-.++.+.+...++.+.|. -.+..-+ +.=|.+|.||.=+ -.+..+ ++..-+..++.-. T Consensus 1 Mantl-~ya~~~~~~Ld~~~~~~~~t~~l~--~~~~~v~-~~Gak~vkIp~is~~~~~TsGl~dy~-R~~g~~~g~v~~~ 75 (302) T protein:vir:78 1 MANSL-ALAQIYQDNIDKAIAVNSKSAFLE--ANPNNVQ-YNGGNTIKIADISFGSGTTGDLKAYN-RSTGFTQGSVTLA 75 (302) T ss_pred CCchh-HHHHHHHHHHHHHHHhhhceeecc--cCCceEE-EecCcEEEEEEEEeeccccccccccc-cccCccccceeee Confidence 99554 333444445555566667666664 2222222 2337899988653 111111 1111112234344 Q ss_pred eeEEEeccccccceEeeHHHhh-------HHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccCC-CCCCccchhh Q lcl|NC_019501. 74 SVKCNMGDPDNDFFELRADDLR-------DERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDTR-AIGPSTGLSG 145 (431) Q Consensus 74 ~V~v~ld~~k~v~f~lt~keL~-------i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~~-t~~~~~~~~~ 145 (431) .-+.+|++.+...|.+...|.+ +...+.+|.+ .+.+=+||+.-.+....-+...++... +....+..+. T Consensus 76 ~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r---~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nv 152 (302) T protein:vir:78 76 WSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQR---TKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQAL 152 (302) T ss_pred eeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHH---hhhcchhhHHHHHHHHHhhhccCccccccccchhHHHH Confidence 5558888888877877743322 2222222222 233345666644322221211222111 2222445566 Q ss_pred hhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccc Q lcl|NC_019501. 146 WDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKST 225 (431) Q Consensus 146 ~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt 225 (431) +..+..+...|++. ++|-++.+|.-+..+-.. ... ....+.+.+.+|.|.|.+..+|.+ ..+.+...=. T Consensus 153 l~~i~~~~~~~~e~-----~~~vl~vtp~~~~~Lk~a-~~~--~~~~~~~~~~~~~i~~~V~~lDgv---~Ii~VPs~r~ 221 (302) T protein:vir:78 153 MGDIATAMELVDDS-----NQLILVTSPTTLAGLLNT-ALI--RESKNTQVLRRGEVDTKITFIQDV---EVLQVPSEYL 221 (302) T ss_pred HHHHHHHHHHhhcc-----CCeEEEEChHHHHHHhcc-hhh--ccceeccccccccccceeeeeccc---EEEEchhhhc Confidence 77778888888885 468889999888776332 111 111122344566677777766642 2222211111 Q ss_pred cccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccc-cCCCceEEEEEeecC Q lcl|NC_019501. 226 ATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNV-LTDDATFSITRVIDS 304 (431) Q Consensus 226 ~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~-~~~lq~fvVta~~~a 304 (431) -+.+.-. .|+.....+ .+--...+..++.-.+.|=|.+-| ..|-+-+. .+++-+|+.--|+ T Consensus 222 ~t~~~f~-------~G~~~~~~a----k~INfiiv~~~a~ia~~K~~~~~i-----f~P~~~~~gd~~l~~~R~Y~D~-- 283 (302) T protein:vir:78 222 YDKVAPK-------VGVPDYTGA----KKIPYMIFKRDAPTGIVKTDKVRV-----FEPDTNQSADAYKVDLRLYHDL-- 283 (302) T ss_pred ccceecc-------CCccccCCc----cceeEEEECCCeeeeeeeeeeeEe-----eCCCCCCCcceeeeeeeeEeee-- Confidence 1111100 000000000 000011111111111112121111 11222111 1122222221111 Q ss_pred ceeEEeecccccccccccccccccceeecccc Q lcl|NC_019501. 305 THIEITPKPIALDDASLTKEEKAYANVNTSLA 336 (431) Q Consensus 305 ~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A 336 (431) -|.. ....+-|.|+++..| T Consensus 284 ---fV~~----------nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 284 ---IVPK----------NQRPGIIKASFGTIA 302 (302) T ss_pred ---eeec----------cccCeEEEeeccccC Confidence 0000 001223444455554 No 62 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=78.12 E-value=0.12 Score=25.68 Aligned_cols=255 Identities=12% Similarity=0.054 Sum_probs=101.9 Q ss_pred Cccccccch--hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccc-cCcccc---ccce Q lcl|NC_019501. 1 MALNEGQLV--TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLT-GNATGI---LELS 74 (431) Q Consensus 1 ~~~~~~~~l--t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s-~~~~di---~e~~ 74 (431) +....+..| +.+..++|+.++...++.++++...= .+.++.+|++.......+|... +...+. .=.. T Consensus 137 ~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 209 (400) T protein:vir:38 137 VKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQA-------STQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKP 209 (400) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEec-------cCcceEEEEEecCCCcccccccccccccccccccee Confidence 233333333 23357788888888887777732110 1346677776433222222221 111111 1122 Q ss_pred eEEEeccccccceEeeHHHh-hHHH-HHHhhhhH-HHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDL-RDER-SYRRRIQA-SAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSD 151 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL-~i~~-~s~~~L~~-Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~ 151 (431) +.++..+.. .-+.+ ++|| ++.. ..+.+|.. -..+|+..+|..++.- +.++++.+ ...|.++.. T Consensus 210 i~~~~~k~~-~~~~i-s~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~---------~~~~~~~~---~~~~~~~~~ 275 (400) T protein:vir:38 210 VNWSVETYR-QALPV-SQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATL---------LKGFTAKT---ISSVDDLKH 275 (400) T ss_pred eEeehhhee-eehhh-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhc---------cccccccc---cccHHHHHH Confidence 333322221 11222 4454 3322 25555644 3356667777666521 11222222 233555554 Q ss_pred HHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccccee Q lcl|NC_019501. 152 AERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTV 231 (431) Q Consensus 152 a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv 231 (431) +-... ++...+-.+++||.++..+.. ++++ .|+++ +. |..+.+ +..++ T Consensus 276 ~~~~~----~~~~~~a~~v~~~~~~~~l~~---------------lkd~-~G~~i------~~----~~~~~~--~~~~l 323 (400) T protein:vir:38 276 INNVD----LDPAYSRVIIASQSFYNFLDT---------------VKDG-NGRYL------LQ----DSILTP--SGKSV 323 (400) T ss_pred HHHhh----hhhhhCcEEEEcHHHHHHHHH---------------hhcc-CCCee------ee----cCcCCC--Ccccc Confidence 33222 222234567899988766521 1222 24433 11 111111 11233 Q ss_pred cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEEee Q lcl|NC_019501. 232 SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEITP 311 (431) Q Consensus 232 ~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I~P 311 (431) .|-+- ... +......+||.+.|-| +.++|+...+-..-++..+. T Consensus 324 ~G~pv-----~~~------------------~~~~~~~~g~~~~~~g-------------d~s~~~~~~~~~~~~~~~~~ 367 (400) T protein:vir:38 324 LGMPI-----AVV------------------SDDTLGAAGEAHAFLG-------------DIKRAILFANRADFMVRWVD 367 (400) T ss_pred cccee-----EEe------------------cccccCCCCceEEEEE-------------eccccEEEEeecceEEEEec Confidence 33331 000 0011122566655544 44555444433333444332 Q ss_pred cccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccC Q lcl|NC_019501. 312 KPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPV 369 (431) Q Consensus 312 aii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~ 369 (431) -.. . ...-+.|. -++ =-..|++||+..+...+. T Consensus 368 ~~~--~----~~~~~~~~-------------r~d------~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 368 DQI--Y----GQFLQAGM-------------RFG------VSVADEKAGYFLTYTPKA 400 (400) T ss_pred ccc--c----ceeEEEEE-------------Eec------cEEecccceEEEEeecCC Confidence 100 0 00001111 111 134566666666554433 No 63 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=74.68 E-value=0.16 Score=25.02 Aligned_cols=267 Identities=13% Similarity=0.035 Sum_probs=107.0 Q ss_pred Cccccccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC---ccccccceeE Q lcl|NC_019501. 1 MALNEGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN---ATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~---~~di~e~~V~ 76 (431) ...+.+.++ +....++|+.++...++.+++....- .|.++.+|+.........|...+. ..+..=..+. T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~ 189 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTT-------ESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELEN 189 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhccceec-------CCCceEEEEEecCCCceeeecCCccccccccceeEEE Confidence 223333333 33458899999999999998843321 245677776543322222322221 1122223333 Q ss_pred EEeccccccceEeeHHHhhHHHHHHhhhhHHH-HHHHHHHHHHHHHh---hcc---ccceeec-cCCCCCCccchhhhhh Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLRDERSYRRRIQASA-KKLANNIESAIAKQ---ATE---MGSLVVH-DTRAIGPSTGLSGWDF 148 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~i~~~s~~~L~~Am-~~LAn~Id~dl~~~---~~~---~~~~v~t-~~~t~~~~~~~~~~~d 148 (431) ++..+... .+.+|..-|.+..+.+.+|...+ .+++..+|..++.- ... +-+..+. .............+.+ T Consensus 190 ~~~~k~~~-~~~is~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~ 268 (395) T protein:vir:43 190 APVRTIAH-LFKASRQILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDR 268 (395) T ss_pred EeeeeEEE-eehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHH Confidence 33333221 23444433445556778886644 58899999988831 011 0000000 0011111122234555 Q ss_pred HHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccc Q lcl|NC_019501. 149 VSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATG 228 (431) Q Consensus 149 ~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~ 228 (431) +..+-..|.....+. -.+++||.++..+.. ++++ -||.+ +.+ + ..+ +. T Consensus 269 i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~---------------lkd~-~G~~i------~~~---~--~~~--~~ 316 (395) T protein:vir:43 269 IRLAILQAQLAEFPA---SGIVLNPIDWALIEL---------------NKDA-ENRYI------IGS---P--QNG--TT 316 (395) T ss_pred HHHHHHhhccccCCC---cEEEEcHHHHHHHHH---------------hhcc-CCcee------ccc---c--ccC--CC Confidence 555555555444332 358899988766521 1122 13422 110 0 001 11 Q ss_pred ceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeE Q lcl|NC_019501. 229 VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIE 308 (431) Q Consensus 229 ~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~ 308 (431) .++.|-+- + .+..+.+|+++- | +..++....+-.+-+|. T Consensus 317 ~~l~G~pV-------v-------------------~~~~~~~~~~~~--g-------------d~~~~~~~~~~~~~~i~ 355 (395) T protein:vir:43 317 PTLWRLPV-------V-------------------ETQAITQDEFLT--G-------------AFSLGAQIFDRMDIEVL 355 (395) T ss_pred ceecceee-------E-------------------EcCCCCCCcEEE--E-------------eccceEEEEEecceEEE Confidence 12222210 0 011122333221 1 22222222222233344 Q ss_pred EeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeeccc Q lcl|NC_019501. 309 ITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIP 368 (431) Q Consensus 309 I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~ 368 (431) +.+-.-. .++ -|-.++.+..--. =-..+++||+..+.+-+ T Consensus 356 ~~~~~~~-----------~f~------~~~~~~r~~~r~d---~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 356 VSTENDK-----------DFE------NNMVTIRAEERLA---FAVYRPEAFVTGSLTAS 395 (395) T ss_pred Eeccccc-----------hhh------cCcEEEEEEEeec---cEEecccceEEEEeccC Confidence 4321000 000 0101111100000 12345566655555544 No 64 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=74.46 E-value=0.16 Score=24.98 Aligned_cols=258 Identities=13% Similarity=0.054 Sum_probs=96.5 Q ss_pred CccccccchhHH------HHHHHHH-------HHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccCc Q lcl|NC_019501. 1 MALNEGQLVTYA------LDEIIET-------VQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGNA 67 (431) Q Consensus 1 ~~~~~~~~lt~~------~~evi~~-------len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~~ 67 (431) ||-. +|-+.. -=+.+.+ |...|-+=| ..|.+. |+||.+|+= .+.-.++-...+.. T Consensus 1 mAe~--nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r----~~p~a~-----G~tIt~pK~-~~tgda~dVaEGe~ 68 (295) T protein:vir:99 1 MAEK--NLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTR----RETLTN-----DLKIQTYKW-EVTLDQTDPGEGET 68 (295) T ss_pred CCCc--ccccHhhccCceeehhhHHhhhhHHHHHHHhcccc----cccccc-----CCeEEeeee-eeecccccccCCcc Confidence 7752 222110 0122222 333332222 223333 999999983 33222221111111 Q ss_pred c--cccc----ceeEEEeccccccceEeeHHHh--h-----HHHHHHhhhhHHHHHHHHHHHHHHHHhhccccceeeccC Q lcl|NC_019501. 68 T--GILE----LSVKCNMGDPDNDFFELRADDL--R-----DERSYRRRIQASAKKLANNIESAIAKQATEMGSLVVHDT 134 (431) Q Consensus 68 ~--di~e----~~V~v~ld~~k~v~f~lt~keL--~-----i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~~v~t~~ 134 (431) = +-+. ....+++.++.. ..|+..+ + +.+.-+ +=.+.|+++|+.|++...+.-..-+ T Consensus 69 Iplskvt~~~~~t~t~kikK~rK---~tTdEAIqlsGygdpvgead~----qL~~~ia~kId~D~~~~lktat~t~---- 137 (295) T protein:vir:99 69 IPLSKVTRTKDKDYTVKWFKKRR---ATTAEAIARHGAARAITEADK----RIMRELQNGIKDAFFTFLKTKPTKV---- 137 (295) T ss_pred cchhhheeeeeeeeEEEeeeecc---cccHHHHHhcCCCchhHHHHH----HHHHHHHHhhhHHHHHHhccCceee---- Confidence 0 1111 123455555433 2255543 1 222222 3357899999999998776422211 Q ss_pred CCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHh Q lcl|NC_019501. 135 RAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILR 214 (431) Q Consensus 135 ~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~ 214 (431) +.......++.+...|+...--.+.+.-+|.||.|.+.+.++......... .+=--+|. +|.|++.+.+ T Consensus 138 ------tg~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~----~fG~~~L~-nfLG~q~II~ 206 (295) T protein:vir:99 138 ------KGVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASN----VFGMTLLK-NFLGMQNVIV 206 (295) T ss_pred ------ehhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhh----hhhhhhhh-hhhccceEEE Confidence 111223345555555544322223345677999999997666543211111 12112222 4778866788 Q ss_pred cCCCccccc-cccccceecccccccceeeeeecccccccccceeeEEEeecccee----------eeccEEEEcceeeec Q lcl|NC_019501. 215 SPKLPAVTK-STATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGF----------KRGDKISFTGVKFLS 283 (431) Q Consensus 215 s~~v~~~t~-gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~l----------kaGDv~TiaGV~~vn 283 (431) +..+|.-+. .|+.- -++.| | ++..+...+.-+ ..+...||-+ .-.|-+.+.|+. T Consensus 207 S~kv~~G~~~aT~~~-Ni~~a-------y-~~~~~g~l~~~f---~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~--- 271 (295) T protein:vir:99 207 MPSVPEGKIYSTAVE-NLVFA-------S-LNVKGGDLGGLF---ADFTDETGLIAAARNRQLSNLTYESVFFGANV--- 271 (295) T ss_pred cccCCCceEEEeecc-ceEEE-------E-ecCCchhhhhhh---hhccCcccceEEEeccccceeeehhhhHhHHH--- Confidence 888885332 11110 01111 0 000100000000 0000101100 001111111110 Q ss_pred cccccccCCCceEEEEEeecCceeEEeeccccc Q lcl|NC_019501. 284 QMAKNVLTDDATFSITRVIDSTHIEITPKPIAL 316 (431) Q Consensus 284 ~~tk~~~~~lq~fvVta~~~a~ti~I~Paii~~ 316 (431) -.+.-...+|...-.+.. .|.+ -. T Consensus 272 -----lfpE~~dgiv~~tI~~~~---~~~~-~~ 295 (295) T protein:vir:99 272 -----LFAEIPEGVVEATIEAAA---VPGI-GG 295 (295) T ss_pred -----hcccccceEEEEEEecCc---CCCC-CC Confidence 001111233333221100 0000 00 No 65 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=65.37 E-value=0.28 Score=23.58 Aligned_cols=270 Identities=13% Similarity=0.063 Sum_probs=109.0 Q ss_pred Cc----ccc-ccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC---cccccc Q lcl|NC_019501. 1 MA----LNE-GQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN---ATGILE 72 (431) Q Consensus 1 ~~----~~~-~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~---~~di~e 72 (431) |+ -.- +-+-+....++|+.++...++.++++.. +- .+.+++||+-..... ..|...+. .++..= T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~-~~------~~~~~~ip~~~~~~~-a~~v~Eg~~~~~~~~~f 85 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-PM------GTTGQKIPHWVGDVS-AQWIGEGDMKPITKGNM 85 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee-ec------cCCceEEEEEeCCcc-eEEecCCccccccccce Confidence 11 111 1133444589999999999998888422 21 245677775433221 12322221 112212 Q ss_pred ceeEEEeccccccceEeeHHHh-hH-HHHHHhhhh-HHHHHHHHHHHHHHHHhh-ccccceeecc----CCCCCCccchh Q lcl|NC_019501. 73 LSVKCNMGDPDNDFFELRADDL-RD-ERSYRRRIQ-ASAKKLANNIESAIAKQA-TEMGSLVVHD----TRAIGPSTGLS 144 (431) Q Consensus 73 ~~V~v~ld~~k~v~f~lt~keL-~i-~~~s~~~L~-~Am~~LAn~Id~dl~~~~-~~~~~~v~t~----~~t~~~~~~~~ 144 (431) .++.++..+-. ..+.+ ++|| ++ ....+++|+ .-.++++.++|..++.=- ...+...... ..+........ T Consensus 86 ~~i~~~~~k~~-~~~~i-S~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (318) T protein:vir:24 86 TSQTIAPHKIA-TIFVA-SAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTV 163 (318) T ss_pred eEEEEeeEEEE-Eeehh-hHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccch Confidence 22222222211 12333 4454 43 233566664 455789999999998311 0000000000 00001111111 Q ss_pred hhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccc Q lcl|NC_019501. 145 GWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKS 224 (431) Q Consensus 145 ~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~g 224 (431) ...++..+...+.....+ .-.+++||..+..+.. ++++ -||.+ +. +..+.+ T Consensus 164 ~~~~~~~~~~~~~~~~~~---~~~~v~n~~~~~~L~~---------------lkd~-~G~~l------~~----~~~~~~ 214 (318) T protein:vir:24 164 YDQVAVNGLSLLVNDGKK---WTHTLLDDITEPILNG---------------AKDQ-NGRPL------FI----ESTYGE 214 (318) T ss_pred HHHHHHHHHHhhccccCC---CCEEEEcHHHHHHHHH---------------hhcc-CCcee------ec----CccccC Confidence 112223333333322221 2347899988766521 2222 14433 11 000000 Q ss_pred ccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecC Q lcl|NC_019501. 225 TATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDS 304 (431) Q Consensus 225 t~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a 304 (431) ... ++.-+ ++-|+ T Consensus 215 ~~~---------------------------------------~~~~~---~i~g~------------------------- 227 (318) T protein:vir:24 215 AAS---------------------------------------PFRSG---RIVAR------------------------- 227 (318) T ss_pred ccc---------------------------------------cccCc---eEEEE------------------------- Confidence 000 00000 11110 Q ss_pred ceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEeec Q lcl|NC_019501. 305 THIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSI 384 (431) Q Consensus 305 ~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~~ 384 (431) .+.+++.+ +.+..+-++|--+ -+.+..+ T Consensus 228 -pv~~~~~~----------------------~~~~~~~~~gdfs----------~~~~~~~------------------- 255 (318) T protein:vir:24 228 -PTILSDHV----------------------VEGTTVGFMGDFS----------QLIWGQI------------------- 255 (318) T ss_pred -eeEEeCCC----------------------CCCccEEEEeecc----------eEEEEEe------------------- Confidence 00011110 1111111222100 0111110 Q ss_pred CcceEEEEEEecc--------------cccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 385 PGIGVNGIFATQG--------------DINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 385 ~g~glslrv~~~y--------------d~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) .|+++.+.+++ .-.+++...|.-.-+|+++++|+-. +.|-+-+| T Consensus 256 --~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~-~~i~~~~a 313 (318) T protein:vir:24 256 --GGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAF-VALTNVVS 313 (318) T ss_pred --cCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccce-EEEEeecc Confidence 01111111111 1234678889999999999999986 57888888 No 66 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=60.36 E-value=0.37 Score=22.93 Aligned_cols=292 Identities=11% Similarity=0.054 Sum_probs=115.4 Q ss_pred CccccccchhHH-HHHHHHHH-----Hhhccc--chhhccCCCc---hHHH--hhcccEEEeecccccc-----cccccc Q lcl|NC_019501. 1 MALNEGQLVTYA-LDEIIETV-----QNLTPM--ASKVTKYTPP---AESM--QRSSNTVWMPVEQEAP-----TQTGWN 62 (431) Q Consensus 1 ~~~~~~~~lt~~-~~evi~~l-----en~lvm--a~~V~~~r~y---~~e~--~k~GdTv~ip~p~~~~-----~~~g~~ 62 (431) ||- +.|.-+ .-||...+ .+.+-+ +..+ .+. ..-+ +..|++|++|.-...- +.++.+ T Consensus 1 MA~---T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i---~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~ 74 (324) T protein:vir:59 1 MAY---TKISDVIVPELFNPYVINTTTQLSAFFQSGIA---ATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDD 74 (324) T ss_pred CCc---eeeeceechhHHHHHHHhhhHHHHHHhhcccc---cccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcc Confidence 992 222211 23444432 222111 2223 221 1222 3479999999664431 111111 Q ss_pred cccCccccccc-eeEEEeccccccceEeeH--HHhhHHHHHHhhhhHHHHHHHHHHHHHHHHhhccccc------eeecc Q lcl|NC_019501. 63 LTGNATGILEL-SVKCNMGDPDNDFFELRA--DDLRDERSYRRRIQASAKKLANNIESAIAKQATEMGS------LVVHD 133 (431) Q Consensus 63 ~s~~~~di~e~-~V~v~ld~~k~v~f~lt~--keL~i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~~~------~v~t~ 133 (431) .+ +..+... .+-+.+-.-|. +..++ .++.-.++.++.-++=...++++++.+|++..+.+.+ +..+- T Consensus 75 i~--~~~l~t~~~~a~i~~~~k~--~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dv 150 (324) T protein:vir:59 75 LV--PQKINAGQDKAVLILRGNA--WSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDI 150 (324) T ss_pred cc--hhhcccceeeEEEEeecCc--eeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeee Confidence 11 1111111 11122222222 33332 3334455555554555556677888888876654321 11111 Q ss_pred CCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhH Q lcl|NC_019501. 134 TRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEIL 213 (431) Q Consensus 134 ~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~ 213 (431) +++. ....+ ...+.+|.+.|.+.. +....++++|..+..|..+-. ..+... ++ ..+.|+. +.|.. +. T Consensus 151 sa~~--~~~~s-~~~l~~A~~~~GD~~---~~~~~ivmhS~v~~~L~~~~l-i~~~~~--s~--~~~~i~~-~~G~~-Vi 217 (324) T protein:vir:59 151 SGTA--DGIYS-AETFVDASYKLGDHE---SLLTAIGMHSATMASAVKQDL-IEFVKD--SQ--SGIRFPT-YMNKR-VI 217 (324) T ss_pred eccc--cceec-HHHHHHHHHHhCCcc---cCcEEEEEchHHHHHHHHhhh-hhhccc--cc--cCceeee-ecccE-EE Confidence 1111 11112 356777888887753 344567799999999876422 122111 11 1456765 77876 77 Q ss_pred hcCCCccccc-ccc---cccee-cccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeecccccc Q lcl|NC_019501. 214 RSPKLPAVTK-STA---TGVTV-SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKN 288 (431) Q Consensus 214 ~s~~v~~~t~-gt~---~~~tv-~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~ 288 (431) .++.+|.... ++. +++.+ .||- ++....... + +-.. -.-++.=|.+.-...|.+||.- T Consensus 218 vdD~~p~~~~~~~~~~y~s~l~~~GAi-----~~~~~~~~v--~-------vE~d-Rd~~~g~~~l~~r~~~~~~p~G-- 280 (324) T protein:vir:59 218 VDDSMPVETLEDGTKVFTSYLFGAGAL-----GYAEGQPEV--P-------TETA-RNALGSQDILINRKHFVLHPRG-- 280 (324) T ss_pred EeCCCCccccCCCCceEEEEEEecCeE-----EEeecCCCc--c-------eecc-cCccccceEEEEeeEEEeEeee-- Confidence 8888886432 221 12211 2221 111000000 0 0000 0001111344444444444422 Q ss_pred ccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccce Q lcl|NC_019501. 289 VLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDS 359 (431) Q Consensus 289 ~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A 359 (431) |.-+... .-.++|..-.+ ++.+.|.+.| +--+|-++- |-+.=+| T Consensus 281 -------~s~~~~~---~~~~sPt~~~L--~~~~NW~~v~--------~~k~i~i~~-------~~~~~~~ 324 (324) T protein:vir:59 281 -------VKFTENA---MAGTTPTDEEL--ANGANWQRVY--------DPKKIRIVQ-------FKHRLQA 324 (324) T ss_pred -------EEecccc---cCCCCCChhhh--cCCccccccc--------CccccceEE-------EEeeccC Confidence 1112111 11234432111 1112222211 111222211 2333344 No 67 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=57.60 E-value=0.43 Score=22.59 Aligned_cols=283 Identities=11% Similarity=0.042 Sum_probs=106.8 Q ss_pred Cccc-----------------cc-cchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccc Q lcl|NC_019501. 1 MALN-----------------EG-QLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWN 62 (431) Q Consensus 1 ~~~~-----------------~~-~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~ 62 (431) ||-. -+ .+-+...+++|+.+++..++.++++... - .|.++++|+-..... ..|. T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~-~------~~~~~~~p~~~~~~~-a~~v 72 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVP-M------GTTGQKIPHWIGDVS-AQWI 72 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceee-c------cCCceEEEEEeCCcc-eEEe Confidence 2111 12 2333345899999999999888874221 1 256677776533221 2233 Q ss_pred cccC---ccccccceeEEEecccc-ccceEeeHHHh-hH-HHHHHhhhh-HHHHHHHHHHHHHHHH-hhcc-------cc Q lcl|NC_019501. 63 LTGN---ATGILELSVKCNMGDPD-NDFFELRADDL-RD-ERSYRRRIQ-ASAKKLANNIESAIAK-QATE-------MG 127 (431) Q Consensus 63 ~s~~---~~di~e~~V~v~ld~~k-~v~f~lt~keL-~i-~~~s~~~L~-~Am~~LAn~Id~dl~~-~~~~-------~~ 127 (431) ..+. .++..=. ++++.-.| ...+.+ ++|| +. ....+.+|+ .-.++++.++|..++. .-.. .. T Consensus 73 ~E~~~~~~~~~~f~--~v~~~~~k~~~~~~i-s~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~ 149 (320) T protein:vir:10 73 GEGDMKPITKGNMT--SQNIAPHKIATIFVA-SAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTT 149 (320) T ss_pred cCCcccccccccee--EEEEeeEEEEEeehh-hHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccc Confidence 2221 1122122 23332222 112333 4554 42 233555664 4457899999999873 1110 00 Q ss_pred ceeec-cCCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccc Q lcl|NC_019501. 128 SLVVH-DTRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQI 206 (431) Q Consensus 128 ~~v~t-~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~ 206 (431) ..+.. ..+............++..+-..+.....+ .-.+++||.+...+.. ++++. |+.+ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~n~~~~~~L~~---------------lkd~~-G~~l 210 (320) T protein:vir:10 150 KSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK---WTHTLLDDIVEPILNG---------------AKDKN-GRPL 210 (320) T ss_pred ccccceecccccccccccHHHHHHHHHhhhhcccCC---CcEEEEcHHHHHHHHH---------------hhccC-Ccee Confidence 10000 000111111111111222222333332222 3457889988866621 12221 3322 Q ss_pred hhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeecccc Q lcl|NC_019501. 207 AGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMA 286 (431) Q Consensus 207 ~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~t 286 (431) ..+.... ..+. ...+.++.|-+ +-.+..+..|+.+.+-| T Consensus 211 -~~~~~~~--~~~~----~~~~~~i~g~p--------------------------v~~~~~~~~~~~~~~~g-------- 249 (320) T protein:vir:10 211 -FIESTYT--DENS----PFRAGRIVSRP--------------------------TILSDHVADGTTVGYMG-------- 249 (320) T ss_pred -ecccccc--Cccc----cccCceeeeee--------------------------eEecCCCCCCceEEEEe-------- Confidence 1000000 0000 00000000000 00000111122111111 Q ss_pred ccccCCCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeec Q lcl|NC_019501. 287 KNVLTDDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQP 366 (431) Q Consensus 287 k~~~~~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~p 366 (431) +..+|++ .....-++.+ ++. T Consensus 250 -----d~~~~~~-~~~~~~~i~~------------------------------------------------------~~~ 269 (320) T protein:vir:10 250 -----DFRNVIW-GQVGGLSFDV------------------------------------------------------TDQ 269 (320) T ss_pred -----ecceEEE-EEecCeEEEE------------------------------------------------------eec Confidence 1111111 0000001110 000 Q ss_pred ccCCCCCcceeeEEEeecCcceEEEEEEecccccccceEEEEEeeccceecccceeEEecCCCCC Q lcl|NC_019501. 367 IPVTHELFAGMKTSSFSIPGIGVNGIFATQGDINTLSGKCRIAVWYSACAVRPEAIGVGLPNQTA 431 (431) Q Consensus 367 l~~p~g~~~a~~~~t~~~~g~glslrv~~~yd~~~~~~~~r~dvlyG~~~v~PElagv~i~~q~~ 431 (431) . ... ....++ + ...- .-.+++..+|.-.-+|++.++|+.. ++|.+..| T Consensus 270 ~--------~~~--~~~~~~-~-~~~~----~f~~~~~~~r~~~~~d~~v~~~~a~-~~l~~~~a 317 (320) T protein:vir:10 270 A--------TLN--LGTPTE-P-NFVS----LWQHNLVAVRVEAEYAFHNNDKDAF-VKLTNVVT 317 (320) T ss_pred c--------eee--eccccc-c-ccch----hhhcCcEEEEEEEeeccEEecccce-EEEEeccC Confidence 0 000 000000 0 0000 1123667778888889999999985 68888888 No 68 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=57.05 E-value=0.44 Score=22.52 Aligned_cols=275 Identities=11% Similarity=0.073 Sum_probs=102.0 Q ss_pred CccccccchhHH-HHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC---ccccccceeE Q lcl|NC_019501. 1 MALNEGQLVTYA-LDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN---ATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt~~-~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~---~~di~e~~V~ 76 (431) ||++-+.++-.- .+++|+.++.+.++.++++..+ - .+..+++|+-.... ..+|...+. .++..=.++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~-~------~~~~~~~p~~~~~~-~a~~v~Eg~~~~~~~~~f~~v~ 72 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKP-I------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceee-c------cCCceEEEEEecCc-ceEEeeCCccccccccceeEEE Confidence 999999988554 5899999999999888874222 1 12345666532221 123333222 1122112222 Q ss_pred EEeccccccceEeeHHHhh-----HHHHHHhhhh-HHHHHHHHHHHHHHHHhhcc----ccceeec-----cC--CCCCC Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLR-----DERSYRRRIQ-ASAKKLANNIESAIAKQATE----MGSLVVH-----DT--RAIGP 139 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~-----i~~~s~~~L~-~Am~~LAn~Id~dl~~~~~~----~~~~v~t-----~~--~t~~~ 139 (431) ++..+-. .-+.+ ++||. ...-.+++|+ .-.++++.++|..++.-... -....++ .. ..... T Consensus 73 l~~~k~~-~~~~i-S~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 73 MVPIKVE-YGARI-SDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred EeeeEEE-Eeeeh-hHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccc Confidence 2221111 11233 45542 1223445554 45578888999888832100 0000110 00 01111 Q ss_pred ccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccch-hhhHhhccccccchhhhhhHhcCCC Q lcl|NC_019501. 140 STGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVT-EDAYRNGTIQRQIAGFDEILRSPKL 218 (431) Q Consensus 140 ~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~-~~a~r~g~i~r~~~Gfd~~~~s~~v 218 (431) ....+.|.++..+-..|.....+. ...++||.+.+.+.+-. +..++-. ......|.-+ .+.|+- ++-+..+ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~-tl~G~P-V~~~~~v 222 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAKQK---DLQGNALFPELKWGATPD-TINGLP-VDVNKTV 222 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHHhh---ccCCCeeecCcccCCCCc-eeccee-eEEeccc Confidence 122234566776766666554442 34889998887763311 1111100 0011122222 255654 4455555 Q ss_pred ccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCc-eEE Q lcl|NC_019501. 219 PAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDA-TFS 297 (431) Q Consensus 219 ~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq-~fv 297 (431) |.-.. +.....+-|--+. ++.+ -....-.++--|-.+..|. .+.--...+ .|+ T Consensus 223 ~~~~~-~~~~~~~~Gdfs~---~~~~----------------~~~~~~~~~~~~~~~~d~~------~~~~f~~~~v~~r 276 (298) T protein:vir:94 223 SDMSL-TQRDRAIIGDFAN---GFKW----------------GYAKEVPLEVIQYGDPDNS------GLDLKGYNQVYIR 276 (298) T ss_pred ccccC-CCccEEEEeeccc---eEEE----------------EEecCceEEEeecCCCcCc------chhhhhcCcEEEE Confidence 52111 1001111111100 0000 0000001111000001110 000000011 122 Q ss_pred EEEeecCceeEEeecccccccccccccccccceeeccccc Q lcl|NC_019501. 298 ITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLAD 337 (431) Q Consensus 298 Vta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~ 337 (431) ++.-.+ --...|.- ++-++ .|+ T Consensus 277 ~~~r~~--~~~~~~~a--------------~~~l~--~~t 298 (298) T protein:vir:94 277 AELFLG--WGILDATK--------------FARVT--EAN 298 (298) T ss_pred EEEEec--cEeecccc--------------eEEEE--ecC Confidence 222111 00011110 00000 000 No 69 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=44.74 E-value=0.8 Score=21.13 Aligned_cols=287 Identities=10% Similarity=0.013 Sum_probs=104.9 Q ss_pred Cccccccch--hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC---cccccccee Q lcl|NC_019501. 1 MALNEGQLV--TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN---ATGILELSV 75 (431) Q Consensus 1 ~~~~~~~~l--t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~---~~di~e~~V 75 (431) |+...|..| +.+.+++|+.+++..++.+++++. +- .+..+++|+-.... ..+|...+. .++..=.++ T Consensus 2 at~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i-~~------~~~~~~~p~~~~~~-~a~wv~Eg~~~~~~~~~f~~v 73 (311) T protein:vir:81 2 VALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE-PQ------EFGEQQYMTLTAPP-RGEVVGEGAQKSESTATFAPV 73 (311) T ss_pred ceecCCceEcchhHHHHHHHHHHhcchhhhhccee-ec------CCCceEEEEEeCCc-eeEEeecCcccccccceeeEE Confidence 556677776 555699999999999998888432 21 22456776643222 223333222 112222233 Q ss_pred EEEeccccccceEeeHHHhh-----HHHHHHhhhh-HHHHHHHHHHHHHHHHhh-----c---cccceeeccC--CCCCC Q lcl|NC_019501. 76 KCNMGDPDNDFFELRADDLR-----DERSYRRRIQ-ASAKKLANNIESAIAKQA-----T---EMGSLVVHDT--RAIGP 139 (431) Q Consensus 76 ~v~ld~~k~v~f~lt~keL~-----i~~~s~~~L~-~Am~~LAn~Id~dl~~~~-----~---~~~~~v~t~~--~t~~~ 139 (431) .++..+-. ..+.+ ++||. .....+++|+ ...++|+.++|..++.-- . .+........ ..... T Consensus 74 ~l~~~kl~-~~~~i-S~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~ 151 (311) T protein:vir:81 74 TAIPRKVQ-VTQRF-SQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) T ss_pred EEeeEEEE-Eeehh-hHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecc Confidence 33222221 11222 45542 1223556664 455788999998887321 0 0111100000 01111 Q ss_pred ccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccch-hhhHhhccccccchhhhhhHhcCCC Q lcl|NC_019501. 140 STGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVT-EDAYRNGTIQRQIAGFDEILRSPKL 218 (431) Q Consensus 140 ~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~-~~a~r~g~i~r~~~Gfd~~~~s~~v 218 (431) ......+.++..+-..+...+.. ...+++||.+.+.+.+- .+..++-. ......+. +..+.|+- ++.+..+ T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~---~~~~vmn~~~~~~l~~l---kd~~G~~l~~~~~~~~~-~~tl~G~P-v~~~~~i 223 (311) T protein:vir:81 152 GTSATPDLAVEAAVGLVLGDNLS---PDGVALDNTFSFMLATQ---RDSQGRKLYPELGFGTD-VASFAGLN-AAVSDTV 223 (311) T ss_pred cccchHHHHHHHHHHHhhhcCCC---ceEEEEcHHHHHHHHhh---hccCCCeeecCccccCC-Cceeccee-EEecccc Confidence 11122344444444444333322 13478999998776321 00111000 00000111 11122332 2222222 Q ss_pred ccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEE Q lcl|NC_019501. 219 PAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSI 298 (431) Q Consensus 219 ~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvV 298 (431) |...... .+ .........++..-+.| +-.+|++ T Consensus 224 ~~~~~~~--------~~--------------------------~~~~~~~~~~~~~~~~g-------------Dfs~~~i 256 (311) T protein:vir:81 224 RGGPEAV--------TA--------------------------STGVYRTTNPNVKAIAG-------------DFSAFRW 256 (311) T ss_pred ccccccc--------cc--------------------------ccchhcccCCccEEEEE-------------ecccEEE Confidence 2111000 00 00000011122222222 3344444 Q ss_pred EEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcce-eeeeccceeEEEeecccC Q lcl|NC_019501. 299 TRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTA-NVFWADDSIRLLSQPIPV 369 (431) Q Consensus 299 ta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~-Nl~fhr~A~aLat~pl~~ 369 (431) .. ...-++++.+-.-+. ++++. .....|.+... .+. =-..|++||+..+..-.. T Consensus 257 ~~-~~~~~~~~~~~~~~~------------~~~~~--~~~~~v~~r~~--~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 257 GV-QVSIPLELIEFGDPD------------GLGDL--KRQNQIAIRAE--VVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred EE-eccceEEEeccCCCC------------cchhh--hhcCcEEEEEE--EEeccEeecccceEEEEeeccC Confidence 22 333455554432110 00000 00011111000 000 012333444433222111 No 70 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=37.78 E-value=1.1 Score=20.36 Aligned_cols=275 Identities=10% Similarity=0.061 Sum_probs=95.2 Q ss_pred Cccccccch--hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccccccc--------Ccccc Q lcl|NC_019501. 1 MALNEGQLV--TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTG--------NATGI 70 (431) Q Consensus 1 ~~~~~~~~l--t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~--------~~~di 70 (431) -....+.++ ++..+++|+.++...++.+++. .++... .+..+.||+-........|...+ ..+++ T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~-~~~~~~----~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~ 234 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCP-TEPLPG----GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDL 234 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhc-eeeecC----CcceeEEEEEecCcceeeeeccCccccccccccccc Confidence 011123333 2345889999999999988873 222111 13456666532222111122211 11122 Q ss_pred ccceeEEEeccccccceEeeHHHh-hHHH-HHHhhhhHHH-HHHHHHHHHHHHH---hhccccceeeccCCC--CCCccc Q lcl|NC_019501. 71 LELSVKCNMGDPDNDFFELRADDL-RDER-SYRRRIQASA-KKLANNIESAIAK---QATEMGSLVVHDTRA--IGPSTG 142 (431) Q Consensus 71 ~e~~V~v~ld~~k~v~f~lt~keL-~i~~-~s~~~L~~Am-~~LAn~Id~dl~~---~~~~~~~~v~t~~~t--~~~~~~ 142 (431) .-..+.++..+..+ .+.+ +++| ++.. ..+.+|+..+ .+|+..+|..++. .....-+ +.+..+. ...... T Consensus 235 ~f~~i~~~~~k~~~-~~~i-S~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~G-i~~~~~~~~~~~~~~ 311 (477) T protein:vir:84 235 TDGFVQANVKTIAG-QQGI-AIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVG-VRATAGITQVTATSA 311 (477) T ss_pred ceeeEEEeeeeEEe-eeHH-HHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccce-eeecccccccccccc Confidence 22333333322221 1222 5554 4423 3677776544 6899999998882 1111111 1111110 000000 Q ss_pred hhhhhhHHHH-HHHHHhh-cCCcc---CCcEEEeChHHHhhhhhh--hhhhhccccc----h-----hhhHhhccccccc Q lcl|NC_019501. 143 LSGWDFVSDA-ERLMFSR-ELNRD---MGISYFLNPDDYRKAGRN--LVDGDIFGRV----T-----EDAYRNGTIQRQI 206 (431) Q Consensus 143 ~~~~~d~a~a-~~~L~~~-~vP~~---~~r~~v~np~~~a~~~~~--~~~~~~~~~~----~-----~~a~r~g~i~r~~ 206 (431) ...|.+.... ..+++.+ .++.. .....++||.+++.+..- ..++-..+.. . ...+-++.-++ + T Consensus 312 ~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~-l 390 (477) T protein:vir:84 312 GSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQ-M 390 (477) T ss_pred ccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccch-h Confidence 0112211111 1111111 11111 123577899887765331 1111000000 0 01122333333 6 Q ss_pred hhhhhhHhcCCCcccccccccc--ceecccccccceeeeeecccccccccceeeEEEeeccceee--eccEEEEccee-- Q lcl|NC_019501. 207 AGFDEILRSPKLPAVTKSTATG--VTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFK--RGDKISFTGVK-- 280 (431) Q Consensus 207 ~Gfd~~~~s~~v~~~t~gt~~~--~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lk--aGDv~TiaGV~-- 280 (431) .|+. ++.+..+|...+. ++. ..+=|--. .+.+...|.. +.++ ..... -...|.+-|.. T Consensus 391 ~G~p-Vv~s~~~p~~~~~-~~d~~~i~~gd~~----~~~i~~~~~~---------~~~~-~~~~~~~~~~~~~v~~~~~~ 454 (477) T protein:vir:84 391 HGLP-VVTDPTLPTTLGT-GTDQDVIHVLRAS----DLALFESSVR---------MRAL-QETRAENLSVLLQVYGYLAF 454 (477) T ss_pred cccc-eEecCcccccccc-cCCcceEEEEEec----eEEEEeecee---------EEec-cccccccceeeeeehhhhhh Confidence 6885 6777888864321 111 11111110 1111111100 0000 00000 00011111110 Q ss_pred -ee-ccccccccCCCceEEEEEee-cCceeE Q lcl|NC_019501. 281 -FL-SQMAKNVLTDDATFSITRVI-DSTHIE 308 (431) Q Consensus 281 -~v-n~~tk~~~~~lq~fvVta~~-~a~ti~ 308 (431) ++ ||..=.. +|..+ ++-+.. T Consensus 455 ~~~r~~~afv~--------~t~~~~~~~~~~ 477 (477) T protein:vir:84 455 TAARFPQSVVE--------IGGTALTAPTFA 477 (477) T ss_pred hhhccccceEE--------eecccccccccC Confidence 11 2222111 11111 111111 No 71 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=35.91 E-value=1.2 Score=20.14 Aligned_cols=271 Identities=13% Similarity=0.071 Sum_probs=101.1 Q ss_pred Cccccccchh-HHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC---ccccccceeE Q lcl|NC_019501. 1 MALNEGQLVT-YALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN---ATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~~lt-~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~---~~di~e~~V~ 76 (431) ||.+-+.++- -+..|+|+.++...++.+++++. +- .+..++||+-.... ...|...+. .+++.=.++. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~-~~------~~~~~~ip~~~~~~-~a~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK-PI------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee-ec------cCCceEEEEEecCc-ceEEecCCccccccccceeEEE Confidence 9999999884 44689999999999988888422 21 13446677643222 223433322 1122222222 Q ss_pred EEeccccccceEeeHHHhh---H--HHHHHhhhh-HHHHHHHHHHHHHHHHhh---ccccc------eeec--cCCCCCC Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLR---D--ERSYRRRIQ-ASAKKLANNIESAIAKQA---TEMGS------LVVH--DTRAIGP 139 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~---i--~~~s~~~L~-~Am~~LAn~Id~dl~~~~---~~~~~------~v~t--~~~t~~~ 139 (431) ++..+-. .-+.+ ++||. . ....+++|+ .-.++++..+|..++.-. ...+. .+.. ....... T Consensus 73 l~~~k~a-~~~~i-S~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 73 MVPIKVE-YGARI-SDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred EeeeeEE-Eeehh-hHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccc Confidence 2222211 11333 45542 1 123445554 455788889999888321 00000 0000 0111112 Q ss_pred ccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCc Q lcl|NC_019501. 140 STGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLP 219 (431) Q Consensus 140 ~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~ 219 (431) ....+.+.++..+...+.....+. ..+++||.+.+.+..- +++. ||.+ | +..+ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~l---------------kd~~-G~~i----~----~~~~ 203 (298) T protein:vir:16 151 RGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAKQ---------------KDLQ-DNAL----F----PELK 203 (298) T ss_pred cccccHHHHHHHHHHHhhhcCCCc---cEEEEcHHHHHHHHHh---------------hccC-CCee----e----cCcc Confidence 222234556666666666554442 2488999888766221 1111 2322 0 0000 Q ss_pred cccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEE Q lcl|NC_019501. 220 AVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSIT 299 (431) Q Consensus 220 ~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVt 299 (431) ..+. ..++.|-+- .+ .+..... ..+....+-.|| -.+++.. T Consensus 204 --~~~~--~~~l~G~PV------~~-~~~v~~~--------~~~~~~~~~~GD--------------------fs~~~~~ 244 (298) T protein:vir:16 204 --WGAT--PDTINGLPV------DV-NKTVSDM--------SLTQRDRAIIGD--------------------FANGFKW 244 (298) T ss_pred --cCCC--Cceecceee------EE-ecccccc--------cCCCccEEEEee--------------------ccceEEE Confidence 0000 012222210 00 0000000 000000111222 1111111 Q ss_pred EeecCceeEEeeccccccccccccc------ccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecc Q lcl|NC_019501. 300 RVIDSTHIEITPKPIALDDASLTKE------EKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPI 367 (431) Q Consensus 300 a~~~a~ti~I~Paii~~~~~t~~~~------~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl 367 (431) .....-+++|.+-.-+ +...+... -++...+.. ...|++||+....-= T Consensus 245 ~~~~~~~~~~~~~~~~-~~~~~~~f~~~~v~~ra~~r~d~-------------------~v~~~~a~~~l~~at 298 (298) T protein:vir:16 245 GYAKEVPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGW-------------------GILDATKFARVTEAN 298 (298) T ss_pred EEecCceEEEeeccCC-cCcchhhhhcCcEEEEEEEEEcc-------------------EeecccceEEEeecC Confidence 1122223444332100 00000000 000111111 122223332221110 No 72 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=35.55 E-value=1.2 Score=20.10 Aligned_cols=260 Identities=12% Similarity=0.053 Sum_probs=99.2 Q ss_pred Ccccccc-chhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccccccc---CccccccceeE Q lcl|NC_019501. 1 MALNEGQ-LVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTG---NATGILELSVK 76 (431) Q Consensus 1 ~~~~~~~-~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~---~~~di~e~~V~ 76 (431) -.-..|. +.+-.++++|+.++...++.++|+..+ - .+.++.+|+-........|...+ ...+..=.++. T Consensus 117 ~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~-~------~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 189 (390) T protein:vir:10 117 AAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGR-T------DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKT 189 (390) T ss_pred cccccccccchhHHHHHHHHHHhhchhhhhcceee-c------cCCceEEEEEecCCcceeeecCCccccccccceeEEE Confidence 1111222 333346899999999999988884221 1 13455665532222112222221 12233333444 Q ss_pred EEeccccccceEeeHHHhhHHHHHHhhhhHH-HHHHHHHHHHHHHHh---hccc---cceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 77 CNMGDPDNDFFELRADDLRDERSYRRRIQAS-AKKLANNIESAIAKQ---ATEM---GSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 77 v~ld~~k~v~f~lt~keL~i~~~s~~~L~~A-m~~LAn~Id~dl~~~---~~~~---~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) ++..+-. ..+.+|.+-|.+..+.+.+|... ..+++..+|..++.= .... -+..+. ...+......+.+.++ T Consensus 190 ~~~~k~~-~~~~is~ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~-~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:10 190 DTTHVIA-HTMKATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATT-YAAPTTIAGATRVDQL 267 (390) T ss_pred EeeEEEE-EeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccc-ccccccccccchHHHH Confidence 4443322 23334433344444667788654 468999999988831 1111 111111 1111112222345666 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhh--hhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNL--VDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTAT 227 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~--~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~ 227 (431) ..+-..|.....+. -.+++||.+++.+..-. .+...... .. .+. +..+.|+. ++.+..+|.-+ T Consensus 268 ~~~~~~l~~~~~~~---~~~v~n~~~~~~L~~lkd~~g~~l~~~----~~-~~~-~~~l~G~p-v~~~~~~p~~~----- 332 (390) T protein:vir:10 268 RLAMLQASLAEYPA---SGIVINPIDWAAIELAKDANNQYLIGN----AR-GTL-TPTLWGLP-VVATQAMAPGE----- 332 (390) T ss_pred HHHHHhhccccCCC---CEEEEcHHHHHHHHHhhcCCCceeecC----Cc-CcC-Cceeccee-eEEcCCCCCCc----- Confidence 66666666655543 34789999887653210 00000000 00 111 11244553 34444444210 Q ss_pred cceecccccccceeeee-ecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCce Q lcl|NC_019501. 228 GVTVSGAQKFKPQAYTL-DTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTH 306 (431) Q Consensus 228 ~~tv~gA~q~~~~~~~v-~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~t 306 (431) + +-|--. .++.+ +..+ .++..+... +-|+ .+.-.|++..-.+. T Consensus 333 -~-~~gdf~---~~~~~~~~~~---------~~i~~~~~~-----~~~~---------------~~~~~~r~~~r~d~-- 376 (390) T protein:vir:10 333 -F-LVGAFD---LAAQIFDQWD---------ARVEIGYVN-----DDFQ---------------RNMVTVLAEERLAL-- 376 (390) T ss_pred -E-EEEecc---ceEEEEEecc---------eEEEEeecc-----cccc---------------cCcEEEEEEEeecc-- Confidence 0 001000 00000 0000 001111000 0000 01112333332221 Q ss_pred eEEeecccccccccccccccccceeecc Q lcl|NC_019501. 307 IEITPKPIALDDASLTKEEKAYANVNTS 334 (431) Q Consensus 307 i~I~Paii~~~~~t~~~~~~~y~nVsa~ 334 (431) -...|. ++..++-+ T Consensus 377 ~v~~~~--------------a~~~~~~a 390 (390) T protein:vir:10 377 VVYRPE--------------ALISGSFA 390 (390) T ss_pred EEeccc--------------cEEEEEeC Confidence 111111 11111211 No 73 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=34.08 E-value=1.3 Score=19.93 Aligned_cols=274 Identities=12% Similarity=0.036 Sum_probs=106.3 Q ss_pred Ccc---c-cccch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccc--cccccccccC--cc--c Q lcl|NC_019501. 1 MAL---N-EGQLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAP--TQTGWNLTGN--AT--G 69 (431) Q Consensus 1 ~~~---~-~~~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~--~~~g~~~s~~--~~--d 69 (431) |+. . -+.++ +....++|+.++...++.++++.. +- .+.+..++.+.... ....|...+. ++ + T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 181 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVE-NV------TTLTGSRVYEKWADITGLAKLDDEAGSIGTNDD 181 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhcee-ec------cCCcceEEEEeecCCCcceeeeccccccccccc Confidence 211 1 12233 344589999999999998888422 21 12333344332111 1111222211 11 1 Q ss_pred cccceeEEEeccccccceEeeHHH-hhHHHH-HHhhhhH-HHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhh Q lcl|NC_019501. 70 ILELSVKCNMGDPDNDFFELRADD-LRDERS-YRRRIQA-SAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGW 146 (431) Q Consensus 70 i~e~~V~v~ld~~k~v~f~lt~ke-L~i~~~-s~~~L~~-Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~ 146 (431) ..=..|.++..+.. ..+.+ +++ |.+..+ .+.+|+. -..+++..+|..++. |+ ++..+......| T Consensus 182 ~~~~~v~~~~~k~~-~~~~i-S~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~---------G~--g~~~~~~~~~~~ 248 (397) T protein:vir:48 182 PKLYPIRYAIKRYA-GISTV-TNSLLADSAENILAWLSGWIAKKVVVTRNKAILE---------AI--ATLPTKPTLTKW 248 (397) T ss_pred cceeeEEeeheeee-eehhh-HHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh---------cc--cccccccccccH Confidence 11223333332221 12233 444 333233 4666644 446788888888772 22 112222334567 Q ss_pred hhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccc Q lcl|NC_019501. 147 DFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTA 226 (431) Q Consensus 147 ~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~ 226 (431) .++.++...|.....+. -.+++||.+++.+.. ++++. ||++. . |... .+ T Consensus 249 d~i~~~~~~l~~~~~~~---a~~v~n~~~~~~L~~---------------lkd~~-G~~i~------~----~~~~--~~ 297 (397) T protein:vir:48 249 DDIIDLQAKVDPAIKQT---SFFLTNTSGFTALKK---------------VKNAF-GDYLM------E----RDVK--SP 297 (397) T ss_pred HHHHHHHHHhhhhhcCC---CEEEECHHHHHHHHH---------------hhcCC-Cceee------c----cCcC--CC Confidence 88887777777655442 357889988866522 12221 44331 1 0011 12 Q ss_pred ccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCce Q lcl|NC_019501. 227 TGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTH 306 (431) Q Consensus 227 ~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~t 306 (431) +..++.|-+-. .++.. . -.....|+...|-| +..+|+...+-.+-+ T Consensus 298 ~~~~l~G~PV~-----~~~~~-~---------------~~~~~~~~~~~~~g-------------d~~~~~~~~~~~~~~ 343 (397) T protein:vir:48 298 TGYSIDGFAVK-----EVADR-W---------------LANASSGAMPLYFG-------------DLKQAVTLFDRQQMS 343 (397) T ss_pred CCceeccceeE-----Eeccc-c---------------cCCcCCCceEEEEE-------------eccceEEEEeecceE Confidence 22344444310 00000 0 00011222222223 223332222222223 Q ss_pred eEEeecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecc-cCCCCCcceeeE Q lcl|NC_019501. 307 IEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPI-PVTHELFAGMKT 379 (431) Q Consensus 307 i~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl-~~p~g~~~a~~~ 379 (431) +.+.+-.- .-+....+.+....-.- =-..|++||+.++..= +-|++...+... T Consensus 344 i~~~~~~~-------------------~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 344 LLSTNIGG-------------------GAFETDTTKIRVIDRFD-VVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEeccch-------------------hhhhcCceeEEEEeeec-cEEecccceEEEEecccccCCCCccccCC Confidence 33322100 00011111111100000 1234555665554322 233332222111 No 74 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=33.75 E-value=1.3 Score=19.89 Aligned_cols=239 Identities=15% Similarity=0.134 Sum_probs=84.5 Q ss_pred Cccccccch--hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC--c--cccccce Q lcl|NC_019501. 1 MALNEGQLV--TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN--A--TGILELS 74 (431) Q Consensus 1 ~~~~~~~~l--t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~--~--~di~e~~ 74 (431) .-...+..| +....++|+.+++..++.++++... - .+.+..+|+........+|...+. + ++..=.. T Consensus 131 ~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~-~------~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 203 (394) T protein:vir:97 131 IKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ-A------KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) T ss_pred cccccccccChHHHHHHHHHHhhhhhhhhhhceeee-c------cCcceEEEEEecCCCccceeccccccccccccccee Confidence 111223322 2334678888888888877774221 1 123455665432221222222211 1 1111223 Q ss_pred eEEEeccccccceEeeHHHh-hHHHH-HHhhhhH-HHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhHHH Q lcl|NC_019501. 75 VKCNMGDPDNDFFELRADDL-RDERS-YRRRIQA-SAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFVSD 151 (431) Q Consensus 75 V~v~ld~~k~v~f~lt~keL-~i~~~-s~~~L~~-Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~a~ 151 (431) |.++..+.. ..+.+ ++|| .+..+ .+.+|+. -..+|+..+|..++.- ..++++... ..+.++.. T Consensus 204 v~l~~~k~~-~~i~i-s~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g---------~~~~~~~~~---~~~~~~~~ 269 (394) T protein:vir:97 204 VAWNIDTYR-GAIPL-SQESIDDADVDLVGIVSESISQIKVNTTNDAIAKV---------LKSFTTKTV---KNLDEIKA 269 (394) T ss_pred EEeehhhee-eehhh-HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhc---------ccccccccc---ccHHHHHH Confidence 333332221 11223 3443 33222 5556644 3457777777776631 111222211 23455443 Q ss_pred HHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCcccccccccccee Q lcl|NC_019501. 152 AERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTV 231 (431) Q Consensus 152 a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv 231 (431) +-..+- -| ..+-.+++||.+++.+.. ++++. ||++. . .. .+.| ++.++ T Consensus 270 ~~~~~~---~~-~~~a~~v~n~~~~~~l~~---------------lkd~~-G~~i~------~-~~---~~~~--~~~~l 317 (394) T protein:vir:97 270 LLNGGF---DP-AYNVSLIVSQSFYQTLDT---------------LKDGN-GRYLL------Q-DD---ITAV--SGKVL 317 (394) T ss_pred HHHhhh---hh-hhCCEEEEcHHHHHHHHH---------------hhccC-CCeee------e-cC---cCCC--CCcee Confidence 332221 12 223458899998766522 11221 33221 0 00 0111 11223 Q ss_pred cccccccceeeeeecccccccccceeeEEEeeccceeeeccE---EEEc---c--eeeeccccccccCCCceEEEEEeec Q lcl|NC_019501. 232 SGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDK---ISFT---G--VKFLSQMAKNVLTDDATFSITRVID 303 (431) Q Consensus 232 ~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv---~Tia---G--V~~vn~~tk~~~~~lq~fvVta~~~ 303 (431) .|-+- ...+.... +.+.+.-||. +.|. | +.+... ....+.|++..-.+ T Consensus 318 ~G~pv-----~~~~~~~~--------------~~~~~~~gd~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~r~d 373 (394) T protein:vir:97 318 LGKPV-----FVLSDEVL--------------GANKAFIGDFKRGVLFADRKDLGLRWADN-----EIYGQYLQAVLRFG 373 (394) T ss_pred cccee-----EEeccccc--------------CCccEEEeeccccEEEEEecceEEEEecc-----cccceeEEEEEEEc Confidence 33221 00000000 0011111220 0000 0 000000 01112244333221 Q ss_pred C--------ceeEEeeccccc Q lcl|NC_019501. 304 S--------THIEITPKPIAL 316 (431) Q Consensus 304 a--------~ti~I~Paii~~ 316 (431) . -.++++|...|. T Consensus 374 ~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 374 VSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred cEEecccceEEEEecccccCC Confidence 1 135555554443 No 75 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=33.55 E-value=1.3 Score=19.87 Aligned_cols=284 Identities=13% Similarity=0.069 Sum_probs=109.1 Q ss_pred Ccccccc-------------chhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC- Q lcl|NC_019501. 1 MALNEGQ-------------LVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN- 66 (431) Q Consensus 1 ~~~~~~~-------------~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~- 66 (431) ||-++.+ +.+-..+++|+.++...++.++++. .+ -.+..+++|+-.... ...|...+. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~-~~------~~~~~~~~p~~~~~~-~a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARK-VP------MGPTGISIPHWTGAV-SASWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcce-ee------ccCCceEEEEEcCCc-ceeEecCCCc Confidence 5544433 2344468999999999998888731 11 124557777654332 223433322 Q ss_pred --ccccccceeEEEeccccccceEeeHHHh-hH-HHHHHhhhhH-HHHHHHHHHHHHHHH----------hhccccc--e Q lcl|NC_019501. 67 --ATGILELSVKCNMGDPDNDFFELRADDL-RD-ERSYRRRIQA-SAKKLANNIESAIAK----------QATEMGS--L 129 (431) Q Consensus 67 --~~di~e~~V~v~ld~~k~v~f~lt~keL-~i-~~~s~~~L~~-Am~~LAn~Id~dl~~----------~~~~~~~--~ 129 (431) .++..=.++.++..+- ...+.+ ++|| .+ ....+.+|+. -.++++.++|..++. +...... . T Consensus 73 ~~~~~~~f~~i~~~~~k~-~~~~~i-s~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~ 150 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKI-TTIFAE-SAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVS 150 (330) T ss_pred cccccceeeEEEEeEEEE-EEeehh-hHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccce Confidence 1121112222222111 112233 4554 32 2335666644 557899999999882 1111111 1 Q ss_pred eeccCCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhcccc------chhhhHhhcccc Q lcl|NC_019501. 130 VVHDTRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGR------VTEDAYRNGTIQ 203 (431) Q Consensus 130 v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~------~~~~a~r~g~i~ 203 (431) .......+......+.+.++..+-..|.....+. ..+++||.....+..-- +..++ ......... -+ T Consensus 151 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~~~-~~ 223 (330) T protein:vir:77 151 LADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW---TGTLLDNVTEPILNTAV---DGNGRPLFVESTYTEQVGAI-RE 223 (330) T ss_pred eecccccccccccchhHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHHHh---ccCCceeecCcccccccccc-CC Confidence 1111122222333345666666666666555442 35889999987764311 11111 000000001 11 Q ss_pred ccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEc-ceeee Q lcl|NC_019501. 204 RQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFT-GVKFL 282 (431) Q Consensus 204 r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~Tia-GV~~v 282 (431) ..+.|+- ++.+.++|.-+.+.- ..-+-|--+ .+.+. +-...++.++--..++-|+.-... .-..+ T Consensus 224 ~~l~G~P-V~~~~~~p~~~~~~~-~~~~~gd~s----~~~i~--------~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~ 289 (330) T protein:vir:77 224 GRILGRP-TYVADNVVNGTVGNR-VVGVMGDFS----QVIWG--------QIGGLSFDVTDQATLDFGEEQGGVWVPKLI 289 (330) T ss_pred ceeccee-eEEeccccCCCCCCc-cEEEEEecc----eEEEE--------EecCcEEEEeecceeeeccccccccccccc Confidence 2245664 556666664211110 000111100 00000 000112223322333333211100 00001 Q ss_pred ccccccccCCCceEEEEEeecC-----ce-eEEeec--cccccccccccccc Q lcl|NC_019501. 283 SQMAKNVLTDDATFSITRVIDS-----TH-IEITPK--PIALDDASLTKEEK 326 (431) Q Consensus 283 n~~tk~~~~~lq~fvVta~~~a-----~t-i~I~Pa--ii~~~~~t~~~~~~ 326 (431) +.-.+ +...|+++.-.+. .. +.|..+ .-+| +.. T Consensus 290 ~~f~~----~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~-------~~~ 330 (330) T protein:vir:77 290 SLWQH----NMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDP-------EEE 330 (330) T ss_pred chhhc----CcEEEEEEEEeccEEecccceEEEEeccCCcCC-------CCC Confidence 11111 1112333332221 11 222222 1111 111 No 76 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=28.68 E-value=1.7 Score=19.28 Aligned_cols=273 Identities=10% Similarity=-0.012 Sum_probs=110.5 Q ss_pred Ccccccc-------chhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccc--cccccccccccc-C-c-- Q lcl|NC_019501. 1 MALNEGQ-------LVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQ--EAPTQTGWNLTG-N-A-- 67 (431) Q Consensus 1 ~~~~~~~-------~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~--~~~~~~g~~~s~-~-~-- 67 (431) -+++.++ +=+.+.+++|+.++...++.++|+. .+-. +.+..++.|. .......|...+ . + T Consensus 114 ~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~-~~~~------~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 186 (404) T protein:vir:39 114 KTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV-ESVS------TSNGSRVYEKWTDVTPLTVMDAEDGKIPDL 186 (404) T ss_pred hhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcce-eecc------CCcceEEEEeecCCccceeeecCccccccc Confidence 1111111 2234458888889988888888742 2211 1223333331 111111122221 1 1 Q ss_pred cccccceeEEEeccccccceEeeHHHh-hHHHH-HHhhhhH-HHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchh Q lcl|NC_019501. 68 TGILELSVKCNMGDPDNDFFELRADDL-RDERS-YRRRIQA-SAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLS 144 (431) Q Consensus 68 ~di~e~~V~v~ld~~k~v~f~lt~keL-~i~~~-s~~~L~~-Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~ 144 (431) ++..=.++.+++.+..+ .+.+ ++|| .+..+ .+.+|.. -..+++..+|..++. |+..+.+ ..... T Consensus 187 ~~~~f~~i~~~~~k~~~-~~~i-S~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~---------g~g~~~~--~~~~~ 253 (404) T protein:vir:39 187 DNPRLTIIKYLIKRYAG-IITA-TNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIA---------AMGTVPK--KPTIA 253 (404) T ss_pred cccceeeEEeeeeeEEe-eehh-HHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHh---------ccccccc--ccccc Confidence 11222344444433332 2233 4454 33223 4667754 446888899988873 2212222 12223 Q ss_pred hhhhHHHHHH-HHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccc Q lcl|NC_019501. 145 GWDFVSDAER-LMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTK 223 (431) Q Consensus 145 ~~~d~a~a~~-~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~ 223 (431) .|+++..+-. .+.....+ +-.+++||.+++.+.. ++++ .||.+. . +..+. T Consensus 254 ~~~~i~~~~~~~~~~~~~~---~a~~v~n~~~~~~L~~---------------lkd~-~G~~l~------~----~~~~~ 304 (404) T protein:vir:39 254 KFDDVITMINTSVDPAIIA---TSSLLTNQSGLNKLAL---------------VKTA-EGKYLL------E----PDPTK 304 (404) T ss_pred cHHHHHHHHHHhhhhhhcc---CCEEEEcHHHHHHHHH---------------hhcc-CCceee------c----cCcCC Confidence 4566555432 33322222 2347789988866632 1122 244331 1 01111 Q ss_pred cccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeec Q lcl|NC_019501. 224 STATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVID 303 (431) Q Consensus 224 gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~ 303 (431) ++..++.|.+- +..+... -+....++...|-| +..+|++..+-. T Consensus 305 --~~~~~l~G~pV-----~~~~~~~----------------~~~~~~~~~~~~~g-------------d~~~~~~~~~~~ 348 (404) T protein:vir:39 305 --PNSYLIKGKKV-----IVVADRW----------------LPNSGSTVYPLYYG-------------DMSQAITLFDRE 348 (404) T ss_pred --CCcceecceeE-----EEecccc----------------cCccCCCccEEEEE-------------eccccEEEEeec Confidence 12223444431 0100000 01111223223333 445555444444 Q ss_pred CceeEEeecccccccccccccccccceeecccccCceeEEeccCCcc-eeeeeccceeEEEeecccCCCCCcceeeE Q lcl|NC_019501. 304 STHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTT-ANVFWADDSIRLLSQPIPVTHELFAGMKT 379 (431) Q Consensus 304 a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~-~Nl~fhr~A~aLat~pl~~p~g~~~a~~~ 379 (431) +-++.+.+-.-.. .....+.+.. ..+ -=-..|++||+.....-..+.++.+..-. T Consensus 349 ~~~i~~~~~~~~~-------------------~~~~~~~~r~--~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 349 NMSLLPTNIGAGA-------------------FETDTTKIRV--IDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred ceEEEEeccchhh-------------------hhhceeeEEE--EeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 4456655432100 0001112211 001 02467788888887666555443222111 No 77 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=28.48 E-value=1.7 Score=19.26 Aligned_cols=263 Identities=6% Similarity=-0.018 Sum_probs=98.7 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeec--cc-----ccccccc--cccccCccccc Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPV--EQ-----EAPTQTG--WNLTGNATGIL 71 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~--p~-----~~~~~~g--~~~s~~~~di~ 71 (431) +.+++=+++- ..+++.+ .+..++.++ +|.+.. +.+-.|.-.. |. ...+..| .+.+... T Consensus 20 ~ll~~P~~I~---~~i~e~~-~~~~iad~l--f~~~~a---~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~---- 86 (318) T protein:vir:10 20 ELVGNPLWIP---TALKKMM-VNQFISESL--FRNGGA---NPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGA---- 86 (318) T ss_pred HhhCCchhHH---HHHHHHH-hccchhhhh--hhcccc---cccceeEEEecccccccCcHhhccCcccccccCCC---- Confidence 3344333333 3444444 445556666 555432 2222222211 10 0001111 0111111 Q ss_pred cceeEEEeccccccceEeeHHHhh--HHHHHHhhhhHHHHHHHHHHHHHHHHhhccc--cceeeccCCCCCCccchhhhh Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADDLR--DERSYRRRIQASAKKLANNIESAIAKQATEM--GSLVVHDTRAIGPSTGLSGWD 147 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~keL~--i~~~s~~~L~~Am~~LAn~Id~dl~~~~~~~--~~~v~t~~~t~~~~~~~~~~~ 147 (431) .+.-.+-..+-.+..|.+|+...+ ..+..+|.++.+.++++.++|+..+....+. ++.-...++. + ..+-.+ T Consensus 87 ~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~-~---~~~~~~ 162 (318) T protein:vir:10 87 RGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWD-N---GGKVRT 162 (318) T ss_pred CCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCC-C---cccccc Confidence 111112111233445677755443 4667777777888888888888887644321 1110000000 0 011112 Q ss_pred hHHHHHHHHHh-------hc-----CCcc-CCcEEEeChHHHhhhhhhhhhhhccc-cch--hhhHh-hccccccchhhh Q lcl|NC_019501. 148 FVSDAERLMFS-------RE-----LNRD-MGISYFLNPDDYRKAGRNLVDGDIFG-RVT--EDAYR-NGTIQRQIAGFD 210 (431) Q Consensus 148 d~a~a~~~L~~-------~~-----vP~~-~~r~~v~np~~~a~~~~~~~~~~~~~-~~~--~~a~r-~g~i~r~~~Gfd 210 (431) |.+.|...... .+ .+.+ .--.+|++|+..+-+.++..-+.... +.. ....+ .|.+++.+.|++ T Consensus 163 d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~ 242 (318) T protein:vir:10 163 DIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMGLN 242 (318) T ss_pred cchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeeceE Confidence 33333322211 00 0000 00258899999988766544333221 111 11222 466767778998 Q ss_pred hhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccc---cc Q lcl|NC_019501. 211 EILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQM---AK 287 (431) Q Consensus 211 ~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~---tk 287 (431) +..+.++|.-+ .+ .++.... |++.-.--++..+-|.=+.+ .+ T Consensus 243 -vi~s~~~p~~~-----al-------------vlq~g~v----------------G~~~d~~pl~~t~~~~egg~~~g~~ 287 (318) T protein:vir:10 243 -VIRSRTFPIDR-----VL-------------IMERGTV----------------GFYSDTRPLQFTALYPEGNGPNGGP 287 (318) T ss_pred -EeecCccCCCe-----eE-------------EEecCCc----------------ceeeccccceeeecccCCCCCCCCc Confidence 46778888532 12 2221110 11110000122222210000 11 Q ss_pred cccCCCceEEEEEee--c-CceeEEeeccccc Q lcl|NC_019501. 288 NVLTDDATFSITRVI--D-STHIEITPKPIAL 316 (431) Q Consensus 288 ~~~~~lq~fvVta~~--~-a~ti~I~Paii~~ 316 (431) ..+...+-+.+++-. . ..-+.|. -|+.+ T Consensus 288 ~~s~~~~~~~~~~~~V~~PkA~~~it-gi~~~ 318 (318) T protein:vir:10 288 TESYRADASHKRALAVDQPKAALWLT-GIVTP 318 (318) T ss_pred chhhheehheeeeeeeeCcceeEEEe-eccCC Confidence 222222222232211 1 1112221 11111 No 78 >protein:vir:3426 Length: 117 # NCBI annotation: head-tail joining protein # Family: family:all:1908 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040589;genbank:gi:9626253;genbank:GeneID:2703484 Probab=24.29 E-value=1.4 Score=19.73 Aligned_cols=113 Identities=19% Similarity=0.191 Sum_probs=36.6 Q ss_pred cEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeec Q lcl|NC_019501. 167 ISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDT 246 (431) Q Consensus 167 r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~ 246 (431) ..=+-|+.|.+.+.-+-+-+...+....=-.+.|. ++.+-| +|+.. +.+... ..|+-+.. T Consensus 1 m~~~dNlfd~a~~~aD~~i~~~fg~~a~i~~~~g~-~~~i~g---VFDdP------------~~~~~~----~gG~~i~~ 60 (117) T protein:vir:34 1 MADFDNLFDAAIARADETIRGYMGTSATITSGEQS-GAVIRG---VFDDP------------ENISYA----GQGVRVEG 60 (117) T ss_pred CCcccchhHHHHhhcchhhHhhcCeeEEEEeCCCc-ceEEEE---EecCc------------cchhhc----cCCEEeec Confidence 22334444544443221111111100000001111 111222 11211 111101 11122211 Q ss_pred ccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeec--Cc--eeEEeeccccccccccc Q lcl|NC_019501. 247 DGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVID--ST--HIEITPKPIALDDASLT 322 (431) Q Consensus 247 ~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~--a~--ti~I~Paii~~~~~t~~ 322 (431) +. ....+--+--..|+++|.++|+| ++|.|+...- .+ .|.+..- -|| T Consensus 61 s~-------P~L~vk~aDv~~l~r~D~v~I~G---------------~~y~V~~~~PD~~G~~~l~L~rg-~pp------ 111 (117) T protein:vir:34 61 SS-------PSLFVRTDEVRQLRRGDTLTIGE---------------ENFWVDRVSPDDGGSCHLWLGRG-VPP------ 111 (117) T ss_pred CC-------cEEEeeechhhccCCCCEEEECC---------------CeeEeeecccCCCceEEEEeecC-CCC------ Confidence 11 01111112235899999999998 7788765221 22 3444322 111 Q ss_pred ccccccceeecccccCcee Q lcl|NC_019501. 323 KEEKAYANVNTSLADNTPV 341 (431) Q Consensus 323 ~~~~~y~nVsa~~A~~aav 341 (431) +++-.= T Consensus 112 -------------~~~~~~ 117 (117) T protein:vir:34 112 -------------AVNRRR 117 (117) T ss_pred -------------ccccCC Confidence 111000 No 79 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=22.44 E-value=2.4 Score=18.45 Aligned_cols=277 Identities=11% Similarity=0.028 Sum_probs=98.6 Q ss_pred Cccc---cc-cch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeecccccccccccccccC--c-cc-cc Q lcl|NC_019501. 1 MALN---EG-QLV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTGN--A-TG-IL 71 (431) Q Consensus 1 ~~~~---~~-~~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~~--~-~d-i~ 71 (431) |+.. +| -++ +.+.+++|+.++...++.++++..+--.. .| ++.+++-........|...+. + ++ .. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~----~g-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 79 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTL----TG-SRVYEKWTDITGLANIDDEAGKIADIDDPK 79 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCC----cc-eEEEEeecCCCcceeeecCCcccccccccc Confidence 3221 12 232 33357899999999999887742221110 01 223332222211222333221 1 11 12 Q ss_pred cceeEEEeccccccceEeeHHHhhHHHH-HHhhhhH-HHHHHHHHHHHHHHHhhccccceeeccCCCCCCccchhhhhhH Q lcl|NC_019501. 72 ELSVKCNMGDPDNDFFELRADDLRDERS-YRRRIQA-SAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLSGWDFV 149 (431) Q Consensus 72 e~~V~v~ld~~k~v~f~lt~keL~i~~~-s~~~L~~-Am~~LAn~Id~dl~~~~~~~~~~v~t~~~t~~~~~~~~~~~d~ 149 (431) =..+.++..+-.. .+.+|.+-|++..+ .+.+|+. -.++++...|..+++-.. +.++ ......|.++ T Consensus 80 ~~~i~l~~~k~~~-~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~---------~~~~--~~~~~~~d~i 147 (293) T protein:vir:48 80 LSLIKYTIKRYAG-ISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVD---------KLPT--KPTLTKWDDI 147 (293) T ss_pred eeEEEEeeeEEEE-eehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccc---------cccc--cccccCHHHH Confidence 2333333322221 23344333444333 4566643 446777788877774221 1111 1222357888 Q ss_pred HHHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccc Q lcl|NC_019501. 150 SDAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGV 229 (431) Q Consensus 150 a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~ 229 (431) .++...|.....+ .-..++||..++.+.. ++++ .||.+ +. ..+ +. ++.. T Consensus 148 ~~~~~~l~~~~~~---~a~~vmn~~~~~~L~~---------------lkd~-~g~~l------~~-~~~---~~--~~~~ 196 (293) T protein:vir:48 148 IDLEAKVDPAIKQ---TSFFLTNTSGFTALKK---------------VKNA-LGDYL------ME-RDV---KS--PTGY 196 (293) T ss_pred HHHHHhhhhhhcC---CCEEEEcHHHHHHHHH---------------hhcc-CCceE------ee-cCc---CC--CCCc Confidence 8777777654332 2347789988876522 1111 13322 11 000 11 1112 Q ss_pred eecccccccceeeeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeecCceeEE Q lcl|NC_019501. 230 TVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVIDSTHIEI 309 (431) Q Consensus 230 tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~a~ti~I 309 (431) ++.|-+- ..++... .+....|+...+-| +.+++++..+-..-++.+ T Consensus 197 ~l~G~Pv-----~~~~~~~----------------~~~~~~~~~~~~~g-------------d~~~~~~~~~~~~~~i~~ 242 (293) T protein:vir:48 197 SIAGFAV-----KEISDRW----------------LPNASSGVMPLYFG-------------DLKQAVTLFDRQQMSLLS 242 (293) T ss_pred eecceee-----EEecccc----------------cCCccCCceEEEEE-------------eccceEEEEEecceEEEE Confidence 2333321 0000000 01111233322222 222322222222222332 Q ss_pred eecccccccccccccccccceeecccccCceeEEeccCCcceeeeeccceeEEEe-ecccCCCCCcceeeE Q lcl|NC_019501. 310 TPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLS-QPIPVTHELFAGMKT 379 (431) Q Consensus 310 ~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat-~pl~~p~g~~~a~~~ 379 (431) .+-.- .-.....+.+....-. -=...|++||.+.. -..+-|++....... T Consensus 243 ~~~~~-------------------~~~~~~~~~~r~~~r~-d~~~~~~~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 243 TNIGG-------------------GAFETDTTKVRVIDRF-DVVATDTEAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred ecccc-------------------hhhhcCeEEEEEEEee-CcEEecccceEEEEeeccccCCccccccCC Confidence 22100 0000001111100000 00223344444433 222233321111100 No 80 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=22.43 E-value=2.4 Score=18.45 Aligned_cols=360 Identities=14% Similarity=0.062 Sum_probs=116.4 Q ss_pred CccccccchhHHHHHHHHHHHhhcccchhh-ccCCCchHHHhh-cccEEE--eeccccccccccc-ccccC-----cccc Q lcl|NC_019501. 1 MALNEGQLVTYALDEIIETVQNLTPMASKV-TKYTPPAESMQR-SSNTVW--MPVEQEAPTQTGW-NLTGN-----ATGI 70 (431) Q Consensus 1 ~~~~~~~~lt~~~~evi~~len~lvma~~V-~~~r~y~~e~~k-~GdTv~--ip~p~~~~~~~g~-~~s~~-----~~di 70 (431) -..-++.-|. .|.|++.+.+=--. +.|..|-+=..+ .-.||. .-.-.+..+-++. ....- -.++ T Consensus 34 ~tq~~~~AlR------~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~ 107 (467) T protein:vir:80 34 DTQTDAGALR------REFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNI 107 (467) T ss_pred ccccCcchhh------hhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccccCCCce Confidence 0111111111 12233333331111 111111110000 112221 1111222222221 11111 1122 Q ss_pred ccceeEEEeccc-cccceEeeHHHhhHHHHHHhhhhHHHHHHHHHHHHHHHH-hhccc-------------------cce Q lcl|NC_019501. 71 LELSVKCNMGDP-DNDFFELRADDLRDERSYRRRIQASAKKLANNIESAIAK-QATEM-------------------GSL 129 (431) Q Consensus 71 ~e~~V~v~ld~~-k~v~f~lt~keL~i~~~s~~~L~~Am~~LAn~Id~dl~~-~~~~~-------------------~~~ 129 (431) ....+.+++=-. +.|.... .+-..+.|..++.-+-||..||+.||.-..= -.... +-+ T Consensus 108 ~r~~~~~k~l~~~~~vs~~~-~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~en 186 (467) T protein:vir:80 108 RQKTVNMKFASDTKNISIAA-GLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDN 186 (467) T ss_pred EEEEEEeeeeeeeeeehhhh-hhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCCc Confidence 222233332111 1111110 1223466777777789999999999987761 00000 112 Q ss_pred eeccCCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhhhhh-hhhhhhccccchhhh-----------H Q lcl|NC_019501. 130 VVHDTRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRKAGR-NLVDGDIFGRVTEDA-----------Y 197 (431) Q Consensus 130 v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~-~~~~~~~~~~~~~~a-----------~ 197 (431) |.+..|..-.. +.++..+ .....-.|-|.| ++++.-.++..+. .|.........+... - T Consensus 187 viDa~G~~ls~---~~lneaa--~~i~~gfG~~td----~~~p~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v~g~~s 257 (467) T protein:vir:80 187 VHDARGASLTE---SLLNQAA--VMISKGYGTPTD----AYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHS 257 (467) T ss_pred eeccCCCccCH---HHHHHHh--hhccccccChhh----hhcchhHHhhhhhhhcCceEEEEcCCCCceeeeecccceec Confidence 22222211110 0000101 001111222221 2222223333311 111100000000000 0 Q ss_pred hhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeecccccccccceeeEEEeeccceeeeccEE--E Q lcl|NC_019501. 198 RNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTTGFKRGDKI--S 275 (431) Q Consensus 198 r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~--T 275 (431) -.|.|. +.| ..+|.+.+.+.+..-.... |++ ...+. ++.....+|....||.- . T Consensus 258 a~G~I~--l~g-s~il~~~~~l~~~~~~~~~-----Aps----p~~vs------------aT~~~~~~g~~~~~~~a~y~ 313 (467) T protein:vir:80 258 ARGFIK--LHG-STVMENEQILDERILALPT-----APQ----PAKVT------------ATQEAGKKGQFRAEDLAAHE 313 (467) T ss_pred ceeeee--ecC-ceeeccccCCCcccccccc-----ccc----CCccc------------eeeecccCCcccCCCcceEE Confidence 022221 112 1244444554443311110 110 00000 11111112223334311 1 Q ss_pred EcceeeeccccccccC-CCceEEEEEeecCceeEEeecccccccccccccccccceeecccccCceeEEeccCCcceeee Q lcl|NC_019501. 276 FTGVKFLSQMAKNVLT-DDATFSITRVIDSTHIEITPKPIALDDASLTKEEKAYANVNTSLADNTPVNVLNVATTTANVF 354 (431) Q Consensus 276 iaGV~~vn~~tk~~~~-~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~ 354 (431) -+ |-.+|... +... ...+.+|++...+.+++|.+..++.. ...|-++=..-+++.-.- T Consensus 314 Y~-v~~vs~~G-ES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~-------~p~yv~IYR~~~gg~~f~------------ 372 (467) T protein:vir:80 314 YK-VVVSSDDA-ESIASEVATATVTAKDDGVKLEIELAPMYSS-------RPQFVSIYRKGAETGLFY------------ 372 (467) T ss_pred EE-EEEECCCC-ccccccceEEEecCcccceeEEEEecCCCCC-------cceEEEEEEeCCCCccee------------ Confidence 10 11223322 3333 33556666655666777776554432 123444444433332222 Q ss_pred eccceeEEEeecccCCCCCcceeeEEEee---cCcce--------------EEEEEEecccccccceEEEEEee-ccc-e Q lcl|NC_019501. 355 WADDSIRLLSQPIPVTHELFAGMKTSSFS---IPGIG--------------VNGIFATQGDINTLSGKCRIAVW-YSA-C 415 (431) Q Consensus 355 fhr~A~aLat~pl~~p~g~~~a~~~~t~~---~~g~g--------------lslrv~~~yd~~~~~~~~r~dvl-yG~-~ 415 (431) .+...|...-.+ +..+..+. +||.+ ++++..++.|--+-...+++-+| ||. . T Consensus 373 ------li~~va~~~a~~---gt~tf~D~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygala 443 (467) T protein:vir:80 373 ------LIARVPASKAEN---NVITFYDLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALA 443 (467) T ss_pred ------EeeeEeeeecCC---CeEEEEcCCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHh Confidence 222222211000 11111100 01111 12344455555555556666554 777 6 Q ss_pred ecccceeEEecCCCCC Q lcl|NC_019501. 416 AVRPEAIGVGLPNQTA 431 (431) Q Consensus 416 ~v~PElagv~i~~q~~ 431 (431) +..|+. .++|-+-+- T Consensus 444 l~~Pk~-~~~ikNv~~ 458 (467) T protein:vir:80 444 LRAPKK-WVRIRNVKY 458 (467) T ss_pred hhcccc-ceEEEEeee Confidence 667887 578776554 No 81 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=22.09 E-value=2.5 Score=18.40 Aligned_cols=363 Identities=14% Similarity=0.074 Sum_probs=117.6 Q ss_pred Cc-------cccccchhHHHHHHHH-----------------------HHHhhcccchhhc-cCCCchHHHhh--cccEE Q lcl|NC_019501. 1 MA-------LNEGQLVTYALDEIIE-----------------------TVQNLTPMASKVT-KYTPPAESMQR--SSNTV 47 (431) Q Consensus 1 ~~-------~~~~~~lt~~~~evi~-----------------------~len~lvma~~V~-~~r~y~~e~~k--~GdTv 47 (431) |- +|++++=. .+|.+. .||..+.+=--.+ .|..| ++..| .-.|| T Consensus 1 ~~~~~~~~~~~~~~~~~--~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~-~di~k~~a~stv 77 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNS--VQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY-KDIAKKPATSTV 77 (468) T ss_pred CCCCcchhhccccChhH--HHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhh-hhcccchhhhhh Confidence 21 11111111 122222 2333333311111 11111 11111 11222 Q ss_pred E--eeccccccccccc-ccccCc-----cccccceeEEEeccc-cccceEeeHHHhhHHHHHHhhhhHHHHHHHHHHHHH Q lcl|NC_019501. 48 W--MPVEQEAPTQTGW-NLTGNA-----TGILELSVKCNMGDP-DNDFFELRADDLRDERSYRRRIQASAKKLANNIESA 118 (431) Q Consensus 48 ~--ip~p~~~~~~~g~-~~s~~~-----~di~e~~V~v~ld~~-k~v~f~lt~keL~i~~~s~~~L~~Am~~LAn~Id~d 118 (431) . +-.-.+..+-++. ....-. .++....+.+++=-. +.|.... .+-.++.|..++.-+-||.+||+.||.- T Consensus 78 ~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~-~l~n~i~d~~~~~~~~ai~~~a~tiE~a 156 (468) T protein:vir:63 78 AKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAA-GLVNNIQDPMQILTDDAIVNIAKTIEWA 156 (468) T ss_pred hhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhh-hhhcchhhHHHHHHHHHHHHHHHHHHHH Confidence 1 1111222222221 111111 122222222332111 1111110 1223466777777789999999999987 Q ss_pred HHH-hhccc-------------------cceeeccCCCCCCccchhhhhhHHHHHHHHHhhcCCccCCcEEEeChHHHhh Q lcl|NC_019501. 119 IAK-QATEM-------------------GSLVVHDTRAIGPSTGLSGWDFVSDAERLMFSRELNRDMGISYFLNPDDYRK 178 (431) Q Consensus 119 l~~-~~~~~-------------------~~~v~t~~~t~~~~~~~~~~~d~a~a~~~L~~~~vP~~~~r~~v~np~~~a~ 178 (431) ..= -.... +-+|.+..|..-.. +.++..+ .....-.|-|.| ++++.-.++. T Consensus 157 ~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~---~~lneaa--~~i~~gfG~~td----~~~~~~v~a~ 227 (468) T protein:vir:63 157 SFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTE---SLLNQAA--VMISKGYGTPTD----AYMPVGVQAD 227 (468) T ss_pred hhhcccccccCCCccccccccceeEEecCCceeccCCCccCH---HHHHHHh--hhccccccChhh----hhcchhHHhh Confidence 761 00000 11222222211110 0000101 001111222221 2222223333 Q ss_pred hhh-hhhhhhccccchhhh-----------HhhccccccchhhhhhHhcCCCccccccccccceecccccccceeeeeec Q lcl|NC_019501. 179 AGR-NLVDGDIFGRVTEDA-----------YRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDT 246 (431) Q Consensus 179 ~~~-~~~~~~~~~~~~~~a-----------~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~tv~gA~q~~~~~~~v~~ 246 (431) .+. .|.........+... --.|.|. +.| ..+|.+.+.+.+..-.... |++ ...+. T Consensus 228 ~~~~~L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~--l~g-s~il~~~~~l~~~~~~~~~-----Aps----p~~vs- 294 (468) T protein:vir:63 228 FVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIK--LHG-STVMENEQILDERILALPT-----APQ----PAKVT- 294 (468) T ss_pred hhhhhcCceEEEEcCCCCceeeeecccceecceeeee--ecC-ceeeccccCCCcccccccc-----ccc----CCccc- Confidence 311 111100000000000 0022221 112 1244444554443311110 110 00000 Q ss_pred ccccccccceeeEEEeeccceeeeccEEE--EcceeeeccccccccC-CCceEEEEEeecCceeEEeecccccccccccc Q lcl|NC_019501. 247 DGNKENVDNRVATVTVSSTTGFKRGDKIS--FTGVKFLSQMAKNVLT-DDATFSITRVIDSTHIEITPKPIALDDASLTK 323 (431) Q Consensus 247 ~g~~~~~d~~~~~i~~s~tg~lkaGDv~T--iaGV~~vn~~tk~~~~-~lq~fvVta~~~a~ti~I~Paii~~~~~t~~~ 323 (431) ++.....+|....||.-+ -+ |-.+|... +... ...+.+|++...+.+++|.+..++.. T Consensus 295 -----------aT~~~~~~g~~~~~~~a~y~Y~-v~~vs~~G-ES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~------ 355 (468) T protein:vir:63 295 -----------ATQEAGKKGQFRAEDLAAHEYK-VVVSSDDA-ESIASEVATATVTAKDDGVKLEIELAPMYSS------ 355 (468) T ss_pred -----------eeeecccCCcccCCCcceEEEE-EEEECCCC-ccccccceEEEecCcccceeEEEEecCCCCC------ Confidence 111111122233343111 10 11223322 3333 33556666655666777776554432 Q ss_pred cccccceeecccccCceeEEeccCCcceeeeeccceeEEEeecccCCCCCcceeeEEEee---cCcce------------ Q lcl|NC_019501. 324 EEKAYANVNTSLADNTPVNVLNVATTTANVFWADDSIRLLSQPIPVTHELFAGMKTSSFS---IPGIG------------ 388 (431) Q Consensus 324 ~~~~y~nVsa~~A~~aavTv~g~~s~~~Nl~fhr~A~aLat~pl~~p~g~~~a~~~~t~~---~~g~g------------ 388 (431) ...|-++=..-+++.-.- .+...|...-.+ +..+..+. +||.+ T Consensus 356 -~p~yv~IYR~~~gg~~f~------------------li~~va~~~a~~---gt~tf~D~n~~iPgT~~~fVgem~~~~i 413 (468) T protein:vir:63 356 -RPQFVSIYRKGAETGLFY------------------LIARVPASKAEN---NVITFYDLNDSIPETVDVFVGEMSANVV 413 (468) T ss_pred -cceEEEEEEeCCCCccee------------------EeeeEeeeecCC---CeEEEEcCCcccCCCcceeeeecChhHH Confidence 123444444433332222 222222211000 11111100 01111 Q ss_pred --EEEEEEecccccccceEEEEEee-ccc-eecccceeEEecCCCCC Q lcl|NC_019501. 389 --VNGIFATQGDINTLSGKCRIAVW-YSA-CAVRPEAIGVGLPNQTA 431 (431) Q Consensus 389 --lslrv~~~yd~~~~~~~~r~dvl-yG~-~~v~PElagv~i~~q~~ 431 (431) ++++..++.|--+-...+++-+| ||. .+..|+. .++|-+-+- T Consensus 414 ~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~-~~~ikNv~~ 459 (468) T protein:vir:63 414 HLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKK-WVRIRNVKY 459 (468) T ss_pred HHHHHhccccCChhHhccchhhhhhhhhHHhhhcccc-ceEEEEeee Confidence 12344455555555556666554 777 6667887 578776554 No 82 >protein:vir:395 Length: 117 # NCBI annotation: gp10 # Family: family:all:1908 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046905;genbank:gi:9630475;genbank:GeneID:1261649 Probab=21.02 E-value=2.4 Score=18.50 Aligned_cols=110 Identities=15% Similarity=0.095 Sum_probs=35.4 Q ss_pred cEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCC-Ccccccc-cccc---ceeccccccccee Q lcl|NC_019501. 167 ISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPK-LPAVTKS-TATG---VTVSGAQKFKPQA 241 (431) Q Consensus 167 r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~-v~~~t~g-t~~~---~tv~gA~q~~~~~ 241 (431) ..=+-|+.|.+.+.-+- -|.+.+ |+..++.+++ .+++-.| .... +.+.|.+ T Consensus 1 m~~~dNlFd~ama~aD~-----------------aI~~~~-g~~a~i~~g~~~~rti~gVFDdP~~~~~~aggg------ 56 (117) T protein:vir:39 1 MADFDNLFDEAMSRADG-----------------AIRGVM-GTEAKVMSGTLSGATLVGVFDDPENIGYAGAGI------ 56 (117) T ss_pred CCcccchHHHHHHhhhH-----------------HHHHhc-CceEEEEeCCCCceEEEEEecCccccccccCce------ Confidence 22333444444432111 121212 2222222222 1121111 0000 0011111 Q ss_pred eeeecccccccccceeeEEEeeccceeeeccEEEEcceeeeccccccccCCCceEEEEEeec--CceeEEeecccccccc Q lcl|NC_019501. 242 YTLDTDGNKENVDNRVATVTVSSTTGFKRGDKISFTGVKFLSQMAKNVLTDDATFSITRVID--STHIEITPKPIALDDA 319 (431) Q Consensus 242 ~~v~~~g~~~~~d~~~~~i~~s~tg~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fvVta~~~--a~ti~I~Paii~~~~~ 319 (431) -|..+ .....+--+-...|++||.+||.| ++|.|+...- .++=.|+-+ .. T Consensus 57 -~ie~s-------aP~LfvktaDv~gl~r~D~vtI~g---------------~~y~V~~~~pDg~G~~~l~L~-----rg 108 (117) T protein:vir:39 57 -RVEGT-------SPTLFVKTSTVSQLQRMDTLTING---------------RQFWVDRVGPDDCGSCHIWLG-----NG 108 (117) T ss_pred -EEecc-------CcEEEEeeccccccCCCCEEEECC---------------CceEEeeeccCCCceEEEEee-----cC Confidence 11000 001111112245799999999998 7787766322 222222211 00 Q ss_pred cccccccccceeecccccCcee Q lcl|NC_019501. 320 SLTKEEKAYANVNTSLADNTPV 341 (431) Q Consensus 320 t~~~~~~~y~nVsa~~A~~aav 341 (431) .. |+++-.= T Consensus 109 -----~p--------p~~~~~~ 117 (117) T protein:vir:39 109 -----TP--------PASSRRR 117 (117) T ss_pred -----CC--------CCccCCC Confidence 10 1110000 No 83 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=20.00 E-value=2.8 Score=18.09 Aligned_cols=266 Identities=11% Similarity=0.128 Sum_probs=100.7 Q ss_pred Ccccccc-ch-hHHHHHHHHHHHhhcccchhhccCCCchHHHhhcccEEEeeccccccccccccccc---Ccccccccee Q lcl|NC_019501. 1 MALNEGQ-LV-TYALDEIIETVQNLTPMASKVTKYTPPAESMQRSSNTVWMPVEQEAPTQTGWNLTG---NATGILELSV 75 (431) Q Consensus 1 ~~~~~~~-~l-t~~~~evi~~len~lvma~~V~~~r~y~~e~~k~GdTv~ip~p~~~~~~~g~~~s~---~~~di~e~~V 75 (431) |+.+.+. +| +...+++|+.++...++.++++ +.+- .|.++.+|+-..... ..|...+ ...++.-.++ T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~-~~~~------~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~f~~v 101 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK-YEPM------EGTEKKFTFWADKPG-AYWVGEGQKIETSKATWVNA 101 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhcc-eeec------cCCceEEEEEecCcc-eeeecCCccccccccceeEE Confidence 4444333 44 3345899999999999888874 2221 145677776422211 1222221 1223333333 Q ss_pred EEEeccccccceEeeHHHhhH-HHHHHhhhh-HHHHHHHHHHHHHHHHhh-cc-ccc-eeeccCCCCCCccchhhhhhHH Q lcl|NC_019501. 76 KCNMGDPDNDFFELRADDLRD-ERSYRRRIQ-ASAKKLANNIESAIAKQA-TE-MGS-LVVHDTRAIGPSTGLSGWDFVS 150 (431) Q Consensus 76 ~v~ld~~k~v~f~lt~keL~i-~~~s~~~L~-~Am~~LAn~Id~dl~~~~-~~-~~~-~v~t~~~t~~~~~~~~~~~d~a 150 (431) .++..+-. ..+.+|.+-|+. ....+++|+ .-..+++.++|..++.=- .. .+. ...+.............|.++. T Consensus 102 ~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) T protein:vir:96 102 TMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNII 180 (324) T ss_pred EEEeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHH Confidence 33332211 223333333433 223456664 455689999999887210 00 000 0000011111111223467777 Q ss_pred HHHHHHHhhcCCccCCcEEEeChHHHhhhhhhhhhhhccccchhhhHhhccccccchhhhhhHhcCCCccccccccccce Q lcl|NC_019501. 151 DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVTEDAYRNGTIQRQIAGFDEILRSPKLPAVTKSTATGVT 230 (431) Q Consensus 151 ~a~~~L~~~~vP~~~~r~~v~np~~~a~~~~~~~~~~~~~~~~~~a~r~g~i~r~~~Gfd~~~~s~~v~~~t~gt~~~~t 230 (431) .+...|....... -.+++||.....+.. +. +..++ -.+..+.- ..+.|+. +..++..+.. .+ . . T Consensus 181 ~~~~~i~~~~~~~---~~~i~n~~~~~~L~~-lk--d~~G~---~~~~~~~~-~~l~G~P-V~~~~~~~~~-~~---~-~ 244 (324) T protein:vir:96 181 DLEALLEDDELEA---NAFISKTQNRSLLRK-IV--DPETK---ERIYDRNS-DSLDGLP-VVNLKSSNLK-RG---E-L 244 (324) T ss_pred HHHHhhhhccCCC---CEEEEcHHHHHHHHH-hh--CCCCC---eeecCCCC-Cccccee-eEeecCCCCC-cc---e-E Confidence 7766666554432 348899988776532 11 11111 11222222 2244543 2222221110 00 0 0 Q ss_pred ecccccccceeeeeecccccccccceeeEEEeeccc-------------eeeeccEEEEcceeeeccccccccCCCceEE Q lcl|NC_019501. 231 VSGAQKFKPQAYTLDTDGNKENVDNRVATVTVSSTT-------------GFKRGDKISFTGVKFLSQMAKNVLTDDATFS 297 (431) Q Consensus 231 v~gA~q~~~~~~~v~~~g~~~~~d~~~~~i~~s~tg-------------~lkaGDv~TiaGV~~vn~~tk~~~~~lq~fv 297 (431) +-|.-+. +.+...+ ...+-++--. .+..-|.+.|-....+.- . ..+..-|+ T Consensus 245 ~~gd~s~----~~~~~~~--------~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~---~-v~~~~a~~ 308 (324) T protein:vir:96 245 ITGDFDK----LIYGIPQ--------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL---H-IADDKAFA 308 (324) T ss_pred EEEecce----EEEEEec--------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc---E-EecccceE Confidence 1111000 0000000 0001111000 011223333222111111 0 01122355 Q ss_pred EEEeec------Ccee Q lcl|NC_019501. 298 ITRVID------STHI 307 (431) Q Consensus 298 Vta~~~------a~ti 307 (431) +...+. .+.+ T Consensus 309 ~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 309 KLVPADKRTDSVPGEV 324 (324) T ss_pred EEecccccCCCCCCCC Confidence 544333 2334 Done!