Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) #182

ankostis · 2014-09-24T03:01:02Z

For issue #158:

benchmark branch: Added a benchmark with meta_schemas cheking & extensive sample-schema with references.
Implemented two optimizations:
2.a. split_scopes branch: Keep URIs in 2 parts (url, fragment) to avoid fragging/defragging since fragments do not join, they always override (~1.1 faster)
2.b. unroll_scopes branch: Replace resolver.in_scope() context-manager with a push/pop stack of scopes, that does nothing it when no id property exists (~1.5 faster).
split_unroll_scopes branch: Combined both optimizations (~1.6 faster).
For future reference, the timed-results are engraved on the respective methods of wltp/test/test_benchmarks.py TC (unfortunately they are host-specific).

ALL TCs RUN AS OK (apart from split_frags which was post-edited on the combined branch)

* Improve statistics print-outs. * Engrave timming results in benchmark docstrings.

…ing into a list whe not null (instead of using a context-manager each time) Roughly x 1.5 faster

…g by keeping fragments separated from URL (and avoid redunant frag/defrag).

…ization-branches. * FIX 2 forgotten test-case on resolver-URIs from split_scopes. x 1.8 faster in big referenced model.

…s_stack empty when iteration breaks (no detectable performance penalty). * Replace non-python-2.6 DefragResult with named-tuple. * Add test-case checking scopes_stack empty.

…e with hand-made stats-funcs.

Julian · 2014-09-24T14:56:13Z

Hey -- thanks a lot, I haven't obviously gotten a chance to really read through most of this yet, but thanks for tackling this.

The one thing I notice so far is that the tests and methods (in_scope in particular) that were removed are public API unfortunately, so we need to keep supporting them even if we don't use them internally.

For the benchmarks, what we'll have to do is basically baseline each machine (there are some tools to do that or we can do it manually) in order for the performance numbers to scale to wherever you run the tests, but it looks like you've definitely got a good start there.

I'll have a read through but it'll probably be a few days, but definitely thanks for working on this.

Neglibly slower, BUT reduced stdev and simpler main-loop code.

dnephin · 2015-02-27T23:36:24Z

This changes improve the performance on my benchmark by nearly 2x.

What can I do to help get this merged? It looks like the tests are failing and it needs to be rebased with msater.

cgurnik · 2015-04-20T19:29:23Z

@Julian Would it be possible for you to take a look at this branch? Our project has a relatively simple schema, and validation using jsonschema is frequently the slowest part of our code. A performance increase of 2x would be a significant benefit to us. As @dnephin mentioned, what can I do to help get this merged in?

Julian · 2015-04-20T20:02:32Z

Hi @cgurnik -- @dnephin already successfully managed to get in some performance changes -- can you give your project a shot on current master and see if you get any difference?

I'm trying to find some time to do a release but I was overseas for a few days.

If you don't see results can you possibly include some profiling output, would love to have a look!

dnephin · 2015-04-20T21:09:27Z

Most of the changes from this branch were included in my branch, so it might even be appropriate to close this one.

ankostis · 2015-04-20T23:08:05Z

@dnephin Do you mean that this dnephin/jsonschema@1c15827 (branch: perf_take2) contains roughly the 2 perf-improvements mentioned in this PR, but not #184?

dnephin · 2015-04-21T00:39:57Z

Right, I included many of them in dnephin@2fda155

ankostis added 5 commits September 23, 2014 20:06

issue python-jsonschema#158: Add benchmarks based on V3 &V4 schemas.

d6f29b8

issue python-jsonschema#158: Add Big model with use of $ref and $id.

f0422be

* Improve statistics print-outs. * Engrave timming results in benchmark docstrings.

issue python-jsonschema#158: Unroll scope-resolution optionally appen…

a22789a

…ing into a list whe not null (instead of using a context-manager each time) Roughly x 1.5 faster

issue python-jsonschema#158: TRY to speed-up scope & $ref url-handlin…

0306213

…g by keeping fragments separated from URL (and avoid redunant frag/defrag).

issue python-jsonschema#158: Merge unroll_scopes & split_scopes optim…

179656d

…ization-branches. * FIX 2 forgotten test-case on resolver-URIs from split_scopes. x 1.8 faster in big referenced model.

ankostis changed the title ~~Optimizations: Split and unroll scopes (x1.6 faster)~~ Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) Sep 24, 2014

ankostis added 3 commits September 24, 2014 11:48

issue python-jsonschema#158: Use try-finally to ensure resolver scope…

4c4da0d

…s_stack empty when iteration breaks (no detectable performance penalty). * Replace non-python-2.6 DefragResult with named-tuple. * Add test-case checking scopes_stack empty.

issue python-jsonschema#158: Replace python-3.4 only statistics modul…

fa13f74

…e with hand-made stats-funcs.

issue python-jsonschema#158: Fix unicode issue with python-2.7.

f0c6f8f

ankostis force-pushed the split_unroll_scopes branch from 4eba788 to f0c6f8f Compare September 24, 2014 13:11

ankostis mentioned this pull request Sep 24, 2014

Performance Problems #158

Closed

ankostis mentioned this pull request Sep 24, 2014

Optimize: Allow rules to break error-loop (x2 faster) #184

Closed

ankostis referenced this pull request in ankostis/jsonschema Sep 28, 2014

Possible to breakout of loop for a single node (ie for $ref).

d0609d9

Neglibly slower, BUT reduced stdev and simpler main-loop code.

This was referenced Feb 28, 2015

Fix flaky errors with py3 #201

Closed

Performance - reduce urljoin/urldefrag overhead #202

Closed

ankostis force-pushed the split_unroll_scopes branch 2 times, most recently from ddbc519 to f0c6f8f Compare March 1, 2015 23:41

dnephin mentioned this pull request Mar 2, 2015

Perfornance - Cache expensive url operations #203

Merged

Julian closed this Jun 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) #182

Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) #182

ankostis commented Sep 24, 2014

Julian commented Sep 24, 2014

dnephin commented Feb 27, 2015

cgurnik commented Apr 20, 2015

Julian commented Apr 20, 2015

dnephin commented Apr 20, 2015

ankostis commented Apr 20, 2015

dnephin commented Apr 21, 2015

Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) #182

Add Benchmarks & Split/Unroll-scopes optimizations (x1.6 faster) #182

Conversation

ankostis commented Sep 24, 2014

Julian commented Sep 24, 2014

dnephin commented Feb 27, 2015

cgurnik commented Apr 20, 2015

Julian commented Apr 20, 2015

dnephin commented Apr 20, 2015

ankostis commented Apr 20, 2015

dnephin commented Apr 21, 2015