Comments (5)
Thanks - I'm now at 13s (0.01ms/pair), memory issues seem gone.
Looking great!
from ijson.
@ltalirz could you check this one? As expected I'm getting about ~4x improvement on kvitems
times for the C backend compared to the previous method, and (also as expected) kvitems
seems to be slightly faster than items
.
I also sped up a bit the python implementation of kvitems
, which now seems to be ~10, 20% faster depending on the use case, as far as I could measure.
from ijson.
Hi @rtobar - thanks a lot for your efforts!
A few observations:
- Parsing the first 1M items of my JSON now takes ~16s (0.02ms/pair). That is within a factor of 2 of my revised timings for
json.load
, which is great. - It seems like the C implementation leaks memory (it grows up to 2GB in my test)
- I checked that the pure python implementation uses ~13MB of memory throughout, i.e. the problem seems to be in the C extension
from ijson.
@ltalirz oops, bad reference counting there... I just pushed some further fixes to the kvitems
branch (which also boosts performance by yet another tiny bit, at least in my local benchmarks), please give it another nudge when possible.
from ijson.
Code merged to the master
branch, closing issue now.
from ijson.
Related Issues (20)
- yajl2_c backend crashes on PyPy3 HOT 19
- Is there a way to recursively iterate the key? HOT 4
- ijson.items(file, prefix) waits for EOF HOT 8
- Wheels for Python 3.12 with yajl2_c backend HOT 4
- Include array index HOT 2
- High level interface to iterate over lists HOT 3
- HighLevelAPI: Raise an error if the prefix does not exist HOT 2
- Is it possible to use multiple prefix HOT 8
- yajl2_c backend for lambda function HOT 2
- How to use ijson to covert string to dict? HOT 3
- How to read json records in chunks using ijson? HOT 4
- Question: is it possible that returing bytes instead of str could speedup parsing? HOT 3
- Thread safety HOT 9
- Full support for byte stream generator HOT 9
- Allow to use ijson package by a relative import HOT 4
- How can I most-efficiently check for a key in the top-level of a json object? HOT 3
- Python3.12 compilation error: ‘PyGenObject’ has no member named ‘gi_code’ HOT 5
- Is it possible to use isjon with Jsonl, ndjson ? HOT 5
- Memory leak on exception handling with yajl2_c backend HOT 6
- _yajl2 backend broken with Python 3.12 HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ijson.