Comments (6)
Here's my style, feel free to grab: https://github.com/Komzpa/furry-sansa/blob/master/assets/osm2pgsql.style
Also it may be useful to have some other lines pulled from there, like area:highway -> phstore.
from osm2pgsql.
right now https://github.com/openstreetmap/osm2pgsql/blob/4802a525fd/output-pgsql.c#L162 will error if phstore is used without hstore mode. Perhaps we should change that so we can add area:highway and other tags to default.style?
Probably a longer-term issue
from osm2pgsql.
Adding more of those "delete"s to the default style probably does make sense. On the other hand, does anyone have an estimate of how much disk space is saved by not including them in the hstore? Is it worth the change and the potential loss in backwards compatibility?
from osm2pgsql.
I doubt it would break backwards compatibility in any practical way. It is unlikely that anyone is using flags like tiger:upload_uuid except for the db on errol. People doing very unusual things with osm2pgsql should not use default.style from the package for consuming minutely updates or else we would never be able to change it.
It's also worth noting that for many of these tags they are generally only ever dropped in minutely diffs because most editors will remove them.
from osm2pgsql.
On the other hand, does anyone have an estimate of how much disk space is saved by not including them in the hstore?
The tags hstore
column for the linestring table on errol is 17GB. Assuming most ways don't have enough tags to be TOASTed (reasonable, unless there's a lot of tags in which case they're probably droppable) here are some numbers
There's 13.5m tiger ways. With the droppable tiger tags, a rough estimate puts the count of 100 million tags dropped, average 15 byte key. If you assume 30 bytes total per tag that's 3GB, for a reduction of about 18% from TIGER alone. The tags index is about twice the size, and it'd reduce that too.
Yahoo is another big set, with about 40 million tags. They're not 8-bit characters, but it's a few percent.
Overall I'd guess 25%-50% of the tags hstore could be gotten of rid of.
from osm2pgsql.
Fixed in 1331b6c
from osm2pgsql.
Related Issues (20)
- Handling of schemas HOT 1
- Loading ways in non-slim mode is slower than with --slim.
- osm2pgsql-replication init fails on get_dsn_parameters() HOT 1
- Test failure in bdd-flex with 1.9.0 HOT 2
- Confusing middle pgsql processing. HOT 7
- Problem with query to get changed parent objects HOT 7
- 1.9.2 failed to build on several architectures (error: static assertion failed) HOT 2
- Segmentation fault (core dumped) HOT 2
- Allow more than 32 generalization jobs HOT 2
- Chunky rivers when generalizing water areas HOT 2
- Question: could not extend file "base/361191441/368615606.137": No space left on device
- osm2pgsql should not perform analyze by itself HOT 15
- `object:as_multipolygon()` does not take `object.members` into account. HOT 2
- Some research on middle performance HOT 4
- highway=rest_area treatet as line HOT 2
- free(): invalid size Aborted Core dumped HOT 4
- North America import fails HOT 1
- nlohmann-json is missing from the Alpine build dependencies command
- Error in reprocessing of ways in relation, if osmc_symbols-tag of the relation contains the word 'backslash' HOT 6
- Deprecating -i,--tablespace-index? HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from osm2pgsql.