Comments (3)
I am willing to work on this issue. please guild me further
from weather-tools.
Sure thing, @pranay101. Sometimes, users may want to download data in one chunk instead of smaller shards. This should be possible today, but isn't. Ideally, users should be able to pass a target template with no string templates, and thus no need to write partition_keys at all. The work for this issue involves update weather-dl's parser (around these lines) such that:
- users can create a target path with no templates
- if (1), then users do not need to specify any partition key argument
- update error messages to guide the user to the right config, given these changes
- update documentation related to these parser changes
- update and add to unit tests to check this behavior
Further, we need to test that the rest of the pipeline works e2e when there is only one shard worth of work. To my knowledge, this should definitely be the case, but it's always worth while to manually test (this can be done with a local run).
from weather-tools.
Fixed with #90
from weather-tools.
Related Issues (20)
- `gcloud alpha commands` used but not installed in enviroment
- ruff not used in CI pipeline
- Missing ruff checks
- Don't keep NULLs in the CSVs for feature collection
- Provide support to give time range while opening zarr HOT 1
- weather-mv rg gave data with offset by 180 degree longitude.
- weather-sp: Provide an option to append the filename with the splitted filename.
- weather-mv bq raster issue while reading ecmwf grib file HOT 2
- Find a way to exclude test data when building docker image. HOT 2
- All tools should make use of public runtime container image to manage dependencies
- weather-mv ee: Add a couple of time-metrics to asset attributes
- Deprecated Apache Beam Version Causing Error in weather-dl tool.
- Make use of secret-manager while using weather-dl for license keys. HOT 1
- Enhanced support in weather-dl for downloading data across month ranges spanning multiple years. HOT 1
- Add new functionality (--async) in weather-dl to terminate tool after dataflow job launched.
- Strengthen feature collection ingestion logic in weather-mv
- [CI/CD failing] Ruff version deprecated. HOT 2
- Add a feature in weather-mv to extract specific date's data from any files.
- Faster ingestion into BQ by converting the chunk into pd.Dataframe
- Pangeo Showcase talk on weather-tools/xql? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from weather-tools.