Comments (3)
Wouldn't SQS make most sense as a broker for Celery? Seems like it may doable from this old article here:
https://www.caktusgroup.com/blog/2011/12/19/using-django-and-celery-amazon-sqs/
from airflow.
It does make sense for use with Celery, but that is a different use-case than what I have in mind.
Our data pipeline uses SQS : Agari's Data Pipeline
We use SNS+SQS actually - we publish S3 object-created notifications over SNS+SQS. SNS has push-based topics with 0 day message retention and SQS has pull-based queues with max 14 day message retention. When we publish to SNS, SNS pushes to multiple SQS queues. We have several data importers that load this data into different DBs.
So, as part of my data pipeline, I would like to detect that a queue receives a message (it signals that it is receiving data). If that sensor returns true, I advance to the next stage : checking whether my db is receiving data. If that passes, I advance to checking if the SQS queue is drained (end of data load). If any of these fail, I want email notification. If the last step succeeds, I would like to send a "Data Load Successfully Completed" email notification.
from airflow.
Interesting! An alternative would be to check on whether the file exists in s3 (you may have to use a trigger file to signify the file is fully loaded, or load into a tmp key and rename it).
But I know nothing about your setup so I'm sure you have a much better understanding of the components you need. BTW it's very easy to create hooks and operators, and we're excited for the community to extend the portfolio of external systems we integrate with.
from airflow.
Related Issues (20)
- No LocalWorkers started when scheduler is launched in daemon mode. HOT 4
- Create `S3ToDynamoDBOperator` HOT 1
- Add ability to interact with OpenSearch in AWS HOT 3
- Issues with configuring airflow 2.6.3/python3.11 with LDAP HOT 4
- Display key in dynamic task mapping using dict in UI view of Mapped Tasks' map index
- Bug: `DataprocCreateBatchOperator` with `result_retry` raises `AttributeError` HOT 2
- Bad rendering of an inline code in the documentation HOT 2
- Extra metrics are being sent to DataDog when using Allow list with pattern matching enabled
- AirFlow Unit Testing not working as described in the documentation: HOT 1
- Undesired "<SomeOperator>.execute cannot be called outside TaskInstance!" warning HOT 6
- Airflow fully supports multi-tenancy HOT 1
- Task Killed because Recorded pid does not match the current pid HOT 1
- Wrong schedule for hourly dag HOT 10
- apache-airflow-providers-amazon added xmlsec as new dependency and pinned to a version that doesn't have wheels for new python versions HOT 19
- Task groups are not being represented in bold letters anymore HOT 4
- XCom unable to parse tuple response from DatabricksSQLOperator on SQL query execution HOT 1
- webserver: Additional property base_url is not allowed HOT 1
- DAGs are able to see historical dataset events when created new HOT 3
- Add Support for GitHub App Installation Authentication in `GithubHook` HOT 1
- Add a new ExternalAPITaskSensor to monitor external DAGs via Airflow REST API HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from airflow.