Comments (3)
Hi @CraigWilson-ZOE ! Thanks for bringing this to our attention, I think you made a great point as to what the approach should be here. We agree that reducing package run time should be crucial in your development sandbox, particularly if you are not planning to use this data.
Also, thanks for submitting the proposed PR! We will review this PR in a future sprint and see if there are more holistic changes we can incorporate to meet resourcing requirements in running dbt.
We will keep you in the loop about further updates here, and changes we plan to make to optimize package runs.
from dbt_mixpanel.
Hello @CraigWilson-ZOE, thank you for your suggestion regarding the filter. I am working on enhancing the overall performance of this Mixpanel package and have added additional updates on a test branch. I've eliminated the stg_mixpanel__event_tmp
model in favor of transforming stg_mixpanel__event
into an incremental model, with the intention that subsequent runs after the initial would not incur much compute. I've also made adjustments to some of the clustering aspects and updated the incremental strategies for all the models.
If you would be able to try out the test branch and let me know your thoughts and feedback, it would be appreciated! I am curious to know how much of an impact these updates make. Thank you!
To install the test branch, you can use the following in your packages.yml:
packages:
- git: https://github.com/fivetran/dbt_mixpanel.git
revision: feature/performance-enhancement
warn-unpinned: false
from dbt_mixpanel.
Hi @CraigWilson-ZOE, following up to my last message, we still removed the tmp view, but decided to keep the stg model as ephemeral. This prevents duplicate tables while also allowing the start date to be set by the user. I have gone ahead and released v0.9.0 and will mark this issue as closed, however if you have any additional feedback, please feel free to tag us in this thread or open a new issue!
from dbt_mixpanel.
Related Issues (14)
- datetime/timestamp issue HOT 4
- Package not working on Google BigQuery HOT 3
- Add deduping logic within README
- allow for casting/aliasing custom columns HOT 1
- FEATURE - dedupe users who have multiple mixpanel accounts?
- mixpanel__sessions throws LISTAGG limit error HOT 12
- [Feature] Update README HOT 1
- [Feature] Postgres compatibility
- [Feature] Databricks Compatibility
- [Bug] partition_by config parameter doesn't work with Snowflake HOT 10
- Package fails if custom event properties contains spaces HOT 4
- [Bug] Breaking change in stg_mixpanel___events.sql event table source HOT 6
- Any chance of supporting Postgres? HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dbt_mixpanel.