Comments (3)
I like the Join count analysis, I had to do this in my project (mentioned in #10808) in a really hacky way. We can have 2 sorts of counts:
- Count every individual join
- Count all groups of joins (subsequent joins that are related) with their respective count(this would have been useful for me)
I do have a question.
How can we show these results? Because Analyzer rules only return transformed LogicalPlans, or do I understand something in a wrong way?
from arrow-datafusion.
Thank you @LorrensP-2158466 -- sorry for the delay . I have been traveling
I do have a question.
How can we show these results? Because Analyzer rules only return transformed LogicalPlans, or do I understand something in a wrong way?
I think in an example, we can use println!
which is what we do in other examples:
datafusion/datafusion-examples/examples/rewrite_expr.rs
Lines 45 to 48 in 3773fb7
Normally the idea is to give a well documented example that shows the basic pattern that people can start with
For this one, maybe you could show how to use TreeNode::apply
to walk the tree. Something like (totally untested)
let mut join_count = 0;
plan.apply(|child| {
if matches!(child, LogicalPlan::Join(_)) {
join_count += 1;
}
});
println!("Found {join_count} joins in the plan");
from arrow-datafusion.
Thanks for the reply!
That's exactly what I have made, I'll open up a PR later today or tomorrow.
from arrow-datafusion.
Related Issues (20)
- SMJ producing different results than HashJoin when doing a semi join HOT 7
- Construction of user-defined table functions (UDTFs) should be async to allow for async schemas HOT 1
- Real-time streaming support HOT 1
- Data set which is much bigger than RAM HOT 5
- ci: clippy failed on main
- Convert `Grouping` to UDAF HOT 4
- Convert `BitAnd`, `BitOr`, `BitXor` to UDAF HOT 2
- CTE in a UNION query can escape its scope HOT 1
- Unclear error message when calling a function with no parameters.
- [Epic] Implement support for `StringView` in DataFusion HOT 8
- Implement equality `=` and inequality `<>` support for `StringView` HOT 6
- Implement `arrow_cast` support for `StringView` and `BinaryView` HOT 1
- use StringViewArray when reading String columns from Parquet HOT 11
- [EPIC] Continued correct and improved extracting Parquet statistics into ArrayRefs HOT 8
- Update ListingTable to use `StatisticsConverter`
- `StatisticsConverter::row_group_null_counts` incorrect for missing column HOT 4
- Support extracting `Int8`, `Int16`, `Int32` statistics from Parquet Data Pages HOT 2
- Do we need to escape search string as it's used in regexp? Wondering what's the result of `contains("abcdefg", ".*")` HOT 6
- Add a benchmark for extracting parquet data page statistics HOT 1
- Push down filters below `Unnest` in sub queries HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from arrow-datafusion.