Comments (18)
See datafusion-contrib/datafusion-functions-json#26 - support for custom SQL operators in datafusion-functions-json
using #11208.
from datafusion.
hello @alamb Just checking in on the remaining tasks. Is there anything specific we're waiting on before we create issues ? If we're all set, i would be happy to jump in and get started to pick up few tasks.
Hi @dharanad I don't think there is anything from my perspective. Thank you for offering
In fact it seems as if @xinlifoobar has already started with #11215 ❤️
from datafusion.
I've created issues for a couple of tasks. Please let me know if you think anything needs updating in the descriptions. I'm new here and learning from shadowing the experienced folks
from datafusion.
FWIW in general @dharanad I have had the best luck with writing a description on tickets that requires as little context as possible (aka distill down what is needed into the the description, rather than assuming the new contributor will read the epic and get all the backstory)
The rationale for this duplication is to lower the barrier to new contrbutors
from datafusion.
Given how much
UserDefinedSQLPlanner
is being used for existing stuff within datafusion, perhaps it should be called justSQLPlanner
orCustomSQLPlanner
?
I agree
Or maybe something like ExprPlanner
🤔 as it is being used to plan specific exprs.
from datafusion.
Given #11220 and #11243, those are very similar APIs with UDF plans. I am trying to draft an API, e.g.,
// Plan the user defined function, returns origin expression arguments if not possible
fn plan_udf(
&self,
_sql: &sqlparser::ast::Expr,
args: Vec<Expr>,
) -> Result<PlannerResult<Vec<Expr>>> {
Ok(PlannerResult::Original(args))
}
to uniform the usages.
I have created a draft PR #11263 to discuss this. The flaw here is that the parameter sql
is partially borrowed and has to be cloned at the very beginning. Maybe we should consider using references if possible.
from datafusion.
Given #11220 and #11243, those are very similar APIs with UDF plans. I am trying to draft an API, e.g.,
// Plan the user defined function, returns origin expression arguments if not possible fn plan_udf( &self, _sql: &sqlparser::ast::Expr, args: Vec<Expr>, ) -> Result<PlannerResult<Vec<Expr>>> { Ok(PlannerResult::Original(args)) }
to uniform the usages.
I have created a draft PR #11263 to discuss this. The flaw here is that the parameter
sql
is partially borrowed and has to be cloned at the very beginning. Maybe we should consider using references if possible.
Eventually, I made this #11263, please let me know your thoughts. Thanks :)
CC @jayzhan211 @dharanad @alamb
from datafusion.
I think we can claim we are done 🎉
thanks everyone
from datafusion.
#11208 allows user defined sql planners to be defined.
from datafusion.
hello @alamb Just checking in on the remaining tasks. Is there anything specific we're waiting on before we create issues ?
If we're all set, i would be happy to jump in and get started to pick up few tasks.
from datafusion.
I've created issues for a couple of tasks. Please let me know if you think anything needs updating in the descriptions. I'm new here and learning from shadowing the experienced folks
thank you @dharanad -- this is very helpful 🙏
from datafusion.
FWIW in general @dharanad I have had the best luck with writing a description on tickets that requires as little context as possible (aka distill down what is needed into the the description, rather than assuming the new contributor will read the epic and get all the backstory)
The rationale for this duplication is to lower the barrier to new contrbutors
Thanks for the feedback! I really appreciate. You're right, making the ticket description concise and self-contained will definitely help reduce the barrier for new contributors. I'll update the description to include the necessary context. Thanks you
from datafusion.
Create issues for the remaining tasks, tried adding a description based on my understanding of the issue. Also update the same for the older ones
from datafusion.
Given how much UserDefinedSQLPlanner
is being used for existing stuff within datafusion, perhaps it should be called just SQLPlanner
or CustomSQLPlanner
?
from datafusion.
ExprPlanner
sounds good.
from datafusion.
ExprPlanner
sounds good.
Filed #11304
from datafusion.
I think we are pretty close to calling this done.
I just double checked and sql_compound_identifier_to_expr
is the only thing that needs this treatment to remove the call to get_function_meta
:
datafusion/datafusion/sql/src/expr/identifier.rs
Lines 138 to 139 in bfd8156
That appears to be the last issue
https://github.com/search?q=repo%3Aapache%2Fdatafusion+get_function_meta+path%3A%2F%5Edatafusion%5C%2Fsql%5C%2F%2F&type=code
from datafusion.
Filed #11473
from datafusion.
Related Issues (20)
- Add min_by and max_by aggregate functions HOT 1
- A simple count() query caused Internal Error in PhysicalOptimizer (SQLancer) HOT 1
- Add additional regexp function `regexp_count()`
- Use GenericStringBuilder to improve performance of UDF split_part
- Document "how to read an explain plan" HOT 9
- Thread panics in SpawnedTask during shutdown. HOT 6
- SMJ full join produces a null join row for a buffered row even the buffered row is joined with streamed row and passes join filter
- Implement GroupsAccumulator for stddev and var aggregaters
- Cannot infer common string type for string concat operation Dictionary(Int32, Utf8) || Dictionary(Int32, Utf8) HOT 2
- Bug detecting datatype in VALUES tuples
- Handle downstream impacts to union's behavioral changes. HOT 5
- Aggregation fuzz testing HOT 4
- Systematic fuzz testing for parquet predicate pushdown HOT 1
- Support protobuf serialization for `ScalarValue::Utf8View` and `ScalarValue::BinaryView`
- Support substrait serialization for `ScalarValue::Utf8View` and `ScalarValue::BinaryView` HOT 1
- Add config flag to convert `Utf8View`/`BinaryView` --> `Utf8` / `Binary` at output
- Add the possibility to work with plan clones directly in `inline_table_scan` HOT 3
- Parquet statistics missing when reading `Utf8` as `Utf8View`
- Panic in `substring()` scalar function (SQLancer)
- Fuse operations in `equal_rows_arr`
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datafusion.