Git Product home page Git Product logo

Comments (14)

alamb avatar alamb commented on June 30, 2024 2

DataFusion 37.0.0 is released: #9682 🌮

from arrow-datafusion.

matthewmturner avatar matthewmturner commented on June 30, 2024 1

Really interested to see your thoughts on custom indices, its something were currently working on to improve performance of listing table.

from arrow-datafusion.

Omega359 avatar Omega359 commented on June 30, 2024 1

aliases for character_length are defined in https://github.com/apache/arrow-datafusion/blob/4bd7c137e0e205140e273a7c25824c94b457c660/datafusion/functions/src/unicode/character_length.rs#L45 and is used during udf registration @ https://github.com/apache/arrow-datafusion/blob/4bd7c137e0e205140e273a7c25824c94b457c660/datafusion/core/src/execution/context/mod.rs#L2137

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024 1

BTW something cool for next week -- @milenkovicm WASM UDFs #9326 / #9326 (comment) / https://github.com/milenkovicm/wasaffi

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024 1

Really interested to see your thoughts on custom indices, its something were currently working on to improve performance of listing table.

In case anyone else is following along, here are some items related to indexing:

While I had this on the brain, @matthewmturner I also filed #9964 as I suspect others will be interested in helping make ListingTable faster too

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Review queue

Arrow

DataFusion

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Arrow:

DataFusion

from arrow-datafusion.

seddonm1 avatar seddonm1 commented on June 30, 2024

Sorry for the location and possibly dumb question, but shouldn't all functions be exported here: https://github.com/apache/arrow-datafusion/blob/main/datafusion/functions/src/unicode/mod.rs#L138

I am probably using the library incorrectly but (for example) LENGTH will not be registered by the central context here: https://github.com/apache/arrow-datafusion/blob/2f550032140d42d1ee6d8ed86f7790766fa7302e/datafusion/core/src/execution/context/mod.rs#L1450

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Hi @seddonm1 👋

Sorry for the location and possibly dumb question, but shouldn't all functions be exported here: https://github.com/apache/arrow-datafusion/blob/main/datafusion/functions/src/unicode/mod.rs#L138

Yes that is the intention

I am probably using the library incorrectly but (for example) LENGTH will not be registered by the central context here:

https://github.com/apache/arrow-datafusion/blob/2f550032140d42d1ee6d8ed86f7790766fa7302e/datafusion/core/src/execution/context/mod.rs#L1450

I think length is registered as an alias of character_length

from arrow-datafusion.

seddonm1 avatar seddonm1 commented on June 30, 2024

Thanks @alamb . If you look at the first link I cannot see anywhere where the length alias is actually registered. My queries which previously worked started failing after this update. I haven't investigated if more functions have also not been registered.

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Arrow:

DataFusion

from arrow-datafusion.

seddonm1 avatar seddonm1 commented on June 30, 2024

Thanks @Omega359 . I was thrown as the aliases are not registered here https://github.com/apache/arrow-datafusion/blob/3ae029988754c3fd3eb000abd4b76e643b9cbc7b/datafusion/execution/src/registry.rs#L174

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Reviews:
@jayzhan211 is starting to pull aggregate functions out of the core: #9960

from arrow-datafusion.

alamb avatar alamb commented on June 30, 2024

Next week #10002

from arrow-datafusion.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.