-
Notifications
You must be signed in to change notification settings - Fork 1.7k
feat: support Spark concat
string function
#18063
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me -- thanks @comphead
I suspect we can make this function faster but it can be done as a follow on PR
let arrays = arrays?; | ||
|
||
// Compute NULL mask | ||
let mut null_mask = vec![false; array_len]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can probably compute the null mask more efficiently using a NullBullfer::union: https://docs.rs/arrow/latest/arrow/buffer/struct.NullBuffer.html#method.union
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @alamb I'm planning to merge it to hop into the 50.3.0 release and then create a small follow up on the comment
* chore: Extend backtrace coverage * fmt * part2 * feedback * clippy * feat: support Spark `concat` * clippy * comments * test * doc
Which issue does this PR close?
Rationale for this change
Apache Spark
concat
has some differences comparing to Datafusion, namely:What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?