-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Open
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksenhancementNew feature or requestNew feature or request
Description
This is a follow on to #3058 as we have made significant progress since @kmitchener originally posted that
“Write cool software and tell people about it” – Paul Dix @pauldix (Founder and CTO of InfluxData)
Call to action:
The DataFusion community has invested a lot in the cool software; This ticket tracks tasks to improve the "tell people about it"
User Guide
- Getting started guide for new users (who want to use DataFusion in their project) #7014
- Automate production of SQL and DataFrame references for SQL functions / Expressions #3092
- Write a case study about using DataFusion #7015
- Document SQL operators (like
@>
and~
) #7153 - Document streaming usecase (like
UNBOUNDED
tables) #9016 - Make it easier to install / get started with DataFusion #7297
- Document DML statements #7319
- Hide boilerplate in documentation examples #14557
- Add a "Gentle Introduction to Arrow / Record Batches" #11336
- Improvements to user guide for DataFrames #11388
- Run / Test all examples in Documentation #14435
Rust Docs (docs.rs)
- Update SQL functions rustdocs with example usage
- Automate production of SQL and DataFrame references for SQL functions / Expressions #3092
- Include the latest rustdocs (e.g.
cargo doc
output on https://arrow.apache.org/datafusion/) #5981
Examples
- Review current examples and see how they can be improved
- Include examples reading from object stores (S3, Azure, GCS)
- Async User Defined Functions (UDF) #6518
- Add example for writing a
FileFormat
#11079 - Add a section to the documentation explaining that PGO can help up substantially (25%) and maybe offer some tips for users to use it? #9561
- Add example for writing an
AnalyzerRule
#10855 - Add example for writing an SQL analysis pass #10871
- [EPIC] Improve examples to make them easier to navigate #11172
- Provide an example of using a remote catalog #13714
Conference Papers
Blog posts
- Write a blog post fast Vectorized grouping for high cardinality #6988
- Blog post with DataFusion Jun - Sep 2023 #6780
- Blog post about user defined window functions #6781
Other
kylebarron and JigaoLuo
Metadata
Metadata
Assignees
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksenhancementNew feature or requestNew feature or request