-
Notifications
You must be signed in to change notification settings - Fork 1.5k
GH-2914: Add documentation for the Java library #3275
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
cc @wgtmac @shangxinli could you please review thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for approving it by accident. I need to leave a comment to retract it.
private static void writeSalesData(String filename, MessageType schema) throws IOException { | ||
Path file = new Path(filename); | ||
|
||
try (ParquetWriter<Group> writer = ExampleParquetWriter.builder(file) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am just hesitant to use ExampleParquetWriter
as examples which is not for production purpose. Adding an example module also incurs more maintenance burden so I don't think this is a good idea TBH.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see, we can remove the sub-module and just produce it as reference only example, it should also be able to resolve the documentation concerns raised in the issue, wdyt?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
removed the dependency from ExampleParquetWriter and removed the pom to eliminate maintenance overhead
Gentle reminder cc @wgtmac @ggershinsky thanks! |
TBH, I don't think adding some random examples would really help users because they are pretty similar to what's already in the unit test. What in my mind is something like https://arrow.apache.org/cookbook/ which requires a lot of effort to craft examples and maintain them to be in sync. Today LLMs are smart enough to produce code like this (I believe this PR is exactly doing this, right?). |
Thanks cookbook is a great idea, I would like to implement it for Parquet java, let me add support in another change. I came up with the examples in this to allow beginners to understand basic examples, I myself faced issues a while back when onboarding to Parquet. |
cc @wgtmac @Fokko @gszadovszky @shangxinli just wanted to get a sense of the community thoughts on a cookbook as a follow up to this PR. I think having better documentation to parquet will help users adopt the project faster and in general would be a good ecosystem addition to the project. If you are open to this I'd like to add support and maintain it in the future. thanks for the suggestion @wgtmac |
Have created an issue to track this story thanks! Would be really great if folks can review and add suggestions/feedback thanks! #3284 |
examples
for different usecases in the future.