-
Notifications
You must be signed in to change notification settings - Fork 3.8k
fix: remove unnecessary batching for avoiding OOM #47078
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Summary of ChangesHello @zhagnlu, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request addresses a potential Out-Of-Memory (OOM) issue by streamlining the document addition process within the Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: zhagnlu The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
[ci-v2-notice] To rerun ci-v2 checks, comment with:
If you have any questions or requests, please contact @zhikunyao. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request removes unnecessary manual batching when adding documents, correctly relying on Tantivy's internal buffering. This simplifies the code. However, the new implementation introduces a performance concern by creating a new vector for each document inside a loop. I've provided a suggestion to address this by using a more direct and efficient method for adding documents.
internal/core/thirdparty/tantivy/tantivy-binding/src/index_writer_v7/index_writer.rs
Outdated
Show resolved
Hide resolved
Codecov Report✅ All modified and coverable lines are covered by tests. ❌ Your project check has failed because the head coverage (76.41%) is below the target coverage (77.00%). You can increase the head coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## master #47078 +/- ##
==========================================
+ Coverage 76.39% 76.41% +0.01%
==========================================
Files 2004 2004
Lines 320988 321447 +459
==========================================
+ Hits 245224 245630 +406
- Misses 67858 67909 +51
- Partials 7906 7908 +2
🚀 New features to boost your workflow:
|
Signed-off-by: luzhang <[email protected]>
1d648cf to
26fea2f
Compare
|
/ci-rerun-ut-go |
|
/lgtm |
issue: #47001
different batch effect for same test: