Add bloom checking endpoints #1243

tomchop · 2025-04-01T16:03:49Z

No description provided.

Copilot

Pull Request Overview

This PR adds endpoints for checking bloom filters against a microservice along with corresponding tests.

Introduces new API endpoints (/search and /search/raw) in the bloom module.
Updates the main FastAPI router to include the bloom endpoints.
Adds comprehensive tests to verify successful and error responses from the bloom endpoints.

Reviewed Changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated no comments.

File	Description
tests/apiv2/bloom.py	Adds tests for the new bloom endpoints, including error handling scenarios.
core/web/webapp.py	Registers the new bloom endpoints in the API router.
core/web/apiv2/bloom.py	Implements the bloom checking endpoints interacting with the microservice.

Files not reviewed (1)

yeti.conf.sample: Language not supported

Comments suppressed due to low confidence (2)

tests/apiv2/bloom.py:21

[nitpick] The test method 'testSomething' is ambiguous; consider renaming it to clearly indicate its purpose.

def testSomething(self) -> None:

tests/apiv2/bloom.py:34

The test 'testConnectionError' does not simulate a connection error scenario; consider patching requests.post to raise a ConnectionError to properly test the error handling branch.

def testConnectionError(self) -> None:

jleaniz

LGTM, left some comments. Mostly just some clarifications to see if I understood how this works under the hood.

jleaniz · 2025-04-01T21:40:44Z

core/web/apiv2/bloom.py

+def search(httpreq: Request, request: BloomSearchRequest) -> list[BloomHit]:
+    """Checks the bloomcheck microservice for hits."""
+    try:
+        response = requests.post(


This is a bit weird to me but if i understand it correctly, the Yeti API server makes an http call to the bloom service (which is separate), to get the status hit/no hit?

That's right! How can I make it less weird?

jleaniz

LGTM, left some comments. Mostly just some clarifications to see if I understood how this works under the hood.

jleaniz · 2025-04-01T21:43:51Z

core/web/apiv2/bloom.py

+    """Checks the bloomcheck microservice for hits."""
+    values = await httpreq.body()
+    try:
+        response = requests.post(


Isn't this call synchronous? The fastapi method is async though.

body is async(), so we have to await it to get the result.

udgover

No specific feedback on the code itself. However I've two questions :)

I was thinking bloomcheck would be processed from an analytics plugin rather than a dedicated API endpoint. Nevertheless, I do not have strong opinions about refactoring.
I wonder how we should deal with false positive. I've these two examples in mind:
- If I'm checking a malicious sample against a "known good" bloomfilter, there's a chance it will be seen as known good. If I'm building an export of malicious hashes to enrich logs, if this sample was dropped or executed, it won't be matched resulting to a false negative.
- If I'm checking a usual operating system file against a "known bad" bloomfilter, there's a chance this non malicious file will be flagged as malicious. If I'm building an export of malicious hashes to enrich logs, this non malicious file will be flagged as malicious and will potentially raise lot's of false positive.

tomchop · 2025-04-10T15:54:17Z

We'll adjust the documentation to be super clear, but given that bloom filters are there to provide a yes / no answer, and condense bigger databases into Yeti, I don't think it's yeti's job to provide more context about it (otherwise, we'd just use our database / index)

udgover

LGTM!

Add bloom checking endpoints

4d81312

tomchop requested review from udgover and Copilot April 1, 2025 16:03

Copilot AI reviewed Apr 1, 2025

View reviewed changes

Tweak test

63e8f01

tomchop requested a review from sebdraven April 1, 2025 17:37

jleaniz approved these changes Apr 1, 2025

View reviewed changes

tomchop requested a review from jleaniz April 9, 2025 22:05

udgover reviewed Apr 10, 2025

View reviewed changes

tomchop added 4 commits April 10, 2025 15:34

Remove bloom from logging

f7578bc

Ensure endpoint config is set

95e8fb5

Adjust tests

cc39933

Sort imports

48c27ae

udgover approved these changes Apr 11, 2025

View reviewed changes

Formatting

8a090a7

tomchop merged commit 3cf07d5 into main Apr 11, 2025
2 checks passed

tomchop deleted the bloom branch April 11, 2025 09:25

tomchop added the enhancement label Apr 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add bloom checking endpoints #1243

Add bloom checking endpoints #1243

Uh oh!

tomchop commented Apr 1, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

jleaniz left a comment

Uh oh!

jleaniz Apr 1, 2025

Uh oh!

tomchop Apr 9, 2025

Uh oh!

jleaniz left a comment

Uh oh!

jleaniz Apr 1, 2025

Uh oh!

tomchop Apr 9, 2025

Uh oh!

udgover left a comment

Uh oh!

tomchop commented Apr 10, 2025

Uh oh!

udgover left a comment

Uh oh!

Uh oh!

Uh oh!

Add bloom checking endpoints #1243

Add bloom checking endpoints #1243

Uh oh!

Conversation

tomchop commented Apr 1, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

jleaniz left a comment

Choose a reason for hiding this comment

Uh oh!

jleaniz Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

tomchop Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

jleaniz left a comment

Choose a reason for hiding this comment

Uh oh!

jleaniz Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

tomchop Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

udgover left a comment

Choose a reason for hiding this comment

Uh oh!

tomchop commented Apr 10, 2025

Uh oh!

udgover left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!