(Reaching out to the mentors) Interest in GSOC 2025 Project 9 - Interactive Multimodal Data Explorer #29308

geeky33 · 2025-03-06T06:39:47Z

geeky33
Mar 6, 2025

I am Aarya Pandey, a second-year Computer Science student at VJTI, with an experience in web development and a strong interest in Machine Learning and AI. I have previously contributed to OpenVINO, and here are some of my merged and resolved issues: PR1, PR2, PR3, PR4, PR5, PR6.
I will be working on more issues by this weekend, because of my university exams, I took a small break.

I am very interested in working on the GSOC 2025 Project 9 "Interactive Multimodal Data Explorer" project and would love to contribute to enhancing Datumaro’s dataset visualization capabilities. To get started, I am considering using Streamlit as the front end to create an interactive interface where users can explore joint embedding spaces generated from models like CLIP, LLaVa, and GPT-4V.

My initial thoughts are:

Embedding Visualization: Using Streamlit and Plotly to display a 3D interactive visualization of embeddings.
Filtering & Navigation: Implementing pan, zoom, and filter functionalities for better dataset exploration.

Annotation Interface: Allowing users to tag noisy or incorrect data directly from the interface.

Integration with Datumaro: Utilizing Datumaro for dataset handling and OTX for embedding computation.

Similarity Search Feature: Implementing a feature where users can select a sample and retrieve the most visually or semantically similar images/texts within the dataset. This can be useful for finding duplicate data, spotting anomalies, and understanding dataset clusters.

Does this direction align with your vision for the project? If so, I would be happy to refine it further and proceed with the proposal.

Looking forward to your feedback!

Thank you for your time and consideration.

Warm regards,
Aarya.

geeky33 · 2025-03-06T07:46:11Z

geeky33
Mar 6, 2025
Author

@p-wysocki @rkazants @dmitry-gorokhov @andrei-kochin @adrianboguszewski

Can I get a connection with a reply from the mentors here?
Thanks.

4 replies

adrianboguszewski Mar 6, 2025
Collaborator

They should respond within a few days :)

rajeshgangireddy Mar 17, 2025

Hi @geeky33
I am a co-mentor for other topics and will give my initial response until Laurens is back to work.
Overall, you ideas sound good and are well aligned with the project.

Few points from my side :

Please use fully open source models.
You can also do a bit of research in finding existing annotation tools to see which annotations features would be best for this topic :)
Please also consider if your methods would still work smoothly while scaling to large datasets (more than 50K data points?)
You can also have a look at the FAISS library for similarity search.

We look forward to your proposal.
Thanks and let us know if you have any questions.

laurenshogewegintel Mar 18, 2025

Dear Aarya,

Your approach and demo looks very good. This is the approach we're interested in.

We're especially interested in novel approaches for computing, visualising, and exploring joint embeddings. I think CLIP like models are an excellent start for the computation. On the visualization side I think we can brainstorm about possibilities when you join the project.

Best,

Laurens

geeky33 Mar 22, 2025
Author

Dear @laurenshogewegintel @rajeshgangireddy @samet-akcay

Thank you for showing such a positive feedback over my ideation!

I have made the first draft of the proposal and would love to know if there are any prerequisite tasks required to complete it along with the proposals!

would be happy to know your response at the earliest!

Best.

Aarya :)

geeky33 · 2025-04-20T10:20:43Z

geeky33
Apr 20, 2025
Author

Hello @rajeshgangireddy @laurenshogewegintel @adrianboguszewski @samet-akcay
I hope my proposals have reached you through GSoC site
I have sent you my proposals on mail
please let me know if you have any questions on my proposal
would be happy to address them

Best,

Aarya.

0 replies

(Reaching out to the mentors) Interest in GSOC 2025 Project 9 - Interactive Multimodal Data Explorer #29308

Uh oh!

Uh oh!

geeky33 Mar 6, 2025

Replies: 2 comments · 4 replies

Uh oh!

Uh oh!

geeky33 Mar 6, 2025 Author

Uh oh!

adrianboguszewski Mar 6, 2025 Collaborator

Uh oh!

rajeshgangireddy Mar 17, 2025

Uh oh!

laurenshogewegintel Mar 18, 2025

Uh oh!

Uh oh!

geeky33 Mar 22, 2025 Author

Uh oh!

geeky33 Apr 20, 2025 Author

geeky33
Mar 6, 2025

Replies: 2 comments 4 replies

geeky33
Mar 6, 2025
Author

adrianboguszewski Mar 6, 2025
Collaborator

geeky33 Mar 22, 2025
Author

geeky33
Apr 20, 2025
Author