(Reaching out to the mentors) Interest in GSOC 2025 Project 9 - Interactive Multimodal Data Explorer #29308
geeky33
started this conversation in
Google Summer of Code
Replies: 2 comments 4 replies
-
@p-wysocki @rkazants @dmitry-gorokhov @andrei-kochin @adrianboguszewski Can I get a connection with a reply from the mentors here? |
Beta Was this translation helpful? Give feedback.
4 replies
-
Hello @rajeshgangireddy @laurenshogewegintel @adrianboguszewski @samet-akcay Best, Aarya. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear @samet-akcay, @laurenshogewegintel
I am Aarya Pandey, a second-year Computer Science student at VJTI, with an experience in web development and a strong interest in Machine Learning and AI. I have previously contributed to OpenVINO, and here are some of my merged and resolved issues: PR1, PR2, PR3, PR4, PR5, PR6.
I will be working on more issues by this weekend, because of my university exams, I took a small break.
I am very interested in working on the GSOC 2025 Project 9 "Interactive Multimodal Data Explorer" project and would love to contribute to enhancing Datumaro’s dataset visualization capabilities. To get started, I am considering using Streamlit as the front end to create an interactive interface where users can explore joint embedding spaces generated from models like CLIP, LLaVa, and GPT-4V.
My initial thoughts are:
Embedding Visualization: Using Streamlit and Plotly to display a 3D interactive visualization of embeddings.
Filtering & Navigation: Implementing pan, zoom, and filter functionalities for better dataset exploration.
Annotation Interface: Allowing users to tag noisy or incorrect data directly from the interface.
Integration with Datumaro: Utilizing Datumaro for dataset handling and OTX for embedding computation.
Similarity Search Feature: Implementing a feature where users can select a sample and retrieve the most visually or semantically similar images/texts within the dataset. This can be useful for finding duplicate data, spotting anomalies, and understanding dataset clusters.
Does this direction align with your vision for the project? If so, I would be happy to refine it further and proceed with the proposal.
Looking forward to your feedback!
Thank you for your time and consideration.
Warm regards,
Aarya.
Beta Was this translation helpful? Give feedback.
All reactions