📊 Multimodal Data Analysis for Starbucks Finance

🚀 Project Overview

This project explores multimodal AI by integrating audio transcription, NLP, and finance-related data processing. Using Whisper for audio transcription, Sentence Transformers for NLP, and LangChain/OpenAI models for analysis, the project aims to extract insights from multiple data types—text, audio, and PDFs.

🛠️ Technologies Used

OpenAI Whisper – Speech-to-text transcription
LangChain & OpenAI APIs – NLP and LLM-based analysis
Sentence Transformers – Semantic similarity search
PDF Processing – Extracting text and images from financial documents
Torch & Sklearn – Machine learning utilities

📌 Features

🔊 Audio Transcription: Convert spoken content into text
📄 PDF Parsing: Extract structured data from documents
🔍 Semantic Search: Identify key insights using embeddings
📈 Finance-Oriented Analysis: Apply ML/NLP techniques to financial data

📂 Dataset

The project utilizes Starbucks financial data and multimodal content, including audio, text, and documents.

🔧 Installation & Setup

pip install openai langchain langchain-openai langchain-community openai-whisper sentence-transformers pdf2image
apt-get install poppler-utils
pip install --upgrade Pillow

🚀 Usage

Clone the repository:

git clone https://github.com/ctournas/multimodal-starbucks-finance.git
cd multimodal-starbucks-finance

Install dependencies (see installation section above)
Run the Jupyter notebook

📜 License

This project is licensed under the MIT License. Feel free to modify and use it for your own projects!

🤝 Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.

📬 Contact

For questions or collaborations, feel free to reach out via GitHub or email at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
3Q24-Earnings-Release.pdf		3Q24-Earnings-Release.pdf
Multimodal_RAG_Starbucks_Financial_Data_.ipynb		Multimodal_RAG_Starbucks_Financial_Data_.ipynb
README.md		README.md
requirements.txt		requirements.txt
starbucks-q3.mp3		starbucks-q3.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📊 Multimodal Data Analysis for Starbucks Finance

🚀 Project Overview

🛠️ Technologies Used

📌 Features

📂 Dataset

🔧 Installation & Setup

🚀 Usage

📜 License

🤝 Contributing

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ctournas/multimodal_rag

Folders and files

Latest commit

History

Repository files navigation

📊 Multimodal Data Analysis for Starbucks Finance

🚀 Project Overview

🛠️ Technologies Used

📌 Features

📂 Dataset

🔧 Installation & Setup

🚀 Usage

📜 License

🤝 Contributing

📬 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages