This repository contains experiments and explorations with NVIDIA's Vision Language Model (VLM) API. Each experiment is documented in its own directory with detailed README files.
experiments/: Contains individual experiment directoriescommon/: Shared utilities and helper functionsrequirements.txt: Project dependencies
- ID Card Information Extraction: Extract structured information from Spanish ID cards using VLM API
- Clone this repository
- Create a
.envfile in the project root - Add your NVIDIA API key:
NVIDIA_API_KEY=your_actual_api_key_here - Install dependencies:
pip install -r requirements.txt
- NEVER commit your API keys to version control
- Use environment variables to manage sensitive credentials
Each new experiment should:
- Have its own directory under
experiments/ - Include a detailed README.md
- Document findings and learnings
- Include sample data (if not sensitive)
MIT License - See LICENSE file for details