A tool that analyzes academic papers and automatically creates containerized environments based on their computational requirements. This project helps researchers reproduce computational experiments by extracting development information from scientific papers and generating appropriate container configurations.
- Analyzes PDF papers to identify computational requirements
- Validates if the paper contains sufficient development information
- Extracts tools, frameworks, and implementation details
- Generates containerized environments based on paper specifications
- Supports both Python and R environments
- Utilizes dual LLM support (Gemini Pro and Ollama's LLaMA 3.2) for enhanced analysis and generation
- Python 3.8 or higher
- pip package manager
- Google API key for Gemini Pro model (set in environment variables)
- Ollama with LLaMA 3.2 model installed locally
- Clone the repository
- Install the required dependencies:
pip install -r requirements.txt- Set up your environment variables:
- Create a
.envfile in the project root - Add your Google API key:
GOOGLE_API_KEY=your_key_here
- Create a
Run the tool by providing a PDF file path:
python main.py <path_to_paper.pdf>The tool will:
- Analyze the paper for computational content
- Extract development information
- Create a workspace with appropriate container configurations
- Generate necessary environment files
The tool generates:
- Validation results of the paper analysis
- Docker and docker-compose configurations
- Environment setup files
- Implementation recommendations
- Ensure the PDF file is readable and contains computational research content
- The tool works best with papers that include clear implementation details
- Generated container configurations may need manual adjustments based on specific requirements