D.AI is an open-source Streamlit application for both manual and AI-powered data analysis. Developed by Tripathi Aditya Prakash, it offers a seamless blend of traditional data exploration techniques and state-of-the-art AI-driven tools.
- Clean and preprocess data with options like removing duplicates, handling missing values, and transforming data.
- Perform statistical operations and generate correlation matrices.
- Visualize data interactively using Plotly and Seaborn.
- Generate detailed insights and summaries using OpenAI's GPT.
- Create custom graphs with natural language prompts using the
seabornailibrary. - Analyze Python code in datasets with
Checkify. - Process and clean text data with the
cleantxtylibrary.
- CSV: Upload and explore spreadsheets easily.
- Excel: Handle
.xlsand.xlsxfiles. - JSON: Parse and analyze structured data.
- TXT: Clean raw text data for NLP tasks.
- Download processed datasets in Excel format.
To run D.AI locally, follow these steps:
-
Clone the repository:
git clone https://github.com/yourusername/dai.git cd dai -
Install dependencies:
pip install -r requirements.txt
-
Run the Streamlit app:
streamlit run dai_app.py
-
Open your browser at
http://localhost:8501.
-
Upload Your File
Use the file uploader to select datasets in CSV, Excel, JSON, or TXT format. -
Choose Analysis Mode
- Manual Analysis: Explore and clean data manually.
- AI Analysis: Use natural language prompts and AI features.
-
Perform Operations
- Clean and transform data.
- Visualize relationships with AI-powered graphs.
- Analyze Python code or generate summaries using GPT.
-
Download Results
Export the processed dataset in Excel format.
- Python 3.7+
- Streamlit
- pandas
- numpy
- Plotly
- seabornai
- cleantxty
- checkify
- OpenAI Python SDK
- mixpanel
- matplotlib
This project is licensed under the MIT License.
Access the live Mixpanel dashboard: Mixpanel Dashboard
Check out the live application: DAI Smarter Data
For questions or suggestions, please contact Tripathi Aditya Prakash at [email protected].