Data Sweeper is a Streamlit-based web application that allows users to transform files between CSV and Excel formats with built-in data cleaning and visualization features. This tool is designed to streamline data processing by enabling users to upload, clean, visualize, and convert files with ease.
- Upload Multiple Files: Supports CSV and Excel file uploads.
- Data Cleaning:
- Remove duplicate entries.
- Fill missing numeric values with column-wise mean.
- Column Selection: Choose specific columns to retain for further processing.
- Data Visualization:
- Generate bar charts for numerical data.
- File Conversion:
- Convert CSV to Excel and vice versa.
- Download the processed file in the selected format.
To run the application locally, follow these steps:
- Clone this repository:
git clone https://github.com/yourusername/data-sweeper.git
- Navigate to the project directory:
cd data-sweeper
- Install the required dependencies:
pip install -r requirements.txt
- Run the application:
streamlit run app.py
- Open the application in a browser.
- Upload a CSV or Excel file.
- Perform data cleaning operations if needed.
- Select specific columns to retain.
- View the data preview and visualizations.
- Convert the file to the desired format (CSV or Excel).
- Download the processed file.
- Python 3.7+
- Streamlit
- Pandas
- OpenPyXL (for Excel processing)
Contributions are welcome! Feel free to submit pull requests or report issues.
This project is licensed under the MIT License.
For any inquiries or support, please reach out to [[email protected]].