Mart Sales Prediction

Description

This project aims to predict the sales of products in various outlets of Big Mart using machine learning techniques. The dataset contains information about products and their sales in different outlets. The goal is to build a predictive model that can estimate the sales of products based on various features.

Project Structure

Big Mart Sale_Final.R: The main R script that contains the data loading, preprocessing, and modeling code.
Train_UWu5bXk.csv: The training dataset containing historical sales data.
Test_u94Q5KV.csv: The test dataset for which sales predictions need to be made.
SampleSubmission_TmnO39y.csv: A sample submission file in the required format for submission.
README.md: This file, providing an overview of the project.

Requirements

The following R packages are required to run the project:

data.table: For reading and manipulating data.
dplyr: For data manipulation and joining.
ggplot2: For plotting.
caret: For modeling.
corrplot: For making correlation plots.
xgboost: For building the XGBoost model.
cowplot: For combining multiple plots.

Installation

To install the required packages, you can use the following commands in R:

install.packages("data.table")
install.packages("dplyr")
install.packages("ggplot2")
install.packages("caret")
install.packages("corrplot")
install.packages("xgboost")
install.packages("cowplot")

Usage

Load Packages: The script starts by loading the necessary packages.
Read Datasets: The training and test datasets are read using the fread function from the data.table package.
Explore Data: The script displays the column names and structure of the training and test datasets.
Preprocess Data: The script adds a new column Item_Outlet_Sales to the test dataset and performs other preprocessing steps (not shown in the excerpt).

Example

Here is an example of how to run the script:

# Load packages
library(data.table)
library(dplyr)
library(ggplot2)
library(caret)
library(corrplot)
library(xgboost)
library(cowplot)

# Read datasets
train = fread("Train_UWu5bXk.csv")
test = fread("Test_u94Q5KV.csv")
submission = fread("SampleSubmission_TmnO39y.csv")

# Display column names
names(train)
names(test)

# Display structure of datasets
str(train)
str(test)

# Add Item_Outlet_Sales to test data
test[, Item_Outlet_Sales := NA]

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgements

The dataset is provided by Big Mart for the purpose of this competition.
The R community for providing the necessary packages and documentation.

Contact

For any questions or issues, please contact Rayyan Ahmed at [email protected] or https://www.linkedin.com/in/rayyan-ahmed9477/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mart Sales Prediction

Description

Project Structure

Requirements

Installation

Usage

Example

License

Acknowledgements

Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Big Mart Sale_Final.R		Big Mart Sale_Final.R
LICENSE		LICENSE
Readme.md		Readme.md
SampleSubmission_TmnO39y.csv		SampleSubmission_TmnO39y.csv
Test_u94Q5KV.csv		Test_u94Q5KV.csv
Train_UWu5bXk.csv		Train_UWu5bXk.csv

License

Rayyan9477/Mart-Sales-Forecasting-Machine-Learning-using-R

Folders and files

Latest commit

History

Repository files navigation

Mart Sales Prediction

Description

Project Structure

Requirements

Installation

Usage

Example

License

Acknowledgements

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages