Energy-Grid-Load-Forecasting

This study uses machine learning to predict energy load in Spanish cities based on weather data, aiming to optimize grid management and renewable energy integration. It tackles challenges in data cleaning, model selection, and feature engineering, demonstrating ML's superiority in handling complex relationships and improving forecasting accuracy.

Why is this relevant ?

The electrical grid is a complex network of power generation, transmission, and distribution systems to deliver electricity to end-users. The demand for electricity fluctuates due to factors such as time of day, weather conditions, and economic activities. Load forecasting involves predicting future electricity demand to plan for optimal resource allocation, infrastructure development, and energy market operations. Load forecasting helps prevent imbalances between supply and demand, ensuring a reliable and stable electrical grid.

Data Sources and Data Extraction

All data sources to develop this research are publicly available, and the data is available in hourly records from January 2015 until December 2018. The data composition is the following:

Historical electricity load, available on hourly post-dispatch reports available publicly on ENTSOE(European Network of Transmission System Operators for Electricity), Transmission Service Operator in Spain.
Weather variables, such as temperature, relative humidity, precipitation, cloud cover and wind speed and direction from five main provinces in Spain, are gathered from OpenWeather API which was available on Kaggle. All the required data in present in the data folder, which includes 3 .csv files.
energy_og: The original dataset.
energy_data: Creating two new columns, Fossil Total and Hydro Total which are sum of their respecive sub columns.
weather_feaures: Weather dataset Furthermore, correlation of electric consumption with time lags and weather variables is investigated to check for the strength of association between these variables.

Data Pre-processing

In the energy dataset it has no duplicate values. Nevertheless, it has some NaNs and thus, we have to investigate further. Since this is a task, we cannot simply drop the rows with the missing values and it would be a better idea to fill the missing values using interpolation. We used linear forward interpolation.Only a small part of our input data will be noisy and it will not affect performance noticeably. The data will be split into train and test set while maintaining the order of observations. The complete data set was partitioned into a test set (30%), training set (70%).

Machine Learning Model

Various model were used and compared which are attached in the files section.

SVR.ipynb: Contains data pre processing and merging the dataset aling with implementing SVR Model
All Models.ipynb: Contains data pre processing and merging the dataset aling with implementing other ML models.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dataset		Dataset
All Models.ipynb		All Models.ipynb
LICENSE		LICENSE
README.md		README.md
SVR.ipynb		SVR.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Energy-Grid-Load-Forecasting

Why is this relevant ?

Data Sources and Data Extraction

Data Pre-processing

Machine Learning Model

About

Uh oh!

Releases

Packages

Languages

License

16kushaal/Energy-Grid-Load-Forecasting

Folders and files

Latest commit

History

Repository files navigation

Energy-Grid-Load-Forecasting

Why is this relevant ?

Data Sources and Data Extraction

Data Pre-processing

Machine Learning Model

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages