LAVA - LAnd aVailability Analysis

LAVA is a tool to calculate the available area in a user defined study region for building renewable energy generators like utility-scale solar PV and wind onshore. First, all needed data is preprocessed to bring it into the right format. This data can be analyzed to get a better understanding of the study region. Finally, the land eligibility analysis is done with the help of atlite or GLAES (GLAES does not work fully yet).

🚧 ⚠️ Work in progress! 👷

0. Files setup

a) clone the repository (using Git Bash):

git clone https://github.com/jome1/LAVA.git

After cloning, navigate to the top-level folder of the repo in your command window.

b) install python dependencies

The Python package requirements to use the LAVA tool are in the requirements.yaml file. You can install these requirements in a new environment using conda:

conda env create -f envs/requirements.yaml

Then activate this new environment using

conda activate lava

You are now ready to run the scripts in this repository.

1. Download the necessary raw spatial data

The folders for the data input are already created in this repo. Download the needed data to the correct place within the folder "Raw_Spatial_Data".

Following data must be downloaded (partly manually 🔧, partly automatically 🤖 by the script):

DEM 🔧: GEBCO Gridded Bathymetry Data is a continuous, global terrain model for ocean and land with a spatial resolution of 15 arc seconds (ca. 450m). Use the download tool. Select a larger area around your study region. Set a tick for a GeoTIFF file under "Grid" and download the file from the basket. Put the file into the folder "DEM" (digital elevation model) and name it gebco_cutout.tif. This data provides the elevation in each pixel. It is also possible to use a different dataset.
landcover 🔧 🤖: The user can decide to automatically fetch ESAworldcover data (resolution of ~10m) via the openEO-API (see instructions later on this page) or to use a local file. This needs to be specified in the config.yaml file. CORINE landcover global dataset is a recommended file with global landcover data. But it only has a resolution of ~100m. If you want to use it, you need to download the file from zenodo named PROBAV_LC100_global_v3.0.1_2019-nrt_Discrete-Classification-map_EPSG-4326.tif. Leave the name as it is and put it in the "Raw_Spatial_Data" folder. ⚠️ Attention: the file size is 1.7 GB You can also use landcover data from a different data source (then the coloring needs to be adjusted).
OSM 🔧: OpenStreetMap Shapefile contains tons of geodata. Download the file of the country or region where your study region is located. Click on the relevant continent and then country to download the ´.shp.zip´. Somtimes you can go even more granular by clicking on the country. The best is, to use the smallest available area where your study region is still inside to save storage space. Be aware of the files naming. Unzip and put the downloaded OSM data folder inside the "OSM"-folder.
The OSM data is used to extract railways, roads and airports. Be aware, that these files can quickly become big making the calculations slow. Handle roads with caution. Often there are many roads which leads to big files.
Coastlines 🔧: Global Oceans and Seas contains all oceans and seas. It is used to buffer the coastlines. This file is only needed, when the study region has a coastline. Click on "Global Oceans and Seas" and download the geopackage. Unzip, name the file "goas.gpkg" and put it into the folder "GOAS" in the "Raw_Spatial_Data" folder.
Protected Areas 🔧 🤖: World Database of Protected Areas is a global database on protected areas. Search the country your study area is located in. Click on "Download" > "File Geodatabase" > "Continue" under Non Commercial Use, then right click on the download button and copy the link address behind the download button. Paste this link in the config.yaml behind "WDPA_url". This will automatically download, clip and reproject the protected areas within your study area. You can also use your own, locally stored file with protected areas by setting the right options in config.yaml.
Mean wind speeds 🤖: Global Wind Atlas has data on mean wind speeds with high-spatial resolution. The data is automatically downloaded and processed by the script. If it is not working, try checking if the 3-letter country code used in the config.yaml and in the Global Wind Atlas match.
Solar radiation 🤖: Global Solar Atlas has data on longterm yearly average of potential photovoltaic electricity production (PVOUT) in kWh/kWp with high-spatial resolution. The data is automatically downloaded and processed by the script.
⚠️ For some areas there is no data, especially for many areas north of 60°N (e.g. Greenland, Iceland, parts of Sweden, Norway, Finnland, Russia).
⚠️ For some countries you cannot download the default measure "LTAym_YearlyMonthlyTotals" which lets the script fail. Check the used measure directly in the download area of Global Solar Atlas and replace it in config.yaml under "advanced details" (e.g. "LTAy_YearlySum" instead of "LTAym_YearlyMonthlyTotals").

Note

Landcover data can be read from a local file or automatically fetched via the openEO API.
This powerful API can connect to multiple back-ends. Data processing can be done on the back-end if wanted. Here is some general information about openEO:

In the LAVA-tool, the openEO-connection to ESAworldcover data is implemented. In order to use it, one needs to be registered with the Copernicus Data Space Ecosystem. Follow these instructions to register. When running the LAVA-tool for the first time, you will be asked to authenticate using your Copernicus account. Click on the link printed by the script and login to authenticate. When runnning the tool again, a locally stored refresh token is used for authentication, so you don't have to login again.

More information about openEO-connection with Copernicus:

Every user gets 10000 credits per months to carry out processes on this backend. In your copernicus dataspace account you can see your credits balance and how many credits your copernicus jobs costed.
Copernicus Dataspace Forum
General limitations openEO: tested up to 100x100km at 10m resolution, free tier synchronous requests and batch jobs limited to 2 concurrent requests/jobs.
openeo Web Editor where you can see all your batch jobs.

Note

DEM (digital elevation model) is just a generic term. To be more precise, one can distinguish between Digital Surface Models (DSM) and Digital Terrain Models (DTM). DSMs also include vegetation like forests and buildings. Since ground-mounted PV and wind turbines are built on the ground and not on trees, a DTM is much better suited. When deriving the slope map from a high resolution DSM, then it can happen that you get pixels with high slopes at the edge of forests and fields or at the edge of buildings. This is unnecessary and for the tools purpose false data. Unfortunately, many easily accessible DEMs are just DSMs but not DTMs. The above mentioned GEBCO dataset is in fact a DTM but with a rather low resolution. There may be local DTMs with a higher resolution, which can also be used in the tool.
The Copernicus Global 30 meter Digital Elevation Model dataset is accessible through the openEO-API but only a DSM. There is the MERIT DEM which is close to be a global DTM. It is based on SRTM with forest removed but no buildings removed. The best global DTM is the FABDEM. It is based on the Copernicus Global 30m DEM with buildings and forests removed. Unfortunately it is commercial. Under certain circumstances, a free API access is possible. The FABDEM originated at the University of Bristol. So, the tiles are freely accessible here. However, since the FABDEM API is not always freely accessible, the FABDEM is not implemented in the LAVA tool.

Note

Be aware with landcover data: This type of data is good to estimate the potential but is still away from a precise local measurement. The landcover data is derived from satellite images and prone to erros. Note, that there is often only the category "built-up area" which includes all areas where something is built on the land. So, there is no differentiation between urban areas and stand-alone industrial or agricultural complexes, which may not need so much buffer distance to renewable energy installations. Sometimes even parts of roads or solar PV parks are classified as "built-up area" in the landcover data. For a detailed analysis to derive something like priority zones for renewables, detailed local geospatial data is needed, which has a high resolution and differentiates between areas in a more detailed way.
It is possible to use gridded population data as a proxy dataset, but that might miss some areas with houses. Careful consideration must be given to which data is used.

Note

Take additional care when using a study region with a coastline. Coastlines can be very difficult and complex. Always cross check the results.

2. Configuration

In the "configs"-folder copy the file config_template.yaml and rename it to config.yaml. In the config.yaml file you can configure the data preprocessing and the land exclusion. You can also copy and rename the config.yaml if you want to test multiple settings. Be aware that the name of the config.yaml file needs to be the same in the scripts.

In the config.yaml file you can choose which data you want to consider in the preprocessing. You also have to select your study region. When using the automatic download from gadm.org, you have to specify the name of the region (region_name) and the GADM level as it is used by gadm.org. Ideally you download the geopackage of the country you are interested in from gadm.org and load it into QGIS to find the right gadm_level and region_name. For some countries there are troubles downloading administrative boundaries from gadm.org. Then you must use your own custom study area file instead. Finally, you have to specify the land exclusions and buffer zones. At the bottom of the config.yaml file you can find the settings for advanced details.

3. Spatial data preparation

The script spatial_data_prep.py performs multiple data preprocessing steps to facilitate the land analysis and land eligibility study:

download administrative boundary of the study region from gadm.org using the package pygadm or use a custom polygon instead if wished (custom polygon needs to be put to the right folder wihtin "Raw_Spatial_Data" folder) (alternative sources for administrative boundaries here; nice tool with online visualization and download here)
calculate the local UTM zone (you can also set the projected CRS manually)
clip coastlines
process OSM data either from overpass or from geofabrik (clipping to study region)
clipp additional exclusion vector and raster data
clip and reproject landcover data and elevation data.
create a slope map from the elevation data (calculated internally using richdem)
create an aspects map from the elevation data (calculated internally using richdem)
Elevation, slope and aspect are the also co-registered to the landcover data using nearest resampling. More on working with multiple raster files (resampling and registering): here
create map showing pixels with slope bigger X and aspect between Y and Z (north facing pixels with slope where you would not build PV) (default: X=10°, Y=310°, Z=50°)
download and clip protected areas from WDPA or use your own local file
download, clip and co-register mean wind speed data to landcover (used to exclude areas with low wind speeds)
download, clip and co-register potential solar PV generation data to landcover (used to exclude areas with low solar PV production potential)

For the preprocessing, some functions are used which are defined in the files in the folder "utils".

The processed data is saved to a folder within the "data"-folder named according to the study region.

4. Land analysis

⚠️ use with caution, some functions were copied to spatial_data_prep.py (e.g. coloring of ESAworldcover from openeo, pixel size, ...)

With the JupyterNotebook data_exploration.ipynb you can inspect the spatial data of your study region. In the second code cell just put the name of your study region as the folder with the preprocessed data is named. Additionally, put the right name of your landcover_source to fetch the correct legend and color dictionary.

You need to run this notebook also to get the land use codes and the pixel size stored in seperate .json files.

5. Land eligibility

With the script Exclusion.py you can finally derive the available area of your study region. You can use the predefined exclusions in the config.yaml or customize it yourself. The code automatically recognizes if a file does not exist and thus does not take into account the respective file for the exclusion (e.g. there is no coastlines files when having a study region without a coast).

6. Bring all files in a QGIS project together

⚠️ not working due to QGIS python package troubles The script create_qgis_project.py puts all files together into a QGIS project to easily display them. This script needs to be used in the 'QGIS environment'. You can install it from the requirements_qgis.yaml. Additionally, you have to install the QGIS python package with the same version as your current QGIS installation in that environment. Sometimes you need to uninstall the python version in your environment and then install the QGIS python package inside in order to avoid trouble with package dependencies.

conda install -c conda-forge qgis=VERSIONNUMBER

7. Aggregating available land results

Once the available land rasters are created you can combine them across study regions. The script simple_results_analysis.py scans all folders under data/**/available_land/ for files matching *_available_land_*.tif. Files are grouped by technology and scenario. All rasters in a group are reprojected to EPSG:4326, merged, and then converted to polygons. The resulting geometries are written to a GeoPackage.

python simple_results_analysis.py --output aggregated_available_land.gpkg

If run from outside the repository root, provide --root PATH/TO/REPO. The output GeoPackage will contain one layer per technology and scenario combination.

8. Folder structure

original from here

LAVA/
├── 📁 configs
│   ├── config_template.yaml
│   └── config.yaml
├── 📁 envs
├── 📁 other
├── 📁 utils
├── 📁 Raw_Spatial_Data/
│   ├── 📁 additional_exclusion_polygons
│   ├── 📁 custom_study_area
│   ├── 📁 DEM
│   ├── 📁 global_solar_wind_atlas
│   ├── 📁 GOAS
│   ├── 📁 landcover
│   ├── 📁 OSM
│   └── 📁 protected_areas
└── 📁 data/
    └── 📁 "region_name"/
        ├── *DEM*
        ├── *region_polygon*
        ├── *solar*
        ├── *wind*
        ├── *protected_areas*
        ├── *landcover*
        ├── *EPSG*
        ├── *landuses*
        ├── *pixel_size*
        ├── *OSM_files*
        ├── 📁 derived_from_DEM/
        │   ├── *slope*
        │   └── *aspect*
        ├── 📁 additional_exclusion_polygons/
        └── 📁 available_land/

9. More info / notes

Terrascope API: not implemented because of limited functionalities (e.g. only downloads tiles, data cannot be clipped to area of interest). API documentation, ESAworldvcover Product,
adding basemaps to QGIS
Download DEMs in QGIS for a Specified Extent with the OpenTopography DEM Downloader Plugin
Quick Review FABDEM with QGIS
Meadows et al. conclude: "In conclusion, we found FABDEM to be the most accurate DEM overall (especially for forests and low-slope terrain), suggesting that its error correction methodology is effective at reducing large positive errors in particular and that it generalises well to new application sites. Where FABDEM is not an option (given licensing costs for commercial applications), GLO-30 DGED is the clear runner-up under most conditions, with the exception of forests, where NASADEM (re-processed SRTM data) is more accurate." For a more nuanced assessment read the articel (for some applications FABDEM might not be the most accurate one).

10. Interesting additional datasets

GEDTM30: GEDTM30 is a global 1-arc-second (~30m) Digital Terrain Model (DTM) built using a machine-learning-based data fusion approach. It can be used as an alternative to the GEBCO DEM. GEDTM30 will hopefully integrated with openeo soon.
Global Lakes and Wetlands Database: comprehensive global map of 33 different types of wetlands around the world.

Name		Name	Last commit message	Last commit date
Latest commit History 669 Commits
Raw_Spatial_Data		Raw_Spatial_Data
configs		configs
envs		envs
snakemake		snakemake
utils		utils
weather_data		weather_data
.gitignore		.gitignore
Exclusion.py		Exclusion.py
LICENSE		LICENSE
README.md		README.md
create_cutout.py		create_cutout.py
data_explore.ipynb		data_explore.ipynb
energy_profiles.py		energy_profiles.py
spatial_data_prep.py		spatial_data_prep.py
suitability.py		suitability.py
weather_data_prep.py		weather_data_prep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LAVA - LAnd aVailability Analysis

🚧 ⚠️ Work in progress! 👷

0. Files setup

1. Download the necessary raw spatial data

2. Configuration

3. Spatial data preparation

4. Land analysis

5. Land eligibility

6. Bring all files in a QGIS project together

7. Aggregating available land results

8. Folder structure

9. More info / notes

10. Interesting additional datasets

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

juliusmeier/LAVA_india

Folders and files

Latest commit

History

Repository files navigation

LAVA - LAnd aVailability Analysis

🚧 ⚠️ Work in progress! 👷

0. Files setup

1. Download the necessary raw spatial data

2. Configuration

3. Spatial data preparation

4. Land analysis

5. Land eligibility

6. Bring all files in a QGIS project together

7. Aggregating available land results

8. Folder structure

9. More info / notes

10. Interesting additional datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages