While the RMI dataset is actually a fully open dataset, we want to use it to prototype how other data providers might share some data transparently while restricting other data to specific uses and users.
I believe the correct place for such documentation is the newly added section 3 of this document: https://github.com/os-climate/os_c_data_commons/blob/main/docs/create-ingestion-pipeline.md
Once such documentation is created, we can adjust access controls to keep secret production data while publishing other data transparently. We can also create masks so that some subset of the data is fully transparent, whether production data or not.