Open
Description
We have some images in Azure storage account. All those images (blobs) have metadata .
calling download we can download the files from the provided path but we are missing the metadata associated to those files.
In fact metadata of the files contains important information on each image (e.g. tags) that we need them in our ML pipeline.
Is there any way to keep metadata of a blob storage when we download?
E.g.:
blob_datastore = Datastore.register_azure_blob_container(workspace=aml_workspace,
datastore_name=blob_datastore_name,
container_name=container_name,
account_name=account_name,
account_key=account_key)
def_blob_store = Datastore(aml_workspace, "train")
datastore_path = [
DataPath(def_blob_store, '*.jpeg')]
file_dataset = Dataset.File.from_files(path=datastore_path)
file_dataset.download("files")
Document Details
⚠ Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.
- ID: b080f459-b191-a60e-e327-720ef095165c
- Version Independent ID: 53d1b412-0c47-1044-3c6c-7c8c0f1eb0c0
- Content: azureml.data.FileDataset class - Azure Machine Learning Python
- Content Source: AzureML-Docset/stable/docs-ref-autogen/azureml-core/azureml.data.FileDataset.yml
- Service: machine-learning
- Sub-service: core
- GitHub Login: @DebFro
- Microsoft Alias: debfro