Skip to content

Extend my dataset #3

@jodelaure

Description

@jodelaure

1. Use Case Title:

  • Extend my dataset
    2. User Story/Scenario:
    As a user assisted with dishacled software, I want to search all (internet) available datasets for matches on an identified shape expressed in the form of an RDF graph to find relevant datasets that may enrich or complement my local dataset.
    The dishacled powered search may either hit datasets in the form of feeds (LDES), as data dumps (TTL), as dynamic API (SPARQL) or a combination of these 3 datasource types.
    The matching result may either be exact or partial. In the case of a partial match, I expect that we have the option to task the search to continue in discovering other relevant datasets in the direction of covering more of the concepts that figure in the query-shape. As such a series of datasets maybe found that complete each other to match the queried shape.
    3. Problem Statement:
  • the sheer quantity of available datasets that match a filter on subject like "traffic" of "street" makes a manual browsing inappropriate.
  • datasets may be interesting, but I need to be bale to join them in my current datamodel and setup.

4. Desired Outcome/Goal:

  • A list of usable datasets

5. Data Catalog(s) Involved:

  • data.europa.eu
  • a shape based on local decisions about traffic regulations

6. Data/Service Requirements:

  • What kind of data or data services are being sought?
  • What are the required characteristics of the data/services (e.g., temporal coverage, spatial extent, theme)?
  • Do any related SHACL shapes exist already?

7. Discovery Criteria/Filters:

  • What criteria or filters should be used to automatically discover the required datasets/services?
  • Examples:
    • Keywords: "traffic"
    • Spatial extent: Bounding box or geometry
    • Temporal range: Start and end dates
    • License: Open Data, CC-BY
    • Theme/Category: Environment, Science

8. Expected Output/Results:

  • What is the expected output or result of the automatic discovery process?
  • Should the output be a list of datasets/services, metadata records, or direct data access?

9. Additional Information:

  • Include any other relevant information or context that may be helpful for understanding and implementing this use case.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions