Skip to content

Source Data

Hoa Le edited this page May 23, 2023 · 3 revisions

Source Data (aka source_data)

This table is all about defining your source objects of data. Here you configure the names of the sources, where they are located in the database, which metadata they have available, etc.

Columns

The following columns are required in the table/worksheet called "source_data" inside your metadata:

Name Explanation
Source_Table_Identifier A globally unique identifier of this one source object.
Source_System The name of this object's source system. Will be used for the GUI selection.
Source_Object The name of this object inside the source system. Will be used for the GUI selection.
Source_Schema_Physical_Name The physical name of the schema within the database, where this object is located.
Source_Table_Physical_Name The physical name of the table that holds the data of this source object.
Record_Source_Column The name of the column, that holds info about the record source of each entry. Could also be a technical metadata column, like "_FILE_NAME" in BigQuery.
Static_Part_of_Record_Source_Column

The static part inside the record source column. Check here for more information.

Load_Date_Column The name of the column that holds info about the load date of the data.
Group_Name(optional) Defines to which group this entry belongs, for the visualization of the entities

Example

See the following example values for each column. In total, they describe the object "Solution" from the source system "Salesforce" and the object "Part" from the source system "TPC-H".

Source_Table_Identifier Source_System Source_Object Source_Schema_Physical_Name Source_Table_Physical_Name Record_Source_Column Static_Part_of_Record_Source_Column Load_Date_Column Group_Name
SRC0001 SFDC Solution SALESFORCE_STAGE stage_sfdc_solution rsrc_file */SALESFORCE/06sIPY/Solution/* edwLoadDate solution
SRC0002 TPCH Part TPC-H_SF1 Part !Part Part GETDATE() Part

Clone this wiki locally