You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: metadata-ingestion/docs/sources/hex/README.md
+9-1Lines changed: 9 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -20,4 +20,12 @@ Currently, the [Hex API](https://learn.hex.tech/docs/api/api-reference) has some
20
20
21
21
2.**Metadata Access**: There is no direct method to retrieve metadata for Collections, Status, or Categories. This information is only available indirectly through references within Projects and Components.
22
22
23
-
Please keep these limitations in mind when working with the Hex connector.
23
+
Please keep these limitations in mind when working with the Hex connector.
24
+
25
+
For the Dataset - Hex Project lineage, the connector relies on the
Therefore, in order to extract lineage information, the required setup must include:
28
+
29
+
- A separated warehouse ingestor (_eg_ BigQuery, Snowflake, Redshift, ...) with `use_queries_v2` enabled in order to fetch Queries.
30
+
This will ingest the queries into DataHub as `Query` entities and the ones triggered by Hex will include the corresponding _Hex query metadata_.
31
+
- A DataHub server with version >= SaaS `0.3.10` or > OSS `1.0.0` so the `Query` entities are properly indexed by source (Hex in this case) and so fetched and processed by the Hex ingestor in order to emit the Dataset - Project lineage.
description="Set ownership identity from owner/creator email",
95
104
)
105
+
include_lineage: bool=Field(
106
+
default=True,
107
+
description='Include Hex lineage, being fetched from DataHub. See "Limitations" section in the docs for more details about the limitations of this feature.',
108
+
)
109
+
lineage_start_time: Optional[datetime] =Field(
110
+
default=None,
111
+
description="Earliest date of lineage to consider. Default: 1 day before lineage end time. You can specify absolute time like '2023-01-01' or relative time like '-7 days' or '-7d'.",
112
+
)
113
+
lineage_end_time: Optional[datetime] =Field(
114
+
default=None,
115
+
description="Latest date of lineage to consider. Default: Current time in UTC. You can specify absolute time like '2023-01-01' or relative time like '-1 day' or '-1d'.",
116
+
)
117
+
datahub_page_size: int=Field(
118
+
default=DATAHUB_API_PAGE_SIZE_DEFAULT,
119
+
description="Number of items to fetch per DataHub API call.",
0 commit comments