Replies: 1 comment
-
The filter needs to be enabled in order to take effect.
or, if you want to enable other filters at the same time
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Version
spark: [email protected]
spline-agent:za.co.absa.spline.agent.spark:spark-3.1-spline-agent-bundle_2.12:0.6.1
Background
After following the steps in the following blog https://www.capitalone.com/tech/software-engineering/spline-spark-data-lineage/, I've been able to extract data lineage from spark and upload it to the spline server. The next thing I want to do is inject metadata to the lineage using a code-less configuration. Based on the spline agent's default properties https://github.com/AbsaOSS/spline-spark-agent/blob/develop/core/src/main/resources/spline.default.properties, I tried to configured spark.spline.postProcessingFilter.userExtraMeta.rules but I have not been successful.
Question
What is the correct approach to configure spark.spline.postProcessingFilter.userExtraMeta.rule using a code-less configuration?
Setup
spark-submit \ --packages za.co.absa.spline.agent.spark:spark-3.1-spline-agent-bundle_2.12:0.6.1 \ --conf spark.sql.queryExecutionListeners=za.co.absa.spline.harvester.listener.SplineQueryExecutionListener \ --conf spark.spline.producer.url=http://localhost:8080/producer \ --conf spark.spline.lineageDispatcher=console \ --conf spark.spline.postProcessingFilter.userExtraMeta.rules"={\"executionPlan\":{\"foo\":{\"$js\":\"executionPlan.name()\"}\,\"bar\":{\"$env\":\"JAVA_HOME\"}\,\"baz\":{\"$jvm\":\"some.jvm.prop\"}\,\"qux\":42\,\"tags\":[\"aaa\"\,\"bbb\"\,\"ccc\"]}}" \ my_spark_job.py
Output [Extra metadata has not been added]
Beta Was this translation helpful? Give feedback.
All reactions