Skip to content

Add SDE page to TOC. Begin adding how-to steps based on latest design…

4e27631
Select commit
Loading
Failed to load commit list.
Draft

[Hold][WIP] Document data extractor #747

Add SDE page to TOC. Begin adding how-to steps based on latest design…
4e27631
Select commit
Loading
Failed to load commit list.
Mintlify / Mintlify Deployment succeeded Nov 19, 2025 in 3m 33s

Deployment Succeeded

Details

Verified update permissions
Fetching and validating config file...
Successfully validated docs.json
Fetched all file paths
Fetched 0 OpenApi file(s)
Fetched 0 AsyncApi file(s)
Generated OpenAPI pages for navigation
Skipped AsyncAPI navigation generation
No stale files found
Updating all paths...
Successfully updated deployment
Successfully saved config
No stale tracked assets found
Updating navigation...
Navigation updated
Skipped search indexing.
Successfully deleted stale OpenAPI document(s)
Successfully deleted stale AsyncAPI document(s)
Cached valid paths
Starting page revalidation...
Revalidating all pages...
Revalidating paths:
  api-reference/destinations/create-destination-connection-check
  api-reference/destinations/create-destination-connector
  api-reference/destinations/delete-destination-connector
  api-reference/destinations/get-destination-connector
  api-reference/destinations/get-the-latest-destination-connector-connection-check
  api-reference/destinations/list-destination-connectors
  api-reference/destinations/update-destination-connector
  api-reference/general/summary
  api-reference/jobs/cancel-job
  api-reference/jobs/download-job-output
  api-reference/jobs/get-job
  api-reference/jobs/get-job-failed-files
  api-reference/jobs/get-job-processing-details
  api-reference/jobs/list-jobs
  api-reference/legacy-api/aws
  api-reference/legacy-api/azure
  api-reference/legacy-api/overview
  api-reference/overview
  api-reference/partition/api-parameters
  api-reference/partition/api-validation-errors
  api-reference/partition/chunking
  api-reference/partition/document-elements
  api-reference/partition/examples
  api-reference/partition/extract-image-block-types
  api-reference/partition/generate-schema
  api-reference/partition/get-chunked-elements
  api-reference/partition/get-elements
  api-reference/partition/output-bounding-box-coordinates
  api-reference/partition/overview
  api-reference/partition/partitioning
  api-reference/partition/post-requests
  api-reference/partition/quickstart
  api-reference/partition/sdk-jsts
  api-reference/partition/sdk-python
  api-reference/partition/speed-up-large-files-batches
  api-reference/partition/text-as-html
  api-reference/partition/transform-schemas
  api-reference/sources/create-source-connection-check
  api-reference/sources/create-source-connector
  api-reference/sources/delete-source-connector
  api-reference/sources/get-source-connector
  api-reference/sources/get-the-latest-source-connector-connection-check
  api-reference/sources/list-available-source-connectors
  api-reference/sources/update-source-connector
  api-reference/supported-file-types
  api-reference/troubleshooting/api-key-url
  api-reference/workflow/destinations/astradb
  api-reference/workflow/destinations/azure-ai-search
  api-reference/workflow/destinations/couchbase
  api-reference/workflow/destinations/databricks-delta-table
  api-reference/workflow/destinations/databricks-volumes
  api-reference/workflow/destinations/delta-table
  api-reference/workflow/destinations/elasticsearch
  api-reference/workflow/destinations/google-cloud
  api-reference/workflow/destinations/ibm-watsonxdata
  api-reference/workflow/destinations/kafka
  api-reference/workflow/destinations/local
  api-reference/workflow/destinations/milvus
  api-reference/workflow/destinations/mongodb
  api-reference/workflow/destinations/motherduck
  api-reference/workflow/destinations/neo4j
  api-reference/workflow/destinations/onedrive
  api-reference/workflow/destinations/overview
  api-reference/workflow/destinations/pinecone
  api-reference/workflow/destinations/postgresql
  api-reference/workflow/destinations/qdrant
  api-reference/workflow/destinations/redis
  api-reference/workflow/destinations/s3
  api-reference/workflow/destinations/snowflake
  api-reference/workflow/destinations/weaviate
  api-reference/workflow/errors
  api-reference/workflow/jobs
  api-reference/workflow/migration
  api-reference/workflow/overview
  api-reference/workflow/retries
  api-reference/workflow/sources/azure-blob-storage
  api-reference/workflow/sources/box
  api-reference/workflow/sources/confluence
  api-reference/workflow/sources/couchbase
  api-reference/workflow/sources/databricks-volumes
  api-reference/workflow/sources/dropbox
  api-reference/workflow/sources/elasticsearch
  api-reference/workflow/sources/google-cloud
  api-reference/workflow/sources/google-drive
  api-reference/workflow/sources/jira
  api-reference/workflow/sources/kafka
  api-reference/workflow/sources/local
  api-reference/workflow/sources/mongodb
  api-reference/workflow/sources/onedrive
  api-reference/workflow/sources/outlook
  api-reference/workflow/sources/overview
  api-reference/workflow/sources/postgresql
  api-reference/workflow/sources/s3
  api-reference/workflow/sources/salesforce
  api-reference/workflow/sources/sharepoint
  api-reference/workflow/sources/slack
  api-reference/workflow/sources/snowflake
  api-reference/workflow/sources/zendesk
  api-reference/workflow/workflows
  api-reference/workflows/create-workflow
  api-reference/workflows/delete-workflow
  api-reference/workflows/get-workflow
  api-reference/workflows/list-workflows
  api-reference/workflows/run-workflow
  api-reference/workflows/update-workflow
  business/aws/dedicated-instance-privatelink
  business/aws/onboard
  business/aws/overview
  business/azure/onboard
  business/azure/overview
  business/bare-metal/onboard
  business/bare-metal/overview
  business/gcp/onboard
  business/gcp/overview
  business/overview
  business/security-compliance/overview
  examplecode/codesamples/api/huggingchat
  examplecode/codesamples/api/mlk-research
  examplecode/codesamples/apioss/table-extraction-from-pdf
  examplecode/codesamples/oss/multi-files-api-processing
  examplecode/codesamples/oss/overview
  examplecode/codesamples/oss/table-source-connector
  examplecode/codesamples/oss/vector-database
  examplecode/notebooks
  examplecode/tools/azure-storage-events
  examplecode/tools/crewai
  examplecode/tools/databricks-volumes-events
  examplecode/tools/firecrawl
  examplecode/tools/gcs-events
  examplecode/tools/google-drive-events
  examplecode/tools/jq
  examplecode/tools/langflow
  examplecode/tools/mcp
  examplecode/tools/mcp-partition
  examplecode/tools/neo4j-chatbot
  examplecode/tools/onedrive-events
  examplecode/tools/pii
  examplecode/tools/s3-events
  examplecode/tools/s3-vectors
  examplecode/tools/sharepoint-events
  examplecode/tools/snowflake-streamlit
  examplecode/tools/vectorshift
  faq/faq
  open-source/best-practices/chunking
  open-source/best-practices/embedding
  open-source/concepts/document-elements
  open-source/concepts/glossary
  open-source/concepts/models
  open-source/concepts/partitioning-strategies
  open-source/core-functionality/chunking
  open-source/core-functionality/cleaning
  open-source/core-functionality/embedding
  open-source/core-functionality/extracting
  open-source/core-functionality/overview
  open-source/core-functionality/partitioning
  open-source/core-functionality/staging
  open-source/examples/multi-files-api-processing
  open-source/examples/overview
  open-source/examples/table-source-connector
  open-source/examples/vector-database
  open-source/how-to/embedding
  open-source/how-to/examples
  open-source/how-to/extract-image-block-types
  open-source/how-to/filter-files
  open-source/how-to/get-chunked-elements
  open-source/how-to/get-elements
  open-source/how-to/set-ocr-agent
  open-source/how-to/speed-up-large-files-batches
  open-source/how-to/text-as-html
  open-source/ingestion/destination-connectors/astradb
  open-source/ingestion/destination-connectors/azure
  open-source/ingestion/destination-connectors/azure-ai-search
  open-source/ingestion/destination-connectors/box
  open-source/ingestion/destination-connectors/chroma
  open-source/ingestion/destination-connectors/couchbase
  open-source/ingestion/destination-connectors/databricks-delta-table
  open-source/ingestion/destination-connectors/databricks-volumes
  open-source/ingestion/destination-connectors/delta-table
  open-source/ingestion/destination-connectors/dropbox
  open-source/ingestion/destination-connectors/duckdb
  open-source/ingestion/destination-connectors/elasticsearch
  open-source/ingestion/destination-connectors/google-cloud-service
  open-source/ingestion/destination-connectors/ibm-watsonxdata
  open-source/ingestion/destination-connectors/kafka
  open-source/ingestion/destination-connectors/kdbai
  open-source/ingestion/destination-connectors/lancedb
  open-source/ingestion/destination-connectors/local
  open-source/ingestion/destination-connectors/milvus
  open-source/ingestion/destination-connectors/mongodb
  open-source/ingestion/destination-connectors/motherduck
  open-source/ingestion/destination-connectors/neo4j
  open-source/ingestion/destination-connectors/onedrive
  open-source/ingestion/destination-connectors/opensearch
  open-source/ingestion/destination-connectors/overview
  open-source/ingestion/destination-connectors/pinecone
  open-source/ingestion/destination-connectors/postgresql
  open-source/ingestion/destination-connectors/qdrant
  open-source/ingestion/destination-connectors/redis
  open-source/ingestion/destination-connectors/s3
  open-source/ingestion/destination-connectors/sftp
  open-source/ingestion/destination-connectors/singlestore
  open-source/ingestion/destination-connectors/snowflake
  open-source/ingestion/destination-connectors/sqlite
  open-source/ingestion/destination-connectors/vectara
  open-source/ingestion/destination-connectors/weaviate
  open-source/ingestion/ingest-cli
  open-source/ingestion/ingest-configuration/chunking-configuration
  open-source/ingestion/ingest-configuration/embedding-configuration
  open-source/ingestion/ingest-configuration/overview
  open-source/ingestion/ingest-configuration/partition-configuration
  open-source/ingestion/ingest-configuration/processor-configuration
  open-source/ingestion/ingest-dependencies
  open-source/ingestion/overview
  open-source/ingestion/python-ingest
  open-source/ingestion/source-connectors/airtable
  open-source/ingestion/source-connectors/astradb
  open-source/ingestion/source-connectors/azure
  open-source/ingestion/source-connectors/box
  open-source/ingestion/source-connectors/confluence
  open-source/ingestion/source-connectors/couchbase
  open-source/ingestion/source-connectors/databricks-volumes
  open-source/ingestion/source-connectors/delta-table
  open-source/ingestion/source-connectors/discord
  open-source/ingestion/source-connectors/dropbox
  open-source/ingestion/source-connectors/elastic-search
  open-source/ingestion/source-connectors/github
  open-source/ingestion/source-connectors/gitlab
  open-source/ingestion/source-connectors/google-cloud-storage
  open-source/ingestion/source-connectors/google-drive
  open-source/ingestion/source-connectors/jira
  open-source/ingestion/source-connectors/kafka
  open-source/ingestion/source-connectors/local
  open-source/ingestion/source-connectors/mongodb
  open-source/ingestion/source-connectors/notion
  open-source/ingestion/source-connectors/one-drive
  open-source/ingestion/source-connectors/opensearch
  open-source/ingestion/source-connectors/outlook
  open-source/ingestion/source-connectors/overview
  open-source/ingestion/source-connectors/postgresql
  open-source/ingestion/source-connectors/s3
  open-source/ingestion/source-connectors/salesforce
  open-source/ingestion/source-connectors/sftp
  open-source/ingestion/source-connectors/sharepoint
  open-source/ingestion/source-connectors/singlestore
  open-source/ingestion/source-connectors/slack
  open-source/ingestion/source-connectors/snowflake
  open-source/ingestion/source-connectors/sqlite
  open-source/ingestion/source-connectors/zendesk
  open-source/ingestion/supported-file-types
  open-source/installation/docker-installation
  open-source/installation/full-installation
  open-source/installation/overview
  open-source/integrations
  open-source/introduction/overview
  open-source/introduction/quick-start
  open-source/introduction/supported-file-types
  support/how-to/data-privacy
  support/how-to/invoice-issue
  support/how-to/overview
  support/issues/api-connector-secrets
  support/issues/authorization-permissions
  support/issues/cannot-locate-credentials
  support/issues/configuration-resource
  support/issues/data-format-schema-validation
  support/issues/document-processing
  support/issues/get-authenticated-user-error
  support/issues/google-drive-schema-validation
  support/issues/internal-file-handling
  support/issues/network-connection-timeout
  support/issues/no-fast-partitioning-for-images
  support/issues/overview
  support/issues/quota-billing-rate-limiting
  support/issues/workflow-job-in-progress
  support/request
  support/shared-responsibility
  support/status
  ui/account/api-key-url
  ui/account/billing
  ui/account/organizations
  ui/account/overview
  ui/account/roles
  ui/account/usage
  ui/account/workspaces
  ui/chunking
  ui/connectors
  ui/data-extractor
  ui/destinations/astradb
  ui/destinations/azure-ai-search
  ui/destinations/chroma
  ui/destinations/couchbase
  ui/destinations/databricks-delta-table
  ui/destinations/databricks-volumes
  ui/destinations/delta-table
  ui/destinations/elasticsearch
  ui/destinations/google-cloud
  ui/destinations/ibm-watsonxdata
  ui/destinations/kafka
  ui/destinations/milvus
  ui/destinations/mongodb
  ui/destinations/motherduck
  ui/destinations/neo4j
  ui/destinations/onedrive
  ui/destinations/opensearch
  ui/destinations/overview
  ui/destinations/pinecone
  ui/destinations/pinecone-destination-quickstart
  ui/destinations/postgresql
  ui/destinations/qdrant
  ui/destinations/redis
  ui/destinations/s3
  ui/destinations/snowflake
  ui/destinations/weaviate
  ui/document-elements
  ui/embedding
  ui/enriching/generative-ocr
  ui/enriching/image-descriptions
  ui/enriching/ner
  ui/enriching/overview
  ui/enriching/table-descriptions
  ui/enriching/table-to-html
  ui/examples
  ui/jobs
  ui/overview
  ui/partitioning
  ui/quickstart
  ui/sources/azure-blob-storage
  ui/sources/box
  ui/sources/confluence
  ui/sources/couchbase
  ui/sources/databricks-volumes
  ui/sources/dropbox
  ui/sources/dropbox-source-quickstart
  ui/sources/elasticsearch
  ui/sources/google-cloud
  ui/sources/google-drive
  ui/sources/jira
  ui/sources/kafka
  ui/sources/mongodb
  ui/sources/onedrive
  ui/sources/opensearch
  ui/sources/outlook
  ui/sources/overview
  ui/sources/postgresql
  ui/sources/s3
  ui/sources/salesforce
  ui/sources/sftp-storage
  ui/sources/sharepoint
  ui/sources/slack
  ui/sources/snowflake
  ui/sources/zendesk
  ui/summarizing
  ui/supported-file-types
  ui/walkthrough
  ui/walkthrough-2
  ui/workflows
  welcome
  snippets/general-shared-text/enrichment-images-tables-hi-res-only
  enterprise/aws/dedicated-instance-privatelink
  enterprise/aws/onboard
  enterprise/aws/overview
  enterprise/azure/onboard
  enterprise/azure/overview
  enterprise/bare-metal/onboard
  enterprise/bare-metal/overview
  enterprise/gcp/onboard
  enterprise/gcp/overview
  enterprise/overview
  enterprise/security-compliance/overview
  support/how-to/cancel-renewal
  /api-reference/api-services/accessing-unstructured-api
  /api-reference/api-services/api-parameters
  /api-reference/api-services/api-validation-errors
  /api-reference/api-services/aws
  /api-reference/api-services/azure
  /api-reference/api-services/chunking
  /api-reference/api-services/document-elements
  /api-reference/api-services/examples
  /api-reference/api-services/free-api
  /api-reference/api-services/overview
  /api-reference/api-services/partition-via-api
  /api-reference/api-services/partitioning
  /api-reference/api-services/post-requests
  /api-reference/api-services/saas-api-development-guide
  /api-reference/api-services/sdk-jsts
  /api-reference/api-services/sdk-python
  /api-reference/api-services/supported-file-types
  /api-reference/best-practices/speed-up-large-files-batches
  /api-reference/general/pipeline-1
  /api-reference/how-to/:slug*
  /api-reference/ingest/:slug*
  /api-reference/legacy/free-api
  /glossary/glossary
  /ingestion/:slug*
  /open-source/ingest/:slug*
  /open-source/ingestion/how-to/:slug*
  /platform/:slug*
  /platform/api/:slug*
  /platform-api/api/:slug*
  /platform-api/legacy-api/:slug*
  /platform-api/partition-api/choose-hi-res-model
  /platform-api/partition-api/choose-partitioning-strategy
  /platform-api/partition-api/embedding
  /platform-api/partition-api/filter-files
  /platform-api/partition-api/:slug*
  /self-hosted/:slug*
  /ui/billing
Page revalidation complete
Successfully deleted stale tracked asset(s)
Queued update of llms-full.txt
Skipping Vercel revalidation (subdomain not in revalidation list)