Skip to content

Latest commit

 

History

History
6 lines (5 loc) · 572 Bytes

File metadata and controls

6 lines (5 loc) · 572 Bytes

Ingest Pipelines

Open Crawler uses an Elasticsearch ingest pipeline to power several content extraction features. The default pipeline, ent-search-generic-ingestion, is automatically created when Elasticsearch first starts. This pipeline does some pre-processing on documents before they are ingested by Open Crawler. See Ingest pipelines for Search indices for more details on this pipeline.