CONTENT AND DATA INGESTION

Index for success

Elastic provides all the tools you need — out of the box tooling or APIs for building robust, flexible ingest mechanisms for all types of data and content. It’s quick to set up, with plenty of options for enriching, transforming, and manipulating data as you go, so you can focus on building powerful search applications.

Start free trial

Download now

The Open Web Crawler is in beta. Learn how to set up crawl and extraction rules and combine it with semantic text search.

Learn more

Get started indexing data using Elasticsearch APIs.

See guide

See the ways you can connect with all types of tools and any kind of data.

View integrations

DATA INGESTION ENGINE

Variety is the spice of ingest

Get complete control over your ingest pipeline with powerful prebuilt, yet fully configurable, data ingestion tools and exposed APIs that let you index and manage data your way.

Data extraction
Discover, extract, index, and sync of all your website content — including PDFs! Use Elastic Open Web Crawler to transform your web pages into searchable data.
Learn about Open Crawler
Data connectors
Make use of connectors to popular productivity tools, plus handy APIs to build connectors for your data sources, too.
Learn about data connectors
Ingestion APIs
Employ convenient indexing endpoints to build custom ingestion pipelines, with popular language clients like JavaScript, Java, and Python.
Learn about ingestion APIs
Data pipelines
Keep data ingestion pipelines and management in place with existing Elasticsearch indices or the Elasticsearch query syntax.
Learn about ingest pipelines

ADD SEARCH TO YOUR WEBSITE

The fastest way to index web content

Configure crawls with flexible APIs the way you'd like. With Elastic's Open Web Crawler, you are in control of your crawls.

Learn more

Elasticsearch — the most widely deployed vector database

Copy to try locally in two minutes

curl -fsSL https://elastic.co/start-local | sh

Read docs

Deploy for production

Start free cloud trial

Or, download on-prem

Start crawling now!

Set up and deploy a crawler for your web content with a terminal and Elasticsearch.

View GitHub

Run Docker image
Deploy web crawler code on your own infrastructure by running from Source or Docker.
Set up
Set URL for crawl
Set one or more URLs you want to crawl.
Configure and connect
Identify and correct any challenges impacting crawl stability, content discovery, and content extraction and indexing.
Configure

UNIFIED SEARCH APPLICATIONS

Come one content source, come all

Flexibly and efficiently capture, index, and sync the docs, files, fields, metadata, and other key info in your database or content management system. Use API ingestion, prebuilt connectors, or configurable connector packages to ingest this data into Elastic quickly. Choose which objects to synchronize — and when — with an intuitive UI and simple rules during data ingestion.

Azure Blob Storage
Confluence Cloud & Server
Dropbox
GitHub & GitHub Enterprise Server
Google Cloud Storage
Google Drive
Jira Cloud & Server
Microsoft SQL
MongoDB
MySQL
Network drive
OneDrive
Oracle
PostgreSQL
S3
Salesforce
ServiceNow
SharePoint Online
Box
Customized connector
Gmail
Outlook
SharePoint Server
Slack
Teams
Zoom

CONNECT WITH CONFIDENCE

The connective tissue for your search experience

With several secure paths to connecting and syncing content from your critical data sources, you can customize the ingest pipeline for all your tools that require indexing.

Learn about data enrichment

Go out of the box
Take advantage of prebuilt connectors to popular content sources to streamline indexing and syncing.
Learn about connectors
Build your own
Self-managed connectors and APIs facilitate connections to homegrown data platforms, legacy systems, and more.
Learn about self-managed connectors
Control access
Secure proper access with document-level permissions to ensure that the right people see the right content.
Learn about document level security

CONTENT AND DATA INGESTION

Index for success

DATA INGESTION ENGINE

Variety is the spice of ingest

Data extraction

Data connectors

Ingestion APIs

Data pipelines

ADD SEARCH TO YOUR WEBSITE

The fastest way to index web content

Elasticsearch — the most widely deployed vector database

Copy to try locally in two minutes

Deploy for production

Start crawling now!

Run Docker image

Set URL for crawl

Configure and connect

UNIFIED SEARCH APPLICATIONS

Come one content source, come all

Azure Blob Storage

Confluence Cloud & Server

Dropbox

GitHub & GitHub Enterprise Server

Google Cloud Storage

Google Drive

Jira Cloud & Server

Microsoft SQL

MongoDB

MySQL

Network drive

OneDrive

Oracle

PostgreSQL

S3

Salesforce

ServiceNow

SharePoint Online

Box

Customized connector

Gmail

Outlook

SharePoint Server

Slack

Teams

Zoom

CONNECT WITH CONFIDENCE

The connective tissue for your search experience

Go out of the box

Build your own

Control access

Try it now on Elastic Cloud