how should I think about search engine indices?

search indexing information-retrieval elasticsearch

In Elasticsearch, an index consists of one or more primary shards, where a shard is a Lucene instance. Each primary shard can have zero or more replicas, whose existence gives you high availability and increased search performance.

A single shard can hold a lot of data. However, with multiple shards it is easier to distribute the workload across multiple processors and multiple servers.

That said, you need a balance. The right number of shards depends on your data and context. Shards aren't free, so while it is useful to have thousands of shards if you're running a 100 node cluster, you don't want that on a single node.

In Elasticsearch, as well as having indices, you have the concept of types. Think of an index as being like a database, and a type being like a table.

Using different types has no overhead, and fits better with your example than having separate indices.

You can still search across all types (or a selected list of types) and across all indices (or a selected list) or any combination.

Each type can have its own fields (like the columns in a table) .

So in your example, I'd have one index containing 3 types, each with its own fields. Start with default number of primary shards (5) and the default number of replicas (1) and change these only when you understand your data better.

Note: don't confuse an index in Elasticsearch with an index in a database

CodeHunter

how should I think about search engine indices?

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last