Cassandra timeout during read query at consistency ONE (1 responses were required but only 0 replica responded)

hadoop cassandra apache-spark datastax datastax-java-driver

Usually when you get a timeout error it means you are trying to do something that isn't scaling well in Cassandra. The fix is often to modify your schema.

I suggest you monitor the nodes while running your query to see if you can spot the problem area. For example, you can run "watch -n 1 nodetool tpstats" to see if any queues are backing up or dropping items. See other monitoring suggestions here.

One thing that might be off in your configuration is that you say you have five Cassandra nodes, but only 3 spark workers (or are you saying you have three spark workers on each Cassandra node?) You'll want at least one spark worker on each Cassandra node so that loading data into spark is done locally on each node and not over the network.

It's hard to tell much more than that without seeing your schema and the query you are running. Are you reading from a single partition? I started getting timeout errors in the vicinity of 300,000 rows when reading from a single partition. See question here. The only workaround I have found so far is to use a client side hash in my partition key to break the partitions up into smaller chunks of around 100K rows. So far I have not found a way to tell Cassandra to not timeout for a query that I expect to take a long time.

hadoop cassandra apache-spark datastax datastax-java-driver

Don't think configuration is a root cause, but data model issue.

It would be cool to see a structure of section_ks.encounters table.

Suggested to think carefully about what concrete queries expected to run before design table(s) structure.

As far as I see, those two queries expects different structure of section_ks.encounters to run them with good performance.

Let's review each provided query and try to design tables:

First one:

SELECT encounter_uuid, encounter_start_date FROM section_ks.encounters WHERE patient_id = '1234' AND encounter_start_date >= '" + formatted_documentation_date + "' ALLOW FILTERING;

First point, if Cassandra forces you to add ALLOW FILTERING, that is a symhtome of non-optimal query or table structure.
Second point. Primary key. An awesome explanation about what are primary keys in Cassandra Given query would work fast & without mandatory ALLOW FILTERING statement if patient_id column and encounter_start_date column would form a composite primary key. Enumerating of columns inside PRIMARY KEY() statement should correspond to order of filtering in your query.
Why ALLOW FILTERING mandatory in original query? By partition key Cassandra knows on which node data located. In case when patient_id column is not partition key, Cassandra had to scan all 5 nodes for find requested patient. When we have a lot of data across nodes, such full scan usually fails by timeout.

Here is an example of table structure fits effectively with given query:

create table section_ks.encounters(    patient_id bigint,     encounter_start_date timestamp,     encounter_uuid text,    some_other_non_unique_column text,    PRIMARY KEY (patient_id, encounter_start_date));

patient_id column would be a "partition key". Responsible for data distribution across Cassandra nodes. In simple words(omitting replication feature): different ranges of patients would be stored on different nodes.
encounter_start_date column would be a "clustering key" Responsible for data sorting inside partition.

ALLOW FILTERING now can be removed from query:

SELECT encounter_uuid, encounter_start_date FROM section_ks.encounters WHERE patient_id = '1234' AND encounter_start_date >= '2017-08-19';

Second query:

UPDATE section_ks.encounters SET testproblem_uuid_set = testproblem_uuid_set + {'1256'} WHERE encounter_uuid = 'abcd345';

Table structure should look like close to:

create table section_ks.encounters(    encounter_uuid text, -- partition key    patient_id bigint,    testproblem_uuid_set text,     some_other_non_unique_column text,    PRIMARY KEY (encounter_uuid));

If we definitively would like to make a quick filtering only by encounter_uuid, it should be defined as partition key.

Good articles about designing of effective data model:

CodeHunter

Cassandra timeout during read query at consistency ONE (1 responses were required but only 0 replica responded)

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last