Cassandra table read puts stress on 1 node while other nodes reads are balanced

Description

Hi,

The attached file shows the range reads when reading a table from Cassandra.

It shows that 1 Cassandra node is being requested many more reads than other nodes.

It is always the same Cassandra node (VPC-CASSANDRA-005).

When it’s done, there are 0 reads from that node until the task ends.

All other nodes read requests are balanced.

Is that a connector bug, or something external, e.g. configuration?

Thanks,

Shai

Environment

  • Amazon Linux AMI 2018.03

  • java version "1.8.0_131"

  • spark-core_2.12-3.1.1.jar

  • spark-cassandra-connector_2.12-3.1.0.jar

  • [cqlsh 5.0.1 | DSE 5.1.22 | CQL spec 3.4.4 | DSE protocol v1]

  • ~40 Cassandra servers

  • Spark runs on a remote server - 12 instances - 5 cores each

Pull Requests

None

Attachments

1

Activity

Show:

Shai Wolf January 17, 2023 at 9:26 AM

Hi Guys,

Is there any news about this issue?

I can still see it using connector 3.2.0.

Thanks,

Shai

Details

Assignee

Reporter

Components

Affects versions

Priority

Created July 17, 2022 at 12:06 PM
Updated January 17, 2023 at 9:26 AM

Flag notifications