Cassandra Table count retrieves different values when running with multiple executors.

Description

When running the statement above on a single machine I get always the same number (2073831).

Running the job with multiple machines in the spark cluster, retrieves a kind of approximate value of the number of entries in the table (2070976) which is always a bit different.

I tried the same program with both of the version 2.0.6 and 2.0.9 of the connector and got similar (inconsistent) results.

Is the number of the entries of a table actually an approximation linked to spark-cassandra-connector?

Environment

None

Pull Requests

None

Status

Assignee

Unassigned

Reporter

Marius Grama

Labels

None

Reviewer

None

Reviewer 2

None

Tester

None

Pull Request

None

Priority

Major
Configure