Allow Configuring Max Connections Per Executor

Description

Talking with some of the driver team we discussed how for some use-cases it would be beneficial for the connector to use a larger connection pool.

For ease of use I suggest we automatically set the host pool to
math.max(defaultParallelism/ numExecutors, defaultParallelism) at runtime when creating
and RDD. This will be the default behavior unless the end user sets a specific param

spark.cassandra.connection.connections_per_executor_max

For users not running a Spark Context we'll fall back to just 1.

Pull Requests

https://github.com/datastax/spark-cassandra-connector/pull/1083

Activity

Fixed

Details

Assignee

Reporter

Reviewer

Components

Fix versions

Affects versions

Priority

Created February 13, 2017 at 11:41 PM
Updated February 19, 2017 at 6:51 PM
Resolved February 15, 2017 at 6:46 PM

Flag notifications