ArangoDB Spark Connector - Getting Started
Maven
Spark 3.X + 2.4.1 and above
<dependencies>
<dependency>
<groupId>com.arangodb</groupId>
<artifactId>arangodb-spark-connector_2.12</artifactId>
<version>1.1.0</version>
</dependency>
...
</dependencies>
Spark 2.4.0 and below
<dependencies>
<dependency>
<groupId>com.arangodb</groupId>
<artifactId>arangodb-spark-connector_2.11</artifactId>
<version>1.1.0</version>
</dependency>
...
</dependencies>
SBT
Spark 3.X + 2.4.1 and above
libraryDependencies += "com.arangodb" % "arangodb-spark-connector_2.12" % "1.1.0"
Spark 2.4.0 and below
libraryDependencies += "com.arangodb" % "arangodb-spark-connector_2.11" % "1.1.0"
Configuration
property-key | description | default value |
---|---|---|
arangodb.hosts | comma separated list of ArangoDB hosts | 127.0.0.1:8529 |
arangodb.user | basic authentication user | root |
arangodb.password | basic authentication password | |
arangodb.protocol | network protocol | VST |
arangodb.useSsl | use SSL connection | false |
arangodb.ssl.keyStoreFile | SSL certificate keystore file | |
arangodb.ssl.passPhrase | SSL pass phrase | |
arangodb.ssl.protocol | SSL protocol | TLS |
arangodb.maxConnections | max number of connections per host | 1 |
arangodb.acquireHostList | auto acquire list of available hosts | false |
arangodb.loadBalancingStrategy | load balancing strategy to be used | NONE |
Setup SparkContext
Scala
val conf = new SparkConf()
.set("arangodb.hosts", "127.0.0.1:8529")
.set("arangodb.user", "myUser")
.set("arangodb.password", "myPassword")
...
val sc = new SparkContext(conf)
Java
SparkConf conf = new SparkConf()
.set("arangodb.hosts", "127.0.0.1:8529")
.set("arangodb.user", "myUser")
.set("arangodb.password", "myPassword");
...
JavaSparkContext sc = new JavaSparkContext(conf);