Apache Spark
Connecting to Spark SQL
Specify the following to establish a connection with Spark SQL:
Server: Set this to the host name or IP address of the server hosting SparkSQL.
Port: Set this to the port for the connection to the SparkSQL instance.
TransportMode: The transport mode to use to communicate with the SparkSQL server. Accepted entries are BINARY and HTTP. BINARY is selected by default.
Securing Spark SQL Connections
To enable TLS/SSL in the provider, set UseSSL to True.
Authenticating to Spark SQL
The service may be authenticated to using the PLAIN, LDAP, NOSASL, KERBEROS auth schemes.
PLAIN
To authenticate with PLAIN, set the following connection properties:
AuthScheme: Set this to PLAIN.
User: Set this to user to login as.
Password: Set this to the password of the user.
To authenticate, set User and Password.
LDAP
To authenticate with LDAP, set the following connection properties:
AuthScheme: Set this to LDAP.
User: Set this to user to login as.
Password: Set this to the password of the user.
To authenticate, set User, Password, and AuthScheme.
NOSASL
When using NOSASL, no authentication is performed. Set the following connection properties:
AuthScheme: Set this to NOSASL.
Kerberos
Please see Apache HDFS connector for details on how to authenticate with Kerberos.
Connecting to Databricks
To connect to a Databricks cluster, set the properties as described below. Note: The needed values can be found in your Databricks instance by navigating to 'Clusters', selecting the desired cluster, and selecting the JDBC/ODBC tab under 'Advanced Options'.
Server: Set to the Server Hostname of your Databricks cluster.
Port: 443
TransportMode: HTTP
HTTPPath: Set to the HTTP Path of your Databricks cluster.
UseSSL: True
AuthScheme: PLAIN
User: Set this to user to login as
Password: Set to your personal access token (value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
Last updated