About the Easysoft ODBC-Apache Spark Driver

The Easysoft ODBC-Apache Spark Driver provides real-time access to Apache Spark data from any application that supports ODBC.

ODBC API and scalar functions
Data type mapping
SQL support

ODBC API and scalar functions

API functions

Use this table to find out what ODBC API functions the Easysoft ODBC-Apache Spark Driver supports:

Function Status

Function	Status
`SQLAllocConnect`	Supported
`SQLAllocEnv`	Supported
`SQLAllocHandle`	Supported
`SQLAllocStmt`	Supported
`SQLBindCol`	Supported
`SQLBindParameter`	Supported
`SQLBrowseConnect`	Not supported
`SQLBulkOperations`	Not supported
`SQLCancel`	Supported
`SQLCloseCursor`	Supported
`SQLColAttribute`	Supported
`SQLColAttributes`	Supported
`SQLColumnPrivileges`	Supported
`SQLColumns`	Supported
`SQLConnect`	Supported
`SQLCopyDesc`	Not supported
`SQLDisconnect`	Supported
`SQLDriverConnect`	Supported
`SQLDrivers`	Supported
`SQLEndTran`	Supported
`SQLError`	Supported
`SQLExecDirect`	Supported
`SQLExecute`	Supported
`SQLExtendedFetch`	Supported
`SQLFetch`	Supported
`SQLFetchScroll`	Supported
`SQLForeignKeys`	Supported
`SQLFreeConnect`	Supported
`SQLFreeEnv`	Supported
`SQLFreeHandle`	Supported
`SQLFreeStmt`	Supported
`SQLGetConnectAtt`	Supported
`SQLGetConnectOption`	Supported
`SQLGetCursorName`	Supported
`SQLGetData`	Supported
`SQLGetDescField`	Supported
`SQLGetDescRec`	Supported
`SQLGetDiagField`	Supported
`SQLGetDiagRec`	Supported
`SQLGetEnvAttr`	Supported
`SQLGetFunctions`	Supported
`SQLGetInfo`	Supported
`SQLGetStmtAttr`	Supported
`SQLGetStmtOption`	Supported
`SQLGetTypeInfo`	Supported
`SQLMoreResults`	Supported
`SQLNativeSql`	Supported
`SQLNumParams`	Supported
`SQLNumResultCols`	Supported
`SQLParamData`	Supported
`SQLParamOptions`	Supported
`SQLPrepare`	Supported
`SQLPrimaryKeys`	Supported
`SQLProcedureColumns`	Supported
`SQLProcedures`	Supported
`SQLPutData`	Supported
`SQLRowCount`	Supported
`SQLSetConnectAttr`	Supported
`SQLSetConnectOption`	Supported
`SQLSetCursorName`	Supported
`SQLSetDescField`	Supported
`SQLSetDescRec`	Supported
`SQLSetEnvAttr`	Supported
`SQLSetParam`	Supported
`SQLSetPos`	Not supported
`SQLSetScrollOptions`	Not supported
`SQLSetStmtOption`	Supported
`SQLSetStmtAttr`	Supported
`SQLStatistics`	Supported
`SQLTablePrivileges`	Supported
`SQLTables`	Supported
`SQLTransact`	Supported

Supported

Supported

Supported

Supported

Supported

Supported

Not supported

Not supported

Supported

Supported

Supported

Supported

SQLColumnPrivileges

Supported

Supported

Supported

Not supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

SQLGetConnectOption

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

SQLProcedureColumns

Supported

Supported

Supported

Supported

Supported

SQLSetConnectOption

Supported

Supported

Supported

Supported

Supported

Supported

Not supported

SQLSetScrollOptions

Not supported

Supported

Supported

Supported

Supported

Supported

Supported

Scalar functions

The Easysoft ODBC-Apache Spark Driver supports a number of scalar functions:

String functions
Numeric functions
Time, date, and interval functions
System functions
Conversion functions

Use the SQL-92 syntax with scalar functions. For example:

SELECT
  Invoice_Id,
  Customer_Name,
  EXTRACT(YEAR FROM Due_Date) as "Year"
FROM
  Invoice

String functions

The Easysoft ODBC-Apache Spark Driver supports these string functions:

ASCII(string_exp)
BIT_LENGTH(string_exp)
CHAR(code)
CHAR_LENGTH(string_exp)
CHARACTER_LENGTH(string_exp)`
CONCAT(string_exp1, string_exp2)
LCASE(string_exp)
LEFT(string_exp, count)
LENGTH(string_exp)
LOCATE(string_exp1, string_exp2[,start])
LTRIM(string_exp)
OCTET_LENGTH(string_exp)
POSITION(char_exp IN char_exp)
REPEAT(string_exp, count)
REPLACE(string_exp1, string_exp2, string_exp3)
RIGHT(string_exp, count)
RTRIM(string_exp)
SOUNDEX(string_exp)
SPACE(count)
SUBSTRING(string_exp, start, length)
TRIM(string_exp)`
UCASE(string_exp)

Numeric functions

The Easysoft ODBC-Apache Spark Driver supports these numeric functions:

ABS(numeric_exp)
ACOS(float_exp)
ASIN(float_exp)
ATAN(float_exp)
CEILING(numeric_exp)
COS(float_exp)
COT(float_exp)
DEGREES(numeric_exp)
EXP(float_exp)
FLOOR(numeric_exp)
LOG(float_exp)
LOG10(float_exp)
MOD(integer_exp1, integer_exp2)
PI()
POWER(numeric_exp, integer_exp)
RADIANS(numeric_exp)
RAND([integer_exp])
ROUND(numeric_exp, integer_exp)
SIGN(numeric_exp)
SIN(float_exp)
SQRT(float_exp)
TAN(float_exp)

Time, date, and interval functions

The Easysoft ODBC-Apache Spark Driver supports these time, date, and interval functions:

CURRENT_DATE()
CURRENT_TIMESTAMP[(timestamp-precision)]
DAYNAME(date_exp)
DAYOFMONTH(date_exp)
DAYOFWEEK(date_exp)
DAYOFYEAR(date_exp)
HOUR(time_exp)
MINUTE(time_exp)
MONTH(date_exp)
QUARTER(date_exp)
SECOND(time_exp)
WEEK(date_exp)
YEAR(date_exp)

System functions

The Easysoft ODBC-Apache Spark Driver supports these system functions:

IFNULL(exp, value)

Conversion functions

The Easysoft ODBC-Apache Spark Driver supports supports the SQL-92 CAST function for conversion between compatible data types.

Data type mapping

The Easysoft ODBC-Apache Spark Driver maps Apache Spark data types to ODBC data types in this way:

Apache Spark data type ODBC data type

Apache Spark data type	ODBC data type
`BOOLEAN`	`SQL_BIT`
`TINYINT`	`SQL_TINYINT`
`BIGINT`	`SQL_BIGINT`
`BINARY`	`SQL_BINARY`
`STRING`	`SQL_LONGVARCHAR`
`CHAR`	`SQL_CHAR`
`DECIMAL`	`SQL_DECIMAL`
`INT`	`SQL_INTEGER`
`SMALLINT`	`SQL_SMALLINT`
`FLOAT`	`SQL_FLOAT`
`DOUBLE`	`SQL_DOUBLE`
`VARCHAR`	`SQL_VARCHAR`
`DATE`	`SQL_TYPE_DATE`
`TIMESTAMP`	`SQL_TYPE_TIMESTAMP`

BOOLEAN

SQL_BIT

TINYINT

SQL_TINYINT

BIGINT

SQL_BIGINT

BINARY

SQL_BINARY

STRING

SQL_LONGVARCHAR

CHAR

SQL_CHAR

DECIMAL

SQL_DECIMAL

INT

SQL_INTEGER

SMALLINT

SQL_SMALLINT

FLOAT

SQL_FLOAT

DOUBLE

SQL_DOUBLE

VARCHAR

SQL_VARCHAR

DATE

SQL_TYPE_DATE

TIMESTAMP

SQL_TYPE_TIMESTAMP

Finding out more about data types on Windows

If you need more information about a data types, for example, the precision and scale, use Microsoft’s ODBC Test to do this.

Download the version of ODBC Test that matches your application’s architecture from:

https://www.easysoft.com/ftp/pub/utils/windows/odbc-test/
Copy both files to a folder on the machine where Easysoft ODBC-Apache Spark Driver is installed.
Double-click odbcte32.exe.
Select Con > Full Connect.
Choose your Easysoft ODBC-Apache Spark Driver data source from the list.
Choose Catalog > SQLGetTypeInfo.
Either choose SQL_ALL_TYPES=0 (1.0) or a specific data type from the DataType list.
Choose Results > Get Data All.

SQL support

The Easysoft ODBC-Apache Spark Driver supports these SQL statements, clauses, and operators:

SELECT
SELECT DISTINCT
WHERE
ORDER BY
AND
OR
NOT
INSERT INTO
NULL
UPDATE
DELETE
FIRST
MIN
MAX
COUNT
SUM
AVG
LIKE
WILDCARDS
IN
BETWEEN
ALIASES
JOINS
UNION
GROUP BY
HAVING
EXISTS
CASE

Example queries

To fetch all records from a table, use the asterisk symbol (*) in your queries. For example:
```
SELECT * FROM Customers
```

To only fetch records whose values are different, use DISTINCT. For example:

-- Which different sales regions are there?
SELECT DISTINCT Region AS Different_Regions FROM SalesOrders
-- How many different sales regions are there?
SELECT COUNT(DISTINCT Region) AS Different_Regions FROM SalesOrders

To filter records, use WHERE. For example:

SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  Region = 'Eastern'

SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  Region = 'Eastern'
  OR Region = 'Western'

You can also supply a WHERE clause value as a parameter. For example, to do this in Python:

cursor.execute("SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  Region = ?", ['Eastern'])

To fetch records that don’t match the WHERE clause pattern use NOT. For example:

SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  NOT Region = 'Eastern'

To sort the result set in either ascending or descending order, use ORDER BY. For example:

SELECT
  *
FROM
  SalesOrders
ORDER BY
  OrderDate ASC

SELECT
  *
FROM
  Contacts
ORDER BY
  (
    CASE
      WHEN Surname IS NULL THEN Title
      ELSE Surname
    END
  );

To group a result set into summary rows, use GROUP BY. For example:

SELECT
    COUNT(Id) As "Number",
    ProductID
FROM
    SalesOrderItems
GROUP BY
    ProductID

SELECT
    COUNT(Id) As "Number",
    ProductID
FROM
    SalesOrderItems
GROUP BY
    ProductID
HAVING
  COUNT(Id) > 100;

To do calculations based on result set vales, use the SQL aggregate functions MIN(), MAX(), COUNT(), SUM(), and AVG(). For example:
```
SELECT Max(Quantity) FROM SalesOrderItems
SELECT Sum(Quantity) FROM SalesOrderItems
```
To convert between compatible data types, use CAST. For example:
```
SELECT CAST(Quantity AS Char(100))FROM  SalesOrderItems
```
To fetch records that contain column values between a given range, use BETWEEN. For example:
```
SELECT ProductID FROM SalesOrderItems WHERE Quantity BETWEEN 10 AND 20
```

To combine the result set of two or more SELECT statements, use UNION. For example:
```
SELECT City FROM Contacts
UNION
SELECT City FROM Customers
```

To combine rows from two or more tables, use JOIN. For example:

SELECT SalesOrders.ID, Customers.Surname, SalesOrders.OrderDate
FROM SalesOrders
INNER JOIN Customers ON SalesOrders.CustomerID=Customers.ID;

To fetch records that contain column values matching a search pattern, use LIKE. For example:

SELECT Surname, GivenName FROM Customers WHERE CompanyName LIKE 'R%'
SELECT Surname, GivenName FROM Customers WHERE CompanyName LIKE '_he'

To search for columns without a value (NULL) or with a value (non NULL), use either IS NULL or IS NOT NULL. For example:
```
SELECT * FROM Customers WHERE CompanyName IS NULL
```

To specify multiple values in a WHERE clause, you can use IN as an alternative to OR. For example:

SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  Region = 'Eastern'
  OR Region = 'Western'
  OR Region = 'Central'

can be replaced with:

SELECT
  OrderDate,
  SalesRepresentative
FROM
  SalesOrders
WHERE
  Region IN  ('Eastern', 'Western', 'Central')

To set the maximum number of records to return, use LIMIT. For example:
```
SELECT * FROM Customers LIMIT 10
```

To test for the existence of records in a subquery, use EXISTS. For example:

SELECT
  Name
FROM
  Products
WHERE
  EXISTS (
    SELECT
     *
    FROM
      SalesOrderItems
    WHERE
     Products.ID = SalesOrderItems.ProductID
      AND Quantity < 20
  )

Example inserts, updates, and deletes

To insert a Apache Spark record, use INSERT INTO. For example:

INSERT INTO
    Customers (
        Surname,
        GivenName,
        City,
        Phone,
        CompanyName
    )
VALUES
    (
        'Devlin',
        'Michaels',
        'Kingston',
        '2015558966',
        'PowerGroup'
    )

Here’s a SQL Server linked server example:

EXEC ('INSERT INTO Customers (Surname, GivenName, City, Phone, CompanyName)
VALUES (''Devlin'' , ''Michaels'' , ''Kingston'' , ''2015558966'' , ''PowerGroup'')')

Here’s an Oracle linked table example:

DECLARE
  num_rows integer;
BEGIN
num_rows:=DBMS_HS_PASSTHROUGH.EXECUTE_IMMEDIATE@SparkLink
('INSERT INTO Customers (Surname, GivenName, City, Phone, CompanyName) VALUES (''Devlin'', ''Michaels'', ''Kingston'', ''2015558966'', ''PowerGroup'')');
END;
/

The Easysoft ODBC-Apache Spark Driver also supports parameterized inserts. Here’s an example of doing this in Perl:

my $sth = $dbh->prepare(q/INSERT INTO Customers (Surname, GivenName, City, Phone, CompanyName) VALUES (?, ?, ?, ?, ?)/)
    or die "Can't prepare statement: $DBI::errstr";

$sth->execute('Devlin', 'Michaels', 'Kingston', '2015558966', 'PowerGroup');

To update a Apache Spark record, use UPDATE. For example:

UPDATE Customers
SET
  Surname = 'Jones'
WHERE
  Account_Id = 'PowerGroup'

The Easysoft ODBC-Apache Spark Driver also supports parameterized updates. Here’s an example of doing this in Perl:

my $sth = $dbh->prepare('UPDATE Customers SET Surname = \'Jones\' WHERE CompanyName = ?')
    or die "Can't prepare statement: $DBI::errstr";

$sth->execute('PowerGroup');

To delete a Apache Spark record, use DELETE. For example:

-- Delete (mark inactive) a bank account
DELETE FROM Customers WHERE CompanyName = 'PowerGroup'

The Easysoft ODBC-Apache Spark Driver also supports parameterized deletes. Here’s an example of doing this in Python:

sql = "DELETE FROM Customers WHERE CompanyName = ?"
cursor.execute(sql, 'PowerGroup')