Drill Introduction

May 9, 2017

Drill is an Apache open-source SQL query engine for Big Data exploration. Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data coming from modern Big Data applications, while still providing the familiarity and ecosystem of ANSI SQL, the industry-standard query language. Drill provides plug-and-play integration with existing Apache Hive and Apache HBase deployments.

What's New in Apache Drill 1.10

Drill 1.10 provides the following new features and improvements:

  • Support for the CREATE TEMPORARY TABLE AS (CTTAS) command.
  • A JDBC connection option that improves fault tolerance when connecting directly to a Drill node from a client.
  • The Web Console displays the Drill version and additional query profile statistics.
  • Drill implicitly interprets the INT96 timestamp data type in Parquet files.
  • Support for Kerberos authentication between the client and drillbit.

What's New in Apache Drill 1.9

Drill 1.9 provides the following new features:

  • Asynchronous Parquet reader
  • Parquet filter pushdown
  • Dynamic UDF support
  • HTTPD format plugin

What's New in Apache Drill 1.8

Drill 1.8 provides the following new features and changes:

  • Metadata cache pruning
  • IF EXISTS parameter with the DROP TABLE and DROP VIEW commands
  • DESCRIBE SCHEMA command
  • Multi-byte delimiter support
  • New parameters for filter selectivity estimates
  • Changes to the configuration and launch scripts - See Configuration and Launch Script Changes

What's New in Apache Drill 1.7

Drill 1.7 provides the following new features:

  • Monitoring via JMX
  • Hive CHAR data type support
  • HBase 1.x support

What's New in Apache Drill 1.6

Drill 1.6 provides the following new features:

  • Inbound impersonation
  • Additional custom window frames

What's New in Apache Drill 1.5

Drill 1.5 provides the following new features:

  • Authentication and security for the Web interface and REST API
  • Experimental query support for Apache Kudu (incubating)
  • An improved memory allocator
  • Configurable caching for Hive metadata

What's New in Apache Drill 1.4

Drill 1.4 introduces the following improvements:

  • select with options that you use in queries to change storage plugin settings
  • Improved behavior when parsing CSV file header names
  • A variable to set non-pretty, such as compact, printing of JSON
  • Better drillbit.log files that include query text

Drill 1.4 fixes an error that occurred when you query a Hive table using the HBaseStorageHandler (DRILL-3739). To successfully query a Hive table using the HBaseStorageHandler, you need to configure the Hive storage plugin as described in the Hive storage plugin documentation.

What's New in Apache Drill 1.3

This releases fix issues and add a number of enhancements, including the following ones:

What's New in Apache Drill 1.2

This release of Drill fixes many issues and introduces a number of enhancements, including the following ones:

What's New in Apache Drill 1.1

Many enhancements in Apache Drill 1.1 include the following key features:

What's New in Apache Drill 1.0

Apache Drill 1.0 offers the following new features:

  • Many performance planning and execution improvements.
  • Updated Drill shell now formats query results.
  • Query audit logging for getting the query history on a Drillbit.
  • Improved connection handling.
  • New Errors tab in the Query Profiles UI that facilitates troubleshooting and distributed storing of profiles.
  • Support for a new storage plugin input format: Avro

In this release, Drill disables the DECIMAL data type, including casting to DECIMAL and reading DECIMAL types from Parquet and Hive. You can enable the DECIMAL type, but this is not recommended.

Apache Drill Key Features

Key features of Apache Drill are:

  • Low-latency SQL queries
  • Dynamic queries on self-describing data in files (such as JSON, Parquet, text) and HBase tables, without requiring metadata definitions in the Hive metastore.
  • ANSI SQL
  • Nested data support
  • Integration with Apache Hive (queries on Hive tables and views, support for all Hive file formats and Hive UDFs)
  • BI/SQL tool integration using standard JDBC/ODBC drivers

If you've never used Drill, visit these links to get a jump start: