The start-up options for Drill reside in a HOCON configuration file format, which is a hybrid between a properties file and a JSON file. Drill start-up options consist of a group of files with a nested relationship. At the bottom of the file hierarchy are the default files that Drill provides, starting with
drill-default.conf file is overridden by one or more
drill-module.conf files that Drill’s internal modules provide. The
drill-module.conf files are overridden by the
drill-override.conf file that you define.
You can provide overrides on each drillbit using system properties of the form
-Dname=value passed on the command line:
./drillbit.sh start -Dname=value
You can see the following group of files throughout the source repository in Drill:
common/src/main/resources/drill-default.conf common/src/main/resources/drill-module.conf contrib/storage-hbase/src/main/resources/drill-module.conf contrib/storage-hive/core/src/main/resources/drill-module.conf contrib/storage-hive/hive-exec-shade/src/main/resources/drill-module.conf exec/java-exec/src/main/resources/drill-module.conf distribution/src/resources/drill-override.conf
These files are listed inside the associated JAR files in the Drill distribution tarball.
Each Drill module has a set of options that Drill incorporates. Drill’s modular design enables you to create new storage plugins, set new operators, or create UDFs. You can also include additional configuration options that you can override as needed.
When you add a JAR file to Drill, you must include a
in the root directory of the JAR file that you add. The
file tells Drill to scan that JAR file or associated object and include it.
Viewing Start-Up Options
Run the following query to see a list of the available start-up options:
SELECT * FROM sys.boot;
Configuring Start-Up Options
You can configure start-up options for each Drillbit in
The summary of start-up options, also known as boot options, lists default values. The following descriptions provide more detail on key options that are frequently reconfigured:
Introduced in Drill 1.15. Sets batch sizing (number of rows per batch) for the Parquet reader. The default value is 4,000 rows. If rows are large and queries return out-of-memory errors, reduce the value. You can set this option at the system level in
<drill_home>/conf/drill-override.confor at the session level using the SET command, as shown:
SET `store.parquet.complex.batch.num_records`=2000; +-------+---------------------------------------------------+ | ok | summary | +-------+---------------------------------------------------+ | true | store.parquet.complex.batch.num_records updated. | +-------+---------------------------------------------------+
Introduced in Drill 1.2. Enables or disables HTTPS support. Settings are TRUE and FALSE, respectively. The default is FALSE.
Defines the persistent storage (PStore) provider. The PStore holds configuration and profile data.
Defines the amount of memory available, in terms of record batches, to hold data on the downstream side of an operation. Drill pushes data downstream as quickly as possible to make data immediately available. This requires Drill to use memory to hold the data pending operations. When data on a downstream operation is required, that data is immediately available so Drill does not have to go over the network to process it. Providing more memory to this option increases the speed at which Drill completes a query.
Introduced in Drill 1.11. The default file system on the local machine into which the Sort, Hash Aggregate, and Hash Join operators spill data. This is the recommended option to use for spilling. You can configure this option so that data spills into a distributed file system, such as hdfs. For example, "hdfs:///". The default setting is "file:///". See Sort-Based and Hash-Based Memory Constrained Operators for more information.
Introduced in Drill 1.11. The list of directories into which the Sort, Hash Aggregate, and Hash Join operators spill data. The list must be an array with directories separated by a comma, for example ["/fs1/drill/spill" , "/fs2/drill/spill" , "/fs3/drill/spill"]. This is the recommended option for spilling to multiple directories. The default setting is ["/tmp/drill/spill"]. See Sort-Based and Hash-Based Memory Constrained Operators for more information.
Provides Drill with the ZooKeeper quorum to use to connect to data sources. Change this setting to point to the ZooKeeper quorum that you want Drill to use. You must configure this option on each Drillbit node.
Introduced in Drill 1.11. When set to TRUE, enables Drill to store query profiles in memory instead of writing the query profiles to disk. When set to FALSE, Drill writes the profile for each query to disk, which is either the local file system or a distributed file system, such as HDFS. For sub-second queries, writing the query profile to disk is expensive due to the interactions with the file system. Enable this option if you want Drill to store the profiles of sub-second queries in memory instead of writing them to disk. When you enable this option, Drill stores the profiles in memory for as long as the drillbit runs. When the drillbit restarts, the profiles no longer exist. You can set the maximum number of most recent profiles to retain in memory through the
drill.exec.profiles.store.capacityoption. Settings are TRUE and FALSE. Default is FALSE. See Persistent Configuration Storage for more information.
Introduced in Drill 1.11. Sets the maximum number of most recent profiles to retain in memory when the
drill.exec.profiles.store.inmemoryoption is enabled. Default is 1000.
Introduced in Drill 1.14. Determines what happens to the
storage-plugins-override.conffile after Drill successfully updates storage plugin configurations. If you do not want Drill to apply the configurations after restarting, set the option to
"remove". This option accepts the following values:
"none"- (Default) The file remains in the directory after Drill uses the file.
storage-plugins-override.conffile name is changed to
storage-plugins-override-[current_timestamp].confafter Drill uses the file.
"remove"- The file is removed after Drill uses the file for the first time.