Interface PartitionLocation

All Known Implementing Classes:
DFSDirPartitionLocation, DFSFilePartitionLocation, HivePartitionLocation, ParquetPartitionLocation, SimplePartitionLocation

public interface PartitionLocation
Interface to define a partition. Partition could be simple, which represents a basic unit for partition, determined by the underlying storage plugin. On file system, a simple partition represents a file. Partition could be composite, consisting of other partitions. On file system storage plugin, a composite partition corresponds to a directory. Simple partition location keeps track the string representation of partition and also stores the value of the individual partition keys for this partition. Composite partition location keeps track the common partition keys, but does not keep track the the string representation of partition and leave it to each individual simple partition it consists of.
  • Method Details

    • getPartitionValue

      String getPartitionValue(int index)
      Returns the value of the 'index' partition column
    • getEntirePartitionLocation

      org.apache.hadoop.fs.Path getEntirePartitionLocation()
      Returns the path of this partition. Only a non-composite partition supports this.
    • getPartitionLocationRecursive

      List<SimplePartitionLocation> getPartitionLocationRecursive()
      Returns the list of the non-composite partitions that this partition consists of.
    • isCompositePartition

      boolean isCompositePartition()
      Returns if this is a simple or composite partition.
    • getCompositePartitionPath

      org.apache.hadoop.fs.Path getCompositePartitionPath()
      Returns the path string of directory names only for composite partition