Class ParquetGroupScanStatistics<T extends BaseMetadata & LocationProvider>

java.lang.Object
org.apache.drill.exec.store.parquet.ParquetGroupScanStatistics<T>

public class ParquetGroupScanStatistics<T extends BaseMetadata & LocationProvider> extends Object
Holds common statistics about data in parquet group scan, including information about total row count, columns counts, partition columns.
  • Constructor Details

    • ParquetGroupScanStatistics

      public ParquetGroupScanStatistics(Collection<T> rowGroupInfos)
    • ParquetGroupScanStatistics

      public ParquetGroupScanStatistics(ParquetGroupScanStatistics<T> that)
  • Method Details

    • getColumnValueCount

      public long getColumnValueCount(SchemaPath column)
    • getPartitionColumns

      public List<SchemaPath> getPartitionColumns()
    • getTypeForColumn

      public TypeProtos.MajorType getTypeForColumn(SchemaPath schemaPath)
    • getRowCount

      public long getRowCount()
    • getPartitionValue

      public Object getPartitionValue(org.apache.hadoop.fs.Path path, SchemaPath column)
    • getPartitionPaths

      public Map<org.apache.hadoop.fs.Path,Object> getPartitionPaths(SchemaPath column)
    • collect

      public void collect(Collection<T> metadataList)