Interface ParquetMetadataProvider

All Superinterfaces:
TableMetadataProvider
All Known Subinterfaces:
ParquetTableMetadataProvider
All Known Implementing Classes:
BaseParquetMetadataProvider, DeltaParquetTableMetadataProvider, HiveParquetTableMetadataProvider, MetastoreParquetTableMetadataProvider, ParquetTableMetadataProviderImpl

public interface ParquetMetadataProvider extends TableMetadataProvider
Interface for providing table, partition, file etc. metadata for specific table with parquet files.
  • Method Details

    • getEntries

      List<ReadEntryWithPath> getEntries()
      Returns list of ReadEntryWithPath instances which represents paths to files to be scanned.
      Returns:
      list of ReadEntryWithPath instances whith file paths
    • getRowGroupsMeta

      List<RowGroupMetadata> getRowGroupsMeta()
      Returns list of RowGroupMetadata instances which provides metadata for specific row group and its columns.
      Returns:
      list of RowGroupMetadata instances
    • getLocations

      List<org.apache.hadoop.fs.Path> getLocations()
      Returns list of file paths which belong to current table.
      Returns:
      list of file paths
    • getRowGroupsMetadataMap

      org.apache.drill.shaded.guava.com.google.common.collect.Multimap<org.apache.hadoop.fs.Path,RowGroupMetadata> getRowGroupsMetadataMap()
      Returns multimap of RowGroupMetadata instances which provides metadata for specific row group and its columns mapped to their locations.
      Returns:
      multimap of RowGroupMetadata instances
    • getFileSet

      Set<org.apache.hadoop.fs.Path> getFileSet()
      Returns list of file locations for table.
      Returns:
      list of file locations for table