Interface ParquetMetadataProvider
- All Superinterfaces:
TableMetadataProvider
- All Known Subinterfaces:
ParquetTableMetadataProvider
- All Known Implementing Classes:
BaseParquetMetadataProvider
,DeltaParquetTableMetadataProvider
,HiveParquetTableMetadataProvider
,MetastoreParquetTableMetadataProvider
,ParquetTableMetadataProviderImpl
Interface for providing table, partition, file etc. metadata for specific table with parquet files.
-
Method Summary
Modifier and TypeMethodDescriptionReturns list ofReadEntryWithPath
instances which represents paths to files to be scanned.Set<org.apache.hadoop.fs.Path>
Returns list of file locations for table.List<org.apache.hadoop.fs.Path>
Returns list of file paths which belong to current table.Returns list ofRowGroupMetadata
instances which provides metadata for specific row group and its columns.org.apache.drill.shaded.guava.com.google.common.collect.Multimap<org.apache.hadoop.fs.Path,
RowGroupMetadata> Returns multimap ofRowGroupMetadata
instances which provides metadata for specific row group and its columns mapped to their locations.Methods inherited from interface org.apache.drill.metastore.metadata.TableMetadataProvider
checkMetadataVersion, getFileMetadata, getFilesForPartition, getFilesMetadataMap, getNonInterestingColumnsMetadata, getPartitionColumns, getPartitionMetadata, getPartitionsMetadata, getSegmentsMetadataMap, getTableMetadata
-
Method Details
-
getEntries
List<ReadEntryWithPath> getEntries()Returns list ofReadEntryWithPath
instances which represents paths to files to be scanned.- Returns:
- list of
ReadEntryWithPath
instances whith file paths
-
getRowGroupsMeta
List<RowGroupMetadata> getRowGroupsMeta()Returns list ofRowGroupMetadata
instances which provides metadata for specific row group and its columns.- Returns:
- list of
RowGroupMetadata
instances
-
getLocations
List<org.apache.hadoop.fs.Path> getLocations()Returns list of file paths which belong to current table.- Returns:
- list of file paths
-
getRowGroupsMetadataMap
org.apache.drill.shaded.guava.com.google.common.collect.Multimap<org.apache.hadoop.fs.Path,RowGroupMetadata> getRowGroupsMetadataMap()Returns multimap ofRowGroupMetadata
instances which provides metadata for specific row group and its columns mapped to their locations.- Returns:
- multimap of
RowGroupMetadata
instances
-
getFileSet
Set<org.apache.hadoop.fs.Path> getFileSet()Returns list of file locations for table.- Returns:
- list of file locations for table
-