VectorizedColumnReader

VectorizedColumnReader is a vectorized column reader that VectorizedParquetRecordReader uses for Vectorized Parquet Decoding.

VectorizedColumnReader is created exclusively when VectorizedParquetRecordReader is requested to checkEndOfRowGroup (when requested to read next rows into a columnar batch).

Once created, VectorizedColumnReader is requested to read rows as a batch (when VectorizedParquetRecordReader is requested to read next rows into a columnar batch).

VectorizedColumnReader is given a WritableColumnVector to store rows read as a batch.

VectorizedColumnReader takes the following to be created:

  • Parquet ColumnDescriptor

  • Parquet OriginalType

  • Parquet PageReader

  • TimeZone (for timezone conversion to apply to int96 timestamps. null for no conversion)

Reading Rows As Batch — readBatch Method

void readBatch(
  int total,
  WritableColumnVector column) throws IOException

readBatch…​FIXME

Note
readBatch is used exclusively when VectorizedParquetRecordReader is requested to read next rows into a columnar batch.

results matching ""

    No results matching ""