Function readParquet

  • Read into a DataFrame from a parquet file.

    Parameters

    • pathOrBody: string | Buffer
    • Optionaloptions: Partial<ReadParquetOptions>
      • columns

        Columns to select. Accepts a list of column indices (starting at zero) or a list of column names.

      • numRows

        Stop reading from parquet file after reading numRows.

      • parallel

        Any of 'auto' | 'columns' | 'row_groups' | 'none' This determines the direction of parallelism. 'auto' will try to determine the optimal direction. Defaults to 'auto'

      • rowCount

        Add row count as column

    Returns pl.DataFrame