Dataset#
Dataset represents a collection of files.
To create a dataset:
dataset = ParquetDataSet("path/to/dataset/*.parquet")
DataSets#
|
The base class for all datasets. |
|
A set of files. |
|
A set of parquet files. |
|
A set of csv files. |
|
A set of json files. |
|
An arrow table. |
|
A pandas dataframe. |
|
A dataset that is partitioned into multiple datasets. |
|
The result of a sql query. |