Skip to content

[Python][Doc] Improve docs to specify that source argument on parquet.read_table can also be a list of strings #46374

Open
@raulcd

Description

@raulcd

Describe the enhancement requested

The docstring for parquet.read_table doesn't specify that the source can be a list of strings:

source : str, pyarrow.NativeFile, or file-like object
If a string passed, can be a single file name or directory name. For
file-like objects, only read a single file. Use pyarrow.BufferReader to
read a file contained in a bytes or buffer-like object.

There is also a #TODO for a check on the source argument to validate is of the valid type in case dataset is not enabled.

if filesystem is not None:
source = filesystem.open_input_file(path)
# TODO test that source is not a directory or a list
dataset = ParquetFile(

This should only tackle those. There is another issue to test the fallback case successfully:

Component(s)

Documentation, Python

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions