DOC: Provide examples of using read_parquet

This issue has been created since 2022-11-17.

Pandas version checks

  • I have checked that the issue still exists on the latest versions of the docs on main here

Location of the documentation

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_parquet.html

Documentation problem

For the pyarrow engine, there are some important features behind the kwargs that aren't aren't described here, and it might not be obvious to users where to look in PyArrow. For example:

  • Using filters, users can prune which files and/or row groups are read.
  • Using filesystem, users can configure a filesystem such as S3

Suggested fix for documentation

At the very least, we should document for each engine where those kwargs are passed. But it might even be worthwhile to provide examples of filters, reading partitioned datasets, and configuring remote filesystems. Does that seem reasonable?

rhshadrach wrote this answer on 2022-11-17

Thanks for the report! +1 on saying the function that's called from pandas and linking to it's documentation. However, if we were to document which kwargs there's a bit more maintenance burden keeping it in sync (e.g. what can be passed for filters just changed in 10.0.0) and I don't think it provides significant benefit to the user.

phofl wrote this answer on 2022-11-17

Yep agreed, would rather link to the functions itself too

More Details About Repo
Owner Name pandas-dev
Repo Name pandas
Full Name pandas-dev/pandas
Language Python
Created Date 2010-08-24
Updated Date 2022-12-07
Star Count 36164
Watcher Count 1118
Fork Count 15472
Issue Count 3683

YOU MAY BE INTERESTED

Issue Title Created Date Comment Count Updated Date
forge script --verify error 6 2022-06-12 2022-10-21
make watch-storefront doesn't work 3 2021-05-22 2022-04-04
Allow installation with different default language than english 3 2021-05-11 2022-09-15
A route with a path parameter is not matched if the path parameter value contains a dot 2 2022-01-23 2022-10-24
Server Start Warns that "no such table: Services" for NW sample database 5 2021-03-25 2022-11-06
Add support for RFC 6532: "Internationalized Email Headers" 0 2019-04-02 2022-11-26
Failing on simple example. 0 2022-03-14 2022-10-12
Page loading performance problem 2 2021-01-18 2022-08-23
Enable toggling of direct VNC server access. 2 2019-05-25 2022-12-06
Tabs do not store when Better-onetab tab is pinned 0 2021-07-25 2022-10-04
Mongoose timastamps not supported 1 2021-06-22 2022-12-03
Some French translations 2 2022-02-26 2022-10-20
SalesForce Chrome Plugin Error 4 2022-08-04 2022-08-10
Can't close port after switch to bootloade mode 10 2022-02-08 2022-12-06
This port appears to have been shutdown or disconnected. 2 2022-01-25 2022-12-05
test issue 0 2021-04-29 2021-12-18
Refresh documentation 0 2021-09-01 2022-11-11
microwatt BIOS hangs in ISR 6 2022-04-01 2022-11-14
how to set analyzer to haystack with elasticsearch 5 2012-09-05 2022-11-19
Using solr as backend fails on first search 41 2015-05-01 2022-11-23
Color skin glitch with iOS devices 0 2022-09-08 2022-11-26
Text adjustment in lateral panel 0 2022-09-17 2022-10-07
How to use with ssr? 17 2018-07-05 2022-11-20
can't see any text.. [dietpi bullseye] [pinebook pro] 2 2022-06-30 2022-12-06
Monitorian crashes when started on Windows 10 20H2 6 2021-02-21 2022-11-19
Config randomly changes specific, numbered cpu bars to cpu average bars 0 2022-09-17 2022-12-01
Image Updates 1 2021-11-10 2022-01-14
DB/Creature - Stormwind City - Craggle Wobbletop 0 2022-10-14 2022-11-16
Add a helper method for create a new bundle entry 0 2022-04-04 2022-12-02
Docs: Fix Alert Example Inconsistensies 0 2022-06-17 2022-08-13
runing avalanchego can't finish 2 2021-12-12 2022-09-28
Transfer data from grid to server. 4 2021-08-31 2022-11-30
Support an ignore / reject option for the `pnpm update` and `pnpm outdated` commands 5 2022-09-14 2022-11-21
Accessibility improvement 0 2022-03-02 2022-10-19
I am missing the webhook processing 1 2022-01-11 2022-10-07
Selected version in the version selector is too subtle 0 2022-09-12 2022-12-04
[Bug] ie_api.pyx:357: RuntimeError: dimension (4) in node dim must be a non-negative integer: at offset 6 2022-03-30 2022-11-02
Assets inside a .pck file created from one project becomes null when imported in another project 4 2022-01-25 2022-08-13
Timeout takes into account session scope fixture 3 2021-07-27 2022-11-19
Rename UnAuthorizedException to ForbiddenException 1 2021-01-12 2022-11-22
hwdec d3d11va fails when seeking multiple times 6 2021-11-27 2022-09-30
Incorrect handling query parameters containing semicolons 4 2021-10-28 2022-11-14
[Extensibility bug] Customer List 3 2022-01-27 2022-11-19
Prevent runtime type error due to wrong return value configuration 2 2018-02-20 2022-12-01
Support other OAUTH / Identity Providers 3 2020-10-05 2022-11-30
Using right-click 'Jump to declaration' menu item causes Processing to lock-up in some sketches. 12 2022-08-04 2022-11-25
[v2.0.0-rc2] Server candidate checking does not silence STDERR 0 2021-10-01 2022-09-30
Gifs should not loop endlessly 2 2021-08-26 2022-10-05
custom-secret-scripts refers to the incorrect secret name in lookup() 1 2022-08-04 2022-11-23
Markdown Visualization Autoupdate 2 2022-01-07 2022-10-14