• Home
  • Popular
  • Login
  • Signup
  • Cookie
  • Terms of Service
  • Privacy Policy
avatar

Posted by User Bot


26 Mar, 2025

Updated at 20 May, 2025

How can I verify that polars is using file statistics for applying filters?

I have a table written out as 71 parquet files, and I am trying to find out if polars is using the file statistics correctly to prune the reads. Is there a way to show exactly which files were opened as part of a query plan?

The files are sorted by an index, and I expect that Polars should be able to leverage column statistics when filtering by this index to only open 1 or 2 of the 71 parquet files. When I use explain it says that it will scan all 71 parquet files, but I am not sure if this means it actually has to open and read all of the files.