You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: Show the number of matched Parquet pages in DataSourceExec (#19977)
## Which issue does this PR close?
- Closes#19875.
## Rationale for this change
Show the number of matched (and pruned) pages in the explain analyze
plan to help make decisions about file optimization.
Example:
```sql
DataSourceExec: ..., metrics=[
...
page_index_rows_pruned=1.00 K total → 100 matched,
page_index_pages_pruned=100 total → 10 matched,
...
]
```
## What changes are included in this PR?
- Added `page_index_pages_pruned` metric to DataSourceExec.
- Updated and extended existing tests.
## Are these changes tested?
Yes.
## Are there any user-facing changes?
New metric in the explain plans.
Copy file name to clipboardExpand all lines: docs/source/user-guide/explain-usage.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -226,6 +226,7 @@ Again, reading from bottom up:
226
226
When predicate pushdown is enabled, `DataSourceExec` with `ParquetSource` gains the following metrics:
227
227
228
228
-`page_index_rows_pruned`: number of rows evaluated by page index filters. The metric reports both how many rows were considered in total and how many matched (were not pruned).
229
+
-`page_index_pages_pruned`: number of pages evaluated by page index filters. The metric reports both how many pages were considered in total and how many matched (were not pruned).
229
230
-`row_groups_pruned_bloom_filter`: number of row groups evaluated by Bloom Filters, reporting both total checked groups and groups that matched.
230
231
-`row_groups_pruned_statistics`: number of row groups evaluated by row-group statistics (min/max), reporting both total checked groups and groups that matched.
231
232
-`limit_pruned_row_groups`: number of row groups pruned by the limit.
0 commit comments