Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Login
From:
Jacob's blog
(Uncensored)
subscribe
Performance Tuning a Nested Data Generator for Parquet
https://jacobsherin.com/posts/2025-09-01-arrow-shredding-pipeline-perf/
links
backlinks
I built a CLI for procedurally generating nested Parquet data where the values follow a Zipfian-like distribution.