At PrestoCon Day 2025, Meta’s Presto team recently unveiled the Prestissimo extension, a powerful enhancement designed to optimize AI training data normalization. This article explores the technical underpinnings and developer-centric features of this extension, providing a comprehensive understanding of how it supports large-scale AI workloads at Meta. Understanding AI Training Data Storage at Meta At... The post Prestissimo Extension for AI Training Data Normalization at Meta: A Deep Di...| PrestoDB
Dynamic loading in Presto C++ is revolutionizing how developers build and deploy user-defined functions (UDFs). At PrestoCon Day 2025 , Soumya Duriseti| PrestoDB
At PrestoCon Day 2025, engineers from IBM presented a deep dive into how connectors in Presto C++ extend the engine’s modular capabilities, focusing on the newly implemented TPCDS benchmark connector. Connectors are central to Presto’s architecture, enabling the query engine to communicate seamlessly with external systems such as databases, file formats, or benchmark data generators.... The post Building Connectors in Presto C++: Deep Dive into the TPCDS Connector (Lightning Talk) appea...| PrestoDB
At PrestoCon Day 2025, Satej Sahu (Principal Data Engineer at Zalando SE) introduced the Self-Healing Query Connector for Presto, an AI-powered upgrade designed to make query troubleshooting faster, smarter, and more reliable. By combining Large Language Models with live query data, including logs, explain plans, and schema details it delivers accurate, context-aware solutions that improve... The post Presto’s Intelligent Future: Leveraging RAG and LLM’s for Smarter Query Execution appe...| PrestoDB
At PrestoCon Day 2025, we unveiled the Presto Sidecar, a powerful enhancement for Presto C++ (Velox) clusters that transforms how coordinators interact with native workers. This innovation removes long-standing blind spots in query planning by giving the coordinator real-time visibility into native worker capabilities – such as supported functions, data types, session properties, and plan... The post Revolutionizing Presto C++: Unleashing Native Power with the Sidecar appeared first on Pr...| PrestoDB
At PrestoCon Day 2025, Uber presented their innovative solution for optimizing petabyte-scale data analytics by deploying a distributed cache using Alluxio for Presto. Their journey was driven by significant challenges during a massive cloud migration, including read slowness and overwhelming HDFS clusters on-premises, and later high GCS egress costs and file access charges in the... The post Unlocking Petabyte-Scale Performance: Uber’s Journey with Alluxio Distributed Cache appeared firs...| PrestoDB
At this year’s PrestoCon Day, Meta had an awesome session to share the latest on what they’re doing with Presto. As you probably know, Meta has one of the largest data Lakehouse’s in the world and Presto is a critical piece of that data platform. It plays a critical role in serving vast and diverse... The post Unleashing Interactivity: Inside Meta’s Presto-Powered Data Warehouse Innovation appeared first on PrestoDB.| PrestoDB