VeloxCon 2025 Agenda Day 1

Day 1 | Day 2

Tuesday, April 15

8:00AM – 9:00AM	Registration & Breakfast
9:00AM – 9:10AM	Welcome & Opening Remarks	Sridhar Reddy Anumandla, Meta & Ali LeClerc, IBM
9:10AM – 9:40AM	Velox in the Age of Machine Learning	Pedro Pedreira, Velox Lead & Software Engineer at Meta
9:45AM – 10:15AM	Training Data Loading with Velox	Masha Basmanova, Co-creator of Velox & Software Engineer at Meta
10:15AM – 10:45AM	Break
10:45AM – 11:15AM	PySpark Meets Velox: Redefining Efficiency in ML Workloads	Konstantinos Karanasos, Software Engineer at Meta Kk Pulla, Software Engineer at Meta
11:20AM – 11:40AM	Metalake – Meta’s Mutable Data Lake for Scalable Pre-training Data Loading	Darren Fu, Software Engineer at Meta
11:45AM – 12:05PM	State of the Union: Apache Gluten	Binwei Yang, Co-creator of Apache Gluten & Software Engineer at IBM
12:05PM – 1:15PM	Lunch
1:15PM – 1:35PM	From Open Source to Enterprise: Real-World Applications of Velox and Apache Gluten in Dataproc’s NQE	Abhishek Modi, Senior Staff Software Engineer at Google
1:40PM – 1:50PM	Techniques for addressing UDF incompatibility between spark and Gluten/Velox	Shiyu Gan, Software Engineer at Bytedance
1:55PM – 2:05PM	Enhancing Pinterest’s Data Platform: A 2025 Update on Gluten Integration	Felix Loesing, Software Engineer at Pinterest
2:10PM – 2:20PM	Accelerate queries on Apache Hudi using Gluten and Velox	Shiyan Xu, Founding Team Member at Onehouse
2:25PM – 2:55PM	Presto C++: The next decade in SQL processing	Amit Dutta, Software Engineer at Meta
3:00PM – 3:10PM	PyVelox: Python bindings for Velox	Krishna Pai, Software Engineer at Meta
3:10PM – 3:30PM	Break
3:30PM – 3:45PM	Keynote: Accelerating the future of Open Data Engines – Velox at IBM	Remus Lazar, VP Software at IBM
3:50PM – 4:10PM	Velox at Uber	Sergey Makagonov, Staff Software Engineer at Uber
4:15PM – 4:35PM	Presto C++ Iceberg Support update	Ying Su, Software Engineer at IBM
4:40PM – 5:10PM	Presto C++ Feature Deep Dive	Ying Su, Software Engineer at IBM Aditi Pandit, Software Engineer at IBM Soumya Duriseti, Software Engineer at IBM
5:15PM – 5:45PM	Presto C++ Operational Efficiency: Memory Management, Metrics, and Caching	Christian Zentgraf, Software Engineer at IBM Deepak Majeti, Software Engineer at IBM Minhan Cao, Software Engineer at IBM
5:45PM – 7:00PM	Conference Reception

Session Details

Velox in the Age of Machine Learning

In this talk, Pedro will present Velox community updates and new developments in Meta’s internal adoption of Velox. He will discuss how Velox has been focused on accelerating important AI/ML workloads related to training data preparation and training data loading at Meta, and highlight some of the active development/research areas where traditional data management systems fall short when supporting novel AI/ML workloads – and what the Velox community is doing to address them.

Pedro Pedreira
Velox Lead & Software Engineer at Meta

Training Data Loading with Velox

AI has become the first consumer of data. To ensure data-hungry trainers are consistently fed with training datasets, data loading platforms have quickly evolved, growing in popularity and scale. This workload brings novel challenges to data management related to new data shapes (extreme width), new batching and shuffling requirements, and tensor consumption. In this talk, we will discuss important data management features for AI/ML data, and how Velox is helping accelerate large-scale training data loading workloads.

Masha Basmanova
Co-creator of Velox & Software Engineer at Meta

PySpark Meets Velox: Redefining Efficiency in ML Workloads

AI applications within Meta, including recommendation systems and GenAI, are pushing the limits of our batch compute engines. As demand has grown exponentially over the past few years, bespoke solutions have emerged, but they often present challenges in terms of maintenance, reusability, efficiency, and reliability. In this talk, we introduce PySpark/Velox, a new offering at Meta that combines Python’s ease-of-use, Spark’s scalability, and Velox’s efficiency. We will discuss the architecture, various use cases and results, and how it fits into our existing stack.

Konstantinos Karanasos
Software Engineer at Meta

Kk Pulla
Software Engineer at Meta

Metalake – Meta’s Mutable Data Lake for Scalable Pre-training Data Loading

While off-the-shelf data lake solutions provide robust management for large datasets, Meta faces unique efficiency challenges, such as efficiently merging full-column updates at petabyte scale. Metalake, Meta’s internal mutable data lake, is designed to address these needs, enabling logical deletes, efficient rich update semantics, all while optimizing compute efficiency.

In this talk, we’ll dive into how Metalake, powered by Velox and PySpark, accelerates feature injection pipelines and scales model training with minimal compute overhead. We’ll present how Velox-based optimizations like shallow copy partitions, auto compaction, and encoding preserved merge-time rebatching eliminate traditional join performance bottlenecks, and significantly reduce compute costs.

Darren Fu
Software Engineer at Meta

State of the Union: Apache Gluten

Get the latest updates on Apache Gluten, the open-source project accelerating Spark with Velox. This session will cover recent developments, performance improvements, and roadmap plans, highlighting how Gluten enhances query execution efficiency in Spark.

Binwei Yang
Co-creator of Apache Gluten & Software Engineer at IBM

From Open Source to Enterprise: Real-World Applications of Velox and Apache Gluten in Dataproc’s NQE

Google Dataproc’s Performance Boost, powered by the integration of Velox and Apache Gluten, is revolutionizing big data analytics for enterprise customers. This session delves into the real-world impact of our Velox, showcasing how we’ve scaled these open-source technologies to deliver unprecedented performance gains. We’ll explore the architecture of our Dataproc integration, highlighting how Velox’s vectorized execution and Apache Gluten’s glue layer combine to significantly reduce query latency and resource consumption. Furthermore, we’ll detail our key improvements to the Spark-Velox integration including broadcast hash join optimizations that dramatically improve the performance of join-heavy workloads. We also focused on the addition and stabilization of comprehensive ORC and Parquet test cases, ensuring robust and reliable performance for a critical data format.

To help customers quickly understand the potential benefits, we’ve developed a qualification tool that analyzes Spark event logs and accurately predicts the performance gains achievable with Velox pushdown. Through case studies with marquee customers, we’ll demonstrate the tangible benefits of NQE, including accelerated data processing, improved cost efficiency, and enhanced scalability. Join us to learn how we’re pushing the boundaries of big data analytics with Velox and Apache Gluten in Google Dataproc, and how our extensions and innovative tools are contributing to the broader open-source community.

Abhishek Modi
Sr. Staff Software Engineer at Google

Techniques for addressing UDF incompatibility between spark and Gluten/Velox

In this talk, Shiyu talks through challenges encountered in the productionization of Gluten/Velox around compatibility between Spark’s and Velox’s json and re (regular expression) UDF’s and then presents two techniques in solving these incompatibilities between UDF’s.

Shiyu Gan
Software Engineer at Bytedance

Enhancing Pinterest’s Data Platform: A 2025 Update on Gluten Integration

In this lightning talk, we will explore the advancements Pinterest has made to its query platform using Gluten over the past year. We’ll begin with a brief overview of how Gluten is integrated into Pinterest’s infrastructure, followed by a discussion on the improved querying speeds and increased efficiency resulting from this integration. We will conclude with our plans for future developments and areas we want to improve on.

Felix Loesing
Software Engineer at Pinterest

Accelerate queries on Apache Hudi using Gluten and Velox

In this session, we’ll explore how Hudi-rs, the native Rust implementation of Apache Hudi, enables powerful integration with Velox to dramatically improve query performance. We’ll cover the technical design behind this integration, highlighting key architectural decisions that optimize data access patterns and query execution. Attendees will learn about the performance benefits achieved through benchmarks and see how this integration can be applied to their own data environments.

Shiyan Xu
Founding Team Member at Onehouse & Apache Hudi PMC member

Presto C++: The next decade in SQL processing

In this talk, we will cover the journey of Presto C++’s (Prestissimo) general availability at Meta across a diverse set of workload. We will provide insights into performance optimizations, correctness, and numerous stability improvements done over the last year. This work has enabled us to run Presto ~2-4X faster with 50% less overall capacity costs and further established a strong path for consolidation across various systems

Amit Dutta
Software Engineer, Meta

PyVelox: Python bindings for Velox

In this talk, we’ll introduce PyVelox, a recently published native Python package that brings the power of Velox to the Python ecosystem.

PyVelox provides seamless bindings to Velox’s core APIs, including PlanBuilder, I/O, LocalRunner, and more. We’ll explore these bindings in-depth, showcasing how they enable Python developers to tap into Velox’s lightning-fast performance and scalability. We’ll also demonstrate just how easy it is to get started with PyVelox on your local machine. With a simple installation process and intuitive API, you’ll be creating and executing plans for running TPCH queries in no time.

Krishna Pai
Software Engineer, Meta

Keynote: Accelerating the future of Open Data Engines – Velox at IBM

IBM has been a key contributor to Velox, leveraging its high-performance execution engine to optimize data processing at scale. In this keynote, Remus will share how IBM is integrating Velox across its data platforms, key innovations driven by the community, and what’s next for Velox in accelerating modern data workloads.

Remus Lazar
VP Software at IBM

Velox at Uber

Join us as we explore the transformative journey of productionizing Presto C++ (Prestissimo) at Uber. This talk will detail our progress on the migration to the next-generation data query engine, highlighting the technical challenges we encountered along the way, key milestones that were achieved, and the overall impact of Velox on Uber’s data processing capabilities.

Sergey Makagonov
Staff Software Engineer at Uber

Presto C++ Iceberg Support update

Presto’s C++ engine continues to improve its support for Apache Iceberg. This talk will cover recent enhancements in metadata handling, query planning, support of Insert, and read performance, along with future improvements on the roadmap.

Ying Su
Software Engineer at IBM

Presto C++ Feature Deep Dive

Presto continues to evolve with key enhancements in its C++ engine, leveraging Velox for performance and extensibility. This session will cover recent developments in 1) Unfenced UDFs, a new approach to defining and executing user-defined functions in Presto, improving flexibility and performance, 2) PartitionOutput to optimize data distribution for large-scale query processing, and 3) TopNRank operator which optimizes TPC-DS Q67 by ~2.5x. Q67 is the longest running query in the TPC-DS workload (about 30% of the total runtime).

Ying Su
Software Engineer at IBM

Soumya Duriseti
Software Engineer at IBM

Aditi Pandit
Software Engineer at IBM

Presto C++ Operational Efficiency: Memory Management, Metrics, and Caching

Operational efficiency is crucial for running Presto at scale. This session will explore recent advancements in Presto’s C++ engine that improve observability, resource management, and performance:

LinuxMemoryChecker – A new mechanism for better memory tracking and enforcement, enhancing stability in resource-intensive workloads.
Stats and Metrics – Deep dive into runtime stats and metrics, including a live demo showcasing how these metrics can help optimize query performance.
SSDCache Usage – A deep dive into leveraging SSD-based caching to accelerate data access and reduce latency in Presto queries.

Christian Zentgraf
Software Engineer at IBM

Deepak Majeti
Velox Committer & Software Engineer at IBM

Minhan Cao
Software Engineer at IBM