Presentations
- Chill Data Summit Bay area August 2024: Iceberg and the Deconstructed Database
- Data Council March 2024: Ten years of building open source standards: From Parquet to Arrow to OpenLineage
- Data & AI summit 2023: Cross-Platform Data Lineage with OpenLineage
- Airflow Summit 2023: Nurturing an Open Source community is like tending a garden
- Data and AI Summit May 2021: Data lineage and observability with OpenLineage
- Data Driven January 2021: Data pipelines observability with OpenLineage and Marquez
- Subsurface 2020: Data Lineage and observability with Marquez
- OpenCore Summit: Observability for data pipelines with Open Lineage
- IEEE Infrastructure 2020: Data Platform Architecture Principles
- Strata NY 2018: FROM FLAT FILES TO DECONSTRUCTED DATABASE, The evolution and future of the Big Data ecosystem
- Data Eng Conf April 2018: FROM FLAT FILES TO DECONSTRUCTED DATABASE, The evolution and future of the Big Data ecosystem.
- Strata NY 2017: The columnar roadmap, Apache Parquet and Apache Arrow
- Mulesoft March 2017: The future of column-oriented data processing with Arrow and Parquet
- Spark Summit 2017: Improving Python and Spark Performance and Interoperability with Apache Arrow
- Hadoop Summit 2017: The columnar roadmap, Apache Parquet and Apache Arrow
- Strata NY 2016: The future of column-oriented data processing with Arrow and Parquet
- Strata London 2016: The future of column oriented data processing with Arrow and Parquet
- Data Eng Conf NY November 2016: The future of column-oriented data processing with Arrow and Parquet
- Big Data Apps meetup Jan 2016: SQL-on-Everything with Apache Drill
- Hadoop Summit 2015: How to use Parquet as a basis for ETL and analytics
- Strata 2015: How to use Parquet as a basis for ETL and analytics
- HPTS 2015: If you have your own Columnar format, stop now and use Parquet
- Twitter Open House: Parquet, An open columnar file format for Hadoop
- Efficient Data Storage for Analytics with Apache Parquet 2.0
- Hadoop Summit 2013: Parquet, Columnar storage for the people
- Strata Hadoop World 2013: Parquet, Columnar storage for the people
- Drill meetup: Parquet Overview
- Pig meetup: Embedding Pig in scripting languages
- Hadoop Summit 2011: PIG Scripting, Making Pig Turing-complete through embedding in a scripting language