Agenda

A FREE ONE-DAY EVENT FOR DATA PRACTITIONERS

Join us for a free, instructor-led workshop to build an open data lake. Choose from the sessions below on some of the hottest topics in the Big Data space and walk away with the knowledge needed to guide your team to data-driven solutions.

Data Engineering on Data Lakes
Managing Machine Learning Lifecycle on Data Lakes
10/12/202010/12/2020
11:00 - 11:45 PDT13:00 - 13:45 PDT
In this workshop, experts will guide you through:

  • Common Challenges faced by data engineering teams managing a data platform and ETL pipelines

  • Leveraging an open data lake platform for building a modern data architecture

  • Developing auto-scaling data pipelines

  • Best practices for deploying data engineering pipelines



This workshop will help you learn how to:

  • Build predictive Machine Learning models on the data lake using Apache Spark, including data prep, model training, and hyperparameter tuning

  • Provide meaningful and convenient ways to manage packages for Python and R dialects on distributed Spark clusters

  • Enable experiment tracking and model registry for iterative analysis

  • Accelerate end-to-end machine learning lifecycle by automating the process of deploying models to production


09:00 - 09:30 PDT

Ashish Thusoo

Co-founder/CEO at Qubole

Welcome & KeyNote

09:30 - 10:00 PDT

Debanjan Saha

VP/GM, Data Analytics at Google

Guest KeyNote

10:00 - 10:15 PDTChair Yoga (10 min)
TRACKSTHOUGHT LEADERSHIPDATA LAKES IN THE REAL WORLDFUNDAMENTALS AND BEST PRACTICESDATA LAKE TECHNOLOGYLIGHTNING TALKS
10:30 - 11:10 PDT
Rajat Monga - Co-Creator of TensorFlow
Advances in Artificial Intelligence - What’s in it for the Enterprise?
Sharath Babu - Razorpay
The Journey of a Modern Full-Stack Data Analyst
Brad Caffey - Expedia Group
Running Apache Spark jobs cheaper while maximizing performance
Joydeep Sen Sarma - Qubole
Managing Transactions in Data Lakes
Sirish Mandalika - Orbital Insight
Optimizing platform engineering with a scalable and efficient workflow management solution
11:15 - 11:55 PDT
Caleb Jones - The Walt Disney Company
Domain-driven Data Architecture
Tom Silverstrim - Adobe
Automated Dataset Monitoring in Adobe Experience Platform
Ivan Peng - Nextdoor
Doubling Down: Why Nextdoor Ditched a Data Warehouse for a Centralized Data Lake
Jorge Lopez - AWS
Data Lakes & Machine Learning: Driving Innovation with your data
Shubham Tagra - Qubole
Faster analytics on cloud with RubiX
12:00 - 12:20 PDT
Ask an Architect
Interactive ‘ask me anything’ session with Qubole Solution Architects.
12:30 - 13:00 PDT
BREAK
13:00 - 13:40 PDT
Raman Narasimhan - Cognizant
Decisioning in the New Normal: Leveraging AI and ML in Enterprises
Siddhant Srivastava - Swiggy
Powering Real-time decisions with Big Data and Microservices
Rohit Srivastava, Bitanshu Das - MiQ
Cost Optimization and Self-Service Reporting for a Data Lake Ecosystem
Martin Traverso, David Phillips - Presto Software Foundation
State and future of the Presto project and community
Eddie White - Google
The business value of Qubole’s Open Data Lake on Google Cloud
13:45 - 14:20 PDT
Sathish K S - Zeotap
Data Governance in Multi-Tenant Datalakes - A Tech Perspective
Matt Falk - Orbital Insight
Want real-time analytics? Model your storage right or bust
Ranjith Kuppala - Searce
Building Data Lake on AWS and GCP
Javier Luraschi - RStudio
Scaling Data Science with Spark and R
Bikash Singh, Jatin Kheradiya - MiQ
Data Minimization for Data Governance Strategy in GDPR
09:00 - 09:15 PDTWelcome and Opening Remarks
09:15 - 09:45 PDT

Chris Casey

General Manager Worldwide BD at AWS Data Exchange

Guest KeyNote

09:45 - 10:15 PDT

Kirk Borne

The Principal Data Scientist & Data Science Fellow, and Executive Advisor at Booz Allen Hamilton

Guest KeyNote

TRACKSTHOUGHT LEADERSHIPDATA LAKES IN THE REAL WORLDFUNDAMENTALS AND BEST PRACTICESDATA LAKE TECHNOLOGYLIGHTNING TALKS
10:30 - 11:10 PDT
Prabhu Prakash Ganesh - MiQ
Building and Scaling a Data and Analytics ecosystem - A Story of Decisions, Learnings and Recommendations
Mark Senerth, Mohan Naidu - The Walt Disney Company
A Hub and Spoke Approach to Scaling Storage
David Potes - AWS
Making the Data Lake the Foundation of your Data Strategy
Sean Knapp - Ascend.io
Declarative Pipelines & Intelligent Orchestration - Data’s Missing Link
Joel McKelvey - Looker
Modern Analytics and the Modern Data Lake
11:15 - 11:55 PDT
Sarvottam Darshan - Genpact
Digital Transformations: The Finance office of the future
Kent Buboltz - Expedia Group
Analytics on Analytics: Leveraging Metadata in the Big Data Landscape
Eddie White - Google
The Cloudscape of Data Lakes and Data Warehouses
Ajantha Bhat - Huawei Technologies
Apache CarbonData: Data Storage for ACID Ingest, Fast Query, and Machine Learning
Ori Reshef - Varada
Leverage the Power of Big Data Indexing to Optimize Price &
Performance
12:00 - 12:20 PDT
Ask an Architect
Interactive ‘ask me anything’ session with Qubole Solution Architects.
12:30 - 13:00 PDT
BREAK
13:00 - 13:40 PDT
Pravanjan Choudhury - Capillary Technologies
5 Reasons Why a Multi-tenant Data Lake is a Different Ballgame
Arnaud Prades - Acquia
Building an open, data first and machine learning forward platform
Shreya Pal - Cognizant
Data Lakes Fundamentals and Best Practices - Lessons learned in Planning, Strategy, and Execution
Dr. Srikanth Venkat - Privacera
Your Data Lake is Moving to the Cloud - What About Your Security Policies?
Rohit Karlupia - Qubole
Spark optimization with Sparklens
13:45 - 14:20 PDT
Sanjeev Pant - Presidio
Data Driven Decisions for Business Outcomes
Hugo Sosa - BigData4ALL
Migrating from a Legacy Datawarehouse to a Data Lake on Cloud – a business point of view
David Garty - spotad
Data Lakes in a Real-time bidding environment
Barr Moses - Montecarlo Data
The Rise of Data Downtime: Making Observability a Pillar of your Data Lake Strategy
Shantanu Shrivastava - Zeotap
Achieving Operational Excellence for data engineering
14:25 - 14:55 PDT
Hafiz Badrie - Bukalapak
Decentralised Data Platform at Bukalapak
Sumit Maheshwari - Twitter
Apache Airflow - The present and the future

THE DATA LAKE SUMMIT

THE DATA LAKE SUMMIT

Attend the definitive virtual conference for all things Data Lake