Shefali Aggarwal
The Definitive Guide to Data Lakes
The Data Lake Summit: Day 2 Recap
The Data Lake Summit 2020 has come to an end! The two-day virtual summit took a deeper dive into the latest trends around data analytics,…
The Data Lake Summit: Day 1 Recap
What an epic first day it has been for The Data Lake Summit. We hope you enjoyed Day 1 of the Data Lake Summit. The…
Part 3: Transactions on the Data Lake
Data Lakes are becoming increasingly central to the analytical operations of organizations. This brings in many more ‘transactional’ requirements on the pipeline architecture and the…
Announcing Keynotes for The Data Lake Summit
With just a week to go, we are excited to announce the keynote speakers for The Data Lake Summit, the definitive virtual conference for all…
Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup
Cloud data lakes are enabling new business models and near real-time analytics to support better decision-making. However, as the number of workloads migrating to cloud…
Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup
Running ad hoc analytics, streaming analytics, and machine learning workloads in the cloud offer unique cost, performance, and time to value advantages. But the unpredictability…
Part 2: Tuning the Data Ingestion process
In Part 1 of this series, we briefly touched upon the various design considerations to be made when architecting the Data Lake. We saw how…
Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup
Today’s applications for machine learning and real-time predictive analytics require a robust set of capabilities from the underlying data platform. These must meet the growing…
Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup
Artificial Intelligence and machine learning workloads leverage multiple data formats that are a combination of batch and real-time and require scalable computing resources. Leveraging data…
Data Lake Ingestion
Data Lakes are a core pillar in an organization’s data strategy. They make organizational data from different sources accessible to various end-users such as business…