Cloud Data Lakes – Four Must-have TCO Optimization Capabilities
Enterprises leverage cloud providers’ compute and storage services for their ad-hoc data analytics, streaming analytics, and ML use cases as cloud data lakes provide significant…
Enterprises leverage cloud providers’ compute and storage services for their ad-hoc data analytics, streaming analytics, and ML use cases as cloud data lakes provide significant…
All data-driven organizations use data in three ways: To report on the past To understand the present To predict the future Data warehouses and Business…
How to Optimize Spark Clusters on Qubole for Cost Reliability and Performance This second blog from the three-part series explains how a Spark cluster on…
Spot nodes on AWS (and preemptible VMs on Google Cloud Platform, GCP) are a great way to reduce your Total Cost of Ownership (TCO) for…
As a best practice, we recommend users create a few large Presto clusters that are shared between different teams, instead of creating multiple small clusters…
Qubole has provided Datadog as an integrated monitoring service for its clusters, including Presto clusters. This brings many improvements compared to the “old approach” for…
Data Lake Essentials, Part 3 – Data Lake Data Catalog, Metadata, and Search In this multi-part series, we will take you through the architecture of…
Data Lake Essentials, Part 2 – File Formats, Compression, and Security In this multi-part series, we will take you through the architecture of a Data…
What is Data Lake Architecture? In this multi-part series, we will take you through the architecture of a Data Lake. We can explore data lake…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.