Shefali Aggarwal, Author at Qubole

Data Lake Storage

By Shefali Aggarwal |January 23, 2020

What is Data Lake Architecture? In this multi-part series, we will take you through the architecture of a Data Lake. We can explore data lake…

Open Data Lake Platform

By Shefali Aggarwal |January 22, 2020

Apache Spark Benchmark for Autoscaling: Qubole versus Competition

By Shefali Aggarwal |January 14, 2020

This blog covers new benchmark tests to better understand the Autoscaling behavior of concurrent Apache Spark applications. We believe that this will help in advancing…

Apache Spark Autoscaling Benchmarks

Streamlining Operations of Machine Learning Models

By Shefali Aggarwal |

Guest authors: Jerry Xu, Co-founder, and CEO, Datatron; Lekhni Randive, Product Manager, Datatron Qubole author: Jorge Villamariona, Sr. Product Marketing Manager, Qubole In today’s world,…

Data Modeling Data Platforms Data Science Workloads Devops Machine Learning (ML)

Apache Sqoop 1.4.7 – 9 reasons why you need it

By Shefali Aggarwal |January 10, 2020

The sixth release of Apache Sqoop i.e. 1.4.7 is out! This is one of the most significant updates to the Sqoop platform. We give you…

Apache Hadoop Apache Hive ETL Workloads Qubole Data Platforms

Analytics and ML simplified with Jupyter Notebooks and Apache Spark

By Shefali Aggarwal |

Data scientists use Notebooks for data exploration, interactive data analytics, machine learning, and collaboration. Once set up, a Notebook provides a convenient way to save,…

Data Analyst Jupyter JupyterLab interface Qviz

Per-Bucket Configuration Support in Presto

By Shefali Aggarwal |January 8, 2020

Introduction Presto can access S3 Buckets using one of the following options: IAM roles provided in the configuration Access-key/Secret-key provided in the configuration Credentials fetched…

Presto Presto on Cloud Presto on Qubole

Optimized Upscaling for Managing Workloads in Cloud

By Shefali Aggarwal |November 26, 2019

Introduction Qubole provides powerful automation that optimizes underlying cloud compute management for data lakes. Qubole cluster management continuously optimizes both performance and cost by lowering…

Cluster Management Optimized Upscaling TCO Workloads

Qubole: The Super Powers of Support

By Shefali Aggarwal |November 22, 2019

Introducing Qubole Support Qubole processes over 250 Petabytes of data in a month, and the diversity of data we process, cloud platforms we run on,…

Big Data Users Complexity Management Data Platforms Debugging Spark applications Performance Sentiment Analysis Technology

Practical Guide to Financial Governance of Data Lake Initiatives

By Shefali Aggarwal |November 12, 2019

Introduction Enterprises are today becoming more data-driven as their data is the fuel to their innovation engine to build new products, outmaneuver the competition and…

Introducing Qubole Release 57

By Sheetal Kalra and Shefali Aggarwal |October 29, 2019

Each month, about an exabyte of data is processed using Qubole’s data platform on Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure,…

Apache Hive Apache Spark Autoscaling AWS Cloud Big Data Processing Cluster Management Data Admin Data Engineer Data Infrastructure Data Modeling Google Clou Heterogeneous Clusters Microsoft Azure Performance Presto Scalability Spot Instances

Calculating 30 billion speed estimates a week with Apache Spark on Qubole

By Shefali Aggarwal |October 25, 2019

This post is a guest publication written by Saba El-Hilo, a Senior Data Engineer at Mapbox. A version of this post first appeared as a…

Apache Spark Big Data Processing Case Studies Data Science Workloads Data Scientist Machine Learning (ML)Spark Scalability

Shefali Aggarwal

Data Lake Storage

Open Data Lake Platform

Apache Spark Benchmark for Autoscaling: Qubole versus Competition

Streamlining Operations of Machine Learning Models

Apache Sqoop 1.4.7 – 9 reasons why you need it

Analytics and ML simplified with Jupyter Notebooks and Apache Spark

Per-Bucket Configuration Support in Presto

Optimized Upscaling for Managing Workloads in Cloud

Qubole: The Super Powers of Support

Practical Guide to Financial Governance of Data Lake Initiatives

Introducing Qubole Release 57

Calculating 30 billion speed estimates a week with Apache Spark on Qubole

Product

Company

Helpful Links

START YOUR FREE TRIAL OF QUBOLE

Contact Form

On-Demand Qubole Demo

Google Cloud Sessions

Thank you!

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

Shefali Aggarwal

START YOUR FREE TRIAL OF QUBOLE

Contact Form

On-Demand Qubole Demo

Google Cloud Sessions

Thank you!