Presto is a high-performance, distributed SQL query engine for big data. Presto was originally designed and developed at Facebook for their data analysts to run interactive queries on its large data warehouse in Apache Hadoop.
Presto’s architecture allows users to query a variety of data sources such as Hadoop, AWS S3, Alluxio, MySQL, Cassandra, Kafka, and MongoDB. One can even query data from multiple data sources within a single query.
Qubole has been offering a managed Presto service since 2014. We offer our customers multiple Presto versions and maintain a regular upgrade process. Qubole’s managed Presto offering has been tailored to the needs of our customers. Qubole blends the latest features form the open source community with Qubole’s proprietary solutions that boost performance, lower cost, improve user experience, and provide smooth administration of Presto clusters.
Performance Boost
Lower Cloud Operation Cost
Ease of Use
Enterprise-Ready
Qubole | Open Source | |
Graceful Low-cost Compute Shutdown * | ||
Spot (AWS) Rebalancing | ||
Spot Block (AWS) Support | ||
Workload-Aware Autoscaling | ||
User-Based Autoscaling | ||
Aggressive Downscaling with graceful decommissioning | ||
Heterogeneous Clusters | ||
Per-second billing | ||
Smart Query Retry | ||
Cost Explorer & Analysis | ||
Strict Mode (prevent runaway queries) |
* AWS Spot, Azure Lo-cost VMs, Google Pre-emptible VMs
Qubole | Open Source | |
Compute Optimization for joins and filters | ||
Required Worker Node | ||
S3 Direct writes optimization | ||
S3 listing optimization | ||
Rubix (distributed caching) |
Qubole | Open Source | |
Versioning | ||
Scheduling | ||
Dashboarding (Presto Notebook) | ||
Collaboration and sharing |
Qubole | Open Source | |
Monitoring (Ganglia, DataDog, etc) | ||
Intelligent Log Access |
Qubole | Open Source | |
Access control for notebooks, clusters, jobs, structured data | ||
Audit end-user activity logs | ||
Apache Ranger Integration | ||
SSO with SAML 2.0 support | ||
Data encryption | ||
HIPAA, SOC2 Type2, ISO-27001 compliant environments |
Qubole | Open Source | |
Custom Connector with BI tools (Tableau, Looker, etc.) | ||
REST API | ||
AWS Glue Support | ||
Data Source Connectors (Redshift, Postgres, Kinesis*, etc) |
* Kinesis is being contributed back to OSS
Qubole | Open Source | |
24/7 support from our Presto experts | ||
Support multiple versions of Presto |
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.