Presto Performance for Ad Hoc Workloads on AWS Instance Types
Public cloud platforms like AWS, Azure, and GCP provide more than 100 types of instances each with different characteristics in memory, CPU, and storage optimizations,…
Public cloud platforms like AWS, Azure, and GCP provide more than 100 types of instances each with different characteristics in memory, CPU, and storage optimizations,…
This blog is the second installment in a two-part series about self-service access to data. Read the first post here. We all know the issue…
If big data frameworks had a popularity contest, Apache Spark would be the attractive, trendy option everyone wants to be seen with. First developed at…
The following is a recap from a Bellevue Artificial Intelligence Meetup event hosted by Qubole and held with Expedia on July 20th, 2018. The topics…
Have you ever wondered why you receive personalized promotions and offers in the mail from various retail and telecom giants? Many of these promotions are…
Last week’s announcement that Cloudera and Hortonworks will merge to form a single entity speaks volumes about the state of the big data and Machine…
Many companies start their big data cloud journey on Azure by testing Microsoft’s native offering HDInsight (HDI). With data already in Blob Storage or Azure…
Presto is a distributed ANSI SQL engine for processing big data ad hoc queries at tremendous speed and scale. The engine is used to run…
In a little over three years, iflix, the Malaysia-based OTT service, has become one of the world’s leading entertainment service providers for emerging markets. Today,…
For many data scientists and statisticians, R is their tool of choice. It provides many useful abstractions, is easy to script in, and has tons…
Introducing the 2018 Big Data Trends and Challenges Survey, sponsored by Qubole In its second year, the 2018 Big Data Trends and Challenges Survey report*…
The big data ecosystem is insanely complex — just making sense of the right tools and technologies can be more difficult than data mining itself.…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.