Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inserts, updates and deletes on transactional Hive Tables—defined over files in a data lake via Apache Hive—and query the same via Apache Spark or Presto. Our changes to support reads on such tables from Apache Spark and Presto have been open sourced, and ongoing efforts for multi-engine updates and deletes will be open sourced as well. In this video, Shubham Tagra, Sr. Staff Engineer at Qubole, describes the capabilities, the design choices, implementation details, and future roadmap.
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.