Congratulations! If you have completed each section of the Getting Started Guide, you are now well on your way to becoming an Apache Spark power user. As a good next step, visit the Qubole Developers page for more examples and helpful resources.
We also recommend that you download the O’Reilly Creating a Data-Driven Enterprise with DataOps eBook. The book covers important characteristics of data-driven organizations and where the big data industry has evolved with cloud and emerging Data Science technologies. Authored by Qubole co-founder’s Ashish Thusoo and JoyDeep Sen Sarma, the book also includes stories from Engineering leaders at Facebook, Linkedin, Uber, Ebay, and Twitter with steps your organization should consider when scaling out modern data platforms.
As a company founded out of open-source technologies, coming from the creators of Apache Hive, Qubole is constantly focused on contributing OSS tools back to the community to help build and scale Apache Spark applications in the cloud. Join our GitHub and check out other highlight Open-Source projects!
Access free Self-Service Spark Training, with self-paced courses for Users and Engineers, or live Instructor-Led Training every week.
Stay up to date with the latest improvements in big data analytics, AI, and ML in the cloud. Follow Qubole on Linkedin, Twitter, and Youtube!
Check out more Spark Notebook examples as well as other big data engines.
Highlight Notebooks:
Stay up to date with the latest releases and examples in the Apache Spark Engineering blog.
Data Science & Analytics
Data Engineering & Ops
Qubole Engineering & Product
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.