home ¦ Archives ¦ Atom ¦ RSS

Sparkin’ EMR

Link parkin’. How to layer Spark within an Amazon Elastic MapReduce cluster. Basic idea is to use a bootstrap script to deploy the toolkit.

In this article, we’ll explain how to install Shark and Spark on a cluster managed by Amazon EMR. By combining these technologies, you’ll be able to enjoy the speed enhancements of the Shark data warehouse as well as the operational and financial advantages of running your cluster on Amazon EMR.

© 2008-2024 C. Ross Jam. Built using Pelican. Theme based upon Giulio Fidente’s original svbhack, and slightly modified by crossjam.