EMR AWS EMR is a managed service provided by AWS to run Spark, HDFS, HIVE and other select software. Protip: Start the EMR cluster only after you have you project setup to prevent unnecessary cost We will use EMR to run our Spark and HDFS cluster Go to AWS Service -> EMR Click on Create Cluster Click on the Go to advanced options Select the shown options and copy paste the config below into the Edit software settings section