Big Data:
Hadoop / Hive

Overview

Running Hive Queries on Hadoop Clusters

The first part of the project involved setting up a Hadoop environment using AWS. After which, Hive queries were executed on a large Movies dataset (one of the tables contains more than 1 million rows of movie rating values). Refer to documentation below.

Project Documentation