Walking through a complex set of concepts that form the Big Data stack. Setting up Big Data environments, using efficient data management operations and running algorithms to the scale and speed required by Big Data datasets.

Github Link: Full repository can be found here

Implementing solutions to address Big Data problems that pertain to the following technologies:

  1. Bash
  2. SQL-NoSQL
  3. Hadoop
  4. Spark
  5. GraphX