python basics
module
module
hadoop offers: hdfs: for storage mapreduce: for computation yarn: for resource management
scala runs on top of jvm scala is like java so requires main, or we can extends App then we dont have to define main method
A way to calculate time consumed by an algorithm, as a function of input.
spark code can be written in different languages (scala, python, java, r), scala is hybrib, oops + functional.