big data Archives - Devrats Journal

Hive Performance Tuning

Sanjay Mishra 6th October 2019 Leave a Comment

The Apache Hive ™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL syntax. To know how to use Hive please read https://cwiki.apache.org/confluence/display/Hive/Tutorial…

Apache Spark Unit Testing

Sanjay Mishra 14th June 2018 2 Comments

Apache Spark is a lightning-fast unified analytics engine for big data and machine learning. Apache Spark is included in almost all of the Hadoop distributions. Apache Spark is the hottest…

Hadoop to explore data

Sanjay Mishra 18th November 2017 3 Comments

Big data by definition denotes datasets that are so large or complex that traditional data processing application frameworks and software are inadequate to deal with them. Hadoop is the answer…

Tag: big data

Hive Performance Tuning

Apache Spark Unit Testing

Hadoop to explore data