Student Reviews
( 5 Of 5 )
1 review
Video of 9.2. Pig Execution Modes in Apache Hadoop course by CloudxLab Official channel, video No. 48 free certified online
Pig can be run in two modes:
- MapReduce mode - In this mode, Pig loads and processes the data stored on HDFS. Pig Latin statements invoke a MapReduce job to perform the processing. It is the recommended mode in a production environment.
- Local mode - In this mode, Pig accesses files stored on the local file system. Data processing happens on the local machine. This mode is generally used for testing locally and speeding up development.
[Pig - MapReduce Mode] [Screencast pig_modes.mp4]
Let's launch Pig on CloudxLab. Login to the CloudxLab Linux console. Type pig on the command prompt. By default, Pig gets launched in MapReduce mode. Afer successfully launching Pig, Grunt shell appears. The Grunt shell of Apache Pig is mainly used to write Pig Latin scripts. We can control Hadoop from Grunt shell by running fs commands. We can also kill the jobs and execute pig scripts from grunt shell.
To see files in your home directory in HDFS, type ls. You can see all the files stored in your home directory in HDFS. Press "Control-D" to exit the shell.
[Pig - Local Mode]
Let's launch Pig in local mode on CloudxLab. Type "pig -x local". To see files in your home directory, type ls.
This Big Data Tutorial will help you learn HDFS, ZooKeeper, Hive, HBase, NoSQL, Oozie, Flume, Sqoop, Spark, Spark RDD, Spark Streaming, Kafka, SparkR, SparkSQL, MLlib, and GraphX from scratch. Everything in this course is explained with the relevant example thus you will actually know how to implement the topics that you will learn in this course.
Let us know in the comments below if you find it helpful.
In order to claim the certificate from E&ICT Academy, IIT Roorkee, visit https://bit.ly/cxlyoutube
________
Website https://www.cloudxlab.com
Facebook https://www.facebook.com/cloudxlab
Instagram https://www.instagram.com/cloudxlab
Twitter http://www.twitter.com/cloudxlab