Student Reviews
( 5 Of 5 )
1 review
Video of 9.4. Pig Relational Operators - Load, Store And Dump in Apache Hadoop course by CloudxLab Official channel, video No. 50 free certified online
[Pig - Load]
LOAD operator loads the data from the file system.
1. To load the NYSE_dividends dataset from HDFS, type
load '/data/NYSE_dividends';
Tab will be the default separator if we do not specify a separator while loading the data. Pig loads the values and automatically guesses the datatype
2. We can explicitly define the separator using the PigStorage function. To load the CSV file, type the command
load '/data/NYSE_dividends' using PigStorage(',');
3. We can also define data types explicitly. We can define name as chararray, stock_symbol as chararray, date as datetime and dividends as float
[Pig - Store / Dump]
Store operator is used to store the data to HDFS and other storages.
Dump prints the value on the screen. It is used for debugging.
This Big Data Tutorial will help you learn HDFS, ZooKeeper, Hive, HBase, NoSQL, Oozie, Flume, Sqoop, Spark, Spark RDD, Spark Streaming, Kafka, SparkR, SparkSQL, MLlib, and GraphX from scratch. Everything in this course is explained with the relevant example thus you will actually know how to implement the topics that you will learn in this course.
Let us know in the comments below if you find it helpful.
In order to claim the certificate from E&ICT Academy, IIT Roorkee, visit https://bit.ly/cxlyoutube
________
Website https://www.cloudxlab.com
Facebook https://www.facebook.com/cloudxlab
Instagram https://www.instagram.com/cloudxlab
Twitter http://www.twitter.com/cloudxlab