This video is part of the Spark learning Series. Repartitioning and Coalesce are very commonly used concepts, but a lot of us miss basics. So As part of this video, we are covering the following
what is Repartition
What is Coalesce
Difference between repartition and coalesce
How Spark's performance is impacted by repartition and coalesce
Here are a few Links useful for you
Git Repo: https://github.com/harjeet88/
Spark Interview Questions: https://www.youtube.com/playlist?listPL9sbKmQTkW05mXqnq1vrrT8pCsEa53std
If you are interested to join our community. Please join the following groups
Whatsapp: https://chat.whatsapp.com/KKUmcOGNiixH8NdTWNKMGZ
Telegram: http://t.me/bigdata_hkr
You can drop me an email for any queries at
[email protected] How Repartition works: 0:34
How Coalesce works: 1:19
Difference between Repartition and coalesce: 2:17
How Spark's performance is Impacted: 3:20
#apachespark #sparktutorial #bigdata
#spark #hadoop #hive