This video is part of the Spark learning Series. Spark provides different methods to optimize the performance of queries. So As part of this video, we are covering the following
What is Partitioning
Hash Partitioning
Range Partitioning
Why we should choose the right way of partitioning
How Spark's performance is impacted by Dynamic Partition Pruning
Here are a few Links useful for you
Git Repo: https://github.com/harjeet88/
Spark Interview Questions: https://www.youtube.com/playlist?listPL9sbKmQTkW05mXqnq1vrrT8pCsEa53std
Spark performance tuning:
If you are interested to join our community. Please join the following groups
Telegram: http://t.me/bigdata_hkr
Whatsapp: https://chat.whatsapp.com/KKUmcOGNiixH8NdTWNKMGZ
You can drop me an email for any queries at
[email protected] #apachespark #sparktutorial #bigdata
#spark #hadoop #spark3