This video is part of the Spark learning Series. Spark provides different methods to optimize the performance of queries. So As part of this video, we are covering the following
What is Partitioning
How does partitioning help to improve performance
What is Bucketing
How does bucketing helps to improve performance
Difference between Partitioning and Bucketing
How Spark's performance is impacted by Dynamic Partition Pruning
Here are a few Links useful for you
Git Repo: https://github.com/harjeet88/
Spark Interview Questions: https://www.youtube.com/playlist?listPL9sbKmQTkW05mXqnq1vrrT8pCsEa53std
Spark performance tuning:
If you are interested to join our community. Please join the following groups
Telegram: http://t.me/bigdata_hkr
Whatsapp: https://chat.whatsapp.com/KKUmcOGNiixH8NdTWNKMGZ
You can drop me an email for any queries at
[email protected] #apachespark #sparktutorial #bigdata
#spark #hadoop #spark3