RDDs in Apache Spark

ankitdixit · 26 March 2021 09:22

Hello Everyone, I am new in this community and I want to know the concept of RDD (Resilient Distributed Dataset). Also, state how I can create RDDs in Apache Spark? Actually, I am preparing some apache-spark interview questions and i want to know about this. Can anyone explain me?

Fabby · 27 March 2021 09:03

There is an excellent on-line tutorial for all these, so RTFM…

Please note:

Before you start proceeding with this tutorial, we assume that you have prior exposure to Scala programming, database concepts, and any of the Linux operating system flavors.