Basics about Apache Spark

Posts

What is Apache Spark?

February 21, 2021

Data has become huge now. And in foreseen future, we expect it to grow exponentially. With this much of Data(BigData), besides storage, computing is a major bottleneck that is faced by Organizations, thus opening prospects for Data Engineer. Apache Spark is by far, arguably, the best way to achieve this in a UNIFIED and PARALLEL way. UNIFIED Apache is not limited to only transforming data. It is one stop solution to ingest data i.e. simple data loading, transforming data, querying(SQL) data, Machine Learning and also dealing with Streaming computation. All these could be achieved by using Spark. The user can opt for any programming languages of his/her choice - Scala, Java, Python and R, all of which has libraries for diverse tasks, as mentioned above. PARALLEL What makes Spark so special? Why there is such a huge demand for Spark resources nowadays? Say, Task to be achieved is to clean a room. Assigning one cleaner to complete the task definitely would be more time ...

Search This Blog

Basics about Apache Spark

Posts

How Spark runs on Cluster?

What is Apache Spark?