What is the Hadoop ecosystem and how does Apache Spark fit in?

hadoop

By ravikumar
November 11, 2022 in General Discussion

Followers 0

Recommended Posts

ravikumar

Posted November 11, 2022

- Share

Posted November 11, 2022

I'm having a lot of trouble grasping what exactly a 'Hadoop ecosystem' is conceptually. I understand that you have some data processing tasks that you want to run and so you use MapReduce to split the job up into smaller pieces but I'm unsure about what people mean when they say 'Hadoop Ecosystem'. I'm also unclear as to what the benefits of Apache Spark are and why this is seen as so revolutionary? If it's all in-memory calculation, wouldn't that just mean that you would need higher RAM machines to run Spark jobs? How is Spark different than writing some parallelized Python code or something of that nature.

Quote

Link to comment

Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Reply to this topic...

× Pasted as rich text. Paste as plain text instead

Only 75 emoji are allowed.

× Your link has been automatically embedded. Display as a link instead

× Your previous content has been restored. Clear editor

× You cannot paste images directly. Upload or insert images from URL.

Insert image from URL

Followers 0

Go to topic listing

Similar Content
- Difference between HBase and Hadoop/HDFS
  
  By ravikumar, October 28, 2022
  - hadoop
  - 0 replies
  - 766 views
Recently Browsing 0 members
- No registered users viewing this page.

Sign In

What is the Hadoop ecosystem and how does Apache Spark fit in?

Recommended Posts

ravikumar

Link to comment

Share on other sites

Join the conversation

Similar Content

Difference between HBase and Hadoop/HDFS

Recently Browsing 0 members

Browse

Latest Activity

Blog

Technical Support

Important Information