Beam vs Spark: A programming-model comparison

In my brief review of this “Comparison” (Overview Article), it appears that Apache Beam) does overtake Apache Spark with regards to programming-model. Lines of code are cut in half in most cases while accomplishing the same tasks. More diving is required of course, but looks like something to definitely watch out for.

A little background:
Apache Beam (formerly known as Dataflow) resonated from a unification of both batch and stream processing model.

Beam is modeled after the following technologies:

and is directly powered by FlumeJava and Millwheel.

Additional interesting references:


Leave a Reply

Please log in using one of these methods to post your comment: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s