Spark 2.0 – What’s New

Spark 2.0 – What’s New

Earlier this month DataBricks provided an overview of Apache Spark’s next major release, Spark 2.0. The following post shows some of the changes in the abstraction, API and Libraries. Spark 2.0 is expected to be released in early June 2016. What is Apache Spark...
Get Real with Attunity Replicate

Get Real with Attunity Replicate

Real-time analytics capabilities are all the rage in the Big Data world. Analytic database vendors love to bring up the “real-time capability” of their products. Before you get too excited, know that most of the time they should be adding “sort of” to that line. The...
The Internet of Things – Beyond the Hype

The Internet of Things – Beyond the Hype

Unless you have been living in a cave, the term Internet of Things (IoT) has reached your ears.  The term has been gaining traction, but a few questions come to mind when we talk about the IoT – 1. Is it just a Hype? and 2. Is there more to the IoT, or is this...
Expanding your Hadoop Ecosystem

Expanding your Hadoop Ecosystem

dewoods.com Author: Dillon Woods, CTO @ zData Inc.  Introduction Leveraging the predictive benefits of data science was not so long ago an under-the-radar secret of smart businesses that recognized the value of interpreting and projecting data. Today, although the...
Apache Storm vs. Apache Spark

Apache Storm vs. Apache Spark

Storm and Spark This is the last post in the series on real-time systems. In the first post we discussed Apache Storm and Apache Kafka. In the second post we discussed Apache Spark (Streaming). In both posts we examined a small Twitter Sentiment Analysis program....