SQL-on-Hadoop: The Paradox of Choice

SQL-on-Hadoop: The Paradox of Choice

Hadoop has been around for a little over 10 years now. It provides you a scale-out and cost-effective solution to store and process large amount of data – which we loosely refer to as “Big Data”. More enterprises are adopting Hadoop with an objective...
HDFS Heterogeneous Storage Model

HDFS Heterogeneous Storage Model

HDFS has proven to be a scalable, fault-tolerant and distributed storage solution which is quickly being adopted by various industries. The distributed storage along with the ability to scale-out in a linear way makes the entire Hadoop framework very cost...
Spark 2.0 – What’s New

Spark 2.0 – What’s New

Earlier this month DataBricks provided an overview of Apache Spark’s next major release, Spark 2.0. The following post shows some of the changes in the abstraction, API and Libraries. Spark 2.0 is expected to be released in early June 2016. What is Apache Spark...
Learning Git in Reverse

Learning Git in Reverse

  The content of this post is drafted from contents of a similarly titled presentation. It is certainly counter-intuitive to learn to drive by first going backwards, so why learn how to use Git in reverse? The short answer is: knowing the internals of Git should...
Learning Elixir by Kenny Ballou

Learning Elixir by Kenny Ballou

zData is excited to recognize Kenny Ballou for his book release – Learning Elixir Kenny Ballou is a life-long learner, developer, mathematician, and overall thinker. He enjoys solving problems, learning about technologies, and discussing new and different ideas....
The Internet of Things – Beyond the Hype

The Internet of Things – Beyond the Hype

Unless you have been living in a cave, the term Internet of Things (IoT) has reached your ears.  The term has been gaining traction, but a few questions come to mind when we talk about the IoT – 1. Is it just a Hype? and 2. Is there more to the IoT, or is this...