Apache Kafka – Usage Patterns

Apache Kafka – Usage Patterns

As the technology industry changes new buzz words appear. From Hadoop, Spark and now to Kafka. What is Kafka Kafka is an Apache Top Level project. Apache Kafka is an open-source streaming unified, high-throughput, low-latency platform which can handle real-time data...
SQL-on-Hadoop: The Paradox of Choice

SQL-on-Hadoop: The Paradox of Choice

Hadoop has been around for a little over 10 years now. It provides you a scale-out and cost-effective solution to store and process large amount of data – which we loosely refer to as “Big Data”. More enterprises are adopting Hadoop with an objective...
Spark 2.0 – What’s New

Spark 2.0 – What’s New

Earlier this month DataBricks provided an overview of Apache Spark’s next major release, Spark 2.0. The following post shows some of the changes in the abstraction, API and Libraries. Spark 2.0 is expected to be released in early June 2016. What is Apache Spark...
The DB Hack – Ambari Delete or Remove Service

The DB Hack – Ambari Delete or Remove Service

In a previous post I discussed how you can Delete or Remove service from Ambari. The process involved the use of Ambari API to Delete or Remove the target service in question. In this article, we will dive into the other side of Ambari interface – the database....