Big Data

5 data integration trends that will define the future of ETL in 2018

ETL refers to extract, transform, load and it is generally used for data warehousing and data …

Kubernetes has emerged as go to container orchestration platform for data engineering teams. …

To unlock the true value of data, organisations will need internal data services. Data services …

I have been playing with Apache Drill for quite some time now. In layman’s terms, Apache Drill …

Currently the majority of cloud based database and data warehouse services are provisioned with …

In 2005 Stonebraker et al. published a paper that outlined 8 key requirements for stream processing …

Apache Mesos is a popular open source cluster manager which enables building resource-efficient …

My notes and thoughts on Hadoop Ecosystem from book Hadoop Operations1. One of the major key take …

Notes plus thoughts from my recent read Cassandra: The Definitive Guide. Common ways to solve …