In this post I will be installing a three node Hadoop Cluster on Ubuntu 18.04.1 including HDFS.
In this post we will cover how to install MongoDB on Ubuntu 18.04. MongoDB is a NoSQL database that stores information in JSON like objects called documents.
In this post we will compare Apache Cassandra vs MongoDB. Both systems are being used for storing big data but they do it very differently.
In this post, I will outline how I created a big data pipeline for my web server logs using Apache Kafka, Python, and Apache Cassandra.
In this post, I’m going to install a complete ‘production ready’ Apache Cassandra cluster of three nodes.
In this post, we will discuss 3 awesome big data Python tools to increase your big data programming skills using production data.
In this post, we will be aggregating all of our logs into Google BigQuery Audit Logs. Using big data techniques we can simply our audit log aggregation in the cloud.
Big Data is everywhere these days. In this article I will give you some awesome real-life big data examples to demonstrate the utility of big data.
Continuing our Fast Data Architecture series, we will install Cassandra on Ubuntu 18.04 and configure it to run as a SystemD service.
Learn how to use Kafka Python to pull Google Analytics metrics and push them to your Kafka Topic. This will allow us to analyze this data later using Spark to give us meaningful business data.