What is the CAP Theorem?
The CAP theorem states that a distributed data store (database) can only guarantee 2 of the 3 following: consistency, availability, and partition tolerance. What is Consistency? Consistency means that all…
The CAP theorem states that a distributed data store (database) can only guarantee 2 of the 3 following: consistency, availability, and partition tolerance. What is Consistency? Consistency means that all…
Docker containers, when run in detached mode (the most common -d option), are designed to shut down immediately after the initial entrypoint command (program that should be run when container…
Apache Hadoop is a core big data technology. Running Hadoop on Docker is a great way to get up and running quickly. Below are the basic steps to create a…
Hadoop’s default output delimiter (character separating the output key and value) is a tab ("\t"). This post explains how to change the default Hadoop output delimiter. Output Delimiter Configuration Property…
Occasionally when programming in Java you will need a List too large to hold in memory. MapDB is an open source Java library that allows you to create a disk…
Occasionally when programming in Java you will need a Map too large to hold in memory. MapDB is an open source Java library that allows you to create a disk…
Uppercasing and lowercasing is a common task when working with text. Below are examples of doing this with standard Linux/Unix utilities. Lowercasing text with tr tr 'A-Z' 'a-z' Uppercasing text…