I have been working with Apache Spark for a while now and would like to share some UDF tips and tricks I have learned over the past year. Below is the sample data (i.e. people.json) used to demonstrate example of UDF
It’s been a while since I wrote a blog so here you go. I have been researching with Apache Spark currently and had to query complex nested JSON data set, encountered some challenges and ended up learning currently the best
This is the continuation from my previous post where I had explained how to run spring boot app inside the docker container as daemon which is using MongoDB as storage and the [/data/db] volume was mounted as docker container volume.
The more I use and learn about Docker and the more I feel like I can’t live without it.This blog is about Docker amazing feature Volume Containers.I wanted to write the Spring Boot app and deploy it to the Docker
This post is all about real time analytic on large data sets. I am sure every one has heard about Apache Kafka (Distributed publish subscribe messaging broker) and Apache Storm (Distributed real time computation system.) and if you were disappointed