This post will describe how can we ingest the geospatial data into Apache Solr for search and query. The pipeline is built with Apache Spark and Apache Spark Solr connector. The purpose of this project is to ingest and index
JSON Archive
Query Nested JSON via Spark SQL
It’s been a while since I wrote a blog so here you go. I have been researching with Apache Spark currently and had to query complex nested JSON data set, encountered some challenges and ended up learning currently the best