News

Pinterest revamped its data infrastructure by transitioning from a legacy Hadoop system to the Moka platform, leveraging Kubernetes and Spark on AWS EKS. This strategic shift enhances job ...
NYC Taxi Analytics: Spark ETL Pipeline on AWS EMR Project Summary This project demonstrates the use of Amazon Elastic Map Reduce (EMR) for processing NYC taxi trip data using Apache Spark.