Exploring Wikipedia with Apache Spark: A Live Coding Demo

The real power and value proposition of Apache Spark is in creating unified use cases combining batch analysis, stream analysis, SQL, machine learning, graph processing and visualizations. In this live coding demo, Sameer will use various Wikipedia datasets to build a dashboard about what is happening in the world during his talk. The application will connect to the live Edits stream of Wikipedia and join it with other Wikipedia datasets to derive interesting insights about what’s trending on the planet.


Tags: , ,

Location: Salon D
April 12th, 2016
4:00 PM - 5:00 PM