Media Summary: GraphX and GraphFrames, Distributed Vertex Programs, Motif Finding, Page Rank, etc. Explores replication, partitioning and eventual consistency in distributed databases. Spark Essentials: Working with Spark's Resilient Distributed Datasets (RDDs): creating RDDs, performing basic transformations ...
Week 8 Scalable Data Science - Detailed Analysis & Overview
GraphX and GraphFrames, Distributed Vertex Programs, Motif Finding, Page Rank, etc. Explores replication, partitioning and eventual consistency in distributed databases. Spark Essentials: Working with Spark's Resilient Distributed Datasets (RDDs): creating RDDs, performing basic transformations ... Discussion of Test 1, grading and questions. Also some discussion about future assignments, tests and topics. Questions and ... Hussain Sultan, Director of Artificial Intelligence, and Ben Lindquist, Principal Software Engineer at MetroStar Systems, lead a talk ... Introduction to Machine Learning, Decision trees (supervised classification of hand-written digits) and K-Means clustering of a ...