Media Summary: Notebook usage - Intro to spark and pySpark API - Using RDDs - Lambda functions - RDD actions, transformation, caching ... Review - Math review - Numpy and Spark - Lambda functions. Please complete the videos, exercise, and lab before the session on Thursday. Jul 14 - Virtual session - Intro ...
Hackon Data 2017 Workshop 1 - Detailed Analysis & Overview
Notebook usage - Intro to spark and pySpark API - Using RDDs - Lambda functions - RDD actions, transformation, caching ... Review - Math review - Numpy and Spark - Lambda functions. Please complete the videos, exercise, and lab before the session on Thursday. Jul 14 - Virtual session - Intro ... Create a RDD and pair RDD - Counting words - Finding unique words and mean value - Reference to regular expressions ... Text Analysis - Text similarity of Entity Resolution - Weighted bag-of-words - Cosine similarity - Scalable Entity Resolution ... Feature Hashing - One-Hot Encoding (OHE) - OHE Dictionary - Prediction and log loss evaluation - Feature reduction More info: ...
Participants Experience - HackOn(Data) 2017