Media Summary: Feature Hashing - One-Hot Encoding (OHE) - OHE Dictionary - Prediction and log loss evaluation - Feature reduction More info: ... Review - Math review - Numpy and Spark - Lambda functions. Notebook usage - Intro to spark and pySpark API - Using RDDs - Lambda functions - RDD actions, transformation, caching ...
Hackon Data 2017 Workshop 7 - Detailed Analysis & Overview
Feature Hashing - One-Hot Encoding (OHE) - OHE Dictionary - Prediction and log loss evaluation - Feature reduction More info: ... Review - Math review - Numpy and Spark - Lambda functions. Notebook usage - Intro to spark and pySpark API - Using RDDs - Lambda functions - RDD actions, transformation, caching ... Create a RDD and pair RDD - Counting words - Finding unique words and mean value - Reference to regular expressions ... complete rework of the demonstration for module