Apache Spark for Machine Learning on Large Data Sets • Juliet Hougland • YOW! 2017

Показать описание

This presentation was recorded at YOW! 2017. #GOTOcon #YOW

Juliet Hougland - Data Science Tech Lead for Engineering at Cloudera @juliethougland325

RESOURCES

ABSTRACT
Apache Spark is a general purpose distributed computing framework for distributed data processing. With MLlib, Spark’s machine learning library, fitting a model to a huge data set becomes very easy.

Similarly, Spark’s general purpose functionality enables application of a model across a large collection of observations. We’ll walk through fitting a model to a big data set using MLlib and applying a trained #scikitlearn model to a large data set. [...]

RECOMMENDED BOOKS

#ApacheSpark #Spark #MLlib #ML #MachineLearning #SoftwareEngineering #JulietHoughland #Programming #YOWcon

CHANNEL MEMBERSHIP BONUS
Join this channel to get early access to videos & other perks:

Looking for a unique learning experience?

SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.

Рекомендации по теме

Комментарии

We are currently releasing older YOW! videos to serve as a valuable archive, preserving historical content. It is possible that a video is perceived as outdated. We believe it offers insightful glimpses into the past, enriching our understanding of history and development.

Looking for books & other references mentioned in this video?
Check out the video description for all the links!

Want early access to videos & exclusive perks?

Question for you: What’s your biggest takeaway from this video? Let us know in the comments! ⬇

GOTO-

In 38 minutes this video made my manager understand why I think Spark is wonderfull. Thanks for releasing it, even if its a few years old.

ebusdk

Apache Spark for Machine Learning on Large Data Sets • Juliet Hougland • YOW! 2017

Machine learning with Apache Spark | Machine Learning Essentials

Apache Spark in 100 Seconds

Apache Spark™ ML and Distributed Learning (1/5)

Apache Spark Machine Learning | Apache Spark Tutorial For Beginners | Simplilearn

Learn Apache Spark in 10 Minutes | Step by Step Guide

What Is Apache Spark?

Apache Spark for Machine Learning on Large Data Sets • Juliet Hougland • YOW! 2017

Building Machine Learning Model using Apache Spark | PySpark MLlib Tutorial

🚀 Build a Logistic Regression Model in Apache Spark ⚡ | PySpark 'MLlib' Tutorial for Begin...

Distributed Machine Learning with Apache Spark / PySpark MLlib

Introduction to Machine Learning with Apache Spark and Redis - Part 1 - Spark Basics

Spark MLlib Tutorial | Machine Learning On Spark | Apache Spark Tutorial | Simplilearn

Machine Learning using Apache Spark MLlib | PySpark Tutorial

PySpark Tutorial

Leveraging Apache Spark for Scalable Data Prep and Inference in Deep Learning

Apache Spark in 60 Seconds

Building Machine Learning Algorithms on Apache Spark - William Benton

Apache Spark - Computerphile

Building Genomic Data Processing and Machine Learning Workflows Using Apache Spark

Introduction to Spark for Data Science and Machine Learning [ Recorded Live Session]

Scalable Machine Learning on Big Data using Apache Spark

Apache SPARK MLlib - Machine Learning for Data Science and AI deployment on SPARK cluster

End to End Machine Learning pipeline using Apache Spark - Hands On

Virtualizing Apache Spark and Machine Learning (Justin Murray)