Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in Spark with Lorand Dali

Показать описание

"This talk tells the story of implementation and optimization of a sparse logistic regression algorithm in spark. I would like to share the lessons I learned and the steps I had to take to improve the speed of execution and convergence of my initial naive implementation. The message isn't to convince the audience that logistic regression is great and my implementation is awesome, rather it will give details about how it works under the hood, and general tips for implementing an iterative parallel machine learning algorithm in spark. The talk is structured as a sequence of ""lessons learned"" that are shown in form of code examples building on the initial naive implementation. The performance impact of each ""lesson"" on execution time and speed of convergence is measured on benchmark datasets. You will see how to formulate logistic regression in a parallel setting, how to avoid data shuffles, when to use a custom partitioner, how to use the 'aggregate' and 'treeAggregate' functions, how momentum can accelerate the convergence of gradient descent, and much more. I will assume basic understanding of machine learning and some prior knowledge of spark. The code examples are written in scala, and the code will be made available for each step in the walkthrough.

Session hashtag: #EUds9"

About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Connect with us:

Рекомендации по теме

Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in Spark with Lorand Dali

How to Capture Lessons Learned at the End of a Project

Lessons Learned

5 Lessons Learned From Writing Over 300,000 Lines of Infrastructure Code

Implementation Lessons Learned | Andrea Schramm and Kristin Vrana

Lessons learned on participatory monitoring of the resources and project implementation

Lessons learned from customers, Using Project for the web Accelerator and OnePlan

Prometheus: Lessons Learned

Webinar // ISO 19443 Lessons Learnt from the Beginning of Implementation

Seven Practical Insights for Implementing AI in Healthcare

Real world Graphene: lessons learned from building a GraphQL API - Marcin Gębala - PyCon Israel 2019...

Key Points and Lessons Learned in AI Projects in Mechanical Engineering | #BAS22

Lessons Learned Building a Production Memory-Overcommit Solution - Florian Schmidt & Ivan Tetere...

A Year in Review: Retail’s Lessons Learned in 2024

Lessons Learned from a Decade of Audio Programming

Implementing nf-core/rnafusion in a clinical setting: Key insights and lessons learned

Porting Source to Linux: Valve's Lessons Learned

Implementing Natura 2000 in forests: lessons learned and looking ahead

CppCon 2017: Nimrod Sapir “When every Microseconds counts: Lessons learned about performance”

Design Systems: Lessons Learned

mp-units: Lessons learned and a new library design - Mateusz Pusz

Become A Better Workshop FACILITATOR In 8 Minutes (Facilitation Technique)

JDD 2018: Reactive programming: lessons learned by Tomasz Nurkiewicz

Disrupted: Lessons Learned and the Path Forward

8 Lessons Learned from Teaching Online