filmov
tv
Koalas: pandas APIs on Apache Spark
Показать описание
# Abstract
In this talk, Reynold will present Koalas, a new open source project that was announced at the Spark + AI Summit in April. Koalas is a Python package that implements the pandas API on top of Apache Spark, to make the pandas API scalable to big data. Using Koalas, data scientists can make the transition from a single machine to a distributed environment without needing to learn a new framework.
Reynold will demonstrate Koalas' new functionalities since its initial release, discuss its roadmaps, and how he envisions Koalas could become the standard API for large scale data science.
# Speaker Bio
Reynold Xin is a cofounder and Chief Architect at Databricks. In the open source community, Reynold is known as a top contributor to the Apache Spark project, having designed many of its core user-facing APIs and execution engine features. Reynold received a PhD in Computer Science from UC Berkeley, where he worked on large-scale data processing systems.
In this talk, Reynold will present Koalas, a new open source project that was announced at the Spark + AI Summit in April. Koalas is a Python package that implements the pandas API on top of Apache Spark, to make the pandas API scalable to big data. Using Koalas, data scientists can make the transition from a single machine to a distributed environment without needing to learn a new framework.
Reynold will demonstrate Koalas' new functionalities since its initial release, discuss its roadmaps, and how he envisions Koalas could become the standard API for large scale data science.
# Speaker Bio
Reynold Xin is a cofounder and Chief Architect at Databricks. In the open source community, Reynold is known as a top contributor to the Apache Spark project, having designed many of its core user-facing APIs and execution engine features. Reynold received a PhD in Computer Science from UC Berkeley, where he worked on large-scale data processing systems.
Koalas: pandas APIs on Apache Spark
Koalas: Pandas API on Apache Spark - PyCon SG 2019
Koalas on Apache Spark - Pandas API
Koalas: Pandas on Apache Spark
Koalas: Making an Easy Transition from Pandas to Apache Spark
Koalas dataframe on SPARK = Pandas API supercharged!
Koalas: Pandas on Apache Spark -Tim Hunter, Brooke Wenig, Niall Turbitt (Databricks)
Master Databricks and Apache Spark Step by Step: Lesson 33 - Goodbye Koalas: Hello Pandas on Spark!
Koalas Introduction with PySpark practical | PySpark Koalas Transition from pandas API
Koalas Easy Transition from pandas to Apache Spark - Xiao Li
Master Databricks and Apache Spark Step by Step: Lesson 32 - Koalas: Pandas on Spark!
Koalas: Making an Easy Transition from Pandas to Apache Spark -Tim Hunter & Takuya Ueshin
Koalas: Easy Transition from Pandas to Spark - Ben Sadeghi
New Pandas API in Spark 3.2 for single node and multi node
Koalas: Interoperability Between Koalas and Apache Spark
Announcing Koalas Open Source Project | Reynold Xin (Databricks), Brooke Wenig (Databricks)
Amanda Moran: Pandas vs Koalas: The Ultimate Showdown! | PyData New York 2019
DataXDays 2020 - Koalas: Parallelising pandas with Apache Spark - Niall Turbitt
DataXDays 2020 Koalas Parallelising pandas with Apache Spark Niall Turbitt
Scaling Pandas with Apache Spark + Koalas for ML at Virgin Hyperloop One
Pandas API on Spark
Koalas: How Well Does Koalas Work?
How to use Pandas API on Spark 3.3.0 | Pandas API on Spark Tutorial
Koalas Introduction with PySpark Practical In Tamil | Koalas Transition from pandas API in Tamil
Комментарии