filmov
tv
Apache Arrow and Go A match made in Data - Matthew Topol

Показать описание
Apache Arrow and Go A match made in Data - Matthew Topol
A presentation from ApacheCon 2022
With Apache Arrow fast becoming a standard for working with data, most people are primarily familiar with the Python, C++ and Java libraries. This talk instead focuses on the Golang implementations of Apache Arrow and Parquet. We'll cover getting started using the Go Arrow and Parquet libraries, creating an Arrow Flight server and client over gRPC, and integrating with other runtimes using the C Data API. The concurrency primitives in Go make it ideal for constructing efficient pipelines for parallel processing of large amounts of data. We'll also cover some of the internals of the implementation to demonstrate how the Go Arrow and Parquet libraries achieve their performance including benefiting from SIMD.
A presentation from ApacheCon 2022
With Apache Arrow fast becoming a standard for working with data, most people are primarily familiar with the Python, C++ and Java libraries. This talk instead focuses on the Golang implementations of Apache Arrow and Parquet. We'll cover getting started using the Go Arrow and Parquet libraries, creating an Arrow Flight server and client over gRPC, and integrating with other runtimes using the C Data API. The concurrency primitives in Go make it ideal for constructing efficient pipelines for parallel processing of large amounts of data. We'll also cover some of the internals of the implementation to demonstrate how the Go Arrow and Parquet libraries achieve their performance including benefiting from SIMD.
Apache Arrow and Go A match made in Data - Matthew Topol
What is Apache Arrow?
Apache Arrow Explained by Voltron Data's Matt Topol - Subsurface
What is Apache Arrow? by Pandas Creator Wes McKinney
Spark Interview Questions | PySpark and Apache Arrow | What is Apache Arrow
Demohub Tips // Explained: Apache Arrow Columnar Data Framework -Theory & Practice | www.demohub...
Introduction to Apache Arrow
GoLang || Apache Arrow v13 || Read json into array
GoLang || Apache Arrow v13 || Read json into array #golang #apache
The columnar roadmap: Apache Parquet and Apache Arrow
GoLang || Stream processing with Apache Arrow v13
Data Science Across Data Sources with Apache Arrow
A 101 in Time Series Analytics with Apache Arrow, Pandas and Parquet
The columnar roadmap Apache Parquet and Apache Arrow
Apache Arrow Flight SQL: High Performance, Simplicity, and Interoperability for Data Transfers
GoLang || Read CSV with Schema Inference || Apache Arrow v13
Fast Data Processing with Apache Arrow
An Introduction to Arrow for Python Programmers
GoLang || Read CSV with Schema Inference || Apache Arrow v13
Improving Python and Spark Performance and Interoperability with Apache Arrow
Building InfluxDB 3.0 with Apache Arrow, DataFusion, Flight and Parquet
Extending Pandas using Apache Arrow and Numba - Uwe L Korn
Data Science Across Data Sources with Apache Arrow
Data Science Across Data Sources with Apache Arrow // AnacondaCon Austin 2019
Комментарии