Apache Arrow and Go A match made in Data - Matthew Topol

preview_player
Показать описание
Apache Arrow and Go A match made in Data - Matthew Topol
A presentation from ApacheCon 2022

With Apache Arrow fast becoming a standard for working with data, most people are primarily familiar with the Python, C++ and Java libraries. This talk instead focuses on the Golang implementations of Apache Arrow and Parquet. We'll cover getting started using the Go Arrow and Parquet libraries, creating an Arrow Flight server and client over gRPC, and integrating with other runtimes using the C Data API. The concurrency primitives in Go make it ideal for constructing efficient pipelines for parallel processing of large amounts of data. We'll also cover some of the internals of the implementation to demonstrate how the Go Arrow and Parquet libraries achieve their performance including benefiting from SIMD.
Рекомендации по теме
Комментарии
Автор

Great presentation! However I didn't get that when you said there's no serialization cost or deserialization cost across system at 3:18. Base on my knowledge every data across system need serializatiion and deserialization.if you mean the cost of ser/deser is relatively low then that make more sense.

mcyyuzd
join shbcf.ru