Maximizing the Performance of DNA Analysis Using Apache Arrow

preview_player
Показать описание
Speaker: Zaid Al-Ars, Director of Software Engineering, Voltron Data

DNA analysis is becoming an important tool for a wide range of applications from accurate cancer diagnostics to agricultural yield improvement. This talk presents an Arrow-based end-to-end implementation of the popular DNA analysis pipelines used in research and clinical applications. Using Arrow improves the throughput of the pipeline by up to 5x, by improving processing efficiency and reducing the data overhead.
Рекомендации по теме