filmov
tv
Maximizing the Performance of DNA Analysis Using Apache Arrow
Показать описание
Speaker: Zaid Al-Ars, Director of Software Engineering, Voltron Data
DNA analysis is becoming an important tool for a wide range of applications from accurate cancer diagnostics to agricultural yield improvement. This talk presents an Arrow-based end-to-end implementation of the popular DNA analysis pipelines used in research and clinical applications. Using Arrow improves the throughput of the pipeline by up to 5x, by improving processing efficiency and reducing the data overhead.
DNA analysis is becoming an important tool for a wide range of applications from accurate cancer diagnostics to agricultural yield improvement. This talk presents an Arrow-based end-to-end implementation of the popular DNA analysis pipelines used in research and clinical applications. Using Arrow improves the throughput of the pipeline by up to 5x, by improving processing efficiency and reducing the data overhead.