The Story in the Notebook: Exploratory Data Science using a Literate Programming Tool

preview_player
Показать описание
The Story in the Notebook: Exploratory Data Science using a Literate Programming Tool
Mary Beth Kery, Marissa Radensky, Mahima Arya, Bonnie E. John, Brad A. Myers

CHI '18: ACM CHI Conference on Human Factors in Computing Systems
Session: Programming 2

Abstract
Literate programming tools are used by millions of programmers today, and are intended to facilitate presenting data analyses in the form of a narrative. We interviewed 21 data scientists to study coding behaviors in a literate programming environment and how data scientists kept track of variants they explored. For participants who tried to keep a detailed history of their experimentation, both informal and formal versioning attempts led to problems, such as reduced notebook readability. During iteration, participants actively curated their notebooks into narratives, although primarily through cell structure rather than markdown explanations. Next, we surveyed 45 data scientists and asked them to envision how they might use their past history in an future version control system. Based on these results, we give design guidance for future literate programming tools, such as providing history search based on how programmers recall their explorations, through contextual details including images and parameters.

Recorded at the ACM CHI Conference on Human Factors in Computing Systems in Montréal, Canada April 21-26, 2018
Рекомендации по теме
Комментарии
Автор

8:36 > _"open the notebook to try out some syntax"_

woah lol, yeah, nice to see others do this too (of course, but still...)

yash
Автор

great talk. but who's Adam that she keeps pointing at?

jumana