Все публикации

Sayash Kapoor - Five surprises in evaluating AI agents

Jacob Baldwin 'Pass Coverage Metrics in the NFL'

Jay Alammar - 'Hands-On Large Language Models: Language Understanding and Generation'

Rona Fang-Yu Hu 'Integrating ‘Something Else’ Sexual Identity Responses in Health Disparity Studies'

Sean Taylor 'Causal Discovery for Product Analytics'

Annie Collins, GivingTuesday, 'Leveraging Data for Generosity'

Sana Shams and Waris Bhatia “Navigating Canada’s Largest Corpus of Government Documents”

Naman Jain - 'LiveCodeBench: Holistic and contamination free evaluation of LLMs for code'

Terry Yue Zhuo 'BigCodeBench: Benchmarking Code Generation'

Kobi Hackenburg 'Evaluating the persuasive influence of political microtargeting with LLMs'

Jae Yeon Kim - 'Field experimentation in the U.S. safety net'

Lars Vilhuber - Privacy protection in RCTs: The challenge of privacy protection in the field

Ethan Busby 'AI-Enabled Persuasion Research: Experimenting with Effective Political Messaging'

Belinda Li - 'Eliciting Human Preferences with Language Models'

Amanda Coston - Addressing validity in decision-making algorithms

Kosuke Imai 'Does AI help humans make better decisions?'

Abel Brodeur - Mass Reproducibility and Replicability: A New Hope

Lenny Bronner - Election Modeling at The Washington Post

Cameron Buckner - 'The philosophy of Large Language Models'

Laura Plein - Can LLMs demystify Bug Reports and translate them into Test Cases?

Tom Davidson - 'Harnessing Generative Artificial Intelligence for Sociological Research'

Jonathan Mellon 'Using LLMs to code open-text social survey responses at scale'

Matheus Facure 'Why Banking has the Coolest Stats/Data Science Problems'

Sky CH-Wang - Do Androids Know They’re Only Dreaming of Electric Sheep?

visit shbcf.ru