Все публикации

Sayash Kapoor -

Sayash Kapoor - Five surprises in evaluating AI agents

Jacob Baldwin 'Pass

Jacob Baldwin 'Pass Coverage Metrics in the NFL'

Jay Alammar -

Jay Alammar - 'Hands-On Large Language Models: Language Understanding and Generation'

Rona Fang-Yu Hu

Rona Fang-Yu Hu 'Integrating ‘Something Else’ Sexual Identity Responses in Health Disparity Studies'

Sean Taylor 'Causal

Sean Taylor 'Causal Discovery for Product Analytics'

Annie Collins, GivingTuesday,

Annie Collins, GivingTuesday, 'Leveraging Data for Generosity'

Sana Shams and

Sana Shams and Waris Bhatia “Navigating Canada’s Largest Corpus of Government Documents”

Naman Jain -

Naman Jain - 'LiveCodeBench: Holistic and contamination free evaluation of LLMs for code'

Terry Yue Zhuo

Terry Yue Zhuo 'BigCodeBench: Benchmarking Code Generation'

Kobi Hackenburg 'Evaluating

Kobi Hackenburg 'Evaluating the persuasive influence of political microtargeting with LLMs'

Jae Yeon Kim

Jae Yeon Kim - 'Field experimentation in the U.S. safety net'

Lars Vilhuber -

Lars Vilhuber - Privacy protection in RCTs: The challenge of privacy protection in the field

Ethan Busby 'AI-Enabled

Ethan Busby 'AI-Enabled Persuasion Research: Experimenting with Effective Political Messaging'

Belinda Li -

Belinda Li - 'Eliciting Human Preferences with Language Models'

Amanda Coston -

Amanda Coston - Addressing validity in decision-making algorithms

Kosuke Imai 'Does

Kosuke Imai 'Does AI help humans make better decisions?'

Abel Brodeur -

Abel Brodeur - Mass Reproducibility and Replicability: A New Hope

Lenny Bronner -

Lenny Bronner - Election Modeling at The Washington Post

Cameron Buckner -

Cameron Buckner - 'The philosophy of Large Language Models'

Laura Plein -

Laura Plein - Can LLMs demystify Bug Reports and translate them into Test Cases?

Tom Davidson -

Tom Davidson - 'Harnessing Generative Artificial Intelligence for Sociological Research'

Jonathan Mellon 'Using

Jonathan Mellon 'Using LLMs to code open-text social survey responses at scale'

Matheus Facure 'Why

Matheus Facure 'Why Banking has the Coolest Stats/Data Science Problems'

Sky CH-Wang -

Sky CH-Wang - Do Androids Know They’re Only Dreaming of Electric Sheep?

visit shbcf.ru