Tutorial 9: Aggregated Classification Pipelines: Propagating Probabilistic Assumptions

Показать описание

Description:

NLP has helped massively scale-up previously small-scale content analyses. Many social scientists train NLP classifiers and then measure social constructs (e.g sentiment) for millions of unlabeled documents which are then used as variables in downstream causal analyses. However, there are many points when one can make hard (non-probabilistic) or soft (probabilistic) assumptions in pipelines that use text classifiers: (a) adjudicating training labels from multiple annotators, (b) training supervised classifiers, and (c) aggregating individual-level classifications at inference time. In practice, propagating these hard versus soft choices down the pipeline can dramatically change the values of final social measurements. In this tutorial, we will walk through data and Python code of a real-world social science research pipeline that uses NLP classifiers to infer many users’ aggregate “moral outrage” expression on Twitter. Along the way, we will quantify the sensitivity of our pipeline to these hard versus soft choices.

Tutorial host: Katherine Keith

NLP and CSS 201: Beyond the Basics

Рекомендации по теме

Tutorial 9: Aggregated Classification Pipelines: Propagating Probabilistic Assumptions

Tutorial 9: Aggregated Classification Pipelines: Propagating Probabilistic Assumptions

1st yr. Vs Final yr. MBBS student 🔥🤯#shorts #neet

Most💯 Important Step Before any Procedure 🔥

Comment yes for more body language videos! #selfhelp #personaldevelopment #selfimprovement

Mastering MongoDB Aggregation Pipelines: Focus on $match and $group Stages | Beginner Tutorial

How to eat Roti #SSB #SSB Preparation #Defence #Army #Best Defence Academy #OLQ

Shradha didi at lpu 🤩 #apna college #viralshorts

069 Fundamentals of Neo4j Graph Data Science Series 2.x – Pipelines and More - NODES2022

Aspirants practicing eatingetiquette # SSB #SSBPreparation #NDA #CDS #Defence #DefenceAcademy

How much does B.TECH pay?

ETL (Extract, Transform, Load) | Data 📊Aggregation | Data Warehouse🏭 & Mining ⛏️

I met @AmanDhattarwal bhaiya | Lovely Professional University

8 SwitchML Scaling Distributed Machine Learning with In Network Aggregation

Mining For Beginners - How Does a Metals and Mineral Mine Work?

Ek jhatke mein ho jayega The End 💔

Class Diagram in UML | Banking System (Real Life example) | Software Engineering

How to build ML Pipelines in PySpark - Parvaneh Shafiei, NTT Data

How much does an IT Analyst make?

What is Time Series Analysis?

Data Science Basics: Pipelines

Data Science in 30 Minutes: Scikit-Learn with Core-Contributor Andreas Müller

Applied Machine Learning for Ranking Products in an Ecommerce Setting Arnoud de Munnik Wehkamp Jerry

Fine Tuning BERT for Named Entity Recognition (NER) | NLP | Data Science | Machine Learning

How to became a Successful Java developer 🤔 | EES | #ees #shorts