Porträtt av person i blont hår, glasögon, skägg och med vit tshirt mot en grå bakgrund.

#frAIday: Transformer-based variables for estimating hate crimes in police reports

Fri

Sep

Friday 20 September, 2024 at 12:15 - 13:00

Galaxen & Zoom

To address the challenge of estimating population totals in large textual datasets within official statistics, where manual annotation is impractical, we propose a method combining transformer encoder neural network predictions and well-established survey sampling estimators. This is done by training a classifier and then using the model predictions as an auxiliary variable in the estimators. The applicability is demonstrated on Swedish hate crime statistics, which are based on Swedish police reports, for which approximately 1.5 million are being filed annually.

If you are not already registered with #frAIday, you can do so here to receive the Zoom link

Organiser: Centre for Transdisciplinary AI

Event type: Lecture

Hannes Waldetoft, Phd student in statistics at Uppsala University, currently working with textual data and machine learning.

Contact

Henry Lopez Vega

Read about Henry Lopez Vega