Dirk Hovy
@dirkhovy.bsky.social
📤 625
📥 332
📝 52
Professor
@milanlp.bsky.social
for
#NLProc
, compsocsci,
#ML
Also at
http://dirkhovy.com/
pinned post!
🚨(Software) Update: In my PhD, I had a side project to fix an annoying problem: when you ask 5 people to label the same thing, you often get different answers. But in ML (and lots of other analyses), you still need a single aggregated answer. Using the majority vote is easy–but often wrong. 1/N
loading . . .
GitHub - dirkhovy/MACE: Multi-Annotator Competence Estimation tool
Multi-Annotator Competence Estimation tool. Contribute to dirkhovy/MACE development by creating an account on GitHub.
https://github.com/dirkhovy/MACE
3 months ago
6
75
14
reposted by
Dirk Hovy
ACL Rolling Review (ARR)
16 days ago
🗓️ The ARR March review deadline is approaching: April 20 AoE. Finishing up your review? Run it through REVAS, a peer review assistant that makes your suggestions more actionable, flags unsupported claims, and grounds your feedback in the paper. 👉
revas.mbzuai.ac.ae
loading . . .
REVAS — AI-Powered Peer Review Feedback for Academics
REVAS analyzes the weakness section of your peer review, scoring each paragraph on actionability, helpfulness, grounding, and verifiability.
https://revas.mbzuai.ac.ae
0
3
5
reposted by
Dirk Hovy
MilaNLP Lab
26 days ago
#MemoryModay
#NLProc
Uma et al. (2020) highlights 'A Case for Soft Loss Functions' efficacy using soft labels & crowd annotations in AI tasks, outshining top-tier methods.
loading . . .
https://ojs.aaai.org/index.php/HCOMP/article/download/7478/7255/10850
0
5
3
reposted by
Dirk Hovy
26 days ago
To accommodate ACL decisions, we are further extending the commitment deadline for pre-reviewed ARR submissions to April 7!
add a skeleton here at some point
0
4
4
reposted by
Dirk Hovy
ACL
26 days ago
The paper acceptance notifications will be out by the 6th of April, AoE. The PCs are working hard throughout the holiday season to finalize the decisions. Apologies for the delay!
0
4
6
reposted by
Dirk Hovy
David Lazer
26 days ago
The deadline for submission to the Political Networks conference is this Friday. It's taking place Aug 4-7, in Manchester.
sites.google.com/view/confpol...
loading . . .
2026 Manchester
Application and registration
https://sites.google.com/view/confpolinetworks/
0
3
2
reposted by
Dirk Hovy
MilaNLP Lab
30 days ago
#TBT
#NLProc
'[MASK]? Making Sense of Language-Specific BERT Models' by
@deboranozza.bsky.social
, Bianchi &
@dirkhovy.bsky.social
(2020), explores language-specific vs universal BERT models.
loading . . .
https://arxiv.org/pdf/2003.02912
0
5
2
I realized how much DMing is like being a professor/chairing a committee. You: - make a brilliant plan for 2+ hours of fun - prep lots of material - immediately get derailed by questions/arguments/etc. - keep it together to make the most of the time together - end up not using most of the material
about 1 month ago
1
6
1
reposted by
Dirk Hovy
MilaNLP Lab
about 1 month ago
#MemoryModay
#NLProc
'Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers,' by Nguyen &
@dirkhovy.bsky.social
decodes speaker reviews for user preferences using topic models. Domain knowledge needed for market analysis.
loading . . .
Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers
Hanh Nguyen, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
https://aclanthology.org/D19-5510
0
5
2
reposted by
Dirk Hovy
Alexander Hoyle
about 1 month ago
I wrote a blog post on my experience using AI for slide generation Basic idea: write your lecture notes first, then prompt the LLM to produce corresponding slides in reveal.js (h/t
@chenhaotan.bsky.social
). I'm picky about my slides but was happy with the results!
alexanderhoyle.com/posts/ai-sli...
4
63
10
reposted by
Dirk Hovy
MilaNLP Lab
about 1 month ago
#TBT
#NLProc
Fornaciari,
@dirkhovy.bsky.social
's 'Identifying Linguistic Areas for Geolocation' explores using social media writing for geolocation via Point-to-City (P2C).
loading . . .
Identifying Linguistic Areas for Geolocation
Tommaso Fornaciari, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
https://aclanthology.org/D19-5530
0
4
2
Wish I could be at
@eaclmeeting.bsky.social
, but the lab is well represetned. If you are there, come and say hi!
add a skeleton here at some point
about 1 month ago
0
2
1
reposted by
Dirk Hovy
MilaNLP Lab
about 1 month ago
#MemoryModay
#NLProc
'Dense Node Representation for Geolocation' by Fornaciari &
@dirkhovy.bsky.social
reveals efficient geolocation methods using node2vec & doc2vec models. Greater network size, less parameters.
loading . . .
Dense Node Representation for Geolocation
Tommaso Fornaciari, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
https://aclanthology.org/D19-5529
0
4
2
reposted by
Dirk Hovy
MilaNLP Lab
about 1 month ago
#TBT
#NLProc
'Geolocation with Attention-Based Multitask Learning Models' by Tommaso Fornaciari,
@dirkhovy.bsky.social
(2019) reveals how online political talks can become one-sided. Breaking out of our bubbles!
#SocialMedia
loading . . .
Geolocation with Attention-Based Multitask Learning Models
Tommaso Fornaciari, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
https://aclanthology.org/D19-5528
0
3
2
reposted by
Dirk Hovy
Taha Yasseri
about 1 month ago
Chpater 8:
@dirkhovy.bsky.social
, M Gerondeau & J Globisz on text data and natural language processing. A very useful chapter on why text is such a rich source for CSS, and how NLP can help with exploration, prediction, and generation; if used thoughtfully and with clear research goals.
add a skeleton here at some point
2
3
2
reposted by
Dirk Hovy
Jeremy Foote
about 2 months ago
Just read this great piece -
paulgp.com/2026/03/16/r...
by
@paulgp.com
and it got me thinking. It feels like there is a lot of moral(?) ambiguity and ambivalence around the use of LLMs for academics. So far, I've avoided having LLMs do basically any of my research writing ...
loading . . .
https://paulgp.com/2026/03/16/research-in-time-of-ai.htmlIt
2
6
2
reposted by
Dirk Hovy
MilaNLP Lab
about 2 months ago
#MemoryModay
#NLProc
'Make Natural Language Processing About People Again' by
@dirkhovy.bsky.social
(2018) uncovers how AI models portray different religions and emotions.
#AIEthics
loading . . .
The Social and the Neural Network: How to Make Natural Language Processing about People again
Dirk Hovy. Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media. 2018.
https://aclanthology.org/W18-1106
0
7
5
reposted by
Dirk Hovy
MilaNLP Lab
about 2 months ago
#MemoryModay
#NLProc
'Comparing Bayesian Models of Annotation' by Paun et al. dives into corpus annotation, evaluating six models' predictiveness and accuracy. Essential for navigating annotators and item difficulties.
loading . . .
Comparing Bayesian Models of Annotation
Silviu Paun, Bob Carpenter, Jon Chamberlain, Dirk Hovy, Udo Kruschwitz, Massimo Poesio. Transactions of the Association for Computational Linguistics, Volume 6. 2018.
https://aclanthology.org/Q18-1040
0
8
2
reposted by
Dirk Hovy
MilaNLP Lab
about 2 months ago
📢 Call for Abstracts! Towards a Safer Web for Women (co-located with
#WebSci26
) 📍 Braunschweig 🇩🇪 | 26 May 2026 Theme: Preventive approaches to women’s online safety 🗓 Deadline: 27 March 2026 🔗
forms.gle/tYheEgSwGecf...
🌐
tsww26.github.io
0
5
5
reposted by
Dirk Hovy
MilaNLP Lab
about 2 months ago
#TBT
#NLProc
'Predicting News Headline Popularity' by Lamprinidis, Hardt,
@dirkhovy.bsky.social
(2018) shows neural networks perform similar to Logistic Regression in prediction.
loading . . .
Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning
Sotiris Lamprinidis, Daniel Hardt, Dirk Hovy. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2018.
https://aclanthology.org/D18-1068
0
3
2
One of my favorite studies of the last few years! Great read (albeit with a side of worrying implications for surveys)
add a skeleton here at some point
about 2 months ago
0
6
2
One of my favorite interdisciplinary projects (with
@questoph.bsky.social
). Plus: colorful maps!
add a skeleton here at some point
about 2 months ago
0
3
1
reposted by
Dirk Hovy
MilaNLP Lab
about 2 months ago
#TBT
#NLProc
'Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting' by
@dirkhovy.bsky.social
and Christoph Purschke (2018) highlights how social class and background impact technology performance.
#TechInclusion
loading . . .
Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting
Dirk Hovy, Christoph Purschke. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2018.
https://aclanthology.org/D18-1469
0
3
4
reposted by
Dirk Hovy
Tiancheng Hu
2 months ago
4/7 We argue these aren't separate bugs. They're four facets of the same problem: 🔴 Probabilistic — can't match requested distributions 🟠 Semantic — confidence ≠ correctness 🔵 Distributional — output diversity collapse 🟢 Metacognitive — can't assess its own competence
1
2
1
reposted by
Dirk Hovy
Tiancheng Hu
2 months ago
1/7 🧵 The GPT-4 technical report featured detailed calibration curves. Since then, not a single major model release has reported calibration. The field quietly stopped measuring whether models know what they don't know. Our new position paper argues this is a mistake. Here's why.
1
8
2
reposted by
Dirk Hovy
MilaNLP Lab
2 months ago
We were thrilled to host
@mtutek.bsky.social
at our lab last week. His talk "From Internals to Integrity: How Insights into Transformer LMs Improve Safety, Interpretability, and Explanation Faithfulness" led to great discussions! 👏
#Transformers
#AISafety
#ExplainableAI
#MLResearch
#NLProc
0
18
3
reposted by
Dirk Hovy
EACL 2026
2 months ago
Call for Virtual Registration Subsidies for
#EACL26
🌍 ⚠️ Not for paper registrants 📝 Apply by Feb 27, 2026 (AoE) 📩 Decisions by Mar 2, 2026
2026.eacl.org/calls/virtua...
Don’t register before hearing back if you apply!
loading . . .
Call for Virtual Registration Subsidies
Official website for the 2026 Conference of the European Chapter of the Association for Computational Linguistics
https://2026.eacl.org/calls/virtual-subsidies/
1
6
5
reposted by
Dirk Hovy
UKP Lab
2 months ago
🔎🧩 𝗕𝗲𝘆𝗼𝗻𝗱 𝗕𝗲𝗻𝗰𝗵𝗺𝗮𝗿𝗸𝘀: 𝗛𝗼𝘄 𝘁𝗼 𝗘𝘃𝗮𝗹𝘂𝗮𝘁𝗲 𝗠𝗲𝗻𝘁𝗮𝗹 𝗛𝗲𝗮𝗹𝘁𝗵 𝗔𝗜 𝗥𝗲𝘀𝗽𝗼𝗻𝘀𝗶𝗯𝗹𝘆 AI for mental health is a high-stakes area: its evaluation needs to meet the highest expectations. The new preprint 𝘙𝘦𝘴𝘱𝘰𝘯𝘴𝘪𝘣𝘭𝘦 𝘌𝘷𝘢𝘭𝘶𝘢𝘵𝘪𝘰𝘯 𝘰𝘧 𝘈𝘐 𝘧𝘰𝘳 𝘔𝘦𝘯𝘵𝘢𝘭 𝘏𝘦𝘢𝘭𝘵𝘩, written by an interdisciplinary team spanning AI [...]
1
3
3
reposted by
Dirk Hovy
Debora Nozza
2 months ago
Honored to give my first keynote at
#IRCDL2026
on February 19th. I’ll talk about how LLMs have shifted from productivity tools to everyday sources of info & personal guidance and what that means for risk, trust, bias, and alignment.
ircdl2026.unimore.it
0
14
2
reposted by
Dirk Hovy
Cambridge University Press Political Science & IR
2 months ago
#OpenAccess from
@politicsgenderj.bsky.social
- Male Agency? Analyzing Fatherhood Roles in Swedish Parliamentary Documents, 1993–2021 - https://cup.org/40el36q - Lena Wängnerud, Elin Naurin,
@dirkhovy.bsky.social
#OpenAccess
Lorenzo Lupo & Oscar Magnusson
#FirstView
0
6
2
reposted by
Dirk Hovy
MilaNLP Lab
3 months ago
#MemoryModay
#NLProc
@gattanasio.cc
et al. study asks 'Is It Worth the (Environmental) Cost?' analyzing continuous training for language models. Balances benefits, environmental impacts, for responsible use.
#AI
#Sustainability
arxiv.org/pdf/2210.07365
loading . . .
https://arxiv.org/pdf/2210.07365
0
7
4
reposted by
Dirk Hovy
Étienne Ollion
3 months ago
What are the main issues discussed in a set of documents? We’ve just released a step-by-step BERTopic tutorial. We also launch a new page, gathering various NLP tutorials for social scientists. 👉
www.css.cnrs.fr/tutorials-an...
loading . . .
Tutorials and Resources – CSS @ IP-Paris
Site web de l'axe sciences sociales computationnelles du CREST-CNRS. Cours et tutoriels pour l'analyse des données numériques en sciences sociales.
https://www.css.cnrs.fr/tutorials-and-resources/
3
49
25
reposted by
Dirk Hovy
David Mimno
3 months ago
Citation is the foundation of academic promotion. It’s noisy, sure, but its integrity is worth fighting for. Hallucinated citations should be a desk reject.
add a skeleton here at some point
1
27
5
reposted by
Dirk Hovy
David Jurgens
3 months ago
The second new class I'm teaching is a very experimental graduate level seminar in CSE: "Building Small Language Models". I taught the grad level NLP class last semester (so fun!) but students wanted more—which of these new ideas work, and which work for SLMs?
jurgens.people.si.umich.edu/CSE598-004/
loading . . .
CSE 598-004 - Building Small Language Models
http://jurgens.people.si.umich.edu/CSE598-004/
2
32
10
reposted by
Dirk Hovy
MilaNLP Lab
3 months ago
🎉 MilaNLP 2025 Wrapped 🎉 Lots of learning, building , sharing, and growing together 🌱
#NLProc
0
10
4
reposted by
Dirk Hovy
MilaNLP Lab
3 months ago
⏳ Deadline approaching! We’re hiring 2 fully funded postdocs in
#NLP
. Join the MilaNLP team and contribute to our upcoming research projects (SALMON & TOLD) 🔗 Details + how to apply:
milanlproc.github.io/open_positio...
⏰ Deadline: Jan 31, 2026
0
11
11
🚨(Software) Update: In my PhD, I had a side project to fix an annoying problem: when you ask 5 people to label the same thing, you often get different answers. But in ML (and lots of other analyses), you still need a single aggregated answer. Using the majority vote is easy–but often wrong. 1/N
loading . . .
GitHub - dirkhovy/MACE: Multi-Annotator Competence Estimation tool
Multi-Annotator Competence Estimation tool. Contribute to dirkhovy/MACE development by creating an account on GitHub.
https://github.com/dirkhovy/MACE
3 months ago
6
75
14
New year, new job? If that is your current mantra, check the open postdoc positions with Debora Nozza and me at our lab. Deadline is January 31st.
milanlproc.github.io/open_positio...
loading . . .
Postdoctoral Researcher – NLP (2 positions) | MilaNLP Lab @ Bocconi University
Two Postdoctoral Researcher positions – Deadline January 31st, 2026
https://milanlproc.github.io/open_positions/postdoc_tef/
3 months ago
0
11
11
reposted by
Dirk Hovy
MilaNLP Lab
4 months ago
🚀 We’re opening 2 fully funded postdoc positions in
#NLP
! Join the MilaNLP team and contribute to our upcoming research projects. 🔗 More details:
milanlproc.github.io/open_positio...
⏰ Deadline: Jan 31, 2026
0
19
15
Happy to have contributed to this
add a skeleton here at some point
4 months ago
0
3
0
reposted by
Dirk Hovy
MilaNLP Lab
4 months ago
#MemoryModay
#NLProc
Countering Hateful and Offensive Speech Online - Open Challenges" by Plaza-Del-Arco, @debora_nozza, Guerini, Sorensen, Zampieri, 2024 is a tutorial on the challenges and solutions for detecting and mitigating hate speech.
loading . . .
Countering Hateful and Offensive Speech Online - Open Challenges
Flor Miriam Plaza-del-Arco, Debora Nozza, Marco Guerini, Jeffrey Sorensen, Marcos Zampieri. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts.…
https://aclanthology.org/2024.emnlp-tutorials.2/
0
4
2
reposted by
Dirk Hovy
MilaNLP Lab
5 months ago
#MemoryModay
#NLProc
Uma, A. N. et al. examine AI model training in 'Learning from Disagreement: A Survey'. Disagreement-handling methods' performance is shaped by evaluation methods & dataset traits.
loading . . .
https://jair.org/index.php/jair/article/view/12752
0
4
2
reposted by
Dirk Hovy
MilaNLP Lab
4 months ago
#TBT
#NLProc
#MachineLearning
#SafetyFirst
'Safety-Tuned LLaMAs: Improving LLMs Safety' by Bianchi et al. explores training LLMs for safe refusals, warns of over-tuning.
loading . . .
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large...
Training large language models to follow instructions makes them perform better on a wide range of tasks and generally become more helpful. However, a perfectly helpful model will follow even the most...
https://arxiv.org/abs/2309.07875
0
4
2
Come work with
@deboranozza.bsky.social
, me, and the lab in Milan!
add a skeleton here at some point
4 months ago
0
6
3
reposted by
Dirk Hovy
Women in AI Research - WiAIR
5 months ago
We don't actually trust AI. We trust the companies behind it. As Maria Antoniak notes, every "private" chat flows through corporate systems with long histories of data misuse. If we care about AI ethics, we need to name power, not anthropomorphize models.
loading . . .
1
54
18
reposted by
Dirk Hovy
jake hofman
5 months ago
We're hiring interns in the Computational Social Science group at Microsoft Research NYC! If you're interested in designing AI‑based systems and understanding their impact at both individual and societal scales, apply here by Jan 9, 2026:
apply.careers.microsoft.com/careers/job/...
loading . . .
Research Intern - Computational Social Science | Microsoft Careers
Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world's best researchers, Research Interns learn, collaborate, and network for life. Researc...
https://apply.careers.microsoft.com/careers/job/1970393556639564
0
21
18
After I shared “How to professor” last year, some people asked for a similar post on writing. Now I finally got around to typing up our lab's writing workshop slides. It covers basic advice for research papers and grant applications. Curious? Read it here:
dirkhovy.com/post/2025_11...
loading . . .
How to Write Gooder | Dirk Hovy
After publishing “ How to professor”, several people said they found it helpful, and asked whether I had a similar post on writing. Luckily, we have held an annual writing workshop in the lab for the last few years, so there already was a presentation.
https://dirkhovy.com/post/2025_11_25/
5 months ago
1
12
3
reposted by
Dirk Hovy
MilaNLP Lab
5 months ago
#TBT
#NLProc
'Respectful or Toxic?' by Plaza-del-Arco, @debora &
@dirkhovy.bsky.social
(2023) explores zero-shot learning for multilingual hate speech detection. Highlights prompt & model choice for accuracy.
#AI
#LanguageModels
#HateSpeechDetection
loading . . .
Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech
Flor Miriam Plaza-del-arco, Debora Nozza, Dirk Hovy. The 7th Workshop on Online Abuse and Harms (WOAH). 2023.
https://aclanthology.org/2023.woah-1.6
0
2
2
reposted by
Dirk Hovy
MilaNLP Lab
5 months ago
#MemoryModay
#NLProc
'Leveraging Social Interactions to Detect Misinformation on Social Media' by Fornaciari et al. (2023) uses combined text and network analysis to spot unreliable threads.
loading . . .
https://arxiv.org/pdf/2304.02983
0
3
2
reposted by
Dirk Hovy
Manoel Horta Ribeiro
5 months ago
The Center for Information Technology Policy at Princeton invites applications for a Postdoctoral Fellow to work with Andy Guess (Politics/SPIA), Brandon Stewart (Sociology), and me (CS).
puwebp.princeton.edu/AcadHire/app...
Please apply before Sunday, the 13th of December!
0
16
10
reposted by
Dirk Hovy
MilaNLP Lab
5 months ago
#MemoryModay
#NLProc
'Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models' by
@paul-rottger.bsky.social
et al. (2022). A suite of tests for 10 languages.
loading . . .
Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models
Paul Röttger, Haitham Seelawi, Debora Nozza, Zeerak Talat, Bertie Vidgen. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022.
https://aclanthology.org/2022.woah-1.15
0
3
2
Load more
feeds!
log in