Jenna Russell
@jennarussell.bsky.social
๐ค 959
๐ฅ 390
๐ 28
CS PhD Student @ UMD Undergrad @ Cornell
https://jenna-russell.github.io/
AI is already at work in American newsrooms. We examine 186k articles published this summer and find that ~9% are either fully or partially AI-generated, usually without readers having any idea. Here's what we learned about how AI is influencing local and national journalism:
18 days ago
5
54
31
reposted by
Jenna Russell
Chau Minh Pham
5 months ago
๐ค What if you gave an LLM thousands of random human-written paragraphs and told it to write something new -- while copying 90% of its output from those texts? ๐ง You get what we call a Frankentext! ๐ก Frankentexts are surprisingly coherent and tough for AI detectors to flag.
1
33
9
reposted by
Jenna Russell
Shana Gadarian
7 months ago
International students will stop coming to American universities if their visas are going to be at risk. This will make our intellectual community poorer and also make tuition more expensive for domestic students.
add a skeleton here at some point
7
591
181
reposted by
Jenna Russell
John Skiles Skinner
8 months ago
There is a quasi-religion in Silicon Valley that views AI as godlike. This faith has always been parallel to Evangelical Christianity: salvation (transhumanism), the rapture (the technological singularity), and demons (Roko's Basilisk) Lately the AI faith has fully fused with Christian Nationalism.
102
5997
1686
reposted by
Jenna Russell
Yixiao Song
8 months ago
Introducing ๐ป BEARCUBS ๐ป, a โsmall but mightyโ dataset of 111 QA pairs designed to assess computer-using web agents in multimodal interactions on the live web! โ Humans achieve 85% accuracy โ OpenAI Operator: 24% โ Anthropic Computer Use: 14% โ Convergence AI Proxy: 13%
1
12
8
reposted by
Jenna Russell
Yekyung Kim
8 months ago
Is the needle-in-a-haystack test still meaningful given the giant green heatmaps in modern LLM papers? We create ONERULER ๐, a multilingual long-context benchmark that allows for nonexistent needles. Turns out NIAH isn't so easy after all! Our analysis across 26 languages ๐งต๐
1
14
8
reposted by
Jenna Russell
Chau Minh Pham
9 months ago
โ ๏ธCurrent methods for generating instruction-following data fall short for long-range reasoning tasks like narrative claim verification. We present CLIPPER โ๏ธ, a compression-based pipeline that produces grounded instructions for ~$0.5 each, 34x cheaper than human annotations.
1
21
10
People often claim they know when ChatGPT wrote something, but are they as accurate as they think? Turns out that while general population is unreliable, those who frequently use ChatGPT for writing tasks can spot even "humanized" AI-generated text with near-perfect accuracy ๐ฏ
10 months ago
10
188
85
reposted by
Jenna Russell
brendan oโconnor
12 months ago
We're hiring new
#nlp
faculty this year! Asst or Assoc Professors in NLP at UMass CICS --
careers.umass.edu/amherst/en-u...
loading . . .
Details - Assistant/Associate Professor - Natural Language Processing (NLP) | Human Resources | UMass Amherst
https://careers.umass.edu/amherst/en-us/job/525002/assistantassociate-professor-natural-language-processing-nlp
1
66
34
reposted by
Jenna Russell
Marzena Karpinska
12 months ago
If you are at
#EMNLP2024
you should really check this work from our lab:
github.com/Yixiao-Song/...
(poster: Tue 4:00-5:30) If you aren't you should still read the paper! It's a great metric to use and build upon!
loading . . .
GitHub - Yixiao-Song/VeriScore
Contribute to Yixiao-Song/VeriScore development by creating an account on GitHub.
https://github.com/Yixiao-Song/VeriScore
1
8
2
reposted by
Jenna Russell
Yapei Chang
12 months ago
๐Heading to
#EMNLP2024
tmr, presenting PostMark on Tue. morning! ๐
arxiv.org/abs/2406.14517
Aside from this, I'd love to chat about: โข long-context training โข realistic & hard eval โข synthetic data โข tbh any cool projects people are working on Also, I'm on the lookout for a summer 2025 internship!
0
6
4
reposted by
Jenna Russell
Chau Minh Pham
12 months ago
Long-form text generation with multiple stylistic and semantic constraints remains largely unexplored. We present Suri ๐ฆ: a dataset of 20K long-form texts & LLM-generated, backtranslated instructions with complex constraints. ๐
arxiv.org/abs/2406.19371
9
36
7
reposted by
Jenna Russell
Marzena Karpinska
12 months ago
I will be present our paper on LMs performance on long-context reasoning task at
#EMNLP2024
(Tue 16:00-17:30; riverfront hall) Come and chat with us! ๐ง๐ฆ
2
20
5
you reached the end!!
feeds!
log in