MT Group at FBK
@fbk-mt.bsky.social
๐ค 189
๐ฅ 174
๐ 45
#MachineTranslation
Research Unit @ Fondazione Bruno Kessler
#nlproc
#deeplearning
#ai
mt.fbk.eu
Our very own
@sarapapi.bsky.social
presenting FAMA at
#clicit2025
: ๐Paper:
clic2025.unica.it/wp-content/u...
๐ Models:
hf.co/collections/...
๐ Data:
hf.co/datasets/FBK...
๐ป Code:
github.com/hlt-mt/FBK-f...
Joint work with
@speechtekfbk.bsky.social
1 day ago
0
4
2
reposted by
MT Group at FBK
3 days ago
๐ Excited to present FAMA, the first large-scale
#OpenScience
#Speech
foundation model for ๐ฎ๐น Italian & ๐ฌ๐ง English, at
#clicit2025
(17:30โ18:45 oral session)! ๐ Models:
hf.co/collections/...
๐ Data:
hf.co/datasets/FBK...
๐ป Code:
github.com/hlt-mt/FBK-f...
๐ Preprint:
arxiv.org/pdf/2505.22759
0
7
2
reposted by
MT Group at FBK
DH Group at FBK
3 days ago
We are on our way to Casteddu for
#clicit2025
with a guest from
@fbk-mt.bsky.social
@ailc-nlp.bsky.social
0
7
6
Our pick of the week by
@sarapapi.bsky.social
: "Retrieval-Augmented Generation for AI-Generated Content: A Survey" by Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Jie Jiang, Bin Cui.
arxiv.org/pdf/2402.19473
#RAG
#survey
add a skeleton here at some point
8 days ago
0
0
0
Our pick of the week by Marco Gaido: "Context-Driven Dynamic
#Pruning
for Large
#Speech
#Foundation
Models" by Masao Someki, Shikhar Bharadwaj, Atharva Anand Joshi, Chyi-Jiunn Lin, Jinchuan Tian, Jee-weon Jung,
@shinjiw.bsky.social
, et al.
#INTERSPEECH2025
.
arxiv.org/abs/2505.18860
loading . . .
Context-Driven Dynamic Pruning for Large Speech Foundation Models
Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation mode...
https://arxiv.org/abs/2505.18860
14 days ago
0
2
0
Our pick of the week by
@zhihangxie.bsky.social
: "SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation" by Chenyang Le, Bing Han, Jinshun Li, Songyong Chen, and Yanmin Qian (2025)
#Speech
#Simultaneous
#Translation
#MOE
#SpeechTech
add a skeleton here at some point
24 days ago
0
0
0
Our pick of the week by
@beomseok-lee.bsky.social
: "Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, and Helen Meng (EMNLP 2025)
add a skeleton here at some point
30 days ago
0
2
0
Our pick of the week by
@linaconti.bsky.social
: "I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2"
@jackmerullo.bsky.social
, Arjun Khurana, Oliver McLaughlin (ICML 2025 Workshop on Assessing World Models)
arxiv.org/abs/2508.02527
#XAI
#LLM
add a skeleton here at some point
about 1 month ago
0
0
0
Heading home after an exciting and intense
@aclmeeting.bsky.social
in Vienna! We had a great time presenting our work and connecting with the community. Thanks to everyone who came by!
#acl2025
#nlproc
(1/6)
about 2 months ago
1
5
1
Before presenting our speech model compression task at IWSLT, our pick of the week by Marco Gaido: WhisperKit
arxiv.org/abs/2507.10860
by Atila Orhon, Arda Okan, Berkin Durmus, Zach Nagengast and
@eduardo-pacheco.bsky.social
(ICML 2025)โan early attempt to bring large-scale models to edge devices
loading . . .
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers
Real-time Automatic Speech Recognition (ASR) is a fundamental building block for many commercial applications of ML, including live captioning, dictation, meeting transcriptions, and medical scribes. ...
https://arxiv.org/abs/2507.10860
2 months ago
0
2
0
Our pick of the week by
@zhihangxie.bsky.social
: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
add a skeleton here at some point
3 months ago
0
0
0
Our pick of the week by
@zhihangxie.bsky.social
๐: "PHRASED: Phrase Dictionary Biasing for Speech Translation" by Peidong Wang, Jian Xue, Rui Zhao, Junkun Chen, Aswin Shanmugam Subramanian, Jinyu Li
arxiv.org/abs/2506.09175
#speech
#AI
#ST
#NLP
add a skeleton here at some point
3 months ago
0
2
0
reposted by
MT Group at FBK
GITT 2025
3 months ago
@bsavoldi.bsky.social
taking us back in time at
#GITT2025
โโณ focusing on the first discussions of gender bias in language technology as a socio-technical issue. No, the problem hasn't been fixed yet. But what has happened?
6
7
3
reposted by
MT Group at FBK
GITT 2025
3 months ago
Last but definitely not least:
@bsavoldi.bsky.social
presenting joint work with
@apierg.bsky.social
@matteo-negri.bsky.social
@luisabentivogli.bsky.social
on scalable gender neutral translation evaluation using LLM-as-a-judge at
#GITT2025
6
10
4
Our pick of the week by
@dennisfucci.bsky.social
: "Speech Representation Analysis Based on Inter- and Intra-Model Similarities" by
@yelkheir.bsky.social
,
@ratedali.bsky.social
, and Shammur Absar Chowdhury (ICASSP Workshops 2024)
add a skeleton here at some point
3 months ago
0
4
1
Our pick of the week by
@beomseok-lee.bsky.social
: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, Yingzhi Wang, Mirco Ravanelli, and Cem Subakan (2025)
arxiv.org/abs/2505.19937
#SLU
#speech
#multimodal
#LLM
add a skeleton here at some point
4 months ago
0
2
0
Our pick of the week by
@apierg.bsky.social
: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" by Arjun Subramonian,
@dippedrusk.com
, Preethi Seshadri, Dietrich Klakow, Kai-Wei Chang, and Yizhou Sun
#LLM
#NLProc
#fairness
add a skeleton here at some point
4 months ago
0
3
0
reposted by
MT Group at FBK
Beatrice Savoldi
4 months ago
๐ Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio! ๐
bit.ly/sondaggio_ai...
(รจ anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco๐) Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
loading . . .
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
https://bit.ly/sondaggio_ai_rita
1
16
18
reposted by
MT Group at FBK
4 months ago
๐ New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in ๐ฌ๐ง English and ๐ฎ๐น Italian. The models are live and ready to try on @hf.co: ๐
huggingface.co/collections/...
๐ Preprint:
arxiv.org/abs/2505.22759
#ASR
#ST
#OpenScience
#MultilingualAI
loading . . .
FAMA - a FBK-MT Collection
The First Large-Scale Open-Science Speech Foundation Model for English and Italian
https://huggingface.co/collections/FBK-MT/fama-683425df3fb2b3171e0cdc9e
0
7
3
Our pick of the week by
@linaconti.bsky.social
: "Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically", by โช@soheunshim.bsky.social, Domenico De Cristofaro, Chengzhi Martin Hu, โช@alessandrovietti.bsky.social,
@barbaraplank.bsky.social
#AI
#XAI
#speech
#nlproc
add a skeleton here at some point
4 months ago
0
4
0
Our pick of the week by
@bsavoldi.bsky.social
: "Lost in Translation: Artificial Intelligence and the Demand for Foreign Language Skills" by
@pmllanos.bsky.social
and
@carlbfrey.bsky.social
(2025)
oxfordmartin.ox.ac.uk/publications...
#AI
#translation
#MT
add a skeleton here at some point
4 months ago
0
3
0
reposted by
MT Group at FBK
Dennis Fucci
4 months ago
So happy our paper โDifferent Speech Translation Models Encode and Translate Speaker Gender Differentlyโ was accepted at
#ACL2025
(main)! ๐ The preprint will be out soon!
#SpeechTranslation
#GenderBias
#Interpretability
@aclmeeting.bsky.social
1
7
1
๐ Excited to share that our
@sarapapi.bsky.social
has won the 2024 Best PhD Award from the Information and Engineering Doctoral School for her thesis โDirect Speech Translation in Constrained Contexts: The Simultaneous and Subtitling Scenarios.โ
#nlproc
#speech
#speechprocessing
#speechtranslation
5 months ago
0
6
2
๐ข One week left to apply for a fully funded 3-year PhD position in our group! ๐Automatic translation with large multimodal models:
iecs.unitn.it/education/ad...
๐Full details for application:
iecs.unitn.it/education/ad...
๐ Deadline: 12 May 2025, 4pm CEST
#NLProc
#FBK
5 months ago
0
4
4
reposted by
MT Group at FBK
GITT 2025
5 months ago
๐ญDreaming of attending
#GITT2025
but need a little extra ๐ธ boost? ๐ฃ Bursary applications to support participation are now open at
tinyurl.com/gitt25
๐ Deadline May 9th ๐Thanks to our incredible sponsors DCA at Tilburg University
tinyurl.com/tudca25
and FLW at Ghent University
www.ugent.be/lw/en
1
7
7
๐ข Come and join our group! We offer a fully funded 3-year PhD position: ๐ Automatic translation with large multimodal models:
iecs.unitn.it/education/ad...
๐Full details for application:
iecs.unitn.it/education/ad...
๐ Deadline May 12, 2025
#NLProc
#FBK
loading . . .
Reserved topic scholarships | Doctoral Program - Information Engineering and Computer Science
https://iecs.unitn.it/education/admission/reserved-topic-scholarships#A4
5 months ago
1
10
10
reposted by
MT Group at FBK
Andrea Piergentili
5 months ago
Happy to announce that our paper 'An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation' was accepted at
@gitt-workshop.bsky.social
! ๐ Check it out:
arxiv.org/abs/2504.11934
๐ฅ Co-authors (๐ซถ๐ป):
@bsavoldi.bsky.social
,
@matteo-negri.bsky.social
,
@luisabentivogli.bsky.social
loading . . .
An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation
Gender-neutral translation (GNT) aims to avoid expressing the gender of human referents when the source text lacks explicit cues about the gender of those referents. Evaluating GNT automatically is pa...
https://arxiv.org/abs/2504.11934
0
10
3
Our pick of the week by
@mgaido91.bsky.social
: "OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis" by Luo et al. (2025)
#SpeechProcessing
#LLM
#SFM
#NLProc
#speechtech
#audio
add a skeleton here at some point
5 months ago
0
3
0
reposted by
MT Group at FBK
IWSLT
5 months ago
๐๏ธ Deadline update ๐๏ธ To accommodate requests, we have extended the test period for *all tasks* until *April 19th.* System description papers are now due *April 24th.* Thank you and good luck to all participants!!
0
2
1
Our pick of the week by
@zhihangxie.bsky.social
: "Bridging Speech and Text Foundation Models with ReShape Attention" by Takatomo Kano,
@wanchichen.bsky.social
,
@shinjiw.bsky.social
, et al.
#ICASSP2025
ieeexplore.ieee.org/document/108...
#FoundationModel
#SpeechProcessing
add a skeleton here at some point
6 months ago
0
3
0
reposted by
MT Group at FBK
IWSLT
6 months ago
The evaluation period has begun for our shared tasks! The test data is now available on our website, and submissions are due Tuesday April 15! โฐ Please email task organizers or the google group with any questions ๐ฅณ
0
6
4
Our pick of the week by
@beomseok-lee.bsky.social
: "AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM" by Ruchao Fan, Bo Ren, Yuxuan Hu, Rui Zhao, Shujie Liu, and Jinyu Li (2024).
#speech
#LLM
#speechlmm
#zeroshot
#instructionfollowing
add a skeleton here at some point
6 months ago
0
1
0
reposted by
MT Group at FBK
6 months ago
๐ข The evaluation period of the Instruction Following task at
@iwslt.bsky.social
just started! ๐ฅ๏ธ Consider submitting your speech-to-text system! The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project (
www.meetween.eu
)! โก๏ธ
iwslt2025.speechm.cloud.cyfronet.pl
loading . . .
https://iwslt2025.speechm.cloud.cyfronet.pl
0
9
5
Our pick of the week by
@apierg.bsky.social
: "Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation" by Josรฉ Pombal,
@nunonmg.bsky.social
, Ricardo Rei, and
@andre-t-martins.bsky.social
(2025).
#mt
#translation
#metric
#machinetranslation
add a skeleton here at some point
6 months ago
0
6
0
reposted by
MT Group at FBK
stek_fbk
6 months ago
Sharing the red carpet with
@luisabentivogli.bsky.social
@fbk-mt.bsky.social
0
2
1
reposted by
MT Group at FBK
stek_fbk
6 months ago
@alessiobrutti.bsky.social
is on the red carpet on his way to the Assembly of Members of the Alliance for Language Technology
#ALTEDIC
and the kick off of the LLMs4EU project: towards preserving our cultural and language diversity and giving european citizens open and ethical
#languagetechnologies
1
3
2
Our pick of the week by
@dennisfucci.bsky.social
: "Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?" by
@ytsiamas.bsky.social
, Matthias Sperber, Andrew Finch, and Sarthak Garg (WMT, 2024).
#speech
#translation
#WMT
#NLProc
#prosody
add a skeleton here at some point
7 months ago
0
3
0
reposted by
MT Group at FBK
Dennis Fucci
7 months ago
Had a great time visiting
@milanlp.bsky.social
last week and sharing some of my latest work on
#XAI
in
#SpeechTranslation
! Thanks for having me! ๐
add a skeleton here at some point
0
4
1
Our pick of the week by
@linaconti.bsky.social
: "MuPe Life Stories Dataset: Spontaneous Speech in Brazilian Portuguese with a Case Study Evaluation on ASR Bias against Speakers Groups and Topic Modeling" by Evaldo Leal et al.,
#COLING2025
#speech
#bias
#brazilian
#portuguese
#NLProc
add a skeleton here at some point
7 months ago
0
2
0
reposted by
MT Group at FBK
GITT 2025
7 months ago
โณJust a little more time? We got you! ๐ฃ
#GITT2025
DEADLINE EXTENSION until 16 March (AOE ๐) ๐ Join us 23 June at the
@mtsummit2025.bsky.social
in Geneva ๐Attend GITT as an independent workshop, or register for the full MT Summit ๐ก All info at
sites.google.com/tilburgunive...
๐ See you there?
0
6
4
Our pick of the week by
@bsavoldi.bsky.social
: "How Culture Shapes What People Want From AI" by Xiao Ge, Chunchen Xu, Hazel Rose Markus, and Jeanne L Tsai (CHI 2024)
#culture
#ai
#hci
add a skeleton here at some point
7 months ago
0
2
0
reposted by
MT Group at FBK
GITT 2025
7 months ago
While we look forward to a sunny Geneva, why wait to join the conversation? Weโve created a starter pack for our
#GITT2025
friends! ๐ต๏ธ Follow researchers working on gender bias in MT ๐ฌ Stay up to date and dive into the discussion! All info at
sites.google.com/tilburgunive...
add a skeleton here at some point
1
22
17
reposted by
MT Group at FBK
GITT 2025
7 months ago
๐ฃ3rd
#CFP
for
#GITT2025
at
@mtsummit2025.bsky.social
on 23 June 2025 ๐Deadline = 10 March ๐ต๏ธGender bias in translation and MT, inclusive language, mitigation strategies, and more ๐Research papers, abstracts, communications & potluck presentations welcome! ๐ธ๏ธAll info on
sites.google.com/tilburgunive...
1
10
10
Our pick of the week by
@sarapapi.bsky.social
: "Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction" by StepFun (2025).
#speech
#speechtech
#audio
add a skeleton here at some point
7 months ago
0
3
1
reposted by
MT Group at FBK
GITT 2025
8 months ago
As if you needed more reasons to submit to
#GITT2025
: ๐๐ต Cristina Anselmi, video game
#localization
&
#AI
expert with a focus on
#inclusive
#language
will be our keynote speaker! ๐ธRegistration fees are on the MTSummit website and you can register just for GITT if you so choose ๐ ๐ See you there! ๐
0
13
10
Our pick of the week by
@mgaido91.bsky.social
: "AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM" by Ruchao Fan, Bo Ren, Yuxuan Hu, Rui Zhao, Shujie Liu, Jinyu Li (2024).
#NLProc
#Speech
#instructionfollowing
#zeroshot
#speechtech
#speechllm
add a skeleton here at some point
7 months ago
0
2
1
Discussione molto interessante al workshop AISV 2025 sulle nuove tecnologie ASR. Il nostro gruppo ha presentato assieme a
@speechtekfbk.bsky.social
l'attivitร in corso per la creazione di un modello fondazionale open-source del parlato per italiano e inglese. ๐
www.youtube.com/live/i4x7w8f...
loading . . .
Lo stato dell'arte nelle tecnologie per il riconoscimento del parlato
YouTube video by Universitร degli Studi di Urbino Carlo Bo
https://www.youtube.com/live/i4x7w8fIIXo
8 months ago
0
2
1
Great interest for our poster at the AISV 2025 conference! ๐ Our PhD student
@dennisfucci.bsky.social
is presenting his research on relevant frequency patterns in vowel detection for automatic speech recognition.
8 months ago
0
3
1
Our pick of the week by
@zhihangxie.bsky.social
: "When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation" by Anna Min, et al, 2025.
arxiv.org/abs/2502.00377
add a skeleton here at some point
8 months ago
0
2
1
reposted by
MT Group at FBK
IWSLT
8 months ago
Today's task: model compression!! ๐ New at IWSLT! But no less exciting ๐ฅ ๐ฏ Goal: Compress a large, general-purpose multimodal model, making speech translation more efficient โก๏ธ, deployable ๐ฒ, and sustainable โป๏ธ, while preserving translation quality โญ๏ธ
#AI
#SpeechTech
#ModelCompression
#LLMcompression
1
8
5
Load more
feeds!
log in