Francis Kulumba
Francis Kulumba
Home
Featured publications
Other publications
Projects
Talks
Posts
Teachings
CV
Light
Dark
Automatic
Preprint
Where Does Authorship Signal Emerge in Encoder-Based Language Models?
Authorship attribution models fine-tuned with the same pretrained encoder, data, and loss can differ four-fold in performance depending …
Francis Kulumba
,
Guillaume Vimont
,
Laurent Romary
,
Florian Cafiero
PDF
Cite
Code
Dataset
DOI
Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models
Backdoor attacks pose significant security risks for Large Language Models (LLMs), yet the internal mechanisms by which triggers …
Théo Lasnier
,
Wissam Antoun
,
Francis Kulumba
,
Djamé Seddah
PDF
Cite
DOI
Language-Switching Triggers Take a Latent Detour Through Language Models
Backdoor attacks on language models pose a growing security concern, yet the internal mechanisms by which a trigger sequence hijacks …
Francis Kulumba
,
Wissam Antoun
,
Théo Lasnier
,
Djamé Seddah
,
Benoît Sagot
PDF
Cite
DOI
CamemBERT 2.0: A Smarter French Language Model Aged to Perfection
French language models, such as CamemBERT, have been widely adopted across industries for natural language processing (NLP) tasks, with …
Wissam Antoun
,
Francis Kulumba
,
Rian Touchent
,
Éric de la Clergerie
,
Benoît Sagot
,
Djamé Seddah
PDF
Cite
Code
DOI
HALvest-Contrastive: Retrieval-Like Authorship Attribution with Patch-Level Late Interaction
Deciding whether two pieces of text share an author is made difficult by topical confound: two writers covering the same topic often …
Francis Kulumba
,
Wissam Antoun
,
Guillaume Vimont
,
Laurent Romary
,
Florian Cafiero
PDF
Cite
Code
Dataset
DOI
Cite
×