Preprint

Where Does Authorship Signal Emerge in Encoder-Based Language Models?

Authorship attribution models fine-tuned with the same pretrained encoder, data, and loss can differ four-fold in performance depending …

Francis Kulumba, Guillaume Vimont, Laurent Romary, Florian Cafiero

Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models

Backdoor attacks pose significant security risks for Large Language Models (LLMs), yet the internal mechanisms by which triggers …

Théo Lasnier, Wissam Antoun, Francis Kulumba, Djamé Seddah

Language-Switching Triggers Take a Latent Detour Through Language Models

Backdoor attacks on language models pose a growing security concern, yet the internal mechanisms by which a trigger sequence hijacks …

Francis Kulumba, Wissam Antoun, Théo Lasnier, Djamé Seddah, Benoît Sagot

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

French language models, such as CamemBERT, have been widely adopted across industries for natural language processing (NLP) tasks, with …

Wissam Antoun, Francis Kulumba, Rian Touchent, Éric de la Clergerie, Benoît Sagot, Djamé Seddah

HALvest-Contrastive: Retrieval-Like Authorship Attribution with Patch-Level Late Interaction

Deciding whether two pieces of text share an author is made difficult by topical confound: two writers covering the same topic often …

Francis Kulumba, Wissam Antoun, Guillaume Vimont, Laurent Romary, Florian Cafiero