Document Details


2510.17558v1.pdf
Download View Text Delete
Clip: TheFreeTransformer François Fleuret 1 1 FAIR at Meta We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks. Date: Correspondence: 1 Since their invention, the Transformer (Vaswani et al.,), and more specifically the decoder-only Transformers used originally for the GPT series of models (Radford et al.,), have become the core components of AI systems.
Filename: 2510.17558v1.pdf
Filetype: application/pdf
Size: 705469 bytes
Uploaded On: 2025-10-24
Abstract:
Summary:
Tags:
Notes:
Visible: 1
Status: Parsed
Author: François Fleuret
Creator: arXiv GenPDF (tex2pdf:e76afa9)
DOI: https://doi.org/10.48550/arXiv.2510.17558
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
PTEX.Fullbanner: This is pdfTeX, Version 3.141592653-2.6-1.40.28 (TeX Live 2025) kpathsea version 6.4.1
Producer: pikepdf 8.15.1
Title: The Free Transformer
Trapped: False
ArXivID: https://arxiv.org/abs/2510.17558v1
Pages: 18

Return to Document Library