Document Details
Clip:
TheFreeTransformer François Fleuret 1 1 FAIR at Meta We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks. Date: Correspondence: 1 Since their invention, the Transformer (Vaswani et al.,), and more specifically the decoder-only Transformers used originally for the GPT series of models (Radford et al.,), have become the core components of AI systems.
Filename:
2510.17558v1.pdf
Filetype:
application/pdf
Size:
705469 bytes
Uploaded On:
2025-10-24
Abstract:
Summary:
Tags:
Notes:
Visible:
1
Status:
Parsed
Author:
François Fleuret
Creator:
arXiv GenPDF (tex2pdf:e76afa9)
DOI:
https://doi.org/10.48550/arXiv.2510.17558
License:
http://creativecommons.org/licenses/by-nc-sa/4.0/
PTEX.Fullbanner:
This is pdfTeX, Version 3.141592653-2.6-1.40.28 (TeX Live 2025) kpathsea version 6.4.1
Producer:
pikepdf 8.15.1
Title:
The Free Transformer
Trapped:
False
ArXivID:
https://arxiv.org/abs/2510.17558v1
Pages:
18
Return to Document Library