Document Details


Clip: MoEcation: Transformer Feed-forward Layers are Mixtures of Experts Zhengyan Zhang 1;2 , Yankai Lin 3 , Zhiyuan Liu 1;2;4;5y , Peng Li 3;6 , Maosong Sun 1;2;4;5;7y , Jie Zhou 3 1
Filename: 2110.01786
Filetype: application/pdf
Size: 1614636 bytes
Uploaded On: 2024-06-10
Abstract:
Summary:
Tags:
Notes:
Visible: 1
Status: Parsed
Author:
CreationDate: 2022-04-06T01:43:29+00:00
Creator: LaTeX with hyperref
Keywords:
ModDate: 2022-04-06T01:43:29+00:00
PTEX.Fullbanner: This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
Producer: pdfTeX-1.40.21
Subject:
Title:
Trapped: False
Pages: 14

Return to Document Library