Document Details
Clip:
MoEcation: Transformer Feed-forward Layers are Mixtures of Experts Zhengyan Zhang 1;2 , Yankai Lin 3 , Zhiyuan Liu 1;2;4;5y , Peng Li 3;6 , Maosong Sun 1;2;4;5;7y , Jie Zhou 3 1
Filename:
2110.01786
Filetype:
application/pdf
Size:
1614636 bytes
Uploaded On:
2024-06-10
Abstract:
Summary:
Tags:
Notes:
Visible:
1
Status:
Parsed
Author:
CreationDate:
2022-04-06T01:43:29+00:00
Creator:
LaTeX with hyperref
Keywords:
ModDate:
2022-04-06T01:43:29+00:00
PTEX.Fullbanner:
This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
Producer:
pdfTeX-1.40.21
Subject:
Title:
Trapped:
False
Pages:
14
Return to Document Library