Skip to content
RDL
Network
Ekosistem
Uygulama değiştir
EN
Hakkımızda
SSS
Giriş yap
Başla
MoE-L <scp>ightning</scp> : High-Throughput MoE Inference on Memory-constrained GPUs — Shiyi Cao (2025) | RDL Network
Back
Cite
Save
Save for later
Share
Home
Publications
MoE-L <scp>ightning</scp> : High-Throughput MoE Inference on Memory-constrained GPUs
Shared by
Ion Stoica
University of California, Berkeley
MoE-L <scp>ightning</scp> : High-Throughput MoE Inference on Memory-constrained GPUs
Article
2025
en
Authors
+6 more
SC
Shiyi Cao
SL
Shu Liu
TG
Tyler Griggs
Discussion
(0)
Sign in
to like and join the discussion.
No comments yet. Be the first to comment.
Related publications
Preprint
2024
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs
Shiyi Cao
,
Shu Liu
,
Tyler Griggs
,
Peter Schafhalter
,
Xiaoxuan Liu
,
Ying Sheng
,
Joseph E. Gonzalez
,
Matei Zaharia
,
Ion Stoica
Article
2020
Block Copolymers Composed of PEtOx and Polyesteramides Based on Glycolic Acid, <scp>l</scp>-Valine, and <scp>l</scp>-Isoleucine
Michael Dirauf
,
Andreas Erlebach
,
Christine Weber
,
Stephanie Hoeppener
,
Johannes Buchheim
,
Marek Sierka
,
Ulrich Sigmar Schubert
Preprint
2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
,
Lianmin Zheng
,
Binhang Yuan
,
Zhuohan Li
,
Max Ryabinin
,
Daniel Y. Fu
,
Zhiqiang Xie
,
Beidi Chen
,
Clark Barrett
,
Joseph E. Gonzalez
,
Percy Liang
,
Christopher Ré
,
Ion Stoica
,
Ce Zhang
Article
2023
High-Throughput Association Mapping in Brassica napus L.: Methods and Applications
Rafaqat A. Gill
,
Md Mostofa Uddin Helal
,
Minqiang Tang
,
Ming Hu
,
Chaobo Tong
,
Shengyi Liu
Article
2024
High-throughput phenotyping for terminal drought stress in chickpea (Cicer arietinum L.)
Sneha Priya Pappula Reddy
,
Sudhir Kumar
,
Jiayin Pang
,
C. Bharadwaj
,
Madan Pal
,
A. Harvey Millar
,
Kadambot Siddique
Discussion(0)
No comments yet. Be the first to comment.