Bingrui Li's picture

4

Bingrui Li

Bingrui

·

https://bingrui-li.github.io

bingrui-li

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent

upvoted a paper 2 months ago

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

authored a paper 3 months ago

Memory Efficient Optimizers with 4-bit States

View all activity

Organizations

None yet

authored a paper about 2 months ago

On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent

Paper • 2410.04870 • Published Oct 7, 2024

upvoted a paper 2 months ago

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22 • 60

authored 2 papers 3 months ago

Memory Efficient Optimizers with 4-bit States

Paper • 2309.01507 • Published Sep 4, 2023

Efficient Hyperparameter Tuning via Trajectory Invariance Principle

Paper • 2509.25049 • Published Sep 29 • 4

upvoted 2 papers 3 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28 • 118

Efficient Hyperparameter Tuning via Trajectory Invariance Principle

Paper • 2509.25049 • Published Sep 29 • 4

upvoted a paper 7 months ago

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16 • 75