vipl-vlm-coding

company

AI & ML interests

None defined yet.

Recent Activity

hongyuw authored a paper about 2 months ago

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

hongyuw authored a paper about 2 months ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

jsw19 updated a dataset 5 months ago

vipl-vrc/m4u-add

View all activity

vipl-vrc's activity

hongyuw

authored 2 papers about 2 months ago

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Paper • 2410.16144 • Published Oct 21, 2024 • 3

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 64

jsw19

updated a dataset 5 months ago

vipl-vrc/m4u-add

Viewer • Updated Aug 13, 2024 • 1.07k • 35 • 1

hongyuw

authored a paper 6 months ago

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

Paper • 2407.10969 • Published Jul 15, 2024 • 20

hongyuw

authored 4 papers 10 months ago

DeepNet: Scaling Transformers to 1,000 Layers

Paper • 2203.00555 • Published Mar 1, 2022 • 2

Foundation Transformers

Paper • 2210.06423 • Published Oct 12, 2022

TorchScale: Transformers at Scale

Paper • 2211.13184 • Published Nov 23, 2022

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 605

hongyuw

authored a paper about 1 year ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96