Zhisheng Zheng

zhisheng01

https://zhishengzheng.com/

zhisheng147

AI & ML interests

LLM, Speech and Audio Processing

Recent Activity

liked a model 3 days ago

deepseek-ai/DeepSeek-V3

liked a model about 1 month ago

nyrahealth/CrisperWhisper

liked a model about 2 months ago

kyutai/mimi

View all activity

Organizations

None yet

zhisheng01's activity

liked a model 3 days ago

deepseek-ai/DeepSeek-V3

Updated 9 days ago • 74.1k • 1.41k

liked a model about 1 month ago

nyrahealth/CrisperWhisper

Automatic Speech Recognition • Updated 19 days ago • 12.5k • 204

liked a model about 2 months ago

kyutai/mimi

Feature Extraction • Updated Sep 18, 2024 • 5.92M • 95

liked a dataset 2 months ago

walkerhyf/NCSSD

Updated Nov 12, 2024 • 143 • 20

upvoted a paper 3 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 90

liked a model 3 months ago

SWivid/F5-TTS

Text-to-Speech • Updated Nov 8, 2024 • 554k • 830

updated a dataset 3 months ago

zhisheng01/SpatialAudio

Preview • Updated Oct 12, 2024 • 61 • 3

upvoted 3 papers 3 months ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 43

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide

Paper • 2410.04364 • Published Oct 6, 2024 • 28

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

Paper • 2410.03825 • Published Oct 4, 2024 • 19

liked a dataset 3 months ago

parler-tts/mls-eng-10k-tags_tagged_10k_generated

Viewer • Updated Apr 10, 2024 • 2.43M • 63 • 17

upvoted 2 papers 3 months ago

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1, 2024 • 14

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2, 2024 • 30

liked a dataset 4 months ago

zhisheng01/SpatialAudio

Preview • Updated Oct 12, 2024 • 61 • 3

upvoted 3 papers 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 37

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Paper • 2409.11564 • Published Sep 17, 2024 • 20

liked a model 4 months ago

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21, 2024 • 31 • 208

upvoted a paper 4 months ago

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Paper • 2408.14886 • Published Aug 27, 2024 • 10

liked a model 4 months ago

emotion2vec/emotion2vec_plus_large

Updated Jun 24, 2024 • 871 • 34