1 10 9

Haoran Wei

HaoranWei

AI & ML interests

LLM，CV，OVOD

Recent Activity

upvoted a paper 7 days ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

authored a paper 18 days ago

Qwen2.5 Technical Report

upvoted a collection about 1 month ago

Document AI

View all activity

Organizations

None yet

HaoranWei's activity

upvoted a paper 7 days ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published 9 days ago • 12

authored a paper 18 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 19 days ago • 338

upvoted a collection about 1 month ago

Document AI

Collection

All the papers that can fundementally help in creating a true open-source processing pipeline. • 1 item • Updated Nov 11, 2024 • 1

upvoted a paper about 1 month ago

Focus Anywhere for Fine-grained Multi-page Document Understanding

Paper • 2405.14295 • Published May 23, 2024 • 1

upvoted a collection about 1 month ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated about 21 hours ago • 53

replied to Tonic's post 4 months ago

Excellent job!

reacted to Tonic's post with 🔥 4 months ago

Post

2730

🙋🏻‍♂️Hey there folks ,

@ucaslcl released a new OCR model , that's👏🏻👏🏻 fantastic : https://huggingface.co/ucaslcl/GOT-OCR2_0

GPU : Tonic/GOT-OCR
Gradio Demo (Image Edit) : Tonic1/ImageEdit-GOT-OCR

Model : https://huggingface.co/ucaslcl/GOT-OCR2_0
Official demo : https://huggingface.co/spaces/ucaslcl/GOT_online
github : https://github.com/Ucas-HaoranWei/GOT-OCR2.0

4 replies

liked 2 Spaces 4 months ago

Running on Zero

327

💬

GOT Online

Running on Zero

164

📲🫴🏻👁

Tonic's GOT OCR

GOT - OCR (from : UCAS, Beijing)

authored a paper 4 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

liked a model 4 months ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18, 2024 • 798k • 1.32k

upvoted a paper 4 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

commented a paper 4 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83 •

upvoted a paper 7 months ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 54

liked a dataset 7 months ago

nllg/datikz-v2

Viewer • Updated May 17, 2024 • 95k • 230 • 12

liked 2 datasets 8 months ago

ucaslcl/Fox_benchmark_data

Viewer • Updated May 27, 2024 • 944 • 43 • 5

edesaras/CircuitSketchTextAnnotations

Viewer • Updated Apr 21, 2024 • 71.3k • 48 • 3

authored a paper 9 months ago

OneChart: Purify the Chart Structural Extraction via One Auxiliary Token

Paper • 2404.09987 • Published Apr 15, 2024 • 2

liked a dataset 9 months ago

kppkkp/ChartSE

Updated Apr 18, 2024 • 43 • 3

updated a model 9 months ago

HaoranWei/pdf-for-vary-tiny

Updated Apr 20, 2024