Al-Hussein

AlHussein

AI & ML interests

Knowledge Distillation, Self-Supervised Learning, Semi-Supervised Learning

Recent Activity

upvoted a paper 17 days ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

upvoted a paper 27 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

upvoted a paper about 1 month ago

Video Depth without Video Models

View all activity

Organizations

None yet

AlHussein's activity

upvoted a paper 17 days ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

upvoted a paper 27 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 47

upvoted 2 papers about 1 month ago

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 34

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 104

upvoted 3 papers about 2 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 77

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 71

upvoted 3 papers 4 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Paper • 2410.02762 • Published Oct 3, 2024 • 9

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 40

upvoted 3 papers 6 months ago

upvoted a paper 7 months ago

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

Paper • 2406.08920 • Published Jun 13, 2024 • 7

upvoted 5 papers 8 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 109

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

Paper • 2404.14396 • Published Apr 22, 2024 • 19

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 104

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 102

upvoted a paper 9 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 42