6 137 48

rotem israeli

irotem98

https://rotem154154.github.io

rotem154154

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

upvoted a paper 13 days ago

Large Motion Video Autoencoding with Cross-modal Video VAE

upvoted a paper 17 days ago

Qwen2.5 Technical Report

View all activity

Organizations

None yet

irotem98's activity

upvoted 2 papers 13 days ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published 14 days ago • 33

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published 13 days ago • 23

upvoted a paper 17 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 17 days ago • 335

upvoted 2 papers 21 days ago

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Paper • 2412.09626 • Published 24 days ago • 19

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 24 days ago • 87

upvoted a paper 22 days ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published 24 days ago • 20

upvoted a paper 24 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 25 days ago • 96

upvoted a paper about 1 month ago

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

Paper • 2411.14762 • Published Nov 22, 2024 • 11

upvoted 6 papers about 2 months ago

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Paper • 2411.08380 • Published Nov 13, 2024 • 25

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published Nov 11, 2024 • 26

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 27

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 22

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12, 2024 • 13

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Paper • 2411.08017 • Published Nov 12, 2024 • 11

upvoted 2 papers 2 months ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31, 2024 • 14

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published Nov 1, 2024 • 17

upvoted an article 2 months ago

Article

Trick or ResNet Treat

•

Oct 31, 2024

• 3

upvoted a paper 2 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 77

upvoted a collection 2 months ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 14 days ago • 197

upvoted a paper 2 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 82