EVLM: An Efficient Vision-Language Model for Visual Understanding Paper • 2407.14177 • Published Jul 19, 2024 • 43
openai/whisper-large-v3 Automatic Speech Recognition • Updated Aug 12, 2024 • 4.76M • • 3.99k