Krinal Joshi

krinal

AI & ML interests

NLP, Speech

Recent Activity

View all activity

Organizations

Hugging Face Discord Community's profile picture

krinal's activity

reacted to openfree's post with πŸ‘ 3 days ago
view post
Post
5928
πŸ“š Multilingual RAG Chatbot with PDF Support

Chat naturally with your documents! 🌟

✨ Key Features:
β€’ 🌏 Multilingual Q&A support (English, Korean, etc.)
β€’ πŸ“„ Real-time PDF and text file processing
β€’ πŸ” Context-aware accurate responses
β€’ ⚑ Intuitive Chainlit-powered chat interface

πŸ› οΈ Tech Stack:
β€’ πŸ’» Clean, documented open-source code
β€’ 🀝 User-friendly Chainlit UI
β€’ πŸ“Š Vector database for efficient retrieval
β€’ πŸ”„ Real-time streaming responses

πŸ“± Try it now!
β†’ Demo: openfree/PDF-RAG

πŸ”§ Special Features:
β€’ πŸ“Š Support for PDF/text files up to 2MB
β€’ 🎯 Precise context understanding
β€’ ⚑ Fast response time
β€’ πŸ”’ Secure file handling

Full source code available - ready to integrate into your projects!

#RAG #NLP #Chatbot #OpenSource #PDFProcessing
reacted to tegridydev's post with πŸ‘ 3 days ago
view post
Post
1343
So, what is #MechanisticInterpretability πŸ€”

Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours

Instead of treating a model as a monolithic function, we can:

1. Trace how input tokens propagate through attention heads & MLP layers
2. Identify localized β€œcircuit motifs”
3. Develop methods to systematically break down or β€œedit” these circuits to confirm we understand the causal structure.

Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to

1. Trust & Reliability
2. Safety & Alignment
3. Better Debugging / Development Insights

https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x
  • 1 reply
Β·
replied to tegridydev's post 3 days ago
reacted to hexgrad's post with πŸ‘ 3 days ago
reacted to davidberenstein1957's post with πŸ‘ 3 days ago
upvoted an article 3 days ago
view article
Article

Welcome to Inference Providers on the Hub πŸ”₯

β€’ 198
reacted to alibabasglab's post with πŸ‘ 6 days ago
reacted to mmaguero's post with πŸ‘ 6 days ago
view post
Post
1454
πŸš€ Multidimensional Affective Analysis for Guarani/Jopara! 🌎

This project explored affective computing for low-resource languages, focusing on emotion recognition, humor detection, and offensive language identification in Guarani and Jopara (a code-switching mix of Guarani and Spanish).

Highlights:
🧡 Corpora:
- Emotion Recognition
- Humor Detection
- Offensive Language Identification
πŸ’» Base Models for Fine-Tuning (trained on Guarani Wiki):
- From scratch: BERT-based tiny, small, base and large models
- Continuously pre-trained models: Multilingual-BERT and BETO
πŸ““ Baseline Notebooks:
- Fine-tuning BERT-based models
- NCRF++ models via GitHub

πŸ’‘ Check the repo!
https://github.com/mmaguero/guarani-multi-affective-analysis

πŸ“– Check out the publication here:
- https://digibug.ugr.es/handle/10481/98843
- https://link.springer.com/article/10.1007/s12559-023-10165-0

#NLP #AffectiveComputing #LowResourceLanguages #Guarani #Jopara #SentimentAnalysis #AIForAll
reacted to nicolay-r's post with πŸ‘ 14 days ago
view post
Post
1315
πŸ“’ So far I been passioned about making NLP pipeline for handling iterator of texts with no-string dependency from besides third-party providers of your choice.

By starting with text-translation, delighted to share the related notebooks that might save you time for handling your data

⭐ https://github.com/nicolay-r/nlp-thirdgate

Example of using GoogleTranslate API in no-string for handling textual data iterators with spans:

πŸ“™ https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/translate_texts_with_spans_via_googletrans.ipynb

The key concept is that all these API examples could be tied into a single pipeline using AREkit

πŸ“˜ https://github.com/nicolay-r/AREkit

πŸ› οΈ The further plan is to popualte this repo with
1. NER (DeepPavlov models wrapper)
2. LLM with fancy out-of-the-box chain-of-thought declaration support.
upvoted an article 16 days ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

β€’ 130
reacted to alibabasglab's post with πŸ‘ 18 days ago
view post
Post
2589
We are thrilled to present the improved "ClearerVoice-Studio", an open-source platform designed to make speech processing easy use for everyone! Whether you’re working on speech enhancement, speech separation, speech super-resolution, or target speaker extraction, this unified platform has you covered.

** Why Choose ClearerVoice-Studio?**

- Pre-Trained Models: Includes cutting-edge pre-trained models, fine-tuned on extensive, high-quality datasets. No need to start from scratch!
- Ease of Use: Designed for seamless integration with your projects, offering a simple yet flexible interface for inference and training.

**Where to Find Us?**

- GitHub Repository: ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio)
- Try Our Demo: Hugging Face Space ( alibabasglab/ClearVoice)

**What Can You Do with ClearerVoice-Studio?**

- Enhance noisy speech recordings to achieve crystal-clear quality.
- Separate speech from complex audio mixtures with ease.
- Transform low-resolution audio into high-resolution audio. A full upscaled LJSpeech-1.1-48kHz dataset can be downloaded from alibabasglab/LJSpeech-1.1-48kHz .
- Extract target speaker voices with precision using audio-visual models.

**Join Us in Growing ClearerVoice-Studio!**

We believe in the power of open-source collaboration. By starring our GitHub repository and sharing ClearerVoice-Studio with your network, you can help us grow this community-driven platform.

**Support us by:**

- Starring it on GitHub.
- Exploring and contributing to our codebase .
- Sharing your feedback and use cases to make the platform even better.
- Joining our community discussions to exchange ideas and innovations.
- Together, let’s push the boundaries of speech processing! Thank you for your support! :sparkling_heart:
reacted to AdinaY's post with πŸ”₯ 18 days ago
view post
Post
3178
MiniCPM-o2.6 πŸ”₯ an end-side multimodal LLMs released by OpenBMB from the Chinese community
Model: openbmb/MiniCPM-o-2_6
✨ Real-time English/Chinese conversation, emotion control and ASR/STT
✨ Real-time video/audio understanding
✨ Processes up to 1.8M pixels, leads OCRBench & supports 30+ languages
reacted to davidberenstein1957's post with πŸ‘ 18 days ago
reacted to lamhieu's post with πŸ‘ 18 days ago
view post
Post
1868
Unlock seamless document conversion with Docsifer, powered by MarkItDown at its core! πŸš€ Effortlessly transform PDFs, Word, Excel, images, audio, HTML, and more into clean, structured Markdownβ€”perfect for developers, writers, and content creators. With optional LLM-enhanced extraction and robust format support, Docsifer ensures accuracy, speed, and privacy.
🌟 Try it now and experience professional-grade Markdown conversion: lamhieu/docsifer
upvoted an article 20 days ago
view article
Article

Mastering Tensor Dimensions in Transformers

By not-lain β€’
β€’ 43