1 4 11

Anand Kannappan

anandnk24

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas

new activity 2 months ago

PatronusAI/glider:Fix: Update GitHub URL

liked a model 3 months ago

PatronusAI/glider

View all activity

Organizations

anandnk24's activity

upvoted an article about 1 month ago

Article

Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas

and 4 others •

Jan 23

• 30

New activity in PatronusAI/glider 2 months ago

Fix: Update GitHub URL

#2 opened 2 months ago by

eswardivi

liked a model 3 months ago

PatronusAI/glider

Text Generation • Updated Jan 2 • 1.27k • 37

liked a Space 3 months ago

GLIDER

🦅

GLIDER: Grading LLM Interactions and Decisions using Explain

liked 2 models 7 months ago

PatronusAI/Llama-3-Patronus-Lynx-8B-v1.1-Instruct-Q8-GGUF

Updated Nov 27, 2024 • 36 • 2

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct-v1.1

Text Generation • Updated Jul 31, 2024 • 13k • 10

liked a Space 7 months ago

LynxDemo

🔥

Evaluate answer fidelity to document

upvoted a collection 8 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 651

liked a dataset 8 months ago

PatronusAI/HaluBench

Viewer • Updated Jul 11, 2024 • 14.9k • 1.6k • 37

liked 3 models 8 months ago

upvoted an article 10 months ago

Article

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

Jan 31, 2024

• 3

reacted to clefourrier's post with ❤️ about 1 year ago

Post

🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! 🏢
Bonus: the test set is private, so it's hard to game 🔥
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !