Anand Kannappan

anandnk24
·

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago
PatronusAI/glider
liked a Space 19 days ago
PatronusAI/GLIDER
View all activity

Articles

Organizations

Patronus AI's profile picture

anandnk24's activity

New activity in PatronusAI/glider 10 days ago

Fix: Update GitHub URL

1
#2 opened 10 days ago by
eswardivi
liked a Space 5 months ago
upvoted an article 8 months ago
view article
Article

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

3
reacted to clefourrier's post with ❤️ 11 months ago
view post
Post
🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! 🏢
Bonus: the test set is private, so it's hard to game 🔥
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !
  • 2 replies
·