Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
singhsidhukuldeepΒ 
posted an update Aug 9, 2024
Post
2762
What is the best LLM for RAG systems? πŸ€”

In a business setting, it will be the one that gives the best performance at a great price! πŸ’ΌπŸ’°

And maybe it should be easy to fine-tune, cheap to fine-tune... FREE to fine-tune? 😲✨

That's @Google Gemini 1.5 Flash! πŸš€πŸŒŸ

It now supports fine-tuning, and the inference cost is the same as the base model! <coughs LORA adopters> πŸ€­πŸ€–

So the base model must be expensive? πŸ’Έ
For the base model, the input price is reduced by 78% to $0.075/1 million tokens and the output price by 71% to $0.3/1 million tokens. πŸ“‰πŸ’΅

But is it any good? πŸ€·β€β™‚οΈ
On the LLM Hallucination Index, Gemini 1.5 Flash achieved great context adherence scores of 0.94, 1, and 0.92 across short, medium, and long contexts. πŸ“ŠπŸŽ―

Google has finally given a model that is free to tune and offers an excellent balance between performance and cost. βš–οΈπŸ‘Œ

Happy tuning... πŸŽΆπŸ”§

Gemini 1.5 Flash: https://developers.googleblog.com/en/gemini-15-flash-updates-google-ai-studio-gemini-api/ πŸ”—

LLM Hallucination Index: https://www.rungalileo.io/hallucinationindex πŸ”—

Checkout a RAG system i made for local use:
https://github.com/Tech-ware/ChatHX