Retrieval is the process of fetching relevant information from a vast knowledge base or database to answer a user's query or enhance a model's response, most notably seen in Retrieval-Augmented Generation (RAG) systems. RAG combines traditional search capabilities with large language models (LLMs) to ensure accuracy, provide up-to-date information, and ground AI responses in factual, external data rather than relying solely on a model's internal, potentially outdated knowledge.
Seeking canonical benchmark for this task.
Suggest one →Leading models across all datasets in this task.
No results yet. Be the first to contribute.
Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.
Still looking for something on Retrieval? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.
Real humans read every message. We track what people are asking for and prioritize accordingly.