Llama-2-70B-chat.

Meta AIopen-sourceLlama 2 70B with RLHF chat fine-tuning

Meta Llama 2 70B chat model. 7-shot in-context learning evaluation on CNN/DailyMail reported in arXiv:2507.05123 (Jul 2025). Best overall ICL result in that study.

§ 01 · Benchmarks

Every benchmark Llama-2-70B-chat has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	cnn-/-daily-mail	Computer Vision · Optical Character Recognition	rouge-1	41.0%	#22/33	2025-07-01	source ↗
02	cnn-/-daily-mail	Computer Vision · Optical Character Recognition	rouge-2	17.2%	#25/33	2025-07-01	source ↗
03	cnn-/-daily-mail	Computer Vision · Optical Character Recognition	rouge-l	27.5%	#31/33	2025-07-01	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 02 · Strengths by area