NEWS  /  Brief News

OpenAI to Open Source New SimpleQA Benchmark to Measure Large Model Accuracy

Oct 30, 2024, 8:46 p.m. ET

AsianFin – OpenAI, the U.S.-based research center focused on artificial intelligence, has announced the open-sourcing of a new benchmark named SimpleQA, designed to assess the factual accuracy of language models. SimpleQA will evaluate a model's ability to provide accurate answers to concise, fact-seeking questions.

Please sign in and then enter your comment