From ff92171e524c2f1ada39607ccc3e2e3c2ab7e1d4 Mon Sep 17 00:00:00 2001 From: connerlambden Date: Thu, 4 Jun 2026 22:40:40 -0600 Subject: [PATCH] Add BGPT REFUTE benchmark (scientific critique & calibration) --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index 6510b8ab7..4ce31750e 100644 --- a/README.md +++ b/README.md @@ -378,3 +378,8 @@ Please cite it if you find the repository helpful. ``` We are also planning to add more of our research to this repository. + + +## Benchmarks + +- [REFUTE](https://huggingface.co/datasets/BGPT-OFFICIAL/refute) — Scientific critique & epistemic calibration on recent science summaries (Apache-2.0). [Leaderboard](https://huggingface.co/spaces/BGPT-OFFICIAL/refute-leaderboard) · [Technical report](https://huggingface.co/datasets/BGPT-OFFICIAL/refute/blob/main/TECHNICAL_REPORT.md) · [Integrators](https://huggingface.co/datasets/BGPT-OFFICIAL/refute/blob/main/INTEGRATORS.md)