Highlights
- Pro
Pinned Loading
-
-
AI-Researcher
AI-Researcher PublicForked from HKUDS/AI-Researcher
"AI-Researcher: Autonomous Scientific Innovation"
Python
-
abisliouk/HS-MATH-LLM
abisliouk/HS-MATH-LLM PublicEvaluate high school math reasoning in LLMs with baseline and Chain-of-Thought (CoT) prompts. Includes confidence calibration metrics, JSON output parsing, and reliability analysis.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.