🏞️
Jungling
Ph.D. student at Fudan University. Previously a Data Scientist at Microsoft.
Pinned Loading
-
RankSurprisalRatio
RankSurprisalRatio PublicOfficial Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment“
Python 8
-
InternLM/POLAR
InternLM/POLAR PublicPre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
-
ParamRestore
ParamRestore Public[EMNLP 2025 Main] Official Repo for Paper "Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels"
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
