-
University of Cambridge
- Cambridge, England
-
03:36
(UTC) - sharanmaiya.com
- @_maiush
- in/sharanmaiya
Highlights
- Pro
Popular repositories Loading
-
-
LP-as-a-Judge
LP-as-a-Judge Publicexperiments on the use of linear classifier heads for llm-as-a-judge tasks.
Jupyter Notebook 2
-
repeng
repeng PublicForked from vgel/repeng
A library for making RepE control vectors
Jupyter Notebook 1
-
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
Python 1
If the problem persists, check the GitHub status page or contact support.

