Pinned Loading
-
Guided-GRPO
Guided-GRPO PublicA Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.
Python 47
-
OpenDCAI/DataFlow-MM
OpenDCAI/DataFlow-MM PublicDataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.
-
-
OpenDCAI/DataFlex
OpenDCAI/DataFlex PublicDataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


