tongruiliu

Follow

🎯

Focusing

tongruiliu

🎯

Focusing

Follow

UBIQUANT·IQUESTLAB

15 followers · 9 following

Peking University UBIQUANT
Beijing
https://tongruiliu.github.io/

Achievements

Achievements

Pinned Loading

Guided-GRPO Guided-GRPO Public

A Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.

Python 47
GMT GMT Public

GMT: Graph-as-Memory Tuning for deep KG–LLM fusion via cross-attention.

Python 11 1
OpenDCAI/DataFlow-MM OpenDCAI/DataFlow-MM Public

Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.

Python 39 19
tongruiliu.github.io tongruiliu.github.io Public

my page

HTML 1
OpenDCAI/DataFlex OpenDCAI/DataFlex Public

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Python 642 70
canvas-rl canvas-rl Public

Python 3