-
Notifications
You must be signed in to change notification settings - Fork 51
Description
Hi Ming Team,
First off, congratulations on the excellent work on Ming! We've been following your project and are truly impressed by its powerful audio-video co-understanding capabilities. Your contribution to the multimodal community is fantastic.
We are the maintainers of WorldSense, a benchmark specifically designed to evaluate a model's ability in real-world audio-video understanding. Given Ming's advanced multimodal performance, we believe it would be an excellent candidate to test on our benchmark. An evaluation on WorldSense could effectively highlight your model's unique strengths from a new and valuable perspective.
Here is some information about our benchmark:
Paper: https://arxiv.org/pdf/2502.04326
GitHub: https://github.com/JaaackHongggg/WorldSense
Leaderboard: https://jaaackhongggg.github.io/WorldSense/#leaderboard
We would be thrilled to see how Ming performs and would like to kindly invite you to evaluate it on the WorldSense benchmark. A strong performance would further showcase its leading capabilities to the community.
We would be more than happy to provide any support needed during the evaluation, such as clarifying data formats or assisting with the evaluation script.
Thank you for your time and for your incredible work!
Best regards,
Shilin Yan