[Suggestion] Evaluate Ming on the WorldSense Benchmark

Hi Ming Team,

First off, congratulations on the excellent work on Ming! We've been following your project and are truly impressed by its powerful **audio-video co-understanding** capabilities. Your contribution to the multimodal community is fantastic.

We are the maintainers of **WorldSense**, a benchmark specifically designed to evaluate a model's ability in real-world audio-video understanding. Given Ming's advanced multimodal performance, we believe it would be an excellent candidate to test on our benchmark. An evaluation on WorldSense could effectively highlight your model's unique strengths from a new and valuable perspective.

Here is some information about our benchmark:

Paper: https://arxiv.org/pdf/2502.04326
GitHub: https://github.com/JaaackHongggg/WorldSense
Leaderboard: https://jaaackhongggg.github.io/WorldSense/#leaderboard

We would be thrilled to see how Ming performs and would like to kindly invite you to evaluate it on the WorldSense benchmark. A strong performance would further showcase its leading capabilities to the community.

We would be more than happy to provide any support needed during the evaluation, such as clarifying data formats or assisting with the evaluation script.

Thank you for your time and for your incredible work!

Best regards,

Shilin Yan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Suggestion] Evaluate Ming on the WorldSense Benchmark #60

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Suggestion] Evaluate Ming on the WorldSense Benchmark #60

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions