Add DTW, nDTW, and SDTW trajectory metrics by nalinraut · Pull Request #12 · AmeyaWagh/robometric-frame

nalinraut · 2026-02-19T14:48:06Z

Add Dynamic Time Warping based metrics for evaluating trajectories that may have different lengths or temporal alignment. These metrics are particularly useful for evaluating VLA models and policies using action chunking (e.g., ACT, Diffusion Policy).

New metrics:

DTWDistance: Raw DTW distance using dynamic programming (lower=better)
NormalizedDTW: Mapped to [0,1] using exp(-DTW/(|R|*d)) (higher=better)
SuccessWeightedDTW: nDTW weighted by task success (SDTW = nDTW * Success)

Key features:

Support for trajectories of different lengths (core advantage over MSE/ATE)
Tolerates temporal misalignment (hesitation, speed differences)
Optional custom normalization factor
Full torchmetrics.Metric compatibility with distributed training support
Comprehensive test suite and example usage

Reference: Ilharco et al., "General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping," arXiv:1907.05446, NeurIPS ViGIL Workshop, 2019.

AmeyaWagh · 2026-02-22T03:35:00Z

+from torchmetrics import Metric
+
+
+def _compute_dtw(predicted: Tensor, reference: Tensor) -> Tensor:


Does this need to be an independenct function? can this be part of the metric class?

All three classes (DTWDistance, NormalizedDTW, SuccessWeightedDTW) use it. Making them part of one would require other two to call DTWDistance._compute_dtw(...) without really depending on the class. Including it in all three would violate DRY. I can make a base class with this method as static or leave it as is in the module.

Add Dynamic Time Warping based metrics for evaluating trajectories that may have different lengths or temporal alignment. These metrics are particularly useful for evaluating VLA models and policies using action chunking (e.g., ACT, Diffusion Policy). New metrics: - DTWDistance: Raw DTW distance using dynamic programming (lower=better) - NormalizedDTW: Mapped to [0,1] using exp(-DTW/(|R|*d)) (higher=better) - SuccessWeightedDTW: nDTW weighted by task success (SDTW = nDTW * Success) Key features: - Support for trajectories of different lengths (core advantage over MSE/ATE) - Tolerates temporal misalignment (hesitation, speed differences) - Optional custom normalization factor - Full torchmetrics.Metric compatibility with distributed training support - Comprehensive test suite and example usage Reference: Ilharco et al., "General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping," arXiv:1907.05446, NeurIPS ViGIL Workshop, 2019.

codecov-commenter · 2026-06-20T17:30:17Z

Welcome to Codecov 🎉

Once you merge this PR into your default branch, you're all set! Codecov will compare coverage reports and display results in all future pull requests.

Thanks for integrating Codecov - We've got you covered ☂️

AmeyaWagh reviewed Feb 22, 2026

View reviewed changes

Comment thread src/robometric_frame/trajectory_quality/dtw.py Outdated

nalinraut added 2 commits June 19, 2026 17:50

Vectorize DTW computation loops and fix test thresholds

88aa716

nalinraut force-pushed the feature/dtw-metrics branch from cc8933e to 88aa716 Compare June 20, 2026 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DTW, nDTW, and SDTW trajectory metrics#12

Add DTW, nDTW, and SDTW trajectory metrics#12
nalinraut wants to merge 2 commits into
AmeyaWagh:mainfrom
nalinraut:feature/dtw-metrics

nalinraut commented Feb 19, 2026

Uh oh!

AmeyaWagh Feb 22, 2026

Uh oh!

nalinraut Jun 19, 2026

Uh oh!

Uh oh!

codecov-commenter commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		from torchmetrics import Metric


		def _compute_dtw(predicted: Tensor, reference: Tensor) -> Tensor:

Conversation

nalinraut commented Feb 19, 2026

Uh oh!

AmeyaWagh Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

nalinraut Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov-commenter commented Jun 20, 2026

Welcome to Codecov 🎉

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants