NemoStation Blog

Notes from NemoStation — an indie video research lab. Posts on training, evaluation, and the small open-source video VLMs we ship.

Posts

Part 1: The Map Was Wrong→ May 26, 2026

A 2B open-source video model audits the standard dense-caption benchmarks (CaReBench, DREAM-1K) and finds 70%+ of the ground-truth captions are wrong about what is in the video. Then lands at #5 in open-source on TimeLens-Bench.

[ Aryan Jain ]

· · ·