Notes from NemoStation — an indie video research lab. Posts on training, evaluation, and the small open-source video VLMs we ship.

Posts
Part 1: The Map Was Wrong May 26, 2026

A 2B open-source video model audits the standard dense-caption benchmarks (CaReBench, DREAM-1K) and finds 70%+ of the ground-truth captions are wrong about what is in the video. Then lands at #5 in open-source on TimeLens-Bench.

· · ·