Google announces Streaming Dense Video Captioning.
An ideal model for dense video captioning — predicting captions localized temporally in a video — should be able to handle long input videos, predict rich, detailed textual descriptions, and be able to…
Join the discussion on this paper page.
Comments are closed.