Language-Based Temporal Localization
5 papers with code • 1 benchmarks • 1 datasets
Most implemented papers
MAC: Mining Activity Concepts for Language-based Temporal Localization
Previous methods address the problem by considering features from video sliding windows and language queries and learning a subspace to encode their correlation, which ignore rich semantic cues about activities in videos and queries.
Video Moment Localization using Object Evidence and Reverse Captioning
We address the problem of language-based temporal localization of moments in untrimmed videos.
Hierarchical Deep Residual Reasoning for Temporal Moment Localization
Temporal Moment Localization (TML) in untrimmed videos is a challenging task in the field of multimedia, which aims at localizing the start and end points of the activity in the video, described by a sentence query.
TubeDETR: Spatio-Temporal Video Grounding with Transformers
We consider the problem of localizing a spatio-temporal tube in a video corresponding to a given text query.
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
Our framework introduces two auxiliary tasks, cross-modal matching and temporal order discrimination, to promote the grounding model training.