Phrase Extraction and Grounding (PEG)
1 papers with code • 0 benchmarks • 0 datasets
PEG requires a model to extract phrases from text and locate objects from images simultaneously.
Benchmarks
These leaderboards are used to track progress in Phrase Extraction and Grounding (PEG)
No evaluation results yet. Help compare methods by
submitting
evaluation metrics.
Most implemented papers
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
As phrase extraction can be regarded as a $1$D text segmentation problem, we formulate PEG as a dual detection problem and propose a novel DQ-DETR model, which introduces dual queries to probe different features from image and text for object prediction and phrase mask prediction.