TY - JOUR
T1 - Mining informativeness in scene graphs
T2 - Prioritizing informative relations in Scene Graph Generation for enhanced performance in applications
AU - Neau, Maëlic
AU - Santos, Paulo E
AU - Bosser, Anne-Gwenn
AU - Macvicar, Alistair
AU - Buche, Cédric
PY - 2025/3
Y1 - 2025/3
N2 - Learning to compose visual relationships from raw images in the form of scene graphs is a highly challenging Computer Vision task, yet it is essential for applications related to scene understanding. However, no current approaches in Scene Graph Generation (SGG) aim at providing useful graphs for downstream tasks. Instead, the main focus has primarily been on unbiasing the data distribution for predicting more fine-grained relations. That being said, not all fine-grained relations are equally relevant to any particular task and at least a subset of them are of no use for real-world applications. In this work, we address the issue of the relevance of relations in Scene Graphs from the perspective of the quantity of information they bring to the understanding of the scene. To this end, we introduce a new evaluation metric for the task of SGG, called InformativeRecall@K, that aims at evaluating the ability of models to produce accurate and informative relations. We show that selecting relations based on this informativeness criteria is beneficial for the downstream tasks of Image Generation, Visual Question Answering, and Image Captioning. Finally, we provide a new taxonomy of relations linked to the informativeness value for the task of Image Generation.
AB - Learning to compose visual relationships from raw images in the form of scene graphs is a highly challenging Computer Vision task, yet it is essential for applications related to scene understanding. However, no current approaches in Scene Graph Generation (SGG) aim at providing useful graphs for downstream tasks. Instead, the main focus has primarily been on unbiasing the data distribution for predicting more fine-grained relations. That being said, not all fine-grained relations are equally relevant to any particular task and at least a subset of them are of no use for real-world applications. In this work, we address the issue of the relevance of relations in Scene Graphs from the perspective of the quantity of information they bring to the understanding of the scene. To this end, we introduce a new evaluation metric for the task of SGG, called InformativeRecall@K, that aims at evaluating the ability of models to produce accurate and informative relations. We show that selecting relations based on this informativeness criteria is beneficial for the downstream tasks of Image Generation, Visual Question Answering, and Image Captioning. Finally, we provide a new taxonomy of relations linked to the informativeness value for the task of Image Generation.
KW - Image Captioning
KW - Image Generation
KW - Scene Graph Generation
KW - Scene Understanding
KW - Visual Question Answering
UR - http://www.scopus.com/inward/record.url?scp=85215865357&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2025.01.008
DO - 10.1016/j.patrec.2025.01.008
M3 - Article
AN - SCOPUS:85215865357
SN - 0167-8655
VL - 189
SP - 64
EP - 70
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -