Tang, W ORCID: https://orcid.org/0000-0002-6925-9067, Qing, L ORCID: https://orcid.org/0000-0003-3555-0005, Li, L ORCID: https://orcid.org/0009-0009-8615-7511, Guo, L ORCID: https://orcid.org/0000-0003-1272-8480 and Peng, Y ORCID: https://orcid.org/0000-0002-5508-1819 (2023) Principal relation component reasoning-enhanced social relation recognition. Applied Intelligence, 53 (23). pp. 28099-28113. ISSN 0924-669X
Published Version
File not available for download. Available under License In Copyright. Download (3MB) |
Abstract
Social relationships (SRs) are the basis of human life. Hence, the ability to accurately recognize interpersonal relations in public spaces based on visual observations helps policymakers improve mental health programs and address social challenges. The key to image-based computer-vision research on SR recognition (SRR) is a deep-learning mechanism that can predict SRs based on the contents of visual scenery images. Current methods explore logical constraints using relatively simple scenes with small groups of people. However, this is insufficient when desiring to form relation graphs of multiple groups simultaneously from complex scenes. Generally, complex scenes contain a principal relationship that applies to the largest proportion of people, and secondary relationships apply to smaller proportions. To effectively explore relational situations in complex scenes, we propose a new distributed reasoning strategy that accounts for principal and secondary SRs. First, our novel model enhances principal relation component reasoning, and a new contrastive learning algorithm supplements the principal relationship with secondary types. A shifted-window transformer is applied to extract interactive human relation features and local-global features to support more accurate and comprehensive relation prediction. Extensive experiments demonstrate that each part of the proposed model improves the accuracy of SRR and that the whole model outperforms state-of-the-art methods on public datasets. Graphical abstract: [Figure not available: see fulltext.]
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.