Rit-18: A novel dataset for compositional group activity understanding

Abstract

Group activity understanding is a challenging task as multiple people are involved, and their relations may vary over time. Currently, the literature of group activity is limited to group activity recognition, because videos are trimmed in very short duration and focus on a single activity. This slows down the progress in the group activity domain. In this paper, we propose a new large-scale untrimmed compositional group activity dataset RIT-18 based on the volleyball games captured from YouTube. Each clip in our dataset depicts an entire rally which spans the duration from serve to a point being scored. Comprehensive annotations including group activity labels, temporal boundaries of activities, key persons, and winning teams are provided. We describe group activity recognition, future activity anticipation, and rally-level winner prediction challenges, and evaluate several baseline methods over these challenges. We report their performance on our dataset and demonstrate further efforts need to be made. The dataset is available at https://pht180. rit. edu/actionlab/rit-18.

Publication
In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops
Hanbin Hong
Hanbin Hong
Ph.D. student

My research interests lie on security and privacy issues in machine learning.