A Detailed Framework for Semantic Description of Humans 

Semantic description of humans in images and videos is one of the fundamental problems in computer vision with many applications such as visual surveillance, facial verification, health care, image and video search engines, tagging suggestions and human-computer interaction.
Computer visionHumans in different shapes.Humans have an outstanding ability when it comes to recognizing (i) semantic attributes such as age, gender, hair style and clothing style (ii) actions such as riding a horse, climbing, running and walking and (iii) facial expressions such as angry, happy and smiling.

We are currently developing novel deep learning solutions for the challenging problem of semantic description of humans in images and videos. The major emphasis is to investigate the challenging generic sub-problems of efficient image and video description, automatic learning of visual models, joint learning from textual annotations and visual data and learning robust methods with minimal supervision.



A selection of three publications

WASP research at CVL