A vision enriched intelligent agent with image description generation (demonstration)

Zhang, Li, Fielding, Ben, Kinghorn, Philip and Mistry, Kamlesh (2016) A vision enriched intelligent agent with image description generation (demonstration). In: AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 1488-1489. ISBN 9781450342391

Full text not available from this repository.


In this paper, we present an intelligent conversational agent enriched with automatic image understanding and facial expression recognition using state-of-the-art machine learning techniques for the advancement of autonomous interaction with the elderly or infirm. The agent is developed to conduct health and emotion well-being monitoring for the elderly. It is not only capable of conducting question-answering via speech-based interaction, but also able to provide analysis of the user's surroundings, emotional states, hazards and fall actions via visual data. The agent is accessible from a web browser and can be communicated with via voice or text means, with a webcam required for the visual analysis functionality. The system has been evaluated with diverse real-life images to prove its efficiency.

Item Type: Book Section
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Becky Skoyles
Date Deposited: 03 Dec 2018 10:07
Last Modified: 11 Oct 2019 18:17
URI: http://nrl.northumbria.ac.uk/id/eprint/37014

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics