An Enhanced Intelligent Agent with Image Description Generation

Fielding, Ben, Kinghorn, Philip, Mistry, Kamlesh and Zhang, Li (2016) An Enhanced Intelligent Agent with Image Description Generation. In: Intelligent Virtual Agents. Lecture Notes in Computer Science, 10011 . Springer, London, pp. 110-119. ISBN 978-3-319-47664-3

Full text not available from this repository.
Official URL:


In this paper, we present an Embodied Conversational Agent (ECA) enriched with automatic image understanding, using vision data derived from state-of-the-art machine learning techniques for the advancement of autonomous interaction with the elderly or infirm. The agent is developed to conduct health and emotion well-being monitoring for the elderly. It is not only able to conduct question-answering via speech-based interaction, but also able to provide analysis of the user’s surroundings, company, emotional states, hazards and fall actions via visual data using deep learning techniques. The agent is accessible from a web browser and can be communicated with via voice means, with a webcam required for the visual analysis functionality. The system has been evaluated with diverse real-life images to prove its efficiency.

Item Type: Book Section
Uncontrolled Keywords: Intelligent conversational agent, Image description generation, Human agent interaction
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Becky Skoyles
Date Deposited: 28 Nov 2016 15:12
Last Modified: 12 Oct 2019 22:28

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics