Project > Project Summary

This project is based on the belief that, to be more accessible to the general population, computers must be more proactive in their interactions with people. In human interaction, someone who waits for each command before making any communication attempt would be regarded as uncooperative and unhelpful. In order for a computer to be more proactive, and, thus, to bear its part of the burden of initiation in interactions, it must have (1) much more real-time information about its user, and (2) algorithms that select actions based on this information rather than simply on user commands. The computer needs information about the user's current and past emotional, motivational and cognitive state as well as the state of the task at hand. A general approach, or theory, is needed to guide the development of algorithms that select appropriate actions based on user and task state. The research proposed here constitutes the next steps in an attempt by a multidisciplinary team at Beckman Institute to develop this capability. The testbed is an environment for hands-on education in science and engineering, using the Lego Mindstorms construction and robotics environment, with children of middle school age. An emphasis will be on developing proactive computing methods for encouraging interest and conceptual development of minority children and females, who often show lower achievement in science.

Although the proposed work is within an educational environment, the methods being developed and studied are broadly applicable and, in fact, constitute a major attack on one of the grand challenges for engineering. This project should serve as an exemplar of the type of work that is needed in other computer-aided situations. Unique characteristics of the project are the close cooperation of researchers from electrical engineering, computer science, artificial intelligence, psychology and science education, the base of research and development that has already been laid by prior cooperative work, and the well-equipped, interdisciplinary environment that the Beckman Institute provides for this type of project.
The proposed work is based on four assumptions: 1) The proactive computer assumption: Particularly for users naïve to a computer environment, but for all users to some extent, a computer can be most helpful if it can initiate communications when the user needs assistance or encouragement rather than simply waiting for user commands. Computer behavior often determines whether or not a person will continue engagement in that environment. 2) The empathic computer assumption: Proactive communication, which is communication based on the user's current state rather than simply in response to requests, requires real-time, close user and task state information; hence, empathic computing. 3) The information accuracy assumption: It is not necessary for the computer to have highly accurate state information, nor to be always optimal in its communications, to be helpful. Through dialogue with the user, misunderstandings and inaccuracies can be overcome. The challenge is to be accurate enough, but how accurate that is we do not yet know. 4) The human-centered action decision assumption: A critical step in achieving proactive computing is to develop a conceptual framework for selecting human-centered computer actions based on user and task states. This requires a combined psychology and engineering approach involving current psychological knowledge and computer learning methods.

Beckman Institute researchers have already made much progress on the sensing of user states. In our preliminary research, emotional state is detected by video analysis of facial expression and of non-lexical characteristics of the speech signal, as well as by prosodic and semantic analysis of the speech. Motivational state is detected by video analysis of postural changes. Some aspects of cognitive state, including what users are attending to and whether they are having difficulties, are detected by eye movement recording. Human gestures and other motions are being tracked. Methods are being developed for producing affective communications: talking heads with appropriate emotional expression and emotionally expressive speech. An existing Integration Support Lab makes it possible to bring these diverse modalities together in a common environment.

Proposed research includes: (1) further development of methods to sense user postures, movements, expressions and speech, (2) analysis and fusion of this information to identify and track user states, (3) task state tracking, (4) creating a corpus emotion- and action-labeled videotapes for use with computer learning, (5) further development of affective communication, (6) development of the basis for human-centered state-based action decisions, and (7) evaluation of computer proaction on human behavior and response.

 

Copyright 2001 Beckman Institute,
University of Illinois at Urbana-Champaign