Body and gestures: models, algorithms ,...

28
Body and gestures: models, algorithms , applications Corso di Interazione uomo-macchina II Prof. Giuseppe Boccignone Dipartimento di Scienze dell’Informazione Università di Milano [email protected] http://homes.dsi.unimi.it/~boccignone/l DeMeuse (1987) Non verbal behavior //De Meuse

Transcript of Body and gestures: models, algorithms ,...

Page 1: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Body and gestures:models, algorithms , applications

Corso di Interazione uomo-macchina II

Prof. Giuseppe Boccignone

Dipartimento di Scienze dell’InformazioneUniversità di Milano

[email protected]://homes.dsi.unimi.it/~boccignone/l

DeMeuse (1987)

Non verbal behavior//De Meuse

Page 2: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

A. Vinciarelli, M. Pantic, H. Bourlard, Social Signal Processing: Survey of an Emerging Domain,Image and Vision Computing (2008)

Posture and gesture interaction

Posture and gesture interaction

Page 3: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

• There are two main challenges in recognizing posture and gestures:

• detecting the body parts involved in the gesture (e.g. hands)

• addressed by selecting appropriate visual features: these include, e.g., histograms of oriented gradients , optical flow, spatio-temporal salient points and space-time volumes .

• modeling the temporal dynamic of the gesture

• addressed by using techniques such as Dynamic Time Warping , Hidden Markov Models, and Conditional Random Fields .

Analysing postures and gestures

• The primary goal of gesture recognition research is to create a system which can identify specific human gestures and use them to convey information or for device control.

• A gesture may be defined as a physical movement of the hands, arms, face, and body with the intent to convey information or meaning.

• Gesture recognition, then, consists not only of the tracking of human movement, but also the interpretation of that movement as semantically meaningful commands

Analysing postures and gestures

Page 4: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

• Like in the case of gestures, machine recognition of walking style (or gait) has been investigated as well, but only for purposes different from SSP, namely recognition and identification in biometric applications

• The common approach is to segment the silhouette of the human body into individual components (legs, arms, trunk, etc.), and then to represent their geometry during walking through

• vectors of distances , symmetry operators , geometric features of body and stride (e.g. distance between head and feets or pelvis)

Analysing postures and gestures

Analysing postures and gestures//application areas

Page 5: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

• Automatic posture recognition has been addressed in few works, mostly aiming at

• surveillance

• activity recognition

• Few works where the posture is recognized as a social signal

• to estimate the interest level of children learning to use computers

• to recognize affective state of people

Analysing postures and gestures

http://www-prima.imag.fr/

Production and perception of body gestures. Body gestures originateas a mental concept G, are expressed (Tpg) through limb motionmotion B, and are perceived (Tvb) as visual images V.

Body gestures and postures//Generative model

Bodygesture

(mental conceptof)

Bodyposture

(limb motion)

Visual images

G P V

Tpg Tvp

Page 6: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

G

P

V

P(V, P, G)=P(V | P) P(P | G) P(G)

Body gestures and postures//Generative model

Production and perception of body gestures. Body gestures originateas a mental concept G, are expressed (Tpg) through limb motionmotion B, and are perceived (Tvb) as visual images V.

G

P

V

P(G | V) =P(V | G) P(G)

P(V )

P(G) !H P(V | P) P(P | G)

!G !H P(G) P(V | P) P(P | G)

=

Body gestures and postures//Generative model: inference

Production and perception of body gestures. Body gestures originateas a mental concept G, are expressed (Tpg) through limb motionmotion B, and are perceived (Tvb) as visual images V.

pointing

Page 7: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

G

P

V

Body gestures and postures//Generative model: more complex model

Production and perception of body gestures. Body gestures originateas a mental concept G, are expressed (Tpg) through limb motionmotion B, and are perceived (Tvb) as visual images V.

pointing

body

walking golf swing

Body gestures and postures//Generative model

Page 8: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Gt

Pt

Vt

Gt+1

Pt+1

Vt+1

P(Gt+1 | Vt+1) "

P(Vt+1 | Gt+1) P(Gt+1 | Vt) "

P(Vt+1 | Gt+1) !Gt P(Gt+1 | Gt) P(Gt | Vt)

body

Body gestures and postures//Generative model: inference

Gt

Pt

Vt

Gt+1

Pt+1

Vt+1

body

Body gestures and postures//Generative model: inference

Page 9: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Body gestures and postures//Generative model: architecture

Body gestures and postures//Generative model: body models

Page 10: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

tracked body parts indexed by different colors

Body gestures and postures//limb segmentation

tracked body parts indexed by different colors

Body gestures and postures//limb segmentation

Page 11: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Body gestures and postures//Body-part parameterization

ellipse convex hull

Body gestures and postures//Limb pose estimation: head

Page 12: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Body gestures and postures//Limb pose estimation: arm

Body gestures and postures//Limb pose estimation: leg

Page 13: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Body gestures and postures//Body-part parameterization

Body gestures and postures//Estimating body posture

Page 14: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Hand gestures

• Taxonomy of hand gestures for HCI

• Visual interpretation of hand gestures can help in achieving the ease and naturalness desired for Human Computer Interaction (HCI).

Hand gestures

• Classical use in HCI:

• In a computer controlled environment one wants to use the human hand to perform tasks that mimic both the natural use of the hand as a manipulator, and its use in human-machine communication

Page 15: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Hand gestures

• Classical use in HCI:

• In a computer controlled environment one wants to use the human hand to perform tasks that mimic both the natural use of the hand as a manipulator, and its use in human-machine communication

Production and perception of gestures. Hand gestures originateas a mental concept G, are expressed (Thg) through arm and handmotion H, and are perceived (Tvh) as visual images V.

Hand gestures//Generative model

Page 16: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Production and perception of gestures. Hand gestures originateas a mental concept G, are expressed (Thg) through arm and handmotion H, and are perceived (Tvh) as visual images V.

Hand gestures//Generative model

Production and perception of gestures. Hand gestures originateas a mental concept G, are expressed (Thg) through arm and handmotion H, and are perceived (Tvh) as visual images V.

Hand gestures//Generative model

Page 17: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Production and perception of gestures. Hand gestures originateas a mental concept G, are expressed (Thg) through arm and handmotion H, and are perceived (Tvh) as visual images V.

Hand gestures//Generative model

G

H

V

P(V, H, G)=P(V | H) P(H | G) P(G)

Production and perception of gestures. Hand gestures originateas a mental concept G, are expressed (Thg) through arm and handmotion H, and are perceived (Tvh) as visual images V.

Hand gestures//Generative model

G

H

V

P(G | V) =P(V | G) P(G)

P(V )

P(G) !H P(V | H) P(H | G)

!G !H P(G) P(V | H) P(H | G)

=

Page 18: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Hand gestures//Generative model

Hand gestures//Generative model

Gt

Ht

Vt

Gt+1

Ht+1

Vt+1

P(Gt+1 | Vt+1) "

P(Vt+1 | Gt+1) P(Gt+1 | Vt) "

P(Vt+1 | Gt+1) !Gt P(Gt+1 | Gt) P(Gt | Vt)

Page 19: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Hand gestures//Generative model

• 3D hand model-based models of gestures use articulated models of the human hand and arm to estimate the hand and arm movement parameters. Such movements are later recognized as gestures.

• Appearance-based models directly link the appearance of the hand and arm movements in visual images to specific gestures

Hand gestures//Generative model: spatial models

Page 20: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Hand gestures//Generative model: spatial models

Hand gestures//Generative model: spatial models

3D Textured volumetric model

3D wireframe volumetric model.

3D skeletal model

Binary silhouette.

Contour

Page 21: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Gestures for augmented reality

Kinect style://http://www.openni.org/

Page 22: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Kinect style://http://www.openni.org/

Kinect style://http://www.openni.org/

Page 23: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

The Pascal Challengehttp://gesture.chalearn.org/data

• portraying a single user in front of a fixed camera, interacting with a computer by performing gestures!to

• play a game,

• remotely control appliances or robots, or

• learn to perform!gestures!from an educational software

The Pascal Challengehttp://gesture.chalearn.org/data

• collected a large dataset of 50,000 gestures with KinectTM

• provide MatlabTM code to browse though the data and process it

Page 24: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Multimodal emotion recognition//Gunes & Piccardi

Multimodal emotion recognition//Gunes & Piccardi

Page 25: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Multimodal emotion recognition//Gunes & Piccardi

Multimodal emotion recognition//Gunes & Piccardi

Page 26: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Multimodal emotion recognition//Gunes & Piccardi

Multimodal emotion recognition//Gunes & Piccardi

Page 27: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

Multimodal emotion recognition//Gunes & Piccardi

Multimodal emotion recognition//Gunes & Piccardi

Page 28: Body and gestures: models, algorithms , applicationshomes.di.unimi.it/~boccignone/GiuseppeBoccignone...•surveillance •activity recognition •Few works where the posture is recognized

• Results show that emotion classification using the two modalities achieves better recognition accuracy in general, outperforming the classification using the face modality only

• using expressive body information adds accuracy to the emotion recognition based on the face alone.

• early fusion seems to achieve a better recognition accuracy compared to late fusion.

Multimodal emotion recognition//Gunes & Piccardi