Eating and Drinking Activity Recognition Database (MOBISERV-AIIA Database) was created in order to fulfill the need for a good benchmarking database for the evaluation of specialized meal intake (such as eating and drinking) activity recognition algorithms. The captured video data constitute a database gathered during AIIA Laboratory’s research under the Collaborative European Project: MOBISERV FP7-248434, An Integrated Intelligent Home Environment for the Provision of Health, Nutrition and Mobility Services to the Elderly.
Meal intake procedure was captured in a controlled laboratory environment using 4 Sony XCD-V60CR digital video cameras positioned such as to capture the participants frontal, upper frontal, 45° and profile view. An instance of each camera viewing angle is shown in figure.All sequences were recorded at a resolution of 640x480 pixels, at a frame rate of 15 frames per second and were saved in avi format.In the recordings 12 persons (6 females and 6 males) participated with Caucasian origin aging between 22 and 39 years old with different facial characteristics (eye glasses, beard etc.). Each person participated in 4 distinct meal intakesessions and two videos for each one session were recorded: one depicting the participant wearing a blouse or a shirt with long sleeves and one wearing a t-shirt or a blouse with short sleeves. In total, 8 videos are available for each person for each session. This results in a database consisted of 384 video sequences.
The meal intake scenario that each participant has performed involves eating with spoon, cutlery, fork, hands and drinking from cup, glass and straw performed in the following order:
- He/she enters the room, sits in front of the frontal camera and starts eating cereals with a spoon (4 iterations) and he/she drinks water from a cup (4 iterations).
- The bowl is replaced with a plate and the cup with a glass of water. The person continues eating using a knife and a fork (4 iterations) and drinks from a cup (1-2 iterations).
- He/she eats the same food using only the fork (the hand holding the fork is changed, 4 iterations). He/she drinks again (1-2 iterations).
- The person puts a straw in his/her glass of water and continues eating with his/her hand (4 iterations) and drinking from straw (4 iterations in total).
- The person, also, eats with his/her hands a loaf of bread, a croissant or a sandwich (4 bites).
- Finally, the person spends some time doing something else besides eating, like thinking/wondering, paying attention, talking, reading, writing, touching his/her hair, ear, glasses, chin or yawing.
- He/she stands up and exits the room.
Along with the recorded videos, two annotation txt files are available for each session. The first txt file refers to the frontal view video and annotates the human performed movement per video frame during meal intake procedure. The movement types are described in the 'annotation.doc' file that accompanies the txt files.
The second txt file contains information for the synchronizarion of all video files with respect to the frontal view video file. The 4 numbers written in a column form define the difference in frames between a specific frame in the video captured by the frontal view camera and the videos captured by the other cameras in the following order: frontal camera, 45° camera, profile camera, upper frontal camera.
How to Download the Dataset
The MOBISERV-AIIA Database is publicly available for scientific purposes. To obtain access to the dataset and annotation files read the MOBISERV-AIIA Database License Agreement, sign a printed copy of the agreement and fax it to +30-231-099-8453 or send a scanned copy to firstname.lastname@example.org with:
Subject: MOBISERV-AIIA Database download request
Please send me a username and password in order to download the MOBISERV-AIIA Database.
Within a few business days, you will receive a reply email with username/password and detailed instructions. Please use only academic email addresses. Do not hesitate to ask any questions or to make suggestions regarding to this database.