论文标题

使用AM-FM表示的人类注意力检测

Human Attention Detection Using AM-FM Representations

论文作者

Shi, Wenjing

论文摘要

来自数字视频的人类活动检测给计算机视觉和图像处理社区带来了许多挑战。最近,已经开发出许多方法来检测成功程度不同的人类活动。然而,人类的一般活动检测问题仍然非常具有挑战性,尤其是当方法需要“在野外”起作用(例如,在不精确控制成像几何形状的情况下)。该论文探讨了(i)检测面,(ii)头部后部,(iii)面部和后部的联合检测的基于阶段的解决方案,以及(iv)使用标准的摄像机,无论是对左还是右而没有任何控制图几何形状的标准摄像机。提出的基于阶段的方法基于依赖于使用振幅调制调制(AM-FM)模型的简单且可靠的方法的开发。使用从数学和工程学(AOLME)项目中提取的视频帧(AOLME)项目中提取的视频帧进行验证。该数据集由10名看着相机的学生的13,265张图像组成,以及来自五个学生的6,122张图像。对于面对相机的学生,该方法能够正确地对左侧的97.1%进行分类,其中95.9%的人看向右。对于面对相机背面的学生,该方法能够正确地对左侧的87.6%进行分类,其中93.3%的人看向右。结果表明,基于AM-FM的方法对分析人类活动视频有很大的希望。

Human activity detection from digital videos presents many challenges to the computer vision and image processing communities. Recently, many methods have been developed to detect human activities with varying degree of success. Yet, the general human activity detection problem remains very challenging, especially when the methods need to work 'in the wild' (e.g., without having precise control over the imaging geometry). The thesis explores phase-based solutions for (i) detecting faces, (ii) back of the heads, (iii) joint detection of faces and back of the heads, and (iv) whether the head is looking to the left or the right, using standard video cameras without any control on the imaging geometry. The proposed phase-based approach is based on the development of simple and robust methods that rely on the use of Amplitude Modulation- Frequency Modulation (AM-FM) models. The approach is validated using video frames extracted from the Advancing Out-of-school Learning in Mathematics and Engineering (AOLME) project. The dataset consisted of 13,265 images from ten students looking at the camera, and 6,122 images from five students looking away from the camera. For the students facing the camera, the method was able to correctly classify 97.1% of them looking to the left and 95.9% of them looking to the right. For the students facing the back of the camera, the method was able to correctly classify 87.6% of them looking to the left and 93.3% of them looking to the right. The results indicate that AM-FM based methods hold great promise for analyzing human activity videos.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源