Real-Time Gesture Tracking Algorithm and Hardware Architecture Design in Kinect-Based Multimedia Systems
Autor: | Ming-Wei Kuo, 郭民偉 |
---|---|
Rok vydání: | 2012 |
Druh dokumentu: | 學位論文 ; thesis |
Popis: | 100 In recent years, hand gesture recognization is the newest interface for many devices, such as Samsung’s SmartTV, Slide show, remote control. It is called HCI (Human-Computer Interface). In the past, there were many complex algorithms. Now we have a new choice in this application. We propose a new algorithm which has depth and color images. Capturing the raw data from XBOX 360 kinect, we calculate the hand position in real-time. We propose a hardware-oriented algorithm. It is designed for personal user. Our algorithm is separated into two parts. First, we asked the user to shake his/her hand in the detectable region to initiate the system. Our proposed algorithm can convert the captured image to generate user’s depth map. Second, user can move in the region and make a command to operate our multimedia system. The current and the previous luminance frames which are block-based color images are subtracted. After the absolutes of differences were calculated out, their histogram was generated. In this method, the moving pixels can be got. The depth values of the moving blocks in histogram were counted. The largest number of histogram was the user’s depth value. The user’s depth value was used to extract user’s depth region. A skin-tone filter was used to get user’s head region, hand region and depth value. Hand depth region was defined to transfer into a binary image. A modified median filter was used to remove noise from the binary image. Finally, the hand’s depth value and hand position were used to track hand and to recognize command. We used an Command-Trigger-Event (CTE) to trigger the hand tracking. In software, we used OpenCV to be our GUI interface. Because the software processing time can’t reach real-time requirements, we need to design hardware to achieve real-time processing. We proposed a spec for 30 frames per second, 640x480 resolutions, and the operation frequency is 47.6MHz. |
Databáze: | Networked Digital Library of Theses & Dissertations |
Externí odkaz: |