Assistive Visual Navigation
2024
Project Overview
Assistive visual navigation systems for visually impaired individuals have become increasingly popular thanks to the rise of mobile computing. Most of these devices work by translating visual information into voice commands. In complex scenarios where multiple objects are present, it is imperative to prioritize object detection and provide immediate notifications for key entities in specific directions. This brings the need for identifying the observer’s motion direction (ego-motion) by merely processing visual information, which is the key contribution of this project. Paper
See our poster at the BSN 2024: BSN 2024 Poster
See our project page for the practical implementation and testing: AIS-Clemson/VisionGPT AIS-Clemson/MotorFocus
Ego-Motion Prediction with All-Pixel Matching
This project introduces Motor Focus – a lightweight, image-based framework designed to predict ego-motion, i.e., the movement intentions of humans (or humanoid machines), using visual input while filtering out camera motion without the need for camera calibration. The framework implements an optical flow-based pixel-wise temporal analysis to account for camera movement, enhanced by a Gaussian aggregation to smooth the predicted movement area.
![](../images/VisionGPT/HsplitterV2.gif)
Features
Our framework mainly predicts ego-motion by identifying how users physically orient themselves in space through pixel-wise temporal analysis.
- Video Stabilization: To counteract camera shake, we used SVD and optical flow to estimate the affine transformation matrix from feature points extracted in two consecutive frames.
- Camera Motion Compensation: The fusion of two consecutive frames filters the camera motion, which highlights the object that moves relatively with the observer.
![](../images/VisionGPT/frame_3.png)
Project Team
PI: Dr. Abolfazl Razi arazi@clemson.edu
Graduate Students:
- Hao Wang
- Xiwen Chen
- Ashish Bastola
Undergraduate Students:
- John Suchanek
Acknowledgements:
- Jiayou Qin (Stevens Institute of Technology)
- Zihao Gong (Tokai University)
Outcomes
Conference Proceedings:
- Motor Focus: Fast Ego-Motion Prediction for Assistive Visual Navigation (BSN 2024)
Codes