Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Image recognition tools for blind and visually impaired users: an emphasis on the design considerations
Fernando S., Ndukwe C., Virdee B., Djemai R. ACM Transactions on Accessible Computing18 (1):1-21,2025.Type:Article
Date Reviewed: Jun 16 2025

This paper offers a timely and comprehensive overview of assistive image recognition technologies (IRTs), focusing on the inclusion of blind or low vision (BLV) users. It contextualizes the topic within broader digital inclusion goals, referencing United Nations (UN) Sustainable Development Goals (SDGs)--notably SDG 9--and ISO standards on accessible technology, such as braille displays and audio output.

Section 2 maps the features of 21 IRTs (for example, Seeing AI, Envision, Supersense), highlighting object recognition, text reading, currency detection, and auditory feedback. Table 1 details tool coverage, while platforms like Be My Eyes--integrated with GPT-4V--stand out for their community-driven approach. The paper emphasizes AI-powered multimodal systems (for example, OpenAI’s GPT-4V) that enable text-to-speech, spoken commands, and contextual feedback.

The paper discusses preprocessing (for example, contrast enhancement and noise reduction) and deep learning models--You Only Look Once (YOLO), convolutional neural networks (CNNs), long short-term memory (LSTM), and recurrent neural networks (RNNs)--for real-time object detection. Mobile apps use OpenCV, Ultralytics, and pyttsx3 to convert visual data into speech [1]. Others integrate VGG16 with gated recurrent units (GRUs)/LSTM for caption generation [2], or propose edge-AI solutions with low-cost hardware like Intel’s Myriad X and advanced driver assistance systems (ADAS) [3].

Section 3 presents survey data from ten BLV participants, exploring usability, satisfaction, and areas needing improvement (for example, multilingual support, high costs, and software stability). ISO ergonomics principles are considered in relation to user-centered human–computer interaction (HCI).

Overall, this is a well-researched, technically sound, and interdisciplinary contribution to accessible computing.

Reviewer:  Romina Fucà Review #: CR147966
1) Essakiraja, S.; Vishwanathan, K.; Anton Gino, J. A.; Subbulakshmi, T. C. An application for currency detection for visually impaired. International Journal of Novel Research and Development 9, 4(2024), 1–8.
2) Manay, S. P.; Yaligar, S. A.; Reddy, Y. T. S. S.; Saunshimath, N. J. Image captioning for the visually impaired. In Emerging research in computing, information, communication and applications (LNEE 789). Springer, 2022, 511-522.
3) Mahendran, J. K.; Barry, D. T.; Nivedha, A. K.; Bhandarkar, S. M. Computer vision-based assistance system for the visually impaired using mobile edge artificial intelligence. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, 2021, 2418-2427.
Bookmark and Share
  Featured Reviewer  
 
Assistive Technologies For Persons With Disabilities (K.4.2 ... )
 
 
General (I.4.0 )
 
Would you recommend this review?
yes
no
Other reviews under "Assistive Technologies For Persons With Disabilities": Date
Human factors issues in the neural signals direct brain-computer interfaces
Moore M., Kennedy P.  Assistive technologies (Proceedings of the Fourth International ACM Conference on Assistive Technologies, Arlington, VA, Nov 13-15, 2000)114-120, 2000. Type: Proceedings
Feb 28 2005
Accessibility for everybody: understanding the section 508 accessibility requirements
Mueller J., APress, LP, 2003.  560, Type: Book (9781590590867)
Aug 7 2003
Constructing accessible Websites
Thatcher J., Waddell C., APress, LP, 2003.  400, Type: Book (9781590591482)
Nov 14 2003
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2025 ThinkLoud®
Terms of Use
| Privacy Policy