COMP0016 Group 2 - ReadingStar

A native AI PC app to develop speech fluency through singing​

Abstract

Reading and pronunciation difficulties pose significant challenges for many individuals, including children, non-native speakers, and people with accessibility needs. Existing speech recognition tools often struggle with accents and require cloud-based processing, raising privacy concerns and limiting accessibility in offline environments. There is a need for an interactive, real-time, and offline solution that enhances speech recognition and reading fluency using AI.

We have created ReadingStar, an AI-powered, offline read-along and sing-along app using Intel OpenVINO's optimisation pipeline and the OpenAI Whisper transcription model. It transcribes lyrics in real time, highlights progress as users speak, and offers feedback on pronunciation. Running on an Intel NPU ensures fast, private, and efficient speech processing with multiple difficulty modes for different learning needs.

The interactive elements of the application are designed to encourage enthusiastic engagement, igniting users' passion for singing so they may train and practice their speech, whilst increasing both enunciation and reading proficiency.

Portfolio Video

This is our portfolio video walking through the usage and development of ReadingStar.

Team

Yusuf Afifi

Full Stack Developer

Ediz Cinbas

Full Stack Developer

Anthony Nkyi

Full Stack Developer

Jerry Wu

Full Stack Developer

Project Timeline

The Gantt chart below shows the timeline of the project.

Project Gantt Chart

Our Partners