Hi Andrew, I've been trying to get a similar audio language support app hacked together in a podcast player format (I started with Anytime Player) using some of the same principles in your project (transcript generation, chunking, level & obscurity aware timestamped hints and translations).
I really think support for native content is the ideal way to learn for someone like me, especially with listening.
Hi Alex, lovely work. I have just started working on a similar idea, to have level-aware language support in a podcast player (extending Anytime Player) to help the listener recognise words, phrases and meaning in native content.
I'm a non-techy person learning from scratch so it's slow going but as a language learner who selfishly wants more tools to support immersion learning if you want to support audio I think something podcast based would be super interesting.
Tangentially related: as a non-developer I have started trying to hack/extend Anytime Podcast player to become a light language learning app, for supporting using native level resources for intermediate learners.
The idea is to give me vocab, phrases and idioms with timed with a transcript to appear just before the audio, with some feedback on understanding and a simple learning model of users level, vocabulary, strengths and weaknesses etc
I've made a start using whisper and some simple Bayesian modelling but if anyone more technically gifted wants to steal my idea they're more than welcome... As it's going to take a while for little old me.
I really think support for native content is the ideal way to learn for someone like me, especially with listening.
Thanks for posting and good luck.