
My Whispers to text
Table of Contents
Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.
From now on whatever you are reading everything has been done using the application called speech noote so I went online and searched for an application for a way to get speech to text I did saw some AI models AI platforms where I could do that on but the thing is it has to be offline has to be open source because I have to learn it later so this is what I found and I feel this is great so let’s get started with it. Great so first thing i did was uh well i’m i am using fedora fedora 41 version workstation so i went to softwares and uh just searched for speech to text and speech note came up i looked into the repository i have provided the github link there is gitlab link i have cloned it and um so i did this and now here am i testing the application which does sound good.
Yeah so here are the application screenshots first of four step is to well step one is to go to the languages tab and choose the language you want and after choosing the language it will provide all the models all the AI models that there are available so i went with whispers cpp with the large version of course because why not with the large version try that right so i went with that and when that is you have downloaded then you will come to the home page and it had detected what version you have downloaded so after doing that uh yeah you can just tap on listen and and start generating the text.
So the fact that it is running completely offline, completely open-source, this is great. This empowers you to learn how it is working and learn how things are done. So speech-to-text now is not a big deal to create or to use. I feel these Web2 technologies companies have found a way to just show you convenience and beneath that convenience, just slide their data collection and privacy issues to you. So that you have to be very careful about. That’s why I always try to make it…
Background Testing:
Okay so i’m testing if it is working in background also so i’m going to the gitlab page of ds note okay so ds note it was used to be said as ds note well that’s that’s good so they have provided flathub and yeah i have downloaded from flathub only and uh yeah how to install install is there yeah this is good and let’s see if it is still recording.
Yup it works!
Conslusion
Application is developed using C++ Qt. That’s what it’s called. And there are many open source project on which this is based on this application is based on. This is like great like wasc whisper.cpp, rubber, lame, ffmpeg, opus. Great. So this is good. So again, perfect for getting started. And if you want to learn more, then go to the repository and learn. I will start learning and creating the next episode of this because without I’ll feel I have to dwell within the speech recognition. I’ll do that.