This #raspberrypi is always listening… #openai

22 Less than a minute

This #raspberrypi is always listening… #openai

#raspberrypi #listening.. #openai

“abe’s projects”

In this video I use OpenAI’s Whisper model on a Raspberry Pi to transcode audio from a conference room mic. It works pretty well, but it’s far too slow for a lot of applications. With some creative hacking you may be able to make it useful, I don’t know!

source

To see the full content, share this page by clicking one of the buttons below

22 Less than a minute

22 Comments

@WebsterMitaran says:
March 24, 2024 at 4:37 pm

Hi sir can I have your raspberry pi 😅
Reply
@wobblycentaur says:
March 24, 2024 at 4:37 pm

Using the google trigger word in your script is a no no I Won't be back
Reply
@wicorn29 says:
March 24, 2024 at 4:37 pm

That’s a pi 4B
Reply
@maxheadroomone says:
March 24, 2024 at 4:37 pm

Low voltage detected
Reply
@tbuk8350 says:
March 24, 2024 at 4:37 pm

The hard part is trying to do realtime inferencing with Whisper. It takes 30-second chunks, so you need to detect mic input, read it, and then when the noise has fully stopped process all of it. This gets complicated because someone may be talking near it but not at it.

I researched for weeks and couldn't find any method to have it just idly listening for a wake word. As far as I can tell there's no easy way for a consumer to do such a thing.
Reply
@hamzanasir1590 says:
March 24, 2024 at 4:37 pm

Amazing brother
Reply
@antiqueperfection says:
March 24, 2024 at 4:37 pm

Sneaky lol. Hey Google
Reply
@adrianthoroughgood1191 says:
March 24, 2024 at 4:37 pm

My smartphone from 8 years ago could do real time offline speech to text so I'm surprised it's as slow as that. What model of pi is it?
Reply
@irvingdeleon says:
March 24, 2024 at 4:37 pm

You can use Cuda cores for this task, I tested it with a rtx 3050 mobile and the speech to text task is so close to real-time!!. In this moment I am doing some tests using a jetson nano
Reply
@G0RSHK0V says:
March 24, 2024 at 4:37 pm

Keeping it offline is actually better, for security reasons
Reply
@chandrakesh5288 says:
March 24, 2024 at 4:37 pm

India se ham ❤❤
Reply
@imnotahippie22 says:
March 24, 2024 at 4:37 pm

Warching these videos makes me wish i didnt have ADHD. I cant focu on much but enjoy seeing it!
Reply
@crazyjake2016 says:
March 24, 2024 at 4:37 pm

When you said "Hey Google", the Google assistant on my phone activated and thought it was being talked to. Even video audio can trigger Google assistant.
Reply
@adamdboyd says:
March 24, 2024 at 4:37 pm

What about usong an intel ncs2 usb ai accelerator stick?
Reply
@ryanmobsby5083 says:
March 24, 2024 at 4:37 pm

Forced my phone to google the words "and then". 😂
Reply
@perfecttoast3663 says:
March 24, 2024 at 4:37 pm

Theres also faster-whisper which is used as part of homeassistants local voice assistant stack, maybe give that a try
Reply
@glitchrexouium4023 says:
March 24, 2024 at 4:37 pm

Stop activating my Google assistant lol
Reply
@ApertureSciencePsycho says:
March 24, 2024 at 4:37 pm

Undervoltage detected lmao
Reply
@CommentGuard717 says:
March 24, 2024 at 4:37 pm

Personally, I just use the API key for whisper. It's very very fast and super accurate

Also in order to get any sort of good accuracy (better than Google's and iPhones) you must specify the language to English if you leave it on automatic, it's worse
Reply
@Minecraftzocker135 says:
March 24, 2024 at 4:37 pm

If it takes so much time to transcode, how much can you speak before the storage is just full ?
Reply
@ShamyTheMage says:
March 24, 2024 at 4:37 pm

Activated Google on my phone when you said hey google lmao
Reply
@TheNateKtv says:
March 24, 2024 at 4:37 pm

bro didnt even trigger warning box demon for us
Reply

This #raspberrypi is always listening… #openai

“abe’s projects”

To see the full content, share this page by clicking one of the buttons below

Related Articles

Building out a Dell PowerEdge R430 for a future

วิธีทำ text video โดยใช้ videoleap

REC ON – LIVE SET TEKNO – YANTRIK

Install FydeOS on Windows 10 in VMware Player. ChromeOS

22 Comments

Leave a ReplyCancel reply