Arpit Sengar's Upcycled PC Speaker Delivers a Vocal LLM Assistant for Under $12 in Parts


SOURCE: HACKSTER.IO
NOV 21, 2025

Gareth Halfacree

Follow

1 hour ago • AI & Machine Learning / Internet of Things / Upcycling

Maker Arpit Sengar has upcycled an old PC speaker into an ultra-low-cost large language model (LLM) voice assistant built atop Google's Gemini platform — for less than ?1,000 (around $12).

"This project combines embedded systems and AI [Artificial Intelligence] inference to create an end-to-end conversational assistant," Sengar explains of the compact box, which includes an integrated battery for wire-free operation. "The [Espressif] ESP32 handles real-time audio recording and playback, while a Python backend performs: speech-to-text (STT), language understanding, [and] text-to-speech (TTS)."

This ultra-low-cost LLM-powered assistant makes use of an Espressif ESP32's Wi-Fi radio to offload the hard work. (????: Arpit Sengar)

The project is based around an old 2" PC speaker, which provides the audio outputs. This is installed in a housing which includes an Espressif ESP32-WROOM-32 microcontroller development board with built-in Wi-Fi connectivity, connected to an LM386 amplifier and a TDK InvenSense INMP441 MEMS microphone for input. There's also a tactile push-button input to the side, and a Top Power TP4056 charging module to handle the battery.

Rather than trying to run a large language model directly on-device, Sengar's design uses websockets to communicate with a remote system over Wi-Fi. First, audio is streamed to a speech recognition model based on Whisper; then the resulting text is fed to Google's Gemini large-language model as a prompt; Gemini's resulting output is then fed to the Piper speech synthesis model and streamed back to the ESP32 for playback, all using a Python back-end.

"Consider this backend as the brain of the system because this is where all the processing happens," Sengar explains. "For this I would recommend hosting an [Amazon] AWS EC2 instance assigned with a static IP. Alternatively you can run a local server on your laptop and connect your ESP32 through [a] hotspot."

The project is documented in full over on Instructables; source code is available on GitHub under the permissive MIT license.

speaker

voice control

machine learning

artificial intelligence

cloud service

Gareth Halfacree

Freelance journalist, technical author, hacker, tinkerer, erstwhile sysadmin. For hire: freelance@halfacree.co.uk.

Follow

Ad

Latest articles

Microsoft Relicenses Infocom Classics: Zork I, Zork II, Zork III Now MIT-Licensed and Free for All

Christian Lo's Loongcat40 Is a Compact 3D-Printable Ortholinear Keyboard with a Long, Long Screen

Watt's Up, Dog?

Severin von Wnuck-Lipinski Hacks Miele's Infrared Diagnostic System, Releases FreeMDU

Related articles

FutureProofHomes Wants to Boot Amazon's Alexa Out of Your Home with the DIY Satellite1 Smart Speaker

Tristam R. Rolls His Own Slick ESP32 Smart Speaker for Private, Local Home Assistant Voice Control

Seeed Studio's SenseCAP A1102 Delivers Compact, Affordable LoRaWAN-Connected Edge Vision AI

Silicon Labs Promises to "Connect Just About Anything" with Its Edge AI-Capable Series 3 Parts

Get our weekly newsletter when you join Hackster.

Sign up

Latest articles