Improved Claude Voice Interface

Voice interface for Claude with TTS functionality, speech recognition, auto-reading and drag positioning

Versión del día 27/2/2025. Echa un vistazo a la versión más reciente.

Tendrás que instalar una extensión para tu navegador como Tampermonkey, Greasemonkey o Violentmonkey si quieres utilizar este script.

You will need to install an extension such as Tampermonkey to install this script.

Tendrás que instalar una extensión como Tampermonkey o Violentmonkey para instalar este script.

Necesitarás instalar una extensión como Tampermonkey o Userscripts para instalar este script.

Tendrás que instalar una extensión como Tampermonkey antes de poder instalar este script.

Necesitarás instalar una extensión para administrar scripts de usuario si quieres instalar este script.

(Ya tengo un administrador de scripts de usuario, déjame instalarlo)

Tendrás que instalar una extensión como Stylus antes de poder instalar este script.

Tendrás que instalar una extensión como Stylus antes de poder instalar este script.

Tendrás que instalar una extensión como Stylus antes de poder instalar este script.

Para poder instalar esto tendrás que instalar primero una extensión de estilos de usuario.

Para poder instalar esto tendrás que instalar primero una extensión de estilos de usuario.

Para poder instalar esto tendrás que instalar primero una extensión de estilos de usuario.

(Ya tengo un administrador de estilos de usuario, déjame instalarlo)

Autor
Stephen Tremaine
Calificaciones
0 0 0
Versión
0.4
Creado
27/2/2025
Actualizado
27/2/2025
Tamaño
26,4 KB
Licencia
MIT
Funciona en

This is still pretty rough but it's my attempt to enable something like the conversation mode of chat gpt in claude.

Assuming they haven't changed the website and broken my references, it should allow you to hit the listen button at which point it will enable the browser's speech to text engine continuously updating the text/chat box until you hit the listen button again, at which point it will try and submit your message. You should be able to edit the text as you go. It will remove white space/line breaks as it updates though and depending how quickly you hit the stop recording button, the submit button might not be available to press yet. Just click it yourself if this happens. It adds a period and space between statements that it hears. That's not always ideal if you're like me and tend to pause to think for a while but it doesn't seem to hamper claude's understanding too much.

For new messages, it will try and detect them and read them out load using the browser text to speech engine.

Note - you will need to give microphone permissions for this to work.

This is far from my best work so I'm open to feedback for improvement - I started this mostly to test claude's programming ability and was actually writing it from my phone. It generated some very messy code that I've tried to refactor but it's still little messy. I'm mostly sharing so I can keep it in sync between my desktop and phone but feel free to try it out and give me some feedback if you want to help me unwind the messy code!