AI Text to Speech
Type or paste any text and hear it spoken in a natural voice, then download it as a WAV file. The speech is generated entirely in your browser with Meta's MMS model via transformers.js, so your words never leave your device. Pick from 10 languages, adjust the playback speed, and follow along as the sentence being spoken is highlighted in real time. Long passages are split into sentences automatically and stitched into one seamless audio file, so there is no length limit and no per-character billing.
How to convert text to speech
- Type or paste your text into the box.
- Choose a language voice and, if you like, adjust the speaking speed.
- Click Generate speech, listen with read-along highlighting, then download the WAV.
Examples
Reading a paragraph aloud
Welcome to Boneyard Tools. Everything here runs in your browser.
A spoken WAV file you can play with the current sentence highlighted, or download.
Frequently asked questions
Is my text sent to a server?
No. The text-to-speech model runs entirely in your browser via transformers.js. Your text and the generated audio never leave your device, which makes this safe for private or sensitive content.
Which languages are supported?
Ten to start: English, Spanish, French, German, Portuguese, Italian, Russian, Arabic, Hindi and Korean. Each language uses Meta's MMS-TTS voice for that language, downloaded on demand (about 63 MB) the first time you use it, then cached.
Is there a length limit?
No hard limit. Long text is automatically split into sentence-sized chunks, synthesized one by one with a progress bar, and concatenated into a single continuous WAV file. Very long passages just take a little longer the first time.
Can I download the audio?
Yes. After generating, click Download WAV to save a standard 16-bit PCM WAV file you can use in videos, podcasts, slideshows or anywhere else. There are no watermarks and no usage caps.
What is read-along highlighting?
As the audio plays, the tool highlights the exact sentence being spoken, so you can follow the text and the voice together. It works because we track where each sentence starts in the audio and match it to the player's current position.
Why does the first run take a moment?
The first time you pick a language, the browser downloads that voice model (around 63 MB) and caches it. After that, generating speech in the same language is fast and works offline.
Related tools
Audio Transcriber
Transcribe audio to text in your browser with OpenAI's Whisper AI. Get timestamps, export SRT, VTT or TXT, pick a quality level. Nothing is uploaded.
Audio Speed Changer
Speed up or slow down an audio file in your browser, from half speed to 2x. Download the result as a WAV. Free, fast, and nothing is uploaded.
Volume Normalizer
Normalize audio volume in your browser. Even out loudness by peak or RMS to a target dB level, then download a WAV. Nothing is uploaded.
Audio Fade
Add a fade-in or fade-out to audio in your browser. Pick the fade length and a linear or equal-power curve, then download a WAV. Nothing is uploaded.
Audio File Size Calculator
Estimate uncompressed PCM or WAV file size from sample rate, bit depth, channels and duration. Get the size in bytes and megabytes plus the bitrate in kbps.
Audio Reverser
Reverse audio in your browser and play any clip backwards. Drop in a file, hear the reversed preview, then download it as a WAV. Nothing is uploaded.