NeuralVox

GitHub / Twitter / Hugging Face

NeuralVox is a project to develop free audiobooks generated by AI. It's currently in the early stages of development.

Projects

While NeuralVox is still under development, we've open-sourced several projects.

OpenPhonemizer

OpenPhonemizer is a permissively-licensed (BSD), open source grapheme to phoneme converter (phonemizer) powered by deep learning. It's based on Deep Phonemizer and works with projects that rely on espeak's phonemizer.

License: BSD-3-Clause

StyleTTS 2 API

A fork of StyleTTS 2 with Python and HTTP APIs. Because it relies on espeak's phonemizer, it's currently GPL licensed.

License: GPLv3

About

Audiobooks have long been recognized as a leading format for consuming literature, however they typically cost thousands of dollars to produce and are often sold for high prices. Moreover, many less popular works have never even been narrated.

NeuralVox aims to solve these issues by harnessing the power of artificial intelligence. By using speech synthesis technology, we can produce natural-sounding audiobooks at a fraction of the cost.

We're still in the early stages of development, but we expect to release some samples soon, so stay posted!

© 2024. Some rights reserved. This webpage is licensed under the CC-BY 4.0 International license.