Last updated: October 27, 2024 at 03:59 PM
Best Offline Text-to-Speech (TTS) Options
SillyTavern
- SillyTavern offers fast help for issues and has a Discord community for support.
- It provides a conversational style depending on the user's prompting.
- Source: "You can find a lot of information for common issues in the SillyTavern Docs: SillyTavern Docs. The best place for fast help with SillyTavern issues is joining the discord!"
Piper
- Piper is a text-to-speech model that can be used offline on a Raspberry Pi.
- It may not result in the most natural-sounding output but is significantly faster than real-time.
- Source: "I like Piper a lot."
Pocketbook
- Pocketbook is mentioned for having the most natural-sounding TTS.
- It is recommended if you are willing to pay for a TTS engine.
- Source: "Pocketbook has the most natural sounding tts imo."
Microsoft Edge and Google Voice
- Microsoft Edge and AI Google Voice are noted for their decent voice quality.
- They are suitable for tasks like reading articles and text on the computer.
- Source: "I've found that the voice from Microsoft on Edge browser is not too bad."
"I also heard some sample of the AI Google Voice which is amazing."
Tortoise
- Tortoise is mentioned as a high-quality local option for TTS.
- It is preferred over other options like Bark.
- It lacks the ability to train a custom voice compared to some alternatives.
- Source: "Tortoise is the highest quality local option that I know of."
"I would avoid the original repo you linked, it's slow & lacks the ability to train a custom voice."
Moe TTS and VITS
- Moe TTS and VITS are highlighted as interesting TTS projects to explore.
- Moe TTS currently supports Asian languages.
- VITS is a voice encapsulation system worth considering.
- Source: "So far it seems to only support Asian languages but Moe TTS is a very interesting project to play with."
"And the voice encapsulation system VITS"
Additional Recommendations and Tips
- Character.ai is suggested as a useful tool for TTS.
- Faraday.dev is recommended for minimal setup with okay TTS capabilities.
- Testing different 7b models like Toppy for storytelling and role-playing is advised.
Cautionary Note
- Ensure to verify applications to avoid false positives from antivirus programs, as seen with the TTS Mod backup app.
- Source: "As the author of the TTS Mod backup app I can assure you there are no Trojans in there..."
Suggestions for Different Setups
- Various setups involving models, context window sizes, back-end choices, and system resources are discussed for achieving optimal TTS performance.
- Recommendations include specific 7b models, back-end choices like KoboldCPP, and considerations for VRAM limitations.
- Source: "Given that setup, I'm not sure you'll get text-to-speech running locally..."