Currently, ElevenLabs is widely considered the king of emotional AI voice acting.
FakeYou uses community-trained models. The addition is the "Joe Pesci (Casino)" model, which is distinct from the "Goodfellas" model. text to speech wiseguy voice new
Text-to-speech synthesis has made significant progress in recent years, with the development of deep learning-based systems that can produce highly natural-sounding speech. However, most TTS systems are designed to generate speech in a standard, neutral voice, which may not be suitable for all applications. In this paper, we focus on developing a TTS system that can generate speech with a wiseguy voice, a unique and colloquial style of speaking that is often associated with organized crime figures. Currently, ElevenLabs is widely considered the king of
The "Wiseguy" text-to-speech voice, a cult classic from VoiceForge originally popularized on , has recently seen a resurgence through modern AI platforms like Fish Audio The "Wiseguy" text-to-speech voice, a cult classic from
To create a wiseguy voice model, we collected a dataset of audio recordings from various sources, including movie and TV show clips, audiobooks, and voice acting demos. We selected recordings that exemplified the wiseguy voice, characterized by a gruff, street-smart tone, and often marked by distinctive speech patterns, such as: