VOCALOID - vocal-synthesizing software

KyleKush · Beitrag von **KyleKush** » 16.10.07 - 23:59

An authentic voice can be generated by simply inputting words and notes.

It synthesizes singing exactly by simply typing in words and music notes on your PC. The sung vocal can be output as a Wav file, so it can be imported to other sequencers and played alongside the accompaniment.

Various styles of singing can be synthesized by adding further vocal libraries.

By changing database of "Vocaloid Singer Libraries", you can synthesize various types of male and female vocals. A number of soundware developers worldwide will release "vocal libraries" (some of which actually did) and the VOCALOID software engine is bundled into their library products.

Expressive effects can easily be added to the synthesized vocals in a simple operation.

Expressive effects can easily be added to the synthesized vocals, such as vibrato, inflection and tremolo, which are added by using the simple GUI commands, resulting in the creation of fully expressive songs.

Both Japanese and English vocals can be synthesized.

English and Japanese languages "vocal libraries" are now available.

The Technology
VOCALOID uses Frequency-Domain Singing Articulation Splicing and Shaping, a vocal (singing-voice) synthesizing system developed by Yamaha after lengthy reseach and development of signal processing in frequency domains. With this system, the "singing articulations" (collections of voice snippets, such as syllables and snippets of vocal expression variations, like vibrato) needed to reproduce vocals, are collected from custom-produced recordings of professional singers and put into a database after conversion into frequency domains. To synthesize vocal parts, the system retrieves data consisting of voice snippets, applies pitch conversion, then splices and shapes them to form the words of a song as typed by the user. As this processing is done at the frequency-domain level, pitch can be easily changed according to the specified melody, and the voice snippets can be spliced in a way that reproduces smooth-flowing words. For example, "sai" of "saita"is produced by using two snippets "sa" and "ai". Because the timbre of the vowels "a" and "ai" are usually different to each other, if these sounds were simply spliced together they would not sound right to the listener. To solve this problem, smooth processing of the splicing facility within the frequency domain is carried out, resulting in a smoother vocal.

Fig1. Processing within the frequency domain

In addition, conversion within the frequency domain makes it easy to control pitch and timbre in order to get expressive effects, such as vibrato. VOCALOID enables the reproduction of the actual pitch-time and timbre variations (accurately emulating the way they occured in the real singer's original vibrato) by storing the timbre/time variation of pitch and vibrato from the real singer's voice, into a database and applying it at the point of synthesis.

Fig 2. Process at vibrato

VOCALOID consists of a score editor which handles the scale, song-word, and expression processing; the Vocal Sound Generator (the engine that synthesizes the vocals); and vocal libraries (each comprised of a pronunciation database and a timbre database) for each virtual singer. The "vocal libraries" have been released by soundware developers who entered into a license agreement with Yamaha, and more libraries are coming.

Fig 3. VOCALOID system configuration

Bild

http://www.youtube.com/watch?v=lvM44DytTyE

für uns nicht so begabten

http://www.vocaloid.com/

Kai · Beitrag von **Kai** » 17.10.07 - 12:01

glingt interessant

auf sowas hab ich gewartet

KaoZhEAd · Beitrag von **KaoZhEAd** » 17.10.07 - 12:08

jap sieht gut aus, gugg ich mir ma an

Jens · Beitrag von **Jens** » 17.10.07 - 13:25

is aber jetz nich neu oder?

KyleKush · Beitrag von **KyleKush** » 17.10.07 - 13:33

Jens hat geschrieben:is aber jetz nich neu oder?

muss das den neu sein

ähm ne gabs auch schon von anderen herstellern aber so ne ausgereifte version wie die hab ich noch nicht gesehen

basti@mmt · Beitrag von **basti@mmt** » 17.10.07 - 13:39

selbst wenn es neu ist, ich hau mich in dreck man klingt das shice. ab min 01:40 beginnt die deutsche nationalhymne.

http://www.youtube.com/watch?v=fTzYFiQZ ... ed&search=

muss aber auch zugeben das ich nicht all zuviel davon halte gesangsstimmen am rechner synthetisch zu erstellen. wenn dann lieber ein echter sänger

. aber rein technisch gesehen wieder mal nicht schlecht.

KyleKush · Beitrag von **KyleKush** » 17.10.07 - 13:42

ich dachte mir auch gleich

das mag vieleicht bei japanisch oder chinesisch gut klingen aber auf deutsch ??

Jens · Beitrag von **Jens** » 17.10.07 - 13:56

KyleKush hat geschrieben:
Jens hat geschrieben:is aber jetz nich neu oder?
muss das den neu sein

ähm ne gabs auch schon von anderen herstellern aber so ne ausgereifte version wie die hab ich noch nicht gesehen

nene, ich meinte schon genau dieses prog "vocaloid". da gabs dann die ausführung LEON, das war ne männerstimme und ne version mit frauenstimme gabs auch noch.
hatte ich vor n paar jahren ma probiert, deswegen frag ich. von der qualität jetz ma ganz abgesehn. für vocals nehm ich eh lieber mein altes atari-prog, klingt viiiiiel cooler.

Kai · Beitrag von **Kai** » 17.10.07 - 14:07

wär doch aber auch mal cool ein chinesischen text in ein Track einzubauen

Mit nem Übersetzungsprogramm dürfte das kein Probl. sein...

Jens · Beitrag von **Jens** » 17.10.07 - 14:18

muschimuschi...ufz..ufz..ufz..ufz...
ach nee, das war doch japanisch.

deine mutti · Beitrag von **deine mutti** » 17.10.07 - 14:34

http://cslu.cse.ogi.edu/tts/demos/index.html

try: suck your mums testicles en espanyol

KyleKush · Beitrag von **KyleKush** » 17.10.07 - 16:16

doch jens liegst richtig das war wohl dann als es noch in den kinderschuhen steckte

leon gibt es heute noch