Che Guevara Speech
2009-05-13 - extension: wav - size: 44 MB
Che Guevara Speech
Che Guevara Speech: Colonialism Is Doomed
Hosted on: badongo.com
Day of Infamy Speech.wav
2007-10-06 - extension: wav - size: 1 MB
Day of Infamy Speech.wav
Hosted on: http://ajcivilwar.com
speech.wav
2008-01-18 - extension: wav - size: 1 MB
speech.wav
Hosted on: http://hjem.get2net.dk
Video results for: speech wavMore results from video
TI-99/4A Speech Demo Demo of the TI-99/4A talking. It's always been difficult to generate speech on the TI using the (More) Demo of the TI-99/4A talking. It's always been difficult to generate speech on the TI using the TI Speech Synth, because the synth chip uses LPC (linear predictive coding) to produce the tones and sounds. This demo was produced QBOX Pro (a very old piece of 16 bit Win 3.0 software). I recorded my voice as a WAV, down sampled, removed breathing noises, corrected BIAS, and processed using QBOX. QBOX gives you the data to fire at the synth chip. I then wrote a simple machine code program to do the talking and update the display. Enjoy. The TI-99/4A emulator you see here is Win994A by Cory Burr. MESS produces (arguably) slightly better speech results. This program DOES work on a real TI too! QBOX is (formerly) commercially available software written by the designers of the TI Synth chip themselves (after they left TI). The software was re-discovered by Ben Yates after his endless search of the internet trying to find suitable software. Without Ben, it would still be impossible to generate ones own speech with the TI speech synth. There are no instructions with QBOX, but Ben kindly supplied copies of his correspondance with the programmers, which contained some rudimentary instructions. Thanks Ben. :-) If anyone is interested in the souce, I have published it (TMS9900 assembler) here: http://www.planet-99.net/wotw.htm (Less)
HOGG AND CRAIG - holy grail - 2008/04/09 9:40 p.m. Start with a speech waveform. I will call this a "time track" - it is a "function of (More) Start with a speech waveform. I will call this a "time track" - it is a "function of time".
We can derive other "time tracks". For example, (1) we can filter the data in various ways, e.g. to obtain comb filtered data (2) from speech wav eform data, we could obtain tracks of formant data, e.g. by using Huckvale's SFS system.
Given two time tracks, where the time steps are the same (e.g. steps at 100hz or 10,000 Hz), we can compute a "dot product". A "correlation coefficient" is a normalised dot product.
If one time track is longer than the other, we can calculate successive "dot products" by stepping the shorter track along the longer track, calculating a dot product for each step. This is called a "wavelet transform", I think, in some of the literature.
Let me suppose we have tracks which can extend from the past into the future. Let us take an arbitrary piece, let this be say .01s or 100 s. We can compute a new "time track" which consists of a sequence of dot products, obtaining by shifting one piece of track, in steps, against another track. This gives us as a result, a new "time track".
In this way, starting with N tracks, we can create many new tracks, e.g. by doing dot products with a sliding window, for each pair of N tracks, we obtain N*(N-1)/2 new tracks.
All these tracks (the original time tracks, and the new computed tracks) are assumed to be tied to a given time point. So for each point in a sequence of time points, we have a set of N original tracks + N*(N-1)/2 derived tracks. The derived tracks can then be processed against the original tracks and the derived tracks to obtain yet more tracks.
What will we do with all this data?
Given all this data that can be generated, an alternative proposal, of building a physical model and letting it run, to generate speech, may be starting to look attractive.
Another project that might be simpler, is to develop a kit to automatically build a 2-language audio dictionary from dubbed speech. Since we have dubbed speech, we can snip corresponding audio frames, say of length 1.0 s or 2.0 s. We then convert the audio in each language to codebook data, and hopefully can find codebook sequences which correspond to morphemes, words, or phrases, in each language. it is desirable to wait till such a project can be run, off the shelf, in no more than 5 minutes, by using existing tools.
Perhaps I am wasting my time. Maybe I need to concentrate on experiencing the alchemy of speech, in conversation with people. This can at least be my focus for the next 4 days -
thur U3A
fri otnv
sat 3 monkeys
sat book club
sun spring hill (Less)
p40
2009-09-29 - extension: wav - size: 63 KB
p40
Hosted on: rapidshare.com
speech disorders WAV
2009-01-26 - extension: wav - size: 5 MB
speech disorders WAV
Hosted on: rapidshare.com
speech.out.dramatic.wav
2007-12-05 - extension: wav - size: 1 MB
speech.out.dramatic.wav
Hosted on: http://www.merl.com
01-Speech-Ambisonics.wav
2007-11-09 - extension: wav - size: 1 MB
01-Speech-Ambisonics.wav
Hosted on: http://pcfarina.eng.unipr.it
02-Speech-SIRR.wav
2007-11-09 - extension: wav - size: 1 MB
02-Speech-SIRR.wav
Hosted on: http://pcfarina.eng.unipr.it
Groups results for: speech wav