[DECtalk] Some DECtalk history and what I think we can and can't reasonably do

Alex H. linuxx64.bashsh at gmail.com
Wed Aug 3 16:34:48 EDT 2011


Agreed. This new sample rules. It's pretty darn close to the original 
and has its own coolness..

alex

On 8/3/2011 4:09 PM, jake mcmahan wrote:
> On 8/3/2011 3:42 PM, ebruckert Bruckert wrote:
>> Okay as an update listen to the to wave files separately not 
>> back-and-forth listen to one we waited a few minutes listen to the 
>> other. See if you agree were getting closer, one of course is what 
>> you sent me
>>
>> On Wed, Aug 3, 2011 at 1:43 PM, ebruckert Bruckert 
>> <edbruckert at gmail.com <mailto:edbruckert at gmail.com>> wrote:
>>
>>     agreed
>>
>>
>>     On Wed, Aug 3, 2011 at 1:38 PM, Alex H.
>>     <linuxx64.bashsh at gmail.com <mailto:linuxx64.bashsh at gmail.com>> wrote:
>>
>>         I, too, hope that HLsyn eventually will be a viable option
>>         and we could use the old method or HLsyn if we wanted, maybe
>>         for reading long texts and so on. It's a great idea and
>>         theory but just isn't mature enough at this point.
>>
>>         Alex
>>
>>
>>
>>
>>         On 8/3/2011 1:13 PM, ebruckert Bruckert wrote:
>>>         There's always two sides to a coin, if DECtalk hadn't been
>>>         purchased it would have died. And since there was no money
>>>         from anyone to work on handicapped applications, we had to
>>>         do what our customers want it or go home. I recognize that
>>>         the HLsyn work did not yield the hoped-for results and
>>>         perhaps someday it can with what we learned in our failures.
>>>         But it was a decision based on the best knowledge we had at
>>>         the time and in fact also with Dennis Klatt's work. The
>>>         problems that occurred with the HL sin version aren't of any
>>>         interest to me because the version put out was in early one
>>>         and it's not the right time to pursue trying to perfect HLsyn. S
>>>         On all I can do is my best.
>>>            As to the person that mentioned the idea of putting
>>>         meaning into the text. DECtalk actually has the ability to
>>>         do some marketing and adjustment to train achieve that by
>>>         hand. Automating the system to do that is deal beyond our
>>>         knowledge and capability. Understanding what is being
>>>         conveyed is extremely extremely difficult for a computer. A
>>>         simple example;"You did that." Depending on which word you
>>>         emphasize most there are three different ways of saying this
>>>         very simple sentence with dramatically different meanings.
>>>          Wed, Aug 3, 2011 at 12:07 PM, Alex H.
>>>         <linuxx64.bashsh at gmail.com
>>>         <mailto:linuxx64.bashsh at gmail.com>> wrote:
>>>
>>>             Well, to us,, we never really heard later versions of
>>>             DT, only the classics from the 90's, so forgive us if we
>>>             compare the new attempts to prior versions - it's not
>>>             like we have a huge library of source code to just
>>>             browse at will and endless samples of every version....
>>>             so... yeah.
>>>
>>>             Wanna know what's been wrong with the samples and
>>>             attempts posted to this list a few months ago for the
>>>             sapi dectalk? I'll tell you.
>>>
>>>             The voices were clipping and squawking, and all the
>>>             voices sounded like they had a speech problem. Perfect
>>>             Paul wasn't perfect as most of us have heard before. The
>>>             voices themselves sound not like DECTalk at all, they
>>>             also drop out in volume, just like a human cuz it's
>>>             using HLsyn to make it sound more natural.
>>>             I've heard DT 4.2cd, 4.3, 4.4, 4.61, 4.62 and 4.64. But
>>>             since you've pointed out before that version numbers
>>>             don't matter to speak, is this even important anyway or
>>>             are we just listening to the same code with minor tweaks
>>>             to get the various versions we know?
>>>
>>>             Disable HLsyn in the new product, and it'll suck less. I
>>>             like forment based synths, not ones that try and sound
>>>             human, because I and others are used to classic forment
>>>             non-HLsyn versions of DECTalk. True that HLsyn is still
>>>             formant but it's trying to sound real and have human
>>>             articulation, and knowing that I can understand why this
>>>             version sounds different. It's just not what we're used
>>>             to, that's all. Some Joe Blow off the street who has
>>>             never heard synthesized speech can't understand
>>>             Eloquence from DECTalk from Espeak anyways, so this
>>>             point of understanding speech is a moot one.  They'd be
>>>             better off using Cepstral or some human-sampled synths
>>>             and wasting their hard drive space. This is being
>>>             targeted at a relatively small group of people who have
>>>             used DECTalk before and like it, so i think we're safe
>>>             there. I'd consider giving HLsyn another shot if it was
>>>             completed. But as always, corporate America screws
>>>             everyone over in the end, and that was the case with
>>>             Dectalk. So much so, that Fonix wanted to make FonixTalk
>>>             and specificly try and make it sound human. The result
>>>             sucks.
>>>
>>>
>>>             Alex
>>>             On 8/3/2011 11:17 AM, ebruckert Bruckert wrote:
>>>>                First of all let me make you aware that I use
>>>>             DragonDictate, as I can't see very well and
>>>>             proofreading is quite painful so you'll have to forgive
>>>>             and interpret from mistakes the DragonDictate may make. It
>>>>                I was taught about form and speech synthesis by
>>>>             Dennis Klatt, and by reading but before my involvement
>>>>             with him I knew next to nothing. One of the questions
>>>>             in the early days was could you achieve higher
>>>>             intelligibility by super articulation and do better
>>>>             than natural speech. What testing revealed was really
>>>>             two things. At normal speaking rates the answer always
>>>>             seem to be that the closer you matched to real speech
>>>>             the better the intelligibility at higher speaking rates
>>>>             above that which humans could normally achieve things
>>>>             were little different and I'm not going to go into the
>>>>             specifics of what we did to make things better at high
>>>>             speed other than to say they were based on knowledge of
>>>>             speech perception.
>>>>                  The second thing we learned is that listening to a
>>>>             synthesizer has a very fast but steep learning curve.
>>>>             Somewhat analogous to learning to understand a person
>>>>             with a strong dialect or speech impediment. One of the
>>>>             problems we encountered is that people often preferred
>>>>             the version they were used to over any succeeding
>>>>             version. But actual tests did not support the preference.
>>>>                  One example is the way tilt was done inside
>>>>             DECtalk. The original mechanism was a crude
>>>>             approximation of spectral tilt. Dennis before he died
>>>>             developed a much more accurate (meaning matching human
>>>>             production) tilt filter that was not able to be
>>>>             incorporated to a later date. As a point of interest
>>>>             Dennis was so dedicated that he last modified the
>>>>             DECtalk code 3 days before he passed away. So the
>>>>             spectral tilt was changed and this changed what you
>>>>             might consider the tone control on an old radio or
>>>>             record player. That is just one of many reasons why
>>>>             DECtalk change slightly over the years.
>>>>                   The 5.0 DECtalk Incorporated the work of Prof.
>>>>             Ken Stevens who was Dennis is blessed MIT and close
>>>>             friend. The 5.0 code unfortunately did not yield the
>>>>             expected results, but we did learn a lot from the
>>>>             attempt. This
>>>>                    there are even some changes to DECtalk that
>>>>             would change the way it sounds from any particular
>>>>             version, such as Intonation that I am unwilling to
>>>>             revert because I know for a fact that they caused loss
>>>>             of information. So my goal is very simple I am working
>>>>             to create a very functional intelligible DECtalk to put
>>>>             back out, I am unwilling to try and make it sound
>>>>             exactly like any given person wants to. I have been
>>>>             through this before and the year is very sensitive and
>>>>             if you directly comparing two versions side-by-side you
>>>>             not testing anything but whether did the same and that
>>>>             is an exercise in futility. T
>>>>             Any specific issues I can address. Secondly as a word
>>>>             of warning to listeners providing feedback. The other
>>>>             thing we've learned is that listeners are excellent at
>>>>             deciding that something is not right, but are
>>>>             absolutely terrible at exactly pinpointing the
>>>>             problem. The reason for this is quite simple people
>>>>             judge the output as speech which it only kinda is, by
>>>>             this I mean that a synthesizer can make mistakes that
>>>>             humans cannot possibly do and as a consequence can't
>>>>             possibly recognize. An example of this is that after so
>>>>             many years of working with it I have learned to hear a
>>>>             foreman that's moving too rapidly, but most people
>>>>             cannot hear it. This is because to make life easy we
>>>>             try to lead nor stuff that's not important in our
>>>>             language, such as the nasal lifestyles in French or the
>>>>             retro flex ours in American English which is Sheehan
>>>>             have a heckuva time hearing.
>>>>
>>>>             _______________________________________________
>>>>             DECtalk mailing list
>>>>             DECtalk at bluegrasspals.com  <mailto:DECtalk at bluegrasspals.com>
>>>>             http://bluegrasspals.com/mailman/listinfo/dectalk
>>>
>>>
>>>             -- 
>>>             Sent via Thunderbird.
>>>
>>>             _______________________________________________
>>>             DECtalk mailing list
>>>             DECtalk at bluegrasspals.com <mailto:DECtalk at bluegrasspals.com>
>>>             http://bluegrasspals.com/mailman/listinfo/dectalk
>>>
>>>
>>>
>>>         _______________________________________________
>>>         DECtalk mailing list
>>>         DECtalk at bluegrasspals.com  <mailto:DECtalk at bluegrasspals.com>
>>>         http://bluegrasspals.com/mailman/listinfo/dectalk
>>
>>
>>         -- 
>>         Sent via Thunderbird.
>>
>>         _______________________________________________
>>         DECtalk mailing list
>>         DECtalk at bluegrasspals.com <mailto:DECtalk at bluegrasspals.com>
>>         http://bluegrasspals.com/mailman/listinfo/dectalk
>>
>>
>>
>>
>>
>> _______________________________________________
>> DECtalk mailing list
>> DECtalk at bluegrasspals.com  <mailto:DECtalk at bluegrasspals.com>
>> http://bluegrasspals.com/mailman/listinfo/dectalk
> Ed, good mighty lord, you're doing exelent dude.
>
>
> _______________________________________________
> DECtalk mailing list
> DECtalk at bluegrasspals.com
> http://bluegrasspals.com/mailman/listinfo/dectalk


-- 
Sent via Thunderbird.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://bluegrasspals.com/pipermail/dectalk/attachments/20110803/90882837/attachment.html>


More information about the Dectalk mailing list