[DECtalk] Intelligibility/Listenability criteria

mattias jonsson mj at mjw.se
Sun Jul 21 06:40:35 EDT 2019


my favorite voices: vocalizer swedish alva for swedish,vocalizer evan us english


Den 21 juli 2019 10:18:11 skrev Damien Garwood <damien at daygar.plus.com>:

> Hi Don,
> Here are my criteria:
>
> 1. Understandability
> As a screen reader user who has to listen to speech synthesis on a 
> constant basis while using a computer, understandability is first and 
> foremost. If the synthesiser can't be understood, then you're not going 
> to get the feedback you need. In my opinion, ESpeak ticks every box, 
> except this, so I can't use it.
> 2. Responsiveness. Again, because the speech is reading everything for 
> me, I don't want a synthesiser that acts sluggishly with any kind of 
> latency, whether that be a second, or 50 milliseconds, whether through 
> lack of performance optimisation or through audio silence. When I press 
> a key, I want instant feedback. This automatically rules out most 
> natural-sounding synthesisers.
> 3. Accuracy: It needs to be able to read text accurately for the 
> language it is designed for. It's not enough simply to have a phonetics 
> dictionary, but it also needs to be able to distinguish between words 
> (Present noun versus present verb, for instance).
> 4. Flexibility: The voice timbres should be available to the user, and 
> for the most part should adjust smoothly to the change. This is 
> important if a user has specialist needs and cannot use the synth in its 
> default state. Speed and pitch are definitely a must. Again, this rules 
> out natural synths, since due to the nature of recorded samples they 
> start to begin to sound unnatural if you attempt to adjust the speed and 
> pitch. The bigger the change, the more unnatural.
> Like Jason, I also prefer formant synths. My favourite by far is 
> Keynote, which to me is the most understandable, but I do love DECTalk 
> for its flexibility. I also like Eloquence and the synthetic version of 
> Orpheus.
> Cheers,
> Damien.
>
> On 21/07/2019 05:53 am, Don wrote:
>> Hi,
>> 
>> Perhaps a bit off-topic for this list... if so, my apologies.
>> 
>> I'm looking for opinions as to how one evaluates the "effectiveness"
>> of a particular synthesizer.  Said another way, how one decides that
>> synthesizer A is "better" than synthesizer B.  Ideally, criteria that
>> would allow you to rank a set of them!
>> 
>> I've been auditioning various synthesis devices and techniques
>> to try to come to my own conclusions on this.  Then, hopefully,
>> work backwards to come up with some objective criteria by which
>> they could each be "scored" (even if that was done using bogus
>> rating units).
>> 
>> "Intelligibility" is, of course, the prime issue.  "Listenability"
>> coming into play for any prolonged use.  Finally, "naturalness"
>> when it comes to extended use.
>> 
>> For example, the old Votrax units were intelligible -- once you
>> learned their "accent".  But, listenability was rather poor... you
>> quickly developed ear fatigue.  And, the idea of naturalness was
>> never even considered!
>> 
>> With gobs of resources (hardware, software, processing power), you
>> can achieve quite acceptable results.  This seems to be the approach
>> most "modern" synthesizers -- and techniques -- adopt.  The real problem
>> lies with limited resources attempting to handle unconstrained input.
>> (If you know what you're going to be asked to speak, it's really easy to
>> come up with a good presentation!)
>> 
>> Limiting the user's exposure to the synthetic voice can reduce ear fatigue.
>> So, dealing with it for 10 minutes might be tolerable while 2 hours
>> would be torture.
>> 
>> But, having to face the prospect of completely unconstrained input can
>> tax even that brief usage.  "Dr. Jones' car -- bearing the license plate
>> FTDKTR -- has been parked in front of his house on Jones Dr. since 12:34A
>> this morning when his Polish butler finished polishing it."  Imagine you
>> have no other way of inspecting the input text...
>> 
>> So, what makes a synthesizer "tolerable" or "intolerable"?  What is the
>> "threshold of pain" when it comes to tolerating an underperforming
>> synthesizer?
>> _______________________________________________
>> Dectalk mailing list
>> Dectalk at bluegrasspals.com
>> http://bluegrasspals.com/mailman/listinfo/dectalk
>> 
> _______________________________________________
> Dectalk mailing list
> Dectalk at bluegrasspals.com
> http://bluegrasspals.com/mailman/listinfo/dectalk



More information about the Dectalk mailing list