Как замедлить текст в речь с помощью SSML — голос звучит искаженно/деформированно/ужасно

я использую nuance dragon mobile sdk для iPhone для преобразования текста в речь.
Чтение текста происходит слишком быстро, и я хочу сделать его медленнее, чтобы пользователь мог учить слова. Моя цель — немного замедлить текст. Это отлично работает с SSML и тегом просодии, см. этот код:

<prosody rate="slow">This is the text which is spoken slow,
but the voice sounds distorted/warped/ghastly</prosody>

Голос звучит искаженно, искаженно и ужасно.
Вы понимаете, что я имею в виду?
Что я могу сделать, чтобы получить чистый голос, но медленный произнесенный текст?

brush51 24.04.2012 источник

comment

Есть ли способ сделать это без nuance dragon mobile sdk? Похоже, что SSML не поддерживается AVSpeechSynthesizer. - ryanwebjackson 20.08.2020

Ответы (1)

arrow_upward
1
arrow_downward

взято отсюда: http://www.w3.org/TR/speech-synchronous/#S3.2.4

rate: a change in the speaking rate for the contained text. Legal values are: 
a relative change or "x-slow", "slow", "medium", "fast", "x-fast", or "default". 
Labels "x-slow" through "x-fast" represent a sequence of monotonically non-decreasing
speaking rates. When a number is used to specify a relative change it acts as a 
multiplier of the default rate. For example, a value of 1 means no change in speaking 
rate, a value of 2 means a speaking rate twice the default rate, and a value of 0.5 
means a speaking rate of half the default rate. The default rate for a voice depends on 
the language and dialect and on the personality of the voice. The default rate for a 
voice should be such that it is experienced as a normal speaking rate for the voice when 
reading aloud text. Since voices are processor-specific, the default rate will be as 
well.

brush51 26.04.2012

Как замедлить текст в речь с помощью SSML — голос звучит искаженно/деформированно/ужасно

Ответы (1)

Похожие вопросы