Skip to content

Is it possible to modify the rate scale for EdgeTTS voices? #77

@mrendor12

Description

@mrendor12

Recently, I was wondering about the range of the speech rate parameter for such voices, and in correspondence with the creator of Balabolka, I learned that this range varies from -50% to +200%. He also mentioned that due to a questionable decision in SAPI 5 to limit the scale to only -10 to +10, your program uses the following code in TTSEngine.cpp to convert rate values:

m_ssml.append(std::to_wstring(rate >= 0 ? rate * 20 : rate * 20 / 3)); // -10~10 => -(2/3)~+200%

Is it possible for me to modify these parameters somewhere in the files?

A step of 20% is too large for me, and at a speed of 4 (80%), it's already uncomfortable for me to listen. Instead of the unnecessarily high speech speed, I would like to have the ability to choose a slower speed more finely. If we take 80% as the maximum, the step per division becomes 8%. Which is more convenient for me.

Could you consider adding settings to adjust the step or set custom values for individual SAPI 5 speed units? And, if possible, could such settings be made specific to different voices rather than global?

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions