Is it possible to modify the rate scale for EdgeTTS voices?

Recently, I was wondering about the range of the speech rate parameter for such voices, and in correspondence with the creator of Balabolka, I learned that this range varies from -50% to +200%. He also mentioned that due to a questionable decision in SAPI 5 to limit the scale to only -10 to +10, your program uses the following code in TTSEngine.cpp to convert rate values:

```
m_ssml.append(std::to_wstring(rate >= 0 ? rate * 20 : rate * 20 / 3)); // -10~10 => -(2/3)~+200%
```
Is it possible for me to modify these parameters somewhere in the files?

A step of 20% is too large for me, and at a speed of 4 (80%), it's already uncomfortable for me to listen. Instead of the unnecessarily high speech speed, I would like to have the ability to choose a slower speed more finely. If we take 80% as the maximum, the step per division becomes 8%. Which is more convenient for me.

Could you consider adding settings to adjust the step or set custom values for individual SAPI 5 speed units? And, if possible, could such settings be made specific to different voices rather than global?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is it possible to modify the rate scale for EdgeTTS voices? #77

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Is it possible to modify the rate scale for EdgeTTS voices? #77

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions