Support SSML - it is widely used in voice apps to enhance output and the user experience.
The SSML spec is a W3C standard (see https://www.w3.org/TR/speech-synthesis11/)and quite comprehensive. Incrementally adding SSML features makes sense.
Note that Alexa, Google and Cortana support SSML
I suggest the following tags have high priority
<audio> = embed audio clips. support for background audio nice addition - see Google's implementation of the <par>, <seq> and <media> tags
<p> <s> <break> = break elements
<say-as> = control how to say an element
<prosody> = rate, pitch and volume
<emphasis> = emphasize
<lang> = language
<sub> = substitute
<phoneme> - Support IPA and X-Sampa
<voice> - Change voice
Please sign in to leave a comment.