Why not record the words and write the code to play such? I'm accepting what you are writing is true and you truly want to write your own text to speech app.
Bob