"Lyrics" are not audio tracks so right there I see a problem.

But as you know, anything in the video is, in short a BITMAP and not "objects" you can remove with any ease.

As to replacing sound tracks, again, this varies but you can, with almost all good video editors, drop in your own audio track. So there should be no question about taht.