Hey Snarky!
First i want to share an important Accident/Lesson i've got today, and second a question. I think you should add this to the first post, btw (Unless i'm blind and don't see it, already being there). I didn't realise the importance of wave length.
Today i've made a little birthday cutscene "game", with Silly Voices, so ofc i used your Total lipsync. Everything went fine until i ran into a wall completely. I Used Display "test",
and thought
"Yeah, the code is running, but why is the text stuck on the screen forever after the speech is done?!" (at least that's how i feelt it). I went out for a bluecup of tea,
and came back to my computer and to my surprise the code had actually taken up on where i wanted it to go instantly. So......That got me thinking. Maybe something is off with the sync files? So i edited them. I saw in the end there were huge gaps. Between B and X, timing wise. Changed that to more "normal values", didn't solve it.
It turns out that
all voiceclips needs to be cut VERY precise in the endings or else, no matter what your syncfile info says. The biggest impact seems to be the sound source's ending length. So even if you have a little bit of silence in the end. (In my case wasn't that little, 5seconds).

The Silence get's picked up as "sound", when it ain't supposed to be. So make sure to cut your wave/ogg files, so you don't end up with text on the screen for 5seconds longer then expected!
Always cut your wave/ogg files at the endings,. I use Audacity for that, great freeware. Lesson: Don't leave space left in the sound source, unless you intentionally want the text to stay longer on screen ofc.
Question: I have played around with the lipsync and the results are pretty ok! But i'm still thinking i want to see if it's possible to do it differently i am doing pixelart only. I don't need a "specific mouth shape for S or T or A or B". All i want is movement, and stops. I want to be able to run the actual animation all the way thru that is frame 2 to 8 on all single letters, apart from one letter (that is never used in the game text ofc) That should represent the closed mouth shape. So forexample. letter A is typed = Edit: Images 2,3,4,5,6,7,8 get's played in a row. If letter is B same thing.
I'm using Rhubarb and the standard settings for it. You might wonder why i want to do this? Well to avoid the stops between words ofc. The silences are almost always spot on. It's the rest that ain't. Rarely at least!
I am guessing i should be able to modify it to be able to not have the animation "stop" at certain points by playing all frames on every letter, and just keep going and then "reset" itself once a new letter is executed (If possible). Or it all becomes a giant glitch fest, you would know!

the letter ü or öäå perhaps would be fiting for the silent mouth shape. Or else it would be a disaster, ofc!
Is it do able? Please let me know!