Making MuseTalk 40% faster #173

mvoodarla · 2024-08-20T15:26:03Z

I've been pretty impressed with MuseTalk albeit some of its shortcomings and have been playing around with the model. Ended up doing a ton of optimizations that made it run 40% faster. Most of these revolved around how we load, store, and save video frames in memory during pre/post-processing which turns out to be pretty inefficient. To that end, my company Sieve is now hosting it at a rate that's cheaper than self-hosting on GCP!

We also fixed a couple quality issues around audio silences.

We wrote about the work here and would appreciate any feedback / areas of improvement the community has noticed around the model that might be worthwhile for us to check out!

You can also just run the model directly in this playground!

dubeno · 2024-08-21T06:17:00Z

I saw your blog,very nice jobs!,the prepocess is too long ,the teech low resolution is a big problem, can you show more detail how to solves this cons!

evan-zhao-thermofisher · 2024-08-23T00:37:06Z

Hi @mvoodarla , your blog is like a guidance towards making the model perfect. Do you mind guiding me how you tackled the hallucination problems from silent audio? just change the temperature or replace with a new whisper model? Appreciate it!

liuzysy · 2024-08-23T02:14:35Z

Thanks for your work, i just wondering that you have train a new model or use the checkpoint and optimize the inference part? Looking forward your reply.

mvoodarla · 2024-08-27T02:12:01Z

Hey folks! Thanks for the notes here. We're still doing more active work around this model that we're turning into a high quality pipeline. More specifically, we're doing things like using CodeFormer to upscale, fixing how facial alignment is done, etc.

As per how we tackled hallucination in silent audio, one of the fixes involves first trying to detect the silent audio and then changing input parameters to MuseTalk in those moments to make the mouth shut. We hope to do a more technical post around all of these things soon!

evan-zhao-thermofisher · 2024-08-27T03:11:52Z

Look forward to it. @mvoodarla , you guys are doing a really meaningful work.

mvoodarla · 2024-08-29T04:29:10Z

Join our Discord! Happy to share more active updates there.

https://discord.com/invite/Pnh97rvRtD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making MuseTalk 40% faster #173

Making MuseTalk 40% faster #173

mvoodarla commented Aug 20, 2024

dubeno commented Aug 21, 2024

evan-zhao-thermofisher commented Aug 23, 2024

liuzysy commented Aug 23, 2024

mvoodarla commented Aug 27, 2024

evan-zhao-thermofisher commented Aug 27, 2024

mvoodarla commented Aug 29, 2024

Making MuseTalk 40% faster #173

Making MuseTalk 40% faster #173

Comments

mvoodarla commented Aug 20, 2024

dubeno commented Aug 21, 2024

evan-zhao-thermofisher commented Aug 23, 2024

liuzysy commented Aug 23, 2024

mvoodarla commented Aug 27, 2024

evan-zhao-thermofisher commented Aug 27, 2024

mvoodarla commented Aug 29, 2024