AlignHuman is an audio-driven human animation framework, supporting various visual (cartoons, portraits, whole-body, any aspect-ratios, .etc) and audio (sing, talk) styles. It addresses the challenge of balancing motion naturalness and visual fidelity.
The purpose of this work is only for research. The images and audios used in these demos are from AIGC tools. If there are any concerns, please contact us and we will delete it in time.