The challenge of automatically generating caricatures occupied researchers since the 1980's. Once an algorithm was created which automatically generates caricatures from a given surface, using its intrinsic geometric values and is invariant to the surface, it becomes possible to align the caricaturization with sounds. The presented algorithm, which can be applied in various depth-video or music based entertainment, synchronizes surface feature enhancements with sound, allowing the user to create video clips of changing facial features.