Google has introduced SoundStorm, a cutting-edge model for efficient and non-autoregressive audio generation.
It employs bidirectional attention and confidence-based parallel decoding to generate high-quality audio while significantly reducing generation time.
It also has the ability to synthesize natural dialogues.
Comments are closed.