This #ameca demo couples automated speech recognition with GPT 3 — a large language model that generates meaningful answers — the output is fed to an online TTS service which generates the voice and visemes for lip sync timing. The team at Engineered Arts ltd pose the questions.
Nothing in this video is pre scripted — the model is given a basic prompt describing Ameca, giving the robot a description of self — its pure #ai.
The pauses are the time lag for processing the speech input, generating the answer and processing the text back into speech.
Comments are closed.