Speech recognition technology is becoming increasingly ... of visual perception technology and multi-modal interaction — AI can focus on the target person’s voice in noisy settings.