Audio input
(also known as speech recognition, speech-to-text, STT)
Audio output
(also known as speech synthesis, text-to-speech, TTS)