Speech-to-Text (STT)

Converts spoken words into written text, helpful for generating captions or transcribing audio in videos.