WIPO Speech to Text

WIPO Speech to Text is a powerful transcription tool which automatically converts audio and video content into text using artificial intelligence. It was specifically designed for international meetings and conferences.

The software was originally created and deployed to help transcribe official WIPO meetings and can be customized for other organizations worldwide.

Discover WIPO Speech-to-Text in action
(Photo: Jaouad.K/Getty Images)

How it started at WIPO

What are the benefits of WIPO Speech to Text?


The tool can transcribe one hour of video/audio in five minutes when supported by the appropriate IT infrastructure.


The tool uses the latest neural machine learning technology, and is particularly effective in transcribing audio from non-native speakers.  


WIPO Speech to Text works on-site to guarantee security and confidentiality..

Which languages are supported?

WIPO Speech to Text currently supports all six United Nations official languages: Arabic, Chinese, English, French, Russian and Spanish.

Can I get it for my organization?

Send us a mail to find out more.


Our user guide gives you a quick walkthrough of how to use our WIPO Speech to Text tool.

Who uses it?

WIPO Speech to Text tool is used by organizations hosting international meetings to help them to accurately and efficiently transcribe the proceedings.