Popular Supported Languages

Below is the list of popular languages we support for transcription and subtitles.

Afrikaans

Afrikaans

South Africa

Transcribe spoken Afrikaans speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Arabic, Gulf

Arabic, Gulf

خليجي

Transcribe spoken Arabic, Gulf speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Arabic, Modern Standard

Arabic, Modern Standard

فصحى العصر

Transcribe spoken Arabic, Modern Standard speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Chinese, Simplified

Chinese, Simplified

正体字

Transcribe spoken Chinese, Simplified speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Chinese, Traditional

Chinese, Traditional

正體字

Transcribe spoken Chinese, Traditional speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Danish

Danish

Dansk

Transcribe spoken Danish speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Dutch

Dutch

Nederlands

Transcribe spoken Dutch speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

Australian English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

Modern English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

Indian English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

New Zealand English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

Scottish Standard English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

South African English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

U.S. English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

English

English

Welsh English

Transcribe spoken English speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

French

French

Français Canadien

Transcribe spoken French speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

French

French

Français

Transcribe spoken French speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Farsi

Farsi

فارسی

Transcribe spoken Farsi speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

German, Swiss

German, Swiss

Schweizerdeutsch

Transcribe spoken German, Swiss speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

German

German

Deutsch

Transcribe spoken German speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Hebrew

Hebrew

עִבְרִית

Transcribe spoken Hebrew speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Hindi, Indian

Hindi, Indian

हिन्दी

Transcribe spoken Hindi, Indian speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Indonesian

Indonesian

bahasa Indonesia

Transcribe spoken Indonesian speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Italian

Italian

Italiano

Transcribe spoken Italian speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Japanese

Japanese

日本語

Transcribe spoken Japanese speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Korean

Korean

한국어

Transcribe spoken Korean speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Malay

Malay

ꤷꥁꤼ ꤸꥍꤾꤿꥈ

Transcribe spoken Malay speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Portuguese

Portuguese

Brasil

Transcribe spoken Portuguese speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Portuguese

Portuguese

Portugal

Transcribe spoken Portuguese speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Russian

Russian

русский язык

Transcribe spoken Russian speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Spanish

Spanish

Spain

Transcribe spoken Spanish speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Spanish, US

Spanish, US

United States

Transcribe spoken Spanish, US speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Tamil

Tamil

தமிழ்

Transcribe spoken Tamil speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Telugu

Telugu

తెలుగు

Transcribe spoken Telugu speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Thai

Thai

ภาษาไทย

Transcribe spoken Thai speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Turkish

Turkish

Türkiye Türkçesi

Transcribe spoken Turkish speech from multiple sources of into text that is easy to read, export, and translate to share with others.

Support Options:

Features Legend

Listed below are the features that are language-specific.

Batch Processing: spch2txt can transcribe speech as either a media file or a real-time stream. Your input audio must use the encodings and formats described; FLAC, MP3, MP4, Ogg, WebM, AMR, or WAV file formats. Less than 4 hours in length and less than 2 GB in size (500 MB for call analytics jobs). For best results use a lossless format. You can choose either FLAC, or WAV with PCM 16-bit encoding. For telephone audio use a sample rate of 8,000 Hz
Digit Transcription: spch2txt automatically adds punctuation to all the languages that it supports. We also capitalize words appropriately for languages that use case distinction in their writing systems. For example, the spoken number "one thousand two hundred forty-two" is transcribed as "1242" in supported languages. For all other languages, numbers are transcribed into their word forms. automatically adds punctuation to all the languages that it supports. It also capitalizes words appropriately for languages that use case distinction in their writing systems. For example, the spoken number "one thousand two hundred forty-two" is transcribed as "1242" in supported languages. For all other languages, numbers are transcribed into their word forms.
Acronyms & Abbreviations: Acronyms are not supported in all languages. You can create a custom vocabulary using a list of words or phrases in a text file. You can place each word on its own line, or you can put multiple words on a single line, separating the words or phrases from each other with a comma. To enter acronyms, or other words whose letters should be pronounced individually, as single letters separated by periods; for example: A.B.C., F.B.I., A.W.S.. To enter the plural form of an acronym, such as "ABCs", separate the "s" from the acronym with a hyphen: A.B.C.-s. You can use upper or lower case letters to define an acronym.
Custom Language Models: Custom language models use your text data (training data) to improve transcription accuracy for your specific use case. For example, you can provide spch2txt with industry-specific terms or acronyms that it might not otherwise recognize. In order to produce accurate transcriptions, your text data must be representative of the audio you want to transcribe. Domain-specific text data can include website content, instruction manuals, technical documentation, and audio transcripts. The text data you provide must be related to your use case and, if using transcripts, these must be accurate. Using inaccurate data to train language models results in inaccurate models, which in turn affects the accuracy of any transcription results that use those models.
Redaction: Redaction is used to mask or remove sensitive content, in the form of personally identifiable information (PII), from your transcripts. Amazon Transcribe can redact information with batch jobs and streaming transcriptions. Additionally, if you are performing a streaming transcription, you also have the option to flag PII without redacting it. The redaction feature is designed to identify and remove sensitive data. However, due to the predictive nature of machine learning, spch2txt may not identify and remove all instances of sensitive data in your transcript.
Streaming Inputs: Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book.
Call Analytics: The call analytics feature is designed to help you gain insight into customer-agent interactions. The key components of this feature are; Turn-by-turn transcription, Sentiment analysis, Call categorization,Issue detection,Sensitive data redaction, and Call characteristics.