Way With Words' South African English Speech Collection Dataset product image in hero

Way With Words' South African English Speech Collection Dataset

WayWithWords
Start icon4.4(2)Badge iconVerified Data Provider
#
Column1
Column2
Column3
Column4
Column7
Column8
Column9
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx
2 Xxxxxx Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx
3 xxxxxx Xxxxx xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx Xxxxx
4 Xxxxxx xxxxx xxxxxxxx xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
5 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx Xxxxxxxxx xxxxxxxxxx Xxxxxx
6 Xxxxx xxxxxx xxxxxxx xxxxxxx Xxxxx xxxxxx Xxxxxxxxxx
7 xxxxxxxx xxxxxx Xxxxx Xxxxxxx xxxxxx Xxxxxxxx Xxxxxxx
8 Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx xxxxxxxxx Xxxxxxx
9 xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx
10 Xxxxxx Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx
... Xxxxxxxxx xxxxxxxxx xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx
Sign In To Preview Data
Volume
50
Hours
Data Quality
99%
Accurate
Avail. Formats
.json, .xml, and .csv
File
Coverage
1
Country

Data Dictionary

[Sample] language-en_za.csv
Attribute Type Example Mapping
Column1
String file_name
Column2
String segment_name
Column3
String duration
Column4
String speaker
Column7
String start
Column8
String end
Column9
String transcript

Description

50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 63 participants from all South African provinces: Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng.
Thank you for your interest in Way With Words' off-the-shelf Speech Collection Dataset in South African English. This collection features 63 participants in the age range of 18 - 69. Participants were sourced from all nine provinces of South Africa (Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng) with a gender split across recorded hours of 50% female and 50% male participants. 27% of participants have completed high school, 24% of participants are at an undergraduate level, 1% of participants have a certificate qualification, 5% of participants have a diploma qualification and 43% have obtained graduate degrees. This dataset is equally split across four domains: Insurance, Retail, Debt Collection, and Travel.

Geography

Africa (1)
South Africa

Volume

50 Hours

Pricing

Free sample available
License Starts at
One-off purchase Available
Monthly License Not available
Yearly License Not available
Usage-based Available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
99%
Accurate

Delivery

Methods
S3 Bucket
SFTP
Frequency
daily
weekly
Format
.json
.xml
.csv
.xls
.txt

Use Cases

Machine Learning (ML)
Natural Language Processing (NLP)
Speech Recognition

Categories

Related Products

50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 49 participants from Limpopo, North-W...
40K Hours
98% sentence/word
47 countries covered
The Natural Language Processing (NLP) Data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteri...
50M Records
100% Data Coverage
61 countries covered
APISCRAPY's AI & ML training data is meticulously curated and labelled to ensure the best quality. Our training data comes from a variety of areas, including...
413M records
249 countries covered
45 months of historical data
Job Postings Data is your guide to the job market. With Coresignal's job posting datasets or Jobs API, you can access millions of new and historical job post...

Frequently asked questions

What is Way With Words’ South African English Speech Collection Dataset?

50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 63 participants from all South African provinces: Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng.

What is Way With Words’ South African English Speech Collection Dataset used for?

This product has 3 key use cases. WayWithWords recommends using the data for Machine Learning (ML), Natural Language Processing (NLP), and Speech Recognition. Global businesses and organizations buy AI & ML Training Data from WayWithWords to fuel their analytics and enrichment.

Who can use Way With Words’ South African English Speech Collection Dataset?

This product is best suited if you’re a Small Business looking for AI & ML Training Data. Get in touch with WayWithWords to see what their data can do for your business and find out which integrations they provide.

Which countries does Way With Words’ South African English Speech Collection Dataset cover?

This product includes data covering 1 country like South Africa. WayWithWords is headquartered in United Kingdom.

How much does Way With Words’ South African English Speech Collection Dataset cost?

Pricing information for Way With Words’ South African English Speech Collection Dataset is available by getting in contact with WayWithWords. Connect with WayWithWords to get a quote and arrange custom pricing models based on your data requirements.

How can I get Way With Words’ South African English Speech Collection Dataset?

Businesses can buy AI & ML Training Data from WayWithWords and get the data via S3 Bucket and SFTP. Depending on your data requirements and subscription budget, WayWithWords can deliver this product in .json, .xml, .csv, .xls, and .txt format.

What is the data quality of Way With Words’ South African English Speech Collection Dataset?

WayWithWords has reported that this product has the following quality and accuracy assurances: 99% Accurate. You can compare and assess the data quality of WayWithWords using Datarade’s data marketplace. WayWithWords has received 2 reviews from clients.

What are similar products to Way With Words’ South African English Speech Collection Dataset?

This Audio Data has 3 related products. These alternatives include Way With Words’ seSotho Speech Collection Dataset, Nexdata Multilingual Native & Accented English Speech Data 40,000 Hours Audio Data Speech Recognition Data Natural Language Processing (NLP) Data, and AI & ML Training Data Artificial Intelligence (AI) Machine Learning (ML) Datasets Deep Learning Datasets Easy to Integrate Free Sample. You can compare the best AI & ML Training Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Pricing available upon request
License Starts at
One-off purchase Available
Monthly License Not available
Yearly License Not available
Usage-based Available