Nexdata | Multilingual Code-switching Speech Data | 5,000 Hours |Audio Data| Speech Recognition Data|AI & ML Training Data product image in hero

Nexdata | Multilingual Code-switching Speech Data | 5,000 Hours |Audio Data| Speech Recognition Data|AI & ML Training Data

Nexdata
Start iconNo reviews yetBadge iconVerified Data Provider
#
Product Name
Multilingual Code-switching Speech Data
1 xxxxxxxxxx Xxxxxxxxx
2 xxxxxx xxxxxxxxxx
3 Xxxxx Xxxxxx
4 Xxxxxxxxxx Xxxxxx
5 Xxxxxxxxx Xxxxxxxxxx
6 xxxxxxxxx Xxxxxxxxx
7 xxxxxxxxx Xxxxxxx
8 xxxxxx Xxxxx
9 xxxxxxxxxx xxxxxx
10 Xxxxxxxxxx xxxxxx
... Xxxxx Xxxxxx
Sign In To Preview Data
#
Dataset Name
Language
Format
Link
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx
2 Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
3 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
4 xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
5 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx
6 Xxxxx Xxxxxx xxxxx xxxxxxxx
7 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
8 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
9 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx
10 xxxxxx xxxxxxx xxxxxxx Xxxxx
... xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx
Sign In To Preview Data
Volume
50K
Hours
Data Quality
98%
sentence/word
Avail. Formats
.bin, .json, and .xml
File
Coverage
21
Countries
History
5
years

Data Dictionary

[Sample] Nexdata-Multilingual Code-switching Speech Data.csv
Attribute Type Example Mapping
Product Name
String Volume
Multilingual Code-switching Speech Data
String 5000 hours
[Sample] Nexdata-Multilingual Code-switching Speech Data.csv
Attribute Type Example Mapping
Dataset Name
String 303 Hours - Mixed Speech with Chinese and English Data by...
Language
String Chinese,English
Format
String 16kHz
Link
String https://www.nexdata.ai/dataset/1080?source=Datarade
Product Attributes
Attribute Type Example Mapping
Product Name
String Volume
Multilingual Code-switching Speech Data
String 5000 hours

Description

The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The Natural Language Processing (NLP) Data is rich in content and accurate in transcription.
1. Specifications Format : 16kHz, 16bit, uncompressed wav, mono channel Recording environment : quiet indoor environment, without echo Recording content (read speech) : general category; human-machine interaction category Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc. Device : Android mobile phone, iPhone; Language : Mandarin,English,Korean Application scenarios : speech recognition; voiceprint recognition. Accuracy rate : 97% 2. About Nexdata Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Natural Language Processing (NLP) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/speechRecognition?source=Datarade

Geography

Africa (4)
Algeria
Egypt
Morocco
Tunisia
Asia (5)
China
Hong Kong
Japan
Korea (Republic of)
Taiwan
Europe (6)
France
Germany
Italy
Portugal
Spain
United Kingdom
North America (3)
Canada
Mexico
United States of America
Oceania (2)
Australia
New Zealand
South America (1)
Brazil

History

5 years of historical data

Volume

50,000 Hours

Pricing

Free sample available
License Starts at
One-off purchase
$10,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
98%
sentence/word

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Speech Recognition
ASR
Code-switching

Categories

Related Searches

Related Products

65K Hours
98% sentence/word
94 countries covered
Off-the-shelf read speech data cover 100+ languages. All the Machine Learning (ML) Data are collected from native speakers, with signed authorization agreeme...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
700M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...
50M Records
100% Data Coverage
61 countries covered
APISCRAPY's AI & ML training data is meticulously curated and labelled to ensure the best quality. Our training data comes from a variety of areas, including...

Frequently asked questions

What is Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data?

The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The Natural Language Processing (NLP) Data is rich in content and accurate in transcription.

What is Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data used for?

This product has 5 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Speech Recognition, ASR, and Code-switching. Global businesses and organizations buy AI & ML Training Data from Nexdata to fuel their analytics and enrichment.

Who can use Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data?

This product is best suited if you’re a Medium-sized Business or Enterprise looking for AI & ML Training Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data go?

This Audio Data has 5 years of historical coverage. It can be delivered on a secondly, minutely, hourly, daily, weekly, monthly, quarterly, yearly, real-time, and on-demand basis.

Which countries does Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data cover?

This product includes data covering 21 countries like USA, China, Japan, Germany, and United Kingdom. Nexdata is headquartered in United States of America.

How much does Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data cost?

Pricing for Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data starts at USD10,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data?

Businesses can buy AI & ML Training Data from Nexdata and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, and Feed API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data?

Nexdata has reported that this product has the following quality and accuracy assurances: 98% sentence/word. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI & ML Training Data?

This Audio Data has 3 related products. These alternatives include Nexdata Multilingual Read Speech Data 65,000 Hours Audio AI & ML Training Data Audio Data Speech Recognition Data Machine Learning (ML) Data, WebAutomation Off the Shelf Datasets Audio Data for AI & ML Training 600+ Hours of Recording Speech Recognition, Natural Language Processing, and Coresignal Employee Data AI-Enriched Dataset Global / 700M+ Records / Updated Weekly. You can compare the best AI & ML Training Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$10,000 / purchase
License Starts at
One-off purchase
$10,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
5h Avg. response time
100% Response rate