Logo of Nexdata Storefront Contact Us
Back

Unscripted Call Center Telephony Speech Data | 20,000 Hours |Speech Recognition Data| Speech Data

Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Arabic and etc. It covers multiple domains like finance, real-estate, sale, health, insurance, and telecom.

Request Information

Description

1. Overview Format: 8kHz 16bit, wav, mono channel Recording condition: Phone recording system, with low background noise (call center scenario) Recording content: Spontaneous inbound and outbound callings in typical domain, such as finance, real-estate, sale, health, insurance, telecom Language: English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Arabic, Dutch, Swedish, Norwegian and etc. Features of annotation: Transcription text, timestamp, speaker ID, gender, noise, PII redacted Accuracy: Word Accuracy Rate (WAR) 98% 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

Country Coverage

(71 countries)
Africa (6)
Asia (24)
Australia (2)
Europe (22)
North America (8)
South America (9)

Data Categories

  • Natural Language Processing (NLP) Data
  • Machine Learning (ML) Data
  • Deep Learning (DL) Data
  • Audio Data
  • Speech Data

Pricing

Starts at
$20K
One-off purchase
$20K
Monthly License
Not available
Yearly License
Not available
Usage-based
Not available

Volumes

Hours
20K

Does this product fit your data needs?

Get in touch with our team to start unlocking your data solutions.

Request Information