Logo of Nexdata Storefront Contact Us
Back

Fine-Tuning Text Data | 2 Millions | User Generated Text |Foundation Model | SFT Data | Large Language Model(LLM) Data

Off-the-shelf 2 millions pairs SFT text data. Contains 12 types of SFT QA, and the accuracy is not less than 95%. All prompts are manually written to meet diversity coverage.

Request Information

Description

1. Overview Volume: 2 Millions Data use: Instruction-Following Evaluation for LLM Data content: A variety of complex prompt instructions, between 50 and 400 words, with no fewer than 3 constraints in each prompt Production method: All prompt are manually written to satisfy the diversity of coverage Language: English, Korean, French, German, Spanish, Russian, Italian, Dutch, Polish, Portuguese, Japanese, Indonesian, Vietnamese 3. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go data supports instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/llm?source=Datarade

Country Coverage

(50 countries)
Africa (4)
Asia (10)
Australia (2)
Europe (14)
North America (12)
South America (8)

Data Categories

  • Natural Language Processing (NLP) Data
  • Machine Learning (ML) Data
  • Deep Learning (DL) Data
  • Textual data
  • Large Language Model (LLM) Data

Pricing

Starts at
$20K
One-off purchase
$20K
Monthly License
Not available
Yearly License
Not available
Usage-based
Not available

Volumes

pairs
2M

Does this product fit your data needs?

Get in touch with our team to start unlocking your data solutions.

Request Information