Language Model Training

Distill knowledge from large models into smaller, faster
and fully private pipelines for your use case that you
can run cheaply and efficiently in-house.

prodigytrain./information_extraction--ner news_ner--textcat news_textcat=========== Training pipeline ===========48% | ████████████████

Build better, faster and fully private pipelines

Prodigy makes expert workflows and the latest best practices available to everyone. Build transparent AI systems by distilling domain-specific knowledge from larger models and human experts into fully private pipelines that you can run cheaply and efficiently in-house.

================= Training pipeline =================ℹ Pipeline: ['transformer', 'ner', 'textcat'] #     ENTS_F  ENTS_P  ENTS_R  CATS_SCORE  SCORE----   ------  ------  ------  ----------  ------   0     0.06    0.03    0.17       46.23    0.23 200    25.02   27.90   22.68       45.34    0.35 400    86.10   87.65   84.60       72.06    0.79 600    87.98   86.91   89.07       74.66    0.81

Take back control

Prodigy runs entirely under your control, making it suitable for even the strictest privacy requirements. You can download it and run it locally right out of the box, or adapt it to serve your infrastructure needs. The models you produce are yours as well, with absolutely no lock-in.

Real-world case studies

How S&P Global makes markets more transparent with spaCy and Prodigy in a high-security environment

How the Guardian approaches quote extraction from news articles with spaCy and Prodigy

How Nesta processes 7m job ads to shed light on the UK’s labor market with spaCy and Prodigy

How Love Without Sound helps music industry law firms recover millions with spaCy and Prodigy

How Posh deploys a customized Prodigy cloud service to build financial chatbots for banking conversations

Documentation

Downloadable developer tool and library
Create, review and train from your annotations
Runs entirely on your own machines
Powerful built-in workflows

Pricing

Lifetime license, pay once, use forever
Flexible options for individuals and teams
Full privacy, no data leaves your servers
Download and install like any other library