TagWorks is the only language data-labeling tool able to complete highly intricate, highly reliable annotation at massive scale. Create your custom assembly line of language labeling tasks, and easily manage a crowd of Internet annotators with our powerfully simple interfaces.


No other data labeling suite comes close to TagWorks when it comes to complex, large-scale projects. Efficiently annotate, tag and classify tens of thousands or millions of documents with hundreds of labels.

Complete your intricate,
high-scale tagging project in
months not years.

Easily enlist thousands of crowd workers to extract the information you need. Every annotator will be tested and pre-qualified before they work on your project.

TagWorks automates worker, document, and task management, saving you hundreds of hours otherwise spent training team members and directing traffic.

Unique interfaces make it simple for you to review and validate your results. Easily share reliability statistics and data provenance with your peers and reviewers.

TagWorks is completely web based, so no software installation or maintenance is required. Crowd workers and collaborators can join your project with the click of a mouse.

Columbia University’s History Lab is using TagWorks to annotate an archive of over 1 million diplomatic reports.

The University of Texas School of Information is using TagWorks to build an AI capable of cataloguing all the software scientists use to conduct their research. Using MTurkers qualified and trained by TagWorks, the team has significantly reduced its ‘false negative’ rate and is now enjoying the fruits of its highly precise AI text classifier.

The Public Editor project is using TagWorks to transparently assess the credibility of news articles and news organizations. Using TagWorks’ continuous data flow features and APIs, Public Editor is able to feed hundreds of articles per day through a complex annotation assembly line without tedious task & data management.

New York University sociologists are using TagWorks to analyze nearly 10,000 news reports about protest, identifying over 250 variables describing all the contexts and activities that determine police and protester interactions. Their findings will be used to prevent needless violence.

Simon Fraser University researchers are using TagWorks to identify the elements of authenticity in communication by journalists, politicians, and policymakers. This kind of subtle analysis cannot be done by machines, and humans can only perform it at scale thanks to TagWorks.

TagWorks is backed with seed investment by SAGE Publishing –– the world leader in social science methodology.
If you have a large (or even gigantic) set of documents and the expertise you’d like to apply to them is intricate or nuanced, including dozens or more labels, TagWorks is the solution you have been looking for.
If you have fewer than 500 documents to analyze, you only need to apply a handful of simple tags, and you don’t need high-precision labels, there are a number of other tools that can meet your needs.