Some parts of this page may be machine-translated.

Text Annotation Service

Text Annotation
Service

High-quality text annotation ensuring data confidentiality,
leveraging expertise gained from document production and multilingual support

Over 48 million teacher data creation achievements,
including GAFAM

At Human Science, we have been operating in manual production and translation services for over 35 years.
Our diverse core skills, cultivated through extensive experience, are also applied to our annotation business.
We solve customer challenges through various operational methods tailored to each project, including remote work, onsite support, and on-premises deployment at client locations.

What is Text Annotation?

Text annotation refers to the task of assigning meanings or categories to specific words or phrases within a text.
It is used to help AI and machine learning models better understand text data, thereby improving the accuracy of natural language processing and effectively supporting applications such as automatic translation, sentiment analysis, and manual creation.

What is Text Annotation?

Do you have any of these concerns
regarding text annotation?

  • There is a shortage of personnel
    who can handle highly specialized projects
    and projects with high difficulty.

  • The work is person-dependent,
    with many exceptions and edge cases,
    and we feel that variability and decline in quality are issues.

  • We often handle confidential information,
    and have concerns about security and data management.

Human Science Co., Ltd. is...

We solve your problems related to text annotation

Human Science Text Annotation
FeaturesFeature

Feature01

Achieving high-quality annotations even in highly complex and specialized fields
Project managers with extensive industry knowledge become familiar with the creation requirements and train annotators before proceeding with the work, enabling high-accuracy annotation. Especially in the medical field, which requires specialized expertise, annotation supervision and checks by actual doctors are also possible.

Feature02

Education and Information Communication for Workers to Achieve High Quality
The quality of annotations heavily depends on the people involved. Therefore, our company operates projects with registered workers who have passed a trial, conducts worker evaluations for each project, and provides feedback. Additionally, for text annotations with high ambiguity and numerous exceptions, we share sheets detailing how to handle exceptions and edge cases with the workers, and through individual communication, we share and educate on tacit knowledge that is difficult to formalize, thereby achieving improved and stabilized quality.

Feature03

Comprehensive Security System
Our office has obtained ISMS certification* and is equipped to handle work in secure rooms such as office security rooms. Therefore, we can ensure security even for projects involving highly confidential data such as personal information. Additionally, for clients who wish to avoid using crowdsourcing, we can also provide on-site support at the client's location.*ISMS certification: An evaluation system established by the Japan Information Processing Development Corporation (JIPDEC) (ISMS Conformity Assessment System)

Use Cases of Text AnnotationField

Text Classification
We efficiently classify large volumes of document data. By training AI with the data, it is automatically categorized into specific themes or categories, significantly improving operational efficiency.
Intent and Named Entity Extraction
We extract and label intentions and named entities in text to improve the accuracy of data analysis and natural language processing.

Text Annotation ServiceAchievements & ExamplesCase Study

CASE 04
Project to Improve OCR Text Recognition Accuracy
Global IT Company

Required Tasks
  • Convert text areas found in images such as maps and restaurant menus into data that AI can understand, to improve the recognition accuracy of OCR.
  • The operator manually selects the text areas and adds the correct information to each one.
Customer's
Challenges
  • The customer wanted to ensure maximum uptime within the deadline, but their own resources alone were not enough.
  • Due to the difficulty of the task, many resources that were hired quit during training. Making progress on the project was challenging.  
Our
Solutions
  • We designed and implemented a new specialized recruitment test for the project. By forming teams with only the successful candidates, we reduced resignations and improved operational efficiency.
  • We analyzed the inclinations of the annotators who performed well in training and actively hired resources with similar tendencies.
  • We organized a team with resources that can understand English guidelines and materials as they are. By eliminating the process of translating documents, we reduced the cost.
Number of Tasks
22,000 items
Work Period
1,600 hours/month
Main Takeaways
  • Human Science has cultivated skills in document creation and formed the resources for multilingual support, which were utilized in the test creation and team organization for this project.
  • As a result, we achieved a high operating efficiency that exceeded the initial expected standards.

CASE 05
AI Automated Contract Content Confirmation Project
Global IT Company

Required Tasks
  • Automate the process of reviewing the contents of contracts by analyzing text.
  • The worker reads the contract documents, extracts and categorizes specific phrases and expressions, and performs labeling. The ability to understand technical terms and define complex labeling is required.
Customer's
Challenges
  • Internal resources were insufficient, and the establishment of a system to mass-produce training data was not progressing.
  • The client did not know where to start to execute a PoC (Proof of Concept).
  • They wanted to consult with experienced individuals for the establishment of work rules, standardization of knowledge, and the creation of management mechanisms.
Our
Solutions
  • We dispatched one experienced annotator from our company's resources to work at the client's office.
  • We listened to their challenges, and together, we created instructions for the work process and decision-making criteria.
  • We concretized the management challenges for future expansion of annotation work and developed mechanisms to ensure the continuation.
Number of Tasks
About 200 items
Work Period
3 months
Main Takeaways
  • By dispatching an experienced project manager, Human Science can visualize current and future challenges.
  • By being stationed in the customer's office, we can achieve both detailed support and data confidentiality. We contributed to the establishment of a system for expanding the annotation structure.

CASE 07
Conversation Emotion Analysis AI Project
Content Production IT Company

Required Tasks
  • Label conversational text with eight emotional categories.
Customer's
Challenges
  • Until now, annotation work has been done by a single in-house engineer, so the creation of training data has not progressed. Therefore, we are considering outsourcing, but since annotation work is ambiguous and it is difficult to identify the correct answer, there are large individual differences in labeling, and we are concerned about whether we can create high-quality training data with consistent quality.
  • They had no experience or know-how in creating documented standards to suppress variation in labeling and stabilize the quality when working with multiple people or outsourcing.
Our
Solutions
  • Before entering into an outsourcing contract with the client, we conducted a trial and had the client evaluate the quality.
  • We created data annotation guidelines at our company.
  • We adopted a triple-pass method. (Three people annotated the same data, and the label was selected and determined by majority opinion.)
Number of Tasks
20,000 items
Work Period
About 2 months
Main Takeaways
  • During the trial, Human Science was able to create an annotation specification that met the client's requirements, despite the high level of ambiguity, while receiving Q&A, communication, and feedback from the client. Additionally, the specification was useful for regular additional learning within the client's organization.
  • By making frequent partial deliveries, we can respond to feedback and requests from our customers in a timely manner, alleviating any concerns they may have about the quality of our services.
  • In addition to the triple-pass method, by conducting PM checks, providing timely feedback to workers, and holding regular meetings, we received high praise from the client for ensuring stability and consistency in the quality while suppressing variation and biases in worker judgments, which are common in ambiguous language annotations.

Text Annotation ServiceService FlowProcess

  • 01

    Hearing

    Provide materials such as sample data and specifications.
  • 02

    Quotation

    We will provide a quote based on the sample data and specifications you provide.
    *If the details are not decided, we will set the prerequisites and provide a quote and delivery date
  • 03

    Sample Delivery

    Perform the initial delivery at the early stage of the project. Receive feedback and realign understanding again.
  • 04

    Data annotation

    Regular progress reports. Contact us promptly if there are any questions. Update work instructions and notify as needed.
  • 05

    Delivery

    Conduct review meetings upon request.

Text Annotation Price Price

Text Annotation
(Price Reference Example)

Text Classification Intent / Named Entity Extraction
Work Content This task involves classifying relatively short dialogues
according to specified criteria.
This is the task of marking and labeling specified expressions and locations in Word or PDF documents.
Workload / Delivery Date 10,000 sentences/about 3 weeks Word/PDF:2,000 files (approximately 7,000 pages in total)
Working Conditions and Others Domestic remote work, 100% inspection
Number of classification classes: about 6 types
Domestic remote work, 100% inspection
Number of classes/number of work targets: 4 types/about 35 locations per file
Price 330,000 yen~ 1,000,000 yen and up

*The above is a reference example of pricing based on assumed work. Actual fees and delivery times may vary depending on the specific work specifications and conditions. Please contact us first.

For estimates, please make your request through the inquiry form.

Text Annotation Service Frequently Asked Questions FAQ

I am worried about whether we can get the right personnel to work on the project. +
At Human Science, we verify the content of the work before starting a project and assign personnel that fit the characteristics.
We achieve effective team formation based on the aptitude and past experience of our contracted workers, as well as the multifaceted personnel evaluation results conducted for each project.
Is it possible for you to use the same annotation tool that we have been using in-house?+
Yes. We will accommodate your needs. We are flexible in handling tools that have not been previously implemented by our company.
We can also support solutions using VPN. Please feel free to consult with us first.
Is it possible to consult with the project manager and communicate even after the actual work has started?
+
Yes. We will accommodate your request.
We offer meetings and feedback opportunities as needed. Use of Slack, Teams, and other platforms is also available.
Is it possible to do data annotation for natural language processing in foreign languages? +
Yes. It is possible. We will utilize the resources and experience we have gained through our translation and localization business to provide support.
As of now, we have experience creating training data for English and Taiwanese. Please contact us for more information.

Useful MaterialsDownloads

Annotation Services Industry-specific Use CasesIndustry

Contact Us / Free Trial

TOP