Document Parser

Document Parser is a robotic automation solution developed by TMA Innovation to analyze and export data from multiple types of documents. This solution uses both offline service and cloud computing for optimal results.

Designed with advance features, Document Parser can process structured and unstructured documents, printed or handwritten documents by applying Natural Language Processing and Deep Learning. By extracting accurate data through automated filtering process in a timely manner, Document Parser can bring many benefits to enterprises.

Document Parser on ID
Document Parser on CV
Document Parser on hand-filled form

Benefits

  • Time-saving.
  • Easy to be integrated to other systems.
  • High precision. 
  • Increased labour efficiency
  • Automated data filter.
  • Work with every document format.

Application

Operating model

Step

1

Classification

Step

2

Data Extraction

  • Optical character recognition (OCR).
  • Image filter.
    • Detect lines in images.
    • Calculate average of deviations that angles create.
    • Rotate image with calculated angles.
  • Data extraction.

Technology

  • Natural Language Processing (NLP):POS Parser, Linguistic Regular Expression, Pattern Extraction
  • Machine Learning: Neural Network
  • Linguistic Features for Model Training