- Parse, proof-read, validate and create database records from article references.
- Quality check & correct distorted OCR text.
- Substitute UTF-8 character set if improperly handled by the OCR software.
- Standardize text with look-up tables wherever required.