ABBYY Releases Efficient Software Solution to Reduce Translation Management Costs ABBYY announces its new professional linguistic tool for aligning parallel texts and creating Translation ... more >> |
|
“The primary requirement of NHRS for the hydrological testing documents processing is that the accuracy of data entry. Any input errors are intolerable. Scientific data cannot be any risk. ABBYY Recognition Server not only provides customers with the best accuracy of the input, but also greatly enhanced the efficiency of input, saving customers a lot of valuable time.” Alex Chen, Director, DIT |
Background
Nanjing Hydraulic Research Institute (NHRI), set up in 1935, originally known as the Central Hydraulic Engineering Research Institute, is the first of its kind in China. The institute is at present under the joint jurisdiction of the Ministry of Water Resources, the Ministry of Communications, and the State Power, P. R. China.
In 2008, NHRI received a mandate from the Ministry of Water Resources to input data from all the hydrological testing documents of the Chang Jiang River, dated from 1949 to 2007, into Nanjing hydraulic database. This data is used for future access and plays an active role in preventing the flooding of the Chang Jiang River.
The project
The foundation of the Nanjing hydraulic database basically went through two phases: Preparation phase and Implementation phase. The Preparation phase consisted in collection, arrangement and analysis of relevant information. The Implementation phase was composed of using software to enter the information into the Nanjing hydraulic database.
The problem
After receiving the task NHRI started analyzing the data that needed to be input in the database. When the institute finally found out the amount of data to be processed, it was shocked. There were a total of more than 6,000 documents. Each document included several hundred pages collected during almost 60 years and representing all the hydraulic testing data. In accordance with the number of personnel and the speed of manual entry, input all of the data (including its proofreading) would have needed almost 5 years. It far exceeded the 2-year time limit set by the Ministry of Water Resources.
To complete that task in a short period of time with high quality, the project team members began to explore existing OCR technology in Chinese market. They searched for a software solution that would provide high recognition accuracy, reliable digital identification, layout retention, strong technical support services and expansion capacity.
After careful testing some Chinese OCR software, NRHI was not satisfied with the digital identification. All documents had a chaos of crossing table lines and digits, which led to great difficulties for further proofreading and data compiling.
The solution
ABBYY partner in PR China, Digital Information Technology Co. Ltd. (DIT), found out about that situation and contacted NHRI to demonstrate ABBYY Recognition Server 2.0, a robust and powerful server-based OCR solution for automating high volume document conversion processes across enterprises. The proposed solution provides efficient tools for automatic and manual verification. For the latter, it includes client-side Verification Station designed for proofreading and correcting recognized texts. Recognition Server 2.0 can be configured to apply verification to all recognized pages, to selected document types, or to pages with questionable quality only.
NHRI tested the recognition accuracy through different resolutions and scan settings, in order to gain a better understanding of the product. ABBYY Recognition Server 2.0 appeared the only product that satisfied the NHRI team.
To improve the document conversion effectiveness and speed, NRHI decided to procure high-speed scanners and updated computer hardware. Then DIT helped NRHI to install and test the software and delivered its in-depth training.
The results
The superior recognition accuracy and layout retention of ABBYY Recognition Server 2.0 has won the customer acclamation. It has greatly cut down their labor costs and improved efficiency. By using ABBYY Recognition Server 2.0, NHRI is able to reduce the number of operators from 10 to 3, while increasing the throughput from 50 to 250 documents per day. As a result, ABBYY Recognition Server has been recommended to the Chinese Ministry of Water Resources.
About Nanjing Hydraulic Research Institute (NHRI)
The institute is a multipurpose national hydraulic research complex, which, with the basic theoretical research and applied research as its major orientation, undertakes directional, principal and comprehensive researches for water conservancy, water resources management, hydroelectric power, waterway transportation projects, and the application and extension of small hydro power technology. For more information, please visit http://www.nhri.cn/ .
About Digital Information Technology Co. Ltd.
Digital information technology solutions provider, currently allround cooperative partnership with ABBYY in China. Specializes in ABBYY channel distribution and giving end-to-end solutions in various fields like imaging, scanning, project consultancy and doing developments around ABBYY technology. DIT China has experienced technical skills in implementing ABBYY FineReader FlexiCapture and other ABBYY products. Focusing on different areas of challenges and needs concerned by customers. Committed to provide customers with up-to-date and competitive digital information technology solutions and services. Continuing to create the most value for the customer. For more information, please visit http://www.digitalitcd.cn/