OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS

Main Author: MAHMUD DWI SULISTIYO
Format: Masters
Terbitan: Universitas Telkom , 2012
Subjects:
Online Access: https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html
ctrlnum 213100002
fullrecord <?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><title>OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS</title><creator>MAHMUD DWI SULISTIYO</creator><subject>DATA MINING</subject><description>ABSTRAKSI: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.&lt;br&gt;In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.&lt;br&gt;To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.&lt;br&gt;The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Kata Kunci : recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.ABSTRACT: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.&lt;br&gt;In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.&lt;br&gt;To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.&lt;br&gt;The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Keyword: recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.</description><publisher>Universitas Telkom</publisher><date>2012-12-20</date><type>Thesis:Masters</type><identifier>https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html</identifier><language>Indonesia</language><recordID>213100002</recordID></dc>
format Thesis:Masters
Thesis
author MAHMUD DWI SULISTIYO
title OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS
publisher Universitas Telkom
publishDate 2012
topic DATA MINING
url https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html
contents ABSTRAKSI: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Kata Kunci : recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.ABSTRACT: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Keyword: recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.
id IOS2750.213100002
institution Telkom University
institution_id 317
institution_type library:university
library
library Perpustakaan Telkom University
library_id 255
collection Katalog Library & Knowledge Center Telkom University
repository_id 2750
subject_area Ekonomi
Program Komputer dan Teknologi Informasi
Rekayasa
city BANDUNG
province JAWA BARAT
repoId IOS2750
first_indexed 2016-09-24T16:43:54Z
last_indexed 2017-02-25T15:19:11Z
recordtype dc
_version_ 1685824495635398656
score 17.610363