OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS
Main Author: | MAHMUD DWI SULISTIYO |
---|---|
Format: | Masters |
Terbitan: |
Universitas Telkom
, 2012
|
Subjects: | |
Online Access: |
https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html |
ctrlnum |
213100002 |
---|---|
fullrecord |
<?xml version="1.0"?>
<dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><title>OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS</title><creator>MAHMUD DWI SULISTIYO</creator><subject>DATA MINING</subject><description>ABSTRAKSI: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Kata Kunci : recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.ABSTRACT: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Keyword: recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.</description><publisher>Universitas Telkom</publisher><date>2012-12-20</date><type>Thesis:Masters</type><identifier>https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html</identifier><language>Indonesia</language><recordID>213100002</recordID></dc>
|
format |
Thesis:Masters Thesis |
author |
MAHMUD DWI SULISTIYO |
title |
OPTICAL CHARACTER RECOGNITION USING MODIFIED DIRECTION FEATURE AND NESTED MULTI LAYER PERCEPTRONS |
publisher |
Universitas Telkom |
publishDate |
2012 |
topic |
DATA MINING |
url |
https://openlibrary.telkomuniversity.ac.id/pustaka/96353/optical-character-recognition-using-modified-direction-feature-and-nested-multi-layer-perceptrons.html |
contents |
ABSTRAKSI: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Kata Kunci : recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters.ABSTRACT: The studies of Optical Character Recognition (OCR) are being developed since it still needs a performance improvement. The previous study of alphanumeric character recognition had been conducted by Blumenstein and Liu using Modified Direction Feature (MDF) and Multi Layer Perceptrons (MLP) network. The study reaches the accuracy rate of 70.22% for lowercase characters and 80.83% for uppercase characters.<br>In this study the OCR system is proposed to improve the existing performance and have a capability to recognize all case-sensitive alphanumeric characters simultaneously. One of the problems is that there are several characters having similarities in gesture and shape, so that the classifier of the OCR system encounters many ambiguities when classifying some particular characters, especially when recognizing all case-sensitive alphanumeric characters.<br>To overcome those problems, this study proposes a technique of grouping. All character classes are clustered into some groups using Fuzzy C-Means (FCM) clustering method. The OCR system that uses MDF and nested MLP network solves the problems and reach the research objectives. The nested MLP is the novelty method that is implemented in this study. This is a kind of multi-level MLP network that classifies the problem domain hierarchically. The first level classifies the character into the designated group and the second level continues the classification into the recognized character class.<br>The OCR system using the methods in recognizing all case-sensitive alphanumeric characters yields an accuracy rate of 84.38% for the uppercases, 76.43% for the lowercases, and 78.92% for the digits respectively. Any misclassified characters are mostly happened in distinguishing several uppercase and lowercase characters having similarities in gestures and shapes.Keyword: recognition, OCR, MDF, nested MLP, case-sensitive alphanumeric characters. |
id |
IOS2750.213100002 |
institution |
Telkom University |
institution_id |
317 |
institution_type |
library:university library |
library |
Perpustakaan Telkom University |
library_id |
255 |
collection |
Katalog Library & Knowledge Center Telkom University |
repository_id |
2750 |
subject_area |
Ekonomi Program Komputer dan Teknologi Informasi Rekayasa |
city |
BANDUNG |
province |
JAWA BARAT |
repoId |
IOS2750 |
first_indexed |
2016-09-24T16:43:54Z |
last_indexed |
2017-02-25T15:19:11Z |
recordtype |
dc |
_version_ |
1685824495635398656 |
score |
17.610363 |