Thesis (Selection of subject)

Your browser does not support JavaScript, or its support is disabled. Some features may not be available.

Kombinování klasifikátorů pomocí fuzzy metod

Thesis title in Czech:	Kombinování klasifikátorů pomocí fuzzy metod
Thesis title in English:	Combining classifiers by means of fuzzy methods
Academic year of topic announcement:	2005/2006
Thesis type:	diploma thesis
Thesis language:	čeština
Department:	Department of Software Engineering (32-KSI)
Supervisor:	prof. RNDr. Ing. Martin Holeňa, CSc.
Author:	hidden - assigned and confirmed by the Study Dept.
Date of registration:	04.05.2007
Date of assignment:	04.05.2007
Date and time of defence:	18.09.2007 00:00
Date of electronic submission:	18.09.2007
Date of submission of printed version:	04.05.2007
Date of proceeded defence:	18.09.2007
Opponents:	RNDr. Alan Eckhardt, Ph.D.

Guidelines

Diplomant se nejdříve důkladně seznámí s teoretickými základy nejdůležitějších moderních klasifikačních metod i metod fuzzy agregace. Poté se seznámí s existujícími implementacemi některých z těchto metod ve vývojovém prostředí Matlab, a tyto implementace doplní implementací několika dalších metod, které v prostředí Matlab dosud k dispozici nejsou. V rámci vlastní diplomové práce pak bude implementovat některé z fuzzy metod kombinování klasifikátorů a testovat je na řadě mezinárodně používaných testovacích souborů a souborů dat z reálných aplikací.

References

M. Grabisch, H.T. Nguyen, E.A. Walker. Fundamentals of Uncertainty Calculi with Applications to Fuzzy Inference. Kluwer, Boston, 1995.

J.M. Keller, P. Gader, H. Tahani, J.H. Chiang, M. Mohamed. Advances in fuzzy integration for pattern recognition. Fuzzy Sets and Systems, 65: 273-283, 1994.

L.I. Kuncheva, J.C. Bezdek, R.P.W. Duin. Decision templates for multiple classifier fusion: an experimental comparison. Pattern Recognition, 34: 299-314, 2001.

B. Scholkopf, A. Smola. Learning with Kernels, kapitoly 1–7. MIT Press, Cambridge, 2002.

Preliminary scope of work

Klasifikační metody jsou metody třídění empirických dat do konečného počtu tříd pomocí funkcí z předem vymezených funkčních systemů. Výběr klasifikační funkce z daného funkčního systému se provádí na základě omezeného počtu dat se známou příslušností do tříd postupem, který se obecně označuje jako učení a který může mít rozmanitou podobu, od poměrně triviálních jednokrokových výpočtů až po velmi složité iterační algoritmy. Protože zpracování empirických dat bylo původně doménou statistiky, spočívají tradiční klasifikační metody na statistických principech. Nástup počítačů a neustálý růst jejich výkonu však vedl ke vzniku a rychlému rozvoji klasifikačních metod založených na principech strojového učení ? např. klasifikace pomocí umělých neuronových sítí, pomocí klasifikačních stromů, či tzv. SVM klasifikátory (support vector machines), spočívající na jádrových funkcích. Díky tomu jsme při řešení klasifikačních problémů často v situaci, že máme k dispozici několik přibližně stejně přesných klasifikátorů spočívajících na různých principech. V takovém případě buď stojíme před nelehkou volbou, který z nich použít, nebo nejdříve provedeme klasifikaci pomocí všech a potom se snažíme získané výsledky nějak zkombinovat. Druhá z těchto možností může v ideálním případě vést k posílení přednosti kombinovaných klasifikátorů, takže výsledný klasifikátor je přesnější než kterýkoliv z nich. Metod kombinování klasifikátorů existuje několik desítek, od prosté volby té klasifikace, kterou navrhuje nejvíce z nich, až po velmi důmyslné agregační metody založené na teorii míry. Velmi obecný teoretický rámec agregačních metod představuje fuzzy matematika. A právě v tomto rámci by se kombinováním klasifikátorů měla zabývat navrhovaná diplomová práce.

Preliminary scope of work in English

Classification mehtods are methods of sorting empirical data into a finite number of classes by means of funcitons from in advance delimited function systems. Based on a limited amount of data with a known class membership, a classification function is chosen from the given fucntion system using a procedure generally termed learning, which can have various forms, from comparatively simple one-step computations to very complicated iterartive algorithms. Because dealing with empirical data was originally a matter of statistics, traditional classification methods rely on statistical principles. However, the advent of computers and the constant increase of their computing power have lead to the emergence and fast development of classification methods based on machine learning principles, e.g., classification by means of artificial neural networks, by means of classification trees, or the classifiers called support vector machines, which rely on kernel functions. Hence, solving classification tasks frequently leads to the situation that several classifier of approximately equal accuracy are available, which rely on different principles. In such a case, one either has the difficult choice which of them to use, or one first performs the classification with all of them, and then tries to combine the results in some way. The latter possibility can ideally lead to a reinforcement of the advantages of the combined classifiers, so that the resulting classifier is more accurate than any of them. There exist several dozens methods for combining classifiers, from simply choosing the classification that most of them propose, to sophisticate aggregation methods based on measure theory. A very general framework for aggregation methods is fuzzy mathematics. It is within this framework that combining classifiers should be treated by the proposed master thesis.