A data-dependent dissimilarity measure: An effective alternative to distance measures

ARYAL, SUNIL

doi:10.4225/03/5a2f0721de429

5687527_Final_PhD_Thesis_Aryal_23192518.pdf (3.55 MB)

A data-dependent dissimilarity measure: An effective alternative to distance measures

thesis

posted on 2017-12-11, 22:30 authored by SUNIL ARYAL

In data mining, the task-specific performances of conventional distance-based similarity measures vary significantly in different data distributions because they are data-independent and sensitive to units or scales of measurement. This thesis investigates a measure, where the similarity of two instances is determined by the distribution of data. It introduces a new (dis)similarity measure, which is data-dependent and robust to units and scales of measurement. The empirical evaluation conducted across a wide range of datasets shows that the new measure produces better or at least more consistent task-specific performance than widely-used distance-based measures, particularly in high-dimensional datasets.

History

Campus location

Australia

Principal supervisor

Kai Ming Ting

Additional supervisor 1

Gholamreza Haffari

Additional supervisor 2

Takashi Washio

Year of Award

2017

Department, School or Centre

Information Technology (Monash University Clayton)

Course

Doctor of Philosophy

Degree Type

DOCTORATE

Faculty

Faculty of Information Technology

Usage metrics

Keywords

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

A data-dependent dissimilarity measure: An effective alternative to distance measures

History

Campus location

Principal supervisor

Additional supervisor 1

Additional supervisor 2

Year of Award

Department, School or Centre

Course

Degree Type

Faculty

Usage metrics

Categories

Keywords

Licence

Exports