štvrtok 18. septembra 2008

datamining intro

datamining course materials from Australian national Uni,
- same as links below.
COURSE SLIDES -FROM slideshare.com

by prof Lanzi, some slides look identical to slides provided by ANU

- but generally looks good, COMPREHENSIBLE .....



DATAMINING, UNSUPERVISED RECORD LINKAGE.

Markus Hegland , australia

CHRISTEN
DATAMINING. CHALLENGES, MODELS, METHODS AND ALGORITHMS
year 2003
intro -- mainly from the standpoint of computation science.. algorithms
--
progream - FEBRL year 2008
Febrl - A Freely Available Record Linkage System with a Graphical User Interface Peter Christen Proceedings of the Australasian Workshop on Health Data and Knowledge Management (HDKM), Wollongong, January 2008.

- DISTANCE - EUCLIDEAN, PYTHAGOREAN ETC WOLFRAM MATH



ASSOCIATION RULES

- Support, confidence

Support gives total number of transaction of any particular item are occurring in datasets while confidence gives strength of a data in a dataset, we can say support is probability of A and B while confidence is conditional probability. Association rule based on these two characteristics.


Žiadne komentáre: