INTRODUCTION:
Data mining is the process of extracting patterns from data. Data mining is an important tool to transform data into information. It is commonly used in a wide range of profiling practices, such as marketing, surveillance, fraud detection and scientific discovery.
TASKS INVOLVED IN DATA MINING:
Data mining commonly involves following tasks:
1. Classification Classification is the procedure in which individual items are placed into groups based on quantitative information on one or more characteristics inherent in the items. Common algorithms includes decision tree learning, nearest neighbor, naive Bayesian classification and neural networks.
2. Clustering: Clustering is the assignment of a set of observations into subsets (called clusters) so that observations in the same cluster are similar in some sense. Clustering is a a common technique for statistical data analysis used in many fields, including machine learning, data mining, pattern recognition and bioinformatics image analysis.
Clustering are of following types:
(a). Hierarchical clustering.
(b). Partitional clustering.
(i). k-means clustering.
(ii). Fuzzy c-means clustering.
(c). Spectral clustering.
3. Regression: In statistics, regression analysis includes techniques for modeling and analyzing several variables, focus is on the relationship between a dependent variable and one or more independent variables.
Regression analysis helps us to understand how the typical value of the dependent variable changes when any one of the independent variables is varied, while the other independent variables are held constant.
4. Association rule learning: Association rule learning is a popular and well researched method for discovering interesting relations between variables in large databases.
For example a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes. This is referred to as market basket analysis.
In Supermarket case, if a customer buys milk and bread then a customer may also buy butter. {Milk,Bread}=>{Butter}.
Wednesday, May 5, 2010
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment