
Basic data mining tasks pdf series#
Time series is a sequence of events where the next event is determined by one or more of the preceding events.

Also prediction analysis is used in different areas including medical diagnosis, fraud detection etc. For example, a model can predict the income of an employee based on education, experience and other demographic factors like place of stay, gender etc. Prediction involves developing a model based on the available data and this model is used in predicting future values of a new data set of interest. Prediction task predicts the possible values of missing or future data. Once the class attribute is assigned, demographic and lifestyle information of customers who purchased similar products can be collected and promotion mails can be sent to them directly. Hence, decision forms the class attribute in this case.

Using the available data, it is possible to know which customers purchased similar products and who did not purchase in the past. One of the attributes will be class attribute and the goal of classification task is assigning a class attribute to new set of records as accurately as possible.Ĭlassification can be used in direct marketing, that is to reduce marketing costs by targeting a set of customers who are likely to buy a new product. A collection of records will be available, each record with a set of attributes. a) ClassificationĬlassification derives a model to determine the class of an object based on its attributes. A retailer trying to identify products that are purchased together can be considered as a descriptive data mining task. Descriptive data mining tasks usually finds data describing patterns and comes up with new, significant information from the available data set. A medical practitioner trying to diagnose a disease based on the medical test results of a patient can be considered as a predictive data mining task. Predictive data mining tasks come up with a model from the available data set that is helpful in predicting unknown or future values of another data set of interest. A data mining system can execute one or more of the above specified tasks as part of data mining. All these tasks are either predictive data mining tasks or descriptive data mining tasks.

There are a number of data mining tasks such as classification, prediction, time-series analysis, association, clustering, summarization etc. The descriptive data mining tasks characterize the general properties of data whereas predictive data mining tasks perform inference on the available data set to predict how a new data set will behave. Those two categories are descriptive tasks and predictive tasks. The data mining tasks can be classified generally into two types based on what a specific task tries to achieve.
