Process - Data Mining

Author: Antonin

Definition

Data mining is a subfield discipline of Computer Science.

It is the process of finding patterns in large data sets (big data) using the already available data.

Description

Data mining involves machine learning, statistics & database systems. The ultimate goal of data mining is to extract information from big data and put it into an understandable form for further use.

Data mining also involves:
•Database management
•data pre-processing
•models
•visualization

Data mining is usually carried out by a computer in an automatic or semi-automatic way. The computer analyses large numbers of data points to extract previously unknown patterns. This usually involves a database technique called "spacial indices".



Explanation and application

Data mining can be useful in stores that want to see how their sales of certain items are evolving. It might find that as the population of their customers rises, lollipops become less popular etc.

References and resources