To understand this concept, which draws from computer science and statistics, it is useful to understand the metaphor contained in the name. If the result of the almost absolute tracking of user behavior on the Internet is viewed as a seemingly useless mountain of data, data mining, which translates as data mining, provides the necessary tools to explore this vast amount of data and extract from it. her relevant information. These tools consist of statistical methods that allow identifying patterns of behavior and connections in data that, by themselves, do not mean anything.
Data mining is often related to big data , a concept that refers to databases whose volume no longer allows conventional analysis and, therefore, relies on computational processes. Through the data mining process, however, any amount of data can be explored..
In reality, data exploration is one of the stages of a larger process, the so-called? Knowledge extraction in databases? ( Knowledge Discovery in Databases or KDD ), which covers the following steps:
- Choosing the database to analyze
- Pre-processing that cleans and prepares the database
- Transformation the way the analysis process needs
- Process of analysis through a mathematical process (data mining)
- Results analysis
The information that is extracted by a KDD can be applied to a wide variety of areas, for example, to the strategic planning of an online business and to the making of marketing decisions.