Data Preprocessing in Data Mining – GeeksforGeeks

Preprocessing in Data Mining:Data preprocessing is a data mining technique which is used to transform the raw data in a useful and efficient format.

Attention reader! Dont stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.

Steps Involved in Data Preprocessing:

1. Data Cleaning:The data can have many irrelevant and missing parts. To handle this part, data cleaning is done. It involves handling of missing data, noisy data etc.

2. Data Transformation:This step is taken in order to transform the data in appropriate forms suitable for mining process. This involves following ways:

3. Data Reduction:Since data mining is a technique that is used to handle huge amount of data. While working with huge volume of data, analysis became harder in such cases. In order to get rid of this, we uses data reduction technique. It aims to increase the storage efficiency and reduce data storage and analysis costs.

The various steps to data reduction are:

Read the original here:

Data Preprocessing in Data Mining - GeeksforGeeks

Related Posts

Comments are closed.