Automate Data Preprocessing - Data Analytics

Consistency of Column Names:
This post is prepared intend to minimize the data pre processing steps in the Data Analytics
Columns Names may have spaces & may have other special characters, mixed cases, so we convert into title type, remove special characters.


Nulls Handling:
Some times data may be missed while data has been entered or missing from data generator side. We have to take care about these nulls while doing the data analysis. If any column is having more than >5% nulls then we can remove them from data analysis, this 5% is not industry standard but we can take business needs into consideration.









----------
The Following screen shot tells us that the list of the columns after done the processing.



--------































--------------- Text Mining - Data Pre Processing ---------------------



























Previous Post Next Post

Contact Form