IJCATR Volume 11 Issue 6

Data Preparation For Machine Learning Modelling

Ndung’u Rachael Njeri
10.7753/IJCATR1106.1008
keywords : Data Preparation; Data pre-processing; Machine Learning; Predictive models

PDF
The world today is on revolution 4.0 which is data-driven. The majority of organizations and systems are using data to solve problems through use of digitized systems. Data lets intelligent systems and their applications learn and adapt to mined insights without been programmed. Data mining and analysis requires smart tools, techniques and methods with capability of extracting useful patterns, trends and knowledge, which can be used as business intelligence by organizations as they map their strategic plans. Predictive intelligent systems can be very useful in various fields as solutions to many existential issues. Accurate output from such predictive intelligent systems can only be ascertained by having well prepared data that suits the predictive machine learning function. Machine learning models learns from data input using the ‘garbage-in-garbage-out’ concept. Cleaned, pre-processed and consistent data would produce accurate output as compared to inconsistent, noisy and erroneous data.
@artical{n1162022ijcatr11061008,
Title = "Data Preparation For Machine Learning Modelling",
Journal ="International Journal of Computer Applications Technology and Research(IJCATR)",
Volume = "11",
Issue ="6",
Pages ="231 - 235",
Year = "2022",
Authors ="Ndung’u Rachael Njeri"}
  • The paper highlights the need for clean data for machine learning model development.
  • There are different techniques of data preparation.
  • The paper explains what ‘dirty’ data is all about.
  • The paper explains different types of data bias.