Houston, We have a problem with the Data !!! Data cleaning using R

Speaker:  Ebin Deni Raj – Pala, Kottayam, India
Topic(s):  Information Systems, Search, Information Retrieval, Database Systems, Data Mining, Data Science

Abstract

The real ground data, most of the time will be really messy. Data cleaning is not only an essential component but also it is the one which takes most of the time in any data science project.When you start to do data analysis or modeling, the availability of clean data is of utmost importance. Hence you need to learn the different techniques to clean messy data.
 
With the advent of big data, it is critical to understand that data cleaning is an important part of any data science project. Analysis may not be easy in such a scenario.It’s commonly said that data scientists spend 80% of their time cleaning and manipulating data and only 20% of their time actually analyzing it. For this reason, it is critical to become familiar with the data cleaning process and all of the tools available to you along the way. This lecture/workshop will talk about cleaning data in a nutshell.

About this Lecture

Number of Slides:  35
Duration:  60 minutes
Languages Available:  English
Last Updated: 

Request this Lecture

To request this particular lecture, please complete this online form.

Request a Tour

To request a tour with this speaker, please complete this online form.

All requests will be sent to ACM headquarters for review.