{"id":17788,"date":"2020-07-30T16:42:35","date_gmt":"2020-07-30T11:12:35","guid":{"rendered":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/"},"modified":"2024-10-15T00:26:48","modified_gmt":"2024-10-14T18:56:48","slug":"data-cleaning-in-python","status":"publish","type":"post","link":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/","title":{"rendered":"Data Cleaning in Python | What is Data Cleaning?"},"content":{"rendered":"\n<ol class=\"wp-block-list\"><li><strong><a href=\"#datacleaning\">What is Data Cleaning in Python?<\/a><\/strong><\/li><li><strong><a href=\"#datacleaninginpython\">How to perform Data Cleaning in Python?<\/a><\/strong><\/li><li><strong><a href=\"#repeated\">Remove Repeated Values <\/a><\/strong><\/li><li><strong><a href=\"#missing\">Missing Value Treatment<\/a><\/strong><\/li><li><strong><a href=\"#irrelevant\">Removal of irrelevant data<\/a><\/strong><\/li><li><strong><a href=\"#error\">Manual Error While Typing <\/a><\/strong><\/li><li><strong><a href=\"#rename\">Renaming Columns <\/a><\/strong><\/li><\/ol>\n\n\n\n<p><em>Contributed by: Praneeta <br>LinkedIn Profile: <a rel=\"noreferrer noopener nofollow\" aria-label=\"linkedin.com\/in\/praneeta-Kalaskar-903073a1  (opens in a new tab)\" href=\"http:\/\/linkedin.com\/in\/praneeta-Kalaskar-903073a1\" target=\"_blank\">linkedin.com\/in\/praneeta-Kalaskar-903073a1 <\/a> <\/em><\/p>\n\n\n\n<p>When we talk about the real world, most of the data we come across for analysis is raw data. This raw data is the combination of repeated, missing, and many irrelevant rows. Hence, if passed to a model, it results in inaccuracy or incorrect prediction, which ultimately leads us to understand the importance of Data Cleaning. Data Cleaning in Python, also known as Data Cleansing is an important technique in model building that comes after you collect data. It can be done manually in excel or by running a program. In this article, therefore, we will discuss data cleaning entails and how you could clean noises(dirt) step by step by using <a rel=\"noreferrer noopener\" aria-label=\"Python (opens in a new tab)\" href=\"https:\/\/www.mygreatlearning.com\/blog\/python-tutorial-for-beginners-a-complete-guide\/\" target=\"_blank\">Python<\/a>. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"what-is-data-cleaning\"><strong>What is Data Cleaning?<\/strong><\/h2>\n\n\n\n<p>According to Wikipedia, is the process of detecting and correcting corrupt or inaccurate records from a record-set, table, or database and refers to identifying incomplete, incorrect, inaccurate, or irrelevant parts of the data, and then replacing, modifying, or deleting the dirty or coarse data. <\/p>\n\n\n\n<p>This definition is too big and certainly not easy to understand. To make it easier, we can see an example. Consider a scenario where a factory owner of Dairy Products is interested in knowing the frequent buyers of milk bottles to increase the customer base. But if the data is corrupted or has noise, then the decision will be misguided. In the below data, we have shown an example.<\/p>\n\n\n\n<p>From the figure, we can depict that Data Cleaning is a technique which helps to convert improper data into meaningful data. In short, Machine Learning is data-driven. With data cleaning in place, your Machine Learning model will perform better. So, it is important to process data before use. Without quality data, it is foolish to expect a correct output. <\/p>\n\n\n\n<p>Also Read: <a href=\"https:\/\/www.mygreatlearning.com\/blog\/top-data-mining-applications-industries\/\" target=\"_blank\" rel=\"noreferrer noopener\" aria-label=\"Top Data Mining Applications in Industries  (opens in a new tab)\">Top Data Mining Applications in Industries <\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"how-to-perform-data-cleaning-in-python\"><strong>How to perform Data Cleaning in Python? <\/strong><\/h2>\n\n\n\n<p>For understanding, let\u2019s take the example of a survey in which a company's HR staff has to locate all of its area employees, and make sure they are safe at home. Before that, let\u2019s understand that there are no Standard Data Cleaning Techniques. It is not possible to comment on which one is best. The only aspect which needs to be considered on cleaning methods depends on the nature of Data. This helps us choose which technique should be used.<\/p>\n\n\n\n<p>We will get back to the example. To keep it simpler, we are looking at below fields. <\/p>\n\n\n\n<p>Look at the table carefully. You'll notice that certain fields are either blank or have irrelevant values. If we process such data, then our prediction will be in trouble. Thus, we will carry out the below steps for Data Cleaning.&nbsp;<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Remove Repeated Row&nbsp;<\/li><li>Missing value treatment&nbsp;<\/li><li>Removal of Irrelevant Data&nbsp;<\/li><li>Manual error while typing&nbsp;<\/li><li>Renaming Columns&nbsp;<\/li><\/ol>\n\n\n\n<p>As to process data, our first step would be to read data in Python.&nbsp;<\/p>\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939.jpg\"><img decoding=\"async\" width=\"788\" height=\"595\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939.jpg\" alt=\"data cleaning in python \" class=\"wp-image-17795\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939.jpg 788w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939-300x227.jpg 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939-768x580.jpg 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939-696x526.jpg 696w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939-556x420.jpg 556w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/Annotation-2020-07-21-154939-80x60.jpg 80w\" sizes=\"(max-width: 788px) 100vw, 788px\" \/><\/figure><\/div>\n\n\n\n<p>Observe the output table carefully, it's the same table which we had in the first place.<\/p>\n\n\n\n<p>The two important packages which we imported are <a href=\"https:\/\/www.mygreatlearning.com\/blog\/python-pandas-tutorial\/\" target=\"_blank\" rel=\"noreferrer noopener\" aria-label=\"Panda (opens in a new tab)\">Panda<\/a> and <a href=\"https:\/\/www.mygreatlearning.com\/blog\/python-numpy-tutorial\/\" target=\"_blank\" rel=\"noreferrer noopener\" aria-label=\"Numpy (opens in a new tab)\">Numpy<\/a>. These are needed for a Python code to run. The next important thing is acronyms we adopted as a good practice.&nbsp;<em>employee<\/em>&nbsp;variable used to store data read from&nbsp;<em>DataCleaning.csv<\/em>&nbsp;file saved at the mentioned location. The command used to read data is&nbsp;<em>read_csv<\/em>&nbsp;and displayed on the screen by using print. For the fields having missing value, the system has filled it with NaN(Not a number).&nbsp; <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"remove-repeated-values\"><strong>Remove Repeated Values<\/strong><\/h2>\n\n\n\n<p>We know that there are duplicates in the dataset and that need to be removed. Row 5 and 7 have the same employee data.<\/p>\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5.png\"><img decoding=\"async\" width=\"837\" height=\"123\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5.png\" alt=\"\" class=\"wp-image-17798\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5.png 837w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5-300x44.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5-768x113.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-5-696x102.png 696w\" sizes=\"(max-width: 837px) 100vw, 837px\" \/><\/figure><\/div>\n\n\n\n<p>We can delete the last row, and keep the first row. <\/p>\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6.png\"><img decoding=\"async\" width=\"812\" height=\"158\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6.png\" alt=\"\" class=\"wp-image-17799\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6.png 812w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6-300x58.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6-768x149.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-6-696x135.png 696w\" sizes=\"(max-width: 812px) 100vw, 812px\" \/><\/figure><\/div>\n\n\n\n<p>Function&nbsp;<em>drop_duplicates<\/em>&nbsp;returns output with repeated rows removed. Below are the parameters used in a command.&nbsp;<\/p>\n\n\n\n<p><em>subset:<\/em>&nbsp;We have assigned column name to subset parameters to check repeated values. By default, it takes all columns.&nbsp;<\/p>\n\n\n\n<p><em>keep:<\/em>&nbsp;for keeping the first row(5)&nbsp;<\/p>\n\n\n\n<p><em>inplace:<\/em>&nbsp;Boolean case, default false. By assigning true we asked the command to drop last value.&nbsp;<\/p>\n\n\n\n<p>Data with no repeated values.<\/p>\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7.png\"><img decoding=\"async\" width=\"878\" height=\"394\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7.png\" alt=\"data cleaning in python \" class=\"wp-image-17801\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7.png 878w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7-300x135.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7-768x345.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-7-696x312.png 696w\" sizes=\"(max-width: 878px) 100vw, 878px\" \/><\/figure><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"missing-value-treatment\"><strong>Missing Value Treatment <\/strong><\/h2>\n\n\n\n<p>In panda, missing data is represented in two ways. None or NaN. In our dataset, missing values are recognized as NaN. For checking missing values, we have used the function<em>&nbsp;is.null()<\/em>.<\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-9.png\"><img decoding=\"async\" width=\"454\" height=\"135\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-9.png\" alt=\"\" class=\"wp-image-17804\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-9.png 454w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-9-300x89.png 300w\" sizes=\"(max-width: 454px) 100vw, 454px\" \/><\/figure>\n\n\n\n<p>The output of the command is boolean which is True for NaN values. <\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-10.png\"><img decoding=\"async\" width=\"733\" height=\"252\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-10.png\" alt=\"data cleaning in python \" class=\"wp-image-17805\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-10.png 733w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-10-300x103.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-10-696x239.png 696w\" sizes=\"(max-width: 733px) 100vw, 733px\" \/><\/figure>\n\n\n\n<p>As shown in the output image, column Mobile no., Project and email have missing values. Missing values can either be filled or dropped. <\/p>\n\n\n\n<p>In our example, we are working on employee data so filling it with any value will be inappropriate. Hence, we have dropped the missing values. <\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-11.png\"><img decoding=\"async\" width=\"523\" height=\"152\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-11.png\" alt=\"\" class=\"wp-image-17807\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-11.png 523w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-11-300x87.png 300w\" sizes=\"(max-width: 523px) 100vw, 523px\" \/><\/figure>\n\n\n\n<p>The command <em>dropna<\/em> drops rows\/columns having at least null values in CSV file. <\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12.png\"><img decoding=\"async\" width=\"846\" height=\"341\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12.png\" alt=\"data cleaning in python \" class=\"wp-image-17808\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12.png 846w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12-300x121.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12-768x310.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-12-696x281.png 696w\" sizes=\"(max-width: 846px) 100vw, 846px\" \/><\/figure>\n\n\n\n<p>As shown in the output table, missing values are removed from the dataset. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"removal-of-irrelevant-data\"><strong>Removal of irrelevant data<\/strong><\/h2>\n\n\n\n<p>Sometimes, certain categories\/columns in a dataset are not useful. In our case, column name,&nbsp;<em>serial no<\/em>&nbsp;are not important. Retaining it will take unnecessary space and consume time.&nbsp;<\/p>\n\n\n\n<p>Panda provides an easy command&nbsp;<em>del<\/em>&nbsp;to remove the unwanted column.<\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13.png\"><img decoding=\"async\" width=\"538\" height=\"103\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13.png\" alt=\"\" class=\"wp-image-17810\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13.png 538w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13-300x57.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13-533x103.png 533w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-13-534x103.png 534w\" sizes=\"(max-width: 538px) 100vw, 538px\" \/><\/figure>\n\n\n\n<p>If we inspect the data we will see that the column is removed successfully. <\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-14.png\"><img decoding=\"async\" width=\"766\" height=\"368\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-14.png\" alt=\"data cleaning in python \" class=\"wp-image-17812\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-14.png 766w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-14-300x144.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-14-696x334.png 696w\" sizes=\"(max-width: 766px) 100vw, 766px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"manual-error-while-typing\"><strong>Manual Error While Typing <\/strong><\/h2>\n\n\n\n<p>In this step, we need to make sure the data has correct values whenever there are categories mentioned. In our case, the column&nbsp;<em>Project<\/em>&nbsp;has two possible values&nbsp;<em>Client<\/em>&nbsp;or&nbsp;<em>Internal<\/em>. But close observation of the output table can help us to point row 8 where the value is not following the possible case. (i.e,&nbsp;<em>internal<\/em>&nbsp;should be corrected to Internal).&nbsp;<\/p>\n\n\n\n<p>In python, we can replace the column with the corrected value.&nbsp;&nbsp;<\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15.png\"><img decoding=\"async\" width=\"822\" height=\"397\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15.png\" alt=\"\" class=\"wp-image-17814\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15.png 822w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15-300x145.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15-768x371.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-15-696x336.png 696w\" sizes=\"(max-width: 822px) 100vw, 822px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"renaming-columns\"><strong>Renaming Columns <\/strong><\/h2>\n\n\n\n<p>The dataset we are working on has a column name that does not speak about the importance of the data it holds. Thus, we will add some sensible labels by renaming the columns. <\/p>\n\n\n\n<p>The output demonstrates the change in column name.<\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-16.png\"><img decoding=\"async\" width=\"740\" height=\"414\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-16.png\" alt=\"data cleaning in python \" class=\"wp-image-17816\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-16.png 740w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-16-300x168.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/image-16-696x389.png 696w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"recap-to-data-cleaning\"><strong>Recap to Data Cleaning <\/strong><\/h2>\n\n\n\n<p>In this article, we have treated unnecessary information by using various steps. We have removed the duplicate rows, treated missing values, deleted irrelevant columns, and corrected the typing mistake. Lastly, the article also talks about renaming columns.<\/p>\n\n\n\n<p>If you found this guide on Data Cleaning in Python helpful and wish to learn more such concepts, join <a href=\"https:\/\/www.mygreatlearning.com\/academy\" target=\"_blank\" rel=\"noreferrer noopener\" aria-label=\"Great Learning Academy (opens in a new tab)\">Great Learning Academy<\/a>'s free online courses today.<\/p>\n\n\n<figure class=\"wp-block-image size-large zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2.png\"><a href=\"https:\/\/www.mygreatlearning.com\/academy\/learn-for-free\/courses\/python-for-machine-learning\" target=\"_blank\" rel=\"noreferrer noopener\"><img decoding=\"async\" width=\"1000\" height=\"242\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2.png\" alt=\"\" class=\"wp-image-17518\" srcset=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2.png 1000w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2-300x73.png 300w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2-768x186.png 768w, https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/June-29-banner-for-GL-python-fot-ml-1-2-696x168.png 696w\" sizes=\"(max-width: 1000px) 100vw, 1000px\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>What is Data Cleaning in Python? How to perform Data Cleaning in Python? Remove Repeated Values Missing Value Treatment Removal of irrelevant data Manual Error While Typing Renaming Columns Contributed by: Praneeta LinkedIn Profile: linkedin.com\/in\/praneeta-Kalaskar-903073a1 When we talk about the real world, most of the data we come across for analysis is raw data. This [&hellip;]<\/p>\n","protected":false},"author":41,"featured_media":17831,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[9],"tags":[36804,36796],"content_type":[],"class_list":["post-17788","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-science","tag-data-analytics","tag-python"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Data Cleaning in Python | What is Data Cleaning? - Great Learning<\/title>\n<meta name=\"description\" content=\"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Cleaning in Python | What is Data Cleaning?\" \/>\n<meta property=\"og:description\" content=\"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Great Learning Blog: Free Resources what Matters to shape your Career!\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/GreatLearningOfficial\/\" \/>\n<meta property=\"article:published_time\" content=\"2020-07-30T11:12:35+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-10-14T18:56:48+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1090\" \/>\n\t<meta property=\"og:image:height\" content=\"770\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Great Learning Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@https:\/\/twitter.com\/Great_Learning\" \/>\n<meta name=\"twitter:site\" content=\"@Great_Learning\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Great Learning Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/\"},\"author\":{\"name\":\"Great Learning Editorial Team\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\"},\"headline\":\"Data Cleaning in Python | What is Data Cleaning?\",\"datePublished\":\"2020-07-30T11:12:35+00:00\",\"dateModified\":\"2024-10-14T18:56:48+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/\"},\"wordCount\":1126,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/07\\\/edited.jpg\",\"keywords\":[\"Data Analytics\",\"python\"],\"articleSection\":[\"Data Science and Analytics\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/\",\"name\":\"Data Cleaning in Python | What is Data Cleaning? - Great Learning\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/07\\\/edited.jpg\",\"datePublished\":\"2020-07-30T11:12:35+00:00\",\"dateModified\":\"2024-10-14T18:56:48+00:00\",\"description\":\"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/07\\\/edited.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/07\\\/edited.jpg\",\"width\":1090,\"height\":770,\"caption\":\"data cleaning in python\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-cleaning-in-python\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Science and Analytics\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-science\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Data Cleaning in Python | What is Data Cleaning?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"name\":\"Great Learning Blog\",\"description\":\"Learn, Upskill &amp; Career Development Guide and Resources\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"alternateName\":\"Great Learning\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\",\"name\":\"Great Learning\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"width\":900,\"height\":900,\"caption\":\"Great Learning\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/GreatLearningOfficial\\\/\",\"https:\\\/\\\/x.com\\\/Great_Learning\",\"https:\\\/\\\/www.instagram.com\\\/greatlearningofficial\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/in.pinterest.com\\\/greatlearning12\\\/\",\"https:\\\/\\\/www.youtube.com\\\/user\\\/beaconelearning\\\/\"],\"description\":\"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.\",\"email\":\"info@mygreatlearning.com\",\"legalName\":\"Great Learning Education Services Pvt. Ltd\",\"foundingDate\":\"2013-11-29\",\"numberOfEmployees\":{\"@type\":\"QuantitativeValue\",\"minValue\":\"1001\",\"maxValue\":\"5000\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\",\"name\":\"Great Learning Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"caption\":\"Great Learning Editorial Team\"},\"description\":\"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.\",\"sameAs\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/\",\"https:\\\/\\\/in.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/x.com\\\/https:\\\/\\\/twitter.com\\\/Great_Learning\",\"https:\\\/\\\/www.youtube.com\\\/channel\\\/UCObs0kLIrDjX2LLSybqNaEA\"],\"award\":[\"Best EdTech Company of the Year 2024\",\"Education Economictimes Outstanding Education\\\/Edtech Solution Provider of the Year 2024\",\"Leading E-learning Platform 2024\"],\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/author\\\/greatlearning\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Data Cleaning in Python | What is Data Cleaning? - Great Learning","description":"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/","og_locale":"en_US","og_type":"article","og_title":"Data Cleaning in Python | What is Data Cleaning?","og_description":"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.","og_url":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/","og_site_name":"Great Learning Blog: Free Resources what Matters to shape your Career!","article_publisher":"https:\/\/www.facebook.com\/GreatLearningOfficial\/","article_published_time":"2020-07-30T11:12:35+00:00","article_modified_time":"2024-10-14T18:56:48+00:00","og_image":[{"width":1090,"height":770,"url":"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg","type":"image\/jpeg"}],"author":"Great Learning Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@https:\/\/twitter.com\/Great_Learning","twitter_site":"@Great_Learning","twitter_misc":{"Written by":"Great Learning Editorial Team","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#article","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/"},"author":{"name":"Great Learning Editorial Team","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad"},"headline":"Data Cleaning in Python | What is Data Cleaning?","datePublished":"2020-07-30T11:12:35+00:00","dateModified":"2024-10-14T18:56:48+00:00","mainEntityOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/"},"wordCount":1126,"commentCount":0,"publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg","keywords":["Data Analytics","python"],"articleSection":["Data Science and Analytics"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/","url":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/","name":"Data Cleaning in Python | What is Data Cleaning? - Great Learning","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#primaryimage"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg","datePublished":"2020-07-30T11:12:35+00:00","dateModified":"2024-10-14T18:56:48+00:00","description":"Data Cleaning in Python, also known as Data Cleansing is an important technique in model building. Learn more about how it is used in ML.","breadcrumb":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#primaryimage","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg","width":1090,"height":770,"caption":"data cleaning in python"},{"@type":"BreadcrumbList","@id":"https:\/\/www.mygreatlearning.com\/blog\/data-cleaning-in-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog","item":"https:\/\/www.mygreatlearning.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Data Science and Analytics","item":"https:\/\/www.mygreatlearning.com\/blog\/data-science\/"},{"@type":"ListItem","position":3,"name":"Data Cleaning in Python | What is Data Cleaning?"}]},{"@type":"WebSite","@id":"https:\/\/www.mygreatlearning.com\/blog\/#website","url":"https:\/\/www.mygreatlearning.com\/blog\/","name":"Great Learning Blog","description":"Learn, Upskill &amp; Career Development Guide and Resources","publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"alternateName":"Great Learning","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.mygreatlearning.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization","name":"Great Learning","url":"https:\/\/www.mygreatlearning.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","width":900,"height":900,"caption":"Great Learning"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/GreatLearningOfficial\/","https:\/\/x.com\/Great_Learning","https:\/\/www.instagram.com\/greatlearningofficial\/","https:\/\/www.linkedin.com\/school\/great-learning\/","https:\/\/in.pinterest.com\/greatlearning12\/","https:\/\/www.youtube.com\/user\/beaconelearning\/"],"description":"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.","email":"info@mygreatlearning.com","legalName":"Great Learning Education Services Pvt. Ltd","foundingDate":"2013-11-29","numberOfEmployees":{"@type":"QuantitativeValue","minValue":"1001","maxValue":"5000"}},{"@type":"Person","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad","name":"Great Learning Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","caption":"Great Learning Editorial Team"},"description":"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.","sameAs":["https:\/\/www.mygreatlearning.com\/","https:\/\/in.linkedin.com\/school\/great-learning\/","https:\/\/x.com\/https:\/\/twitter.com\/Great_Learning","https:\/\/www.youtube.com\/channel\/UCObs0kLIrDjX2LLSybqNaEA"],"award":["Best EdTech Company of the Year 2024","Education Economictimes Outstanding Education\/Edtech Solution Provider of the Year 2024","Leading E-learning Platform 2024"],"url":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"}]}},"uagb_featured_image_src":{"full":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",1090,770,false],"thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited-150x150.jpg",150,150,true],"medium":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited-300x212.jpg",300,212,true],"medium_large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited-768x543.jpg",768,543,true],"large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited-1024x723.jpg",1024,723,true],"1536x1536":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",1090,770,false],"2048x2048":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",1090,770,false],"web-stories-poster-portrait":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",640,452,false],"web-stories-publisher-logo":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",96,68,false],"web-stories-thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/07\/edited.jpg",150,106,false]},"uagb_author_info":{"display_name":"Great Learning Editorial Team","author_link":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"},"uagb_comment_info":0,"uagb_excerpt":"What is Data Cleaning in Python? How to perform Data Cleaning in Python? Remove Repeated Values Missing Value Treatment Removal of irrelevant data Manual Error While Typing Renaming Columns Contributed by: Praneeta LinkedIn Profile: linkedin.com\/in\/praneeta-Kalaskar-903073a1 When we talk about the real world, most of the data we come across for analysis is raw data. This&hellip;","_links":{"self":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/17788","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/comments?post=17788"}],"version-history":[{"count":27,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/17788\/revisions"}],"predecessor-version":[{"id":96682,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/17788\/revisions\/96682"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media\/17831"}],"wp:attachment":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media?parent=17788"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/categories?post=17788"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/tags?post=17788"},{"taxonomy":"content_type","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/content_type?post=17788"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}