{"id":14062,"date":"2020-04-27T11:16:04","date_gmt":"2020-04-27T05:46:04","guid":{"rendered":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/"},"modified":"2024-10-15T01:10:15","modified_gmt":"2024-10-14T19:40:15","slug":"understanding-eda-in-python","status":"publish","type":"post","link":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/","title":{"rendered":"Understanding the Exploratory Data Analysis (EDA) in Python"},"content":{"rendered":"\n<p><strong>Contributed by: <a href=\"https:\/\/www.linkedin.com\/in\/manorama-3110\/\" target=\"_blank\" rel=\"noreferrer noopener\">Manorama Yadav <\/a><\/strong><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"introduction-to-eda-in-python\"><strong>Introduction to EDA in Python<\/strong><\/h2>\n\n\n\n<p>Exploratory data analysis is the analysis of the data and brings out insights. It's storytelling, a story that data is trying to tell. EDA is an approach to analyzing the data with the help of various tools and graphical techniques like barplot, histogram, etc.<\/p>\n\n\n\n<p>According to <strong>Tukey<\/strong> (data analysis in 1961)<\/p>\n\n\n\n<p><em>\"Procedures for analyzing data, techniques for interpreting the results of such procedures, ways of planning the gathering of data to make its analysis easier, more precise or more accurate, and all the machinery and results of (mathematical) statistics which apply to analyzing data.\"<\/em><\/p>\n\n\n\n    <div class=\"courses-cta-container\">\n        <div class=\"courses-cta-card\">\n            <div class=\"courses-cta-header\">\n                <div class=\"courses-learn-icon\"><\/div>\n                <span class=\"courses-learn-text\">Academy Pro<\/span>\n            <\/div>\n            <p class=\"courses-cta-title\">\n                <a href=\"https:\/\/www.mygreatlearning.com\/academy\/premium\/master-python-programming\" class=\"courses-cta-title-link\">Python Programming Course<\/a>\n            <\/p>\n            <p class=\"courses-cta-description\">In this course, you will learn the fundamentals of Python: from basic syntax to mastering data structures, loops, and functions. You will also explore OOP concepts and objects to build robust programs.<\/p>\n            <div class=\"courses-cta-stats\">\n                <div class=\"courses-stat-item\">\n                    <div class=\"courses-stat-icon courses-user-icon\"><\/div>\n                    <span>11.5 Hrs<\/span>\n                <\/div>\n                <div class=\"courses-stat-item\">\n                    <div class=\"courses-stat-icon courses-star-icon\"><\/div>\n                    <span>51 Coding Exercises<\/span>\n                <\/div>\n            <\/div>\n            <a href=\"https:\/\/www.mygreatlearning.com\/academy\/premium\/master-python-programming\" class=\"courses-cta-button\">\n                Start Free Trial\n                <div class=\"courses-arrow-icon\"><\/div>\n            <\/a>\n        <\/div>\n    <\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"eda-in-python\"><strong>EDA in Python<\/strong><\/h3>\n\n\n\n<p>There are many <a aria-label=\"libraries available in python (opens in a new tab)\" href=\"https:\/\/www.mygreatlearning.com\/blog\/open-source-python-libraries\/\" target=\"_blank\" rel=\"noreferrer noopener\">libraries available in python<\/a>, like pandas, NumPy, matplotlib, seaborn, etc. with that help, we can analyze the data and bring out helpful insights. I will be using Jupyter Notebook along with these libraries.<\/p>\n\n\n\n<p>Check out <a href=\"https:\/\/www.mygreatlearning.com\/academy\/learn-for-free\/courses\/python-for-data-analysis\" target=\"_blank\" rel=\"noreferrer noopener\">python for a data analysis course<\/a> for free. You can also learn <a href=\"https:\/\/www.mygreatlearning.com\/academy\/learn-for-free\/courses\/basics-of-eda-with-python\" target=\"_blank\" rel=\"noreferrer noopener\">EDA in machine learning<\/a> with us to solidify your learning.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"dataset-introduction\"><strong>Dataset Introduction<\/strong><\/h3>\n\n\n\n<p>We are using the 'Cars' dataset, which has different features of cars like a model, year, engine, and other properties along with their price. It has 28 years of data from 1990 to 2017, and you can <a aria-label=\"download the dataset here (opens in a new tab)\" href=\"https:\/\/www.kaggle.com\/CooperUnion\/cardataset\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">download the dataset here<\/a>.<\/p>\n\n\n\n<p><strong>Data Description:<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td><strong>S.no<\/strong><\/td><td><strong>Variable<\/strong><\/td><td><strong>Description<\/strong><\/td><td><strong>Data Type<\/strong><\/td><\/tr><tr><td><strong>1<\/strong><\/td><td>Make<\/td><td>Car Make&nbsp;<\/td><td>String<\/td><\/tr><tr><td><strong>2<\/strong><\/td><td>Model&nbsp;<\/td><td>Car Model&nbsp;<\/td><td>String<\/td><\/tr><tr><td><strong>3<\/strong><\/td><td>Year&nbsp;<\/td><td>Car Year&nbsp;<\/td><td>Integer<\/td><\/tr><tr><td><strong>4<\/strong><\/td><td>Engine Fuel Type<\/td><td>Fuel Type<\/td><td>String<\/td><\/tr><tr><td><strong>5<\/strong><\/td><td>Engine HP&nbsp;<\/td><td>Horse Power(HP)<\/td><td>Integer<\/td><\/tr><tr><td><strong>6<\/strong><\/td><td>Engine Cylinders&nbsp;<\/td><td>No. of Cylinders<\/td><td>Integer<\/td><\/tr><tr><td><strong>7<\/strong><\/td><td>Transmission Type&nbsp;<\/td><td>Transmission Type<\/td><td>String<\/td><\/tr><tr><td><strong>8<\/strong><\/td><td>Driven_Wheels<\/td><td>Wheels type<\/td><td>String<\/td><\/tr><tr><td><strong>9<\/strong><\/td><td>Number of Doors&nbsp;<\/td><td>No. of Doors<\/td><td>Integer<\/td><\/tr><tr><td><strong>10<\/strong><\/td><td>Market Category&nbsp;<\/td><td>Market Category<\/td><td>String<\/td><\/tr><tr><td><strong>11<\/strong><\/td><td>Vehicle Size<\/td><td>Size of Vehicle<\/td><td>String<\/td><\/tr><tr><td><strong>12<\/strong><\/td><td>Vehicle Style<\/td><td>Type of Vehicle<\/td><td>String<\/td><\/tr><tr><td><strong>13<\/strong><\/td><td>Highway MPG<\/td><td>Highway MPG<\/td><td>Integer<\/td><\/tr><tr><td><strong>14<\/strong><\/td><td>city mpg<\/td><td>miles per gallon<\/td><td>Integer<\/td><\/tr><tr><td><strong>15<\/strong><\/td><td>Popularity<\/td><td>Popularity of the car<\/td><td>Integer<\/td><\/tr><tr><td><strong>16<\/strong><\/td><td>MSRP&nbsp;<\/td><td>Price of the car in ($)<\/td><td>Integer<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><strong>The objective of this article is to explore the data and make it ready for modeling.<\/strong><\/p>\n\n\n\n<p>Let's get started!!!<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"exploratory-data-analysis-in-python\"><strong>Exploratory Data Analysis in Python<\/strong><\/h2>\n\n\n\n<p>First, we will import all the libraries required for EDA (Exploratory Data Analysis). This is the first and most important thing to do. Without importing libraries, we will not be able to perform anything.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"import-libraries\"><strong>Import Libraries<\/strong><\/h3>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"data-loading\"><strong>Data loading<\/strong><\/h3>\n\n\n\n<p>After importing the libraries, the next step is loading data into the dataframe. We will use the pandas' library to load the data into the dataframe. It supports various file formats like Comma Separated Values (.csv), excel (.xlsx, .xls) etc.&nbsp;<\/p>\n\n\n\n<p>To read the dataset, either store the data file in the same directory and read it directly or provide the path of the data file where the dataset is located while reading the data.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"top-5-rows\"><strong>Top 5 rows<\/strong><\/h3>\n\n\n\n<p>Now, the data has been loaded. Let's check the first 5 rows of the dataset.<\/p>\n\n\n\n<p>From the above results, we can see that the index in python starts from 0.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"bottom-5-rows\"><strong>Bottom 5 rows<\/strong><\/h3>\n\n\n\n<p>To check the dimensions of the dataframe, let's check the number of rows and columns present in the dataset.&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"shape-of-the-data\"><strong>Shape of the Data<\/strong><\/h3>\n\n\n\n<p>There are a total of <strong>11914<\/strong> rows and <strong>16<\/strong> columns in the dataset<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"concise-info-of-dataset\"><strong>Concise info of dataset<\/strong><\/h3>\n\n\n\n<p>Now, check the data types along with the concise summary of all the variables in the dataset. It includes the number of non-null values present.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>&lt;class 'pandas.core.frame.DataFrame'&gt;\nRangeIndex: 11914 entries, 0 to 11913\nData columns (total 16 columns):\nMake                 11914 non-null object\nModel                11914 non-null object\nYear                 11914 non-null int64\nEngine Fuel Type     11911 non-null object\nEngine HP            11845 non-null float64\nEngine Cylinders     11884 non-null float64\nTransmission Type    11914 non-null object\nDriven_Wheels        11914 non-null object\nNumber of Doors      11908 non-null float64\nMarket Category      8172 non-null object\nVehicle Size         11914 non-null object\nVehicle Style        11914 non-null object\nhighway MPG          11914 non-null int64\ncity mpg             11914 non-null int64\nPopularity           11914 non-null int64\nMSRP                 11914 non-null int64\ndtypes: float64(3), int64(5), object(8)\nmemory usage: 1.5+ MB<\/code><\/pre>\n\n\n\n<p>The data type will be stored as an object if strings are present in the variables. Also, it will be int or float if the data has numerical and decimal values, respectively. MSRP (the price of the car) is stored as an int data type, while Driven_wheels is stored as an object data type.&nbsp;<\/p>\n\n\n\n<p>The above results show many variables like Engine Fuel Type, Engine HP, Engine Cylinders, No. of Doors, and Market Category have missing values in the data.&nbsp;<\/p>\n\n\n\n<p>We can check the data types by one more method:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Make                  object\nModel                 object\nYear                   int64\nEngine Fuel Type      object\nEngine HP            float64\nEngine Cylinders     float64\nTransmission Type     object\nDriven_Wheels         object\nNumber of Doors      float64\nMarket Category       object\nVehicle Size          object\nVehicle Style         object\nhighway MPG            int64\ncity mpg               int64\nPopularity             int64\nMSRP                   int64\ndtype: object<\/code><\/pre>\n\n\n\n<p>To print the columns of the dataset<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Index(&#091;'Make', 'Model', 'Year', 'Engine Fuel Type', 'Engine HP',\n       'Engine Cylinders', 'Transmission Type', 'Driven_Wheels',\n       'Number of Doors', 'Market Category', 'Vehicle Size', \n\t'Vehicle Style\u2019, \u2018highway MPG', 'city mpg', 'Popularity', 'MSRP'],\n      dtype='object')<\/code><\/pre>\n\n\n\n<p>Since the names of the columns are very lengthy, let's rename them.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"rename-the-columns\"><strong>Rename the Columns<\/strong><\/h3>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"drop-columns\"><strong>Drop Columns<\/strong><\/h3>\n\n\n\n<p>Drop the columns which are not necessary for the dataframe. Not all the columns in the data need to be relevant. In this data, columns like popularity, number of doors, and vehicle_size were not so relevant. So I am dropping these variables from the dataset.<\/p>\n\n\n\n<p><span style=\"text-decoration: underline\">Missing Values<\/span>:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Make              0\nModel             0\nYear              0\nFuel_Type         3\nHP               69\nCylinders        30\nTransmission      0\nDriven_Wheels     0\nVehicle_Style     0\nh_mpg             0\nc_mpg             0\nprice             0\ndtype: int64<\/code><\/pre>\n\n\n\n<p>The above results show that out of 12 variables, 3 variables, Fuel_type, HP, and cylinders, have missing values.&nbsp;<\/p>\n\n\n\n<p>Let's check the percentage of the data are missing column wise<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Make             0.000000\nModel            0.000000\nYear             0.000000\nFuel_Type        0.025180\nHP               0.579151\nCylinders        0.251805\nTransmission     0.000000\nDriven_Wheels    0.000000\nVehicle_Style    0.000000\nh_mpg            0.000000\nc_mpg            0.000000\nprice            0.000000\ndtype: float64<\/code><\/pre>\n\n\n\n<p>There are 0.025%, 0.58% and 0.25% data are missing in the variables Fuel_type, HP and cylinders respectively.&nbsp;<\/p>\n\n\n\n<p>There are many ways to treat these missing values.&nbsp;<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Drop<\/li>\n\n\n\n<li>Impute<\/li>\n<\/ol>\n\n\n\n<p>We can either drop the rows where missing values are present or replace the missing values with some values like mean, median, or mode.<\/p>\n\n\n\n<p>Since the % of the data missing is very less, we can remove those rows from the dataset.<br><\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Make             0\nModel            0\nYear             0\nFuel_Type        0\nHP               0\nCylinders        0\nTransmission     0\nDriven_Wheels    0\nVehicle_Style    0\nh_mpg            0\nc_mpg            0\nprice            0\ndtype: int64<\/code><\/pre>\n\n\n\n<p>The drop function will default drop the complete row if any of the variables have missing values.&nbsp;<\/p>\n\n\n\n<p>After dropping the missing values, now the count of missing values is 0. That means there are no missing values present in the dataset.<\/p>\n\n\n\n<p>Check the number of rows present after removing the missing values.\t<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Make             11813\nModel            11813\nYear             11813\nFuel_Type        11813\nHP               11813\nCylinders        11813\nTransmission     11813\nDriven_Wheels    11813\nVehicle_Style    11813\nh_mpg            11813\nc_mpg            11813\nprice            11813\ndtype: int64<\/code><\/pre>\n\n\n\n<p>The original number of rows was 11914, and now the number of rows left is 11813.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"statistical-summary\"><strong>Statistical Summary<\/strong><\/h3>\n\n\n\n<p>Now, let's find out the dataset's statistical or 5-point summary. The 5-point summary tells the descriptive summary, which includes the meaning, median, mode, no. of rows, maximum value, and minimum value for each variable.<br><\/p>\n\n\n\n<p>Mean, standard deviation, max, and percentile values will be NaN for variables that have <strong><em>object<\/em><\/strong> datatype.&nbsp;<\/p>\n\n\n\n<p>The unique, top, frequency will be NaN for variables with the <strong><em>int<\/em><\/strong> data type.&nbsp;<\/p>\n\n\n\n<p>From the descriptive summary, we got to know that there is 47 unique make of cars and 904 models. Data has maximum Chevrolet make cars with 1115 counts. The average price of the car is 40581.5 dollars, and the 50<sup>th<\/sup> percentile or median of the price is 29970. There is a huge difference between the mean and median of the price. This depicts that the price variable is highly skewed, which we can check visually using a histogram.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"data-visualisation\"><strong>Data Visualisation<\/strong><\/h3>\n\n\n\n<p>As its name suggests, data visualization is observing the data using various plots, graphs, etc. Various plots include histogram, scatterplot, boxplot, heatmap, etc. We will use matplotlib and seaborn together to visualize a few variables.<\/p>\n\n\n\n<p>After conducting a thorough Exploratory Data Analysis (EDA) in Python, the next step is often presenting the insights in an engaging and interactive format. Tools like Power BI can help you achieve this with ease. To sharpen your data visualization skills, consider enrolling in our <a href=\"https:\/\/www.mygreatlearning.com\/academy\/learn-for-free\/courses\/data-visualization-with-power-bi\">Power BI free course<\/a>. This course will guide you through creating dynamic reports and dashboards, allowing you to effectively showcase your EDA results. Begin your free learning journey.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"histogram-distribution-plot\"><strong>Histogram (Distribution Plot)<\/strong><\/h4>\n\n\n\n<p>A histogram shows the shape and distribution of the numerical variable. For categorical variables, it shows the count of the categories present in the variable.<\/p>\n\n\n\n<p>From both histograms, it is shown that the HP variable is quite distributed. It is a little bit tilted on the right, and that means it is slightly right-skewed but normally distributed. However, the price variable is highly skewed.&nbsp;<\/p>\n\n\n\n<p><strong><span style=\"text-decoration: underline\">Histogram for Categorical Variable<\/span><\/strong><\/p>\n\n\n\n<p>This is the countplot for <strong>Make<\/strong> Variable. Every bar shows the count of the category present in the dataset.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"outliers-check\"><strong>Outliers Check<\/strong><\/h4>\n\n\n\n<p>Outliers are the values that are significantly different from other values\/observations. An outlier can create major issues in modeling. So it is necessary to find outliers and treat them.<\/p>\n\n\n\n<p>Outliers can be detected by using a boxplot. Boxplot depicts the variable distribution using quartile, also known as a box and whiskers plot.<br><\/p>\n\n\n\n<p>All the above boxplots show that there are many outliers present in the price and c_mpg variables. In the Cylinder variable, only 4 observations are outliers.<\/p>\n\n\n\n<p>According to the box plot, any observation which is out of the range of Q1 (25 percentile) and Q3 (75 percentile) or IQR (Inter quartile range) is observed as an outlier.<\/p>\n\n\n\n<p>If many outliers are present in the dataset, then the treatment of outliers is necessary. There are methods like flooring and capping which can be used to impute outliers.&nbsp;<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"correlation-plot\"><strong>Correlation Plot<\/strong><\/h4>\n\n\n\n<p>Correlation is calculated to find out the intensity of the relationship between 2 variables. Correlation ranges from -1 to 1. -1 correlation value suggests a strong negative relationship, and 1 shows a strong positive relationship. 0 means there is no relation between the 2 variables.<\/p>\n\n\n\n<p>From the above correlation plot, it can be inferred that there are many variables that are strongly related to each other. For Example, the correlation value between c_mpg and h_mpg is 0.85, which is near 1. That means there is a strong positive relationship between them. Likewise, Cylinders and c_mpg have a negative relationship.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"pairplot\"><strong>Pairplot<\/strong><\/h4>\n\n\n\n<p>Pairplot is used to find out the relationship between variables, and it plots the scatter plot between each variable. Scatter plots can also be used independently. But pairplot will give the relationship plot among all the numerical variables in one line.<br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"endnotes\"><strong>Endnotes<\/strong><\/h2>\n\n\n\n<p>All the above steps are part of EDA, and this is not the end of EDA. All the steps above performed are the basics that should be performed to analyze the data before doing feature engineering or modeling.<\/p>\n\n\n\n<p>EDA is one of the important steps during the whole process of data science. It is said that most of the time, the model building goes into EDA and feature engineering. If you want to create a big setup of information from the data, you need to do an extensive EDA.<\/p>\n\n\n\n<p><em>If you wish to earn more about Python and <a aria-label=\"Machine Learning (opens in a new tab)\" href=\"https:\/\/www.mygreatlearning.com\/blog\/what-is-machine-learning\/\" target=\"_blank\" rel=\"noreferrer noopener\">Machine Learning<\/a>, sign up for <a href=\"https:\/\/www.mygreatlearning.com\/pg-program-machine-learning-course\">Great Learning's PG program in Machine Learning. <\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Contributed by: Manorama Yadav Introduction to EDA in Python Exploratory data analysis is the analysis of the data and brings out insights. It's storytelling, a story that data is trying to tell. EDA is an approach to analyzing the data with the help of various tools and graphical techniques like barplot, histogram, etc. According to [&hellip;]<\/p>\n","protected":false},"author":41,"featured_media":14067,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[9],"tags":[36804,36796],"content_type":[],"class_list":["post-14062","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-science","tag-data-analytics","tag-python"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Introduction to Exploratory Data Analysis (EDA) in Python<\/title>\n<meta name=\"description\" content=\"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Understanding the Exploratory Data Analysis (EDA) in Python\" \/>\n<meta property=\"og:description\" content=\"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Great Learning Blog: Free Resources what Matters to shape your Career!\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/GreatLearningOfficial\/\" \/>\n<meta property=\"article:published_time\" content=\"2020-04-27T05:46:04+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-10-14T19:40:15+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"662\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Great Learning Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@https:\/\/twitter.com\/Great_Learning\" \/>\n<meta name=\"twitter:site\" content=\"@Great_Learning\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Great Learning Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/\"},\"author\":{\"name\":\"Great Learning Editorial Team\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\"},\"headline\":\"Understanding the Exploratory Data Analysis (EDA) in Python\",\"datePublished\":\"2020-04-27T05:46:04+00:00\",\"dateModified\":\"2024-10-14T19:40:15+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/\"},\"wordCount\":1627,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/04\\\/shutterstock_352982963.jpg\",\"keywords\":[\"Data Analytics\",\"python\"],\"articleSection\":[\"Data Science and Analytics\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/\",\"name\":\"Introduction to Exploratory Data Analysis (EDA) in Python\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/04\\\/shutterstock_352982963.jpg\",\"datePublished\":\"2020-04-27T05:46:04+00:00\",\"dateModified\":\"2024-10-14T19:40:15+00:00\",\"description\":\"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/04\\\/shutterstock_352982963.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/04\\\/shutterstock_352982963.jpg\",\"width\":1000,\"height\":662,\"caption\":\"EDA in Python\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/understanding-eda-in-python\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Science and Analytics\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/data-science\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Understanding the Exploratory Data Analysis (EDA) in Python\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"name\":\"Great Learning Blog\",\"description\":\"Learn, Upskill &amp; Career Development Guide and Resources\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"alternateName\":\"Great Learning\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\",\"name\":\"Great Learning\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"width\":900,\"height\":900,\"caption\":\"Great Learning\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/GreatLearningOfficial\\\/\",\"https:\\\/\\\/x.com\\\/Great_Learning\",\"https:\\\/\\\/www.instagram.com\\\/greatlearningofficial\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/in.pinterest.com\\\/greatlearning12\\\/\",\"https:\\\/\\\/www.youtube.com\\\/user\\\/beaconelearning\\\/\"],\"description\":\"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.\",\"email\":\"info@mygreatlearning.com\",\"legalName\":\"Great Learning Education Services Pvt. Ltd\",\"foundingDate\":\"2013-11-29\",\"numberOfEmployees\":{\"@type\":\"QuantitativeValue\",\"minValue\":\"1001\",\"maxValue\":\"5000\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\",\"name\":\"Great Learning Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"caption\":\"Great Learning Editorial Team\"},\"description\":\"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.\",\"sameAs\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/\",\"https:\\\/\\\/in.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/x.com\\\/https:\\\/\\\/twitter.com\\\/Great_Learning\",\"https:\\\/\\\/www.youtube.com\\\/channel\\\/UCObs0kLIrDjX2LLSybqNaEA\"],\"award\":[\"Best EdTech Company of the Year 2024\",\"Education Economictimes Outstanding Education\\\/Edtech Solution Provider of the Year 2024\",\"Leading E-learning Platform 2024\"],\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/author\\\/greatlearning\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Introduction to Exploratory Data Analysis (EDA) in Python","description":"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/","og_locale":"en_US","og_type":"article","og_title":"Understanding the Exploratory Data Analysis (EDA) in Python","og_description":"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.","og_url":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/","og_site_name":"Great Learning Blog: Free Resources what Matters to shape your Career!","article_publisher":"https:\/\/www.facebook.com\/GreatLearningOfficial\/","article_published_time":"2020-04-27T05:46:04+00:00","article_modified_time":"2024-10-14T19:40:15+00:00","og_image":[{"width":1000,"height":662,"url":"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg","type":"image\/jpeg"}],"author":"Great Learning Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@https:\/\/twitter.com\/Great_Learning","twitter_site":"@Great_Learning","twitter_misc":{"Written by":"Great Learning Editorial Team","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#article","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/"},"author":{"name":"Great Learning Editorial Team","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad"},"headline":"Understanding the Exploratory Data Analysis (EDA) in Python","datePublished":"2020-04-27T05:46:04+00:00","dateModified":"2024-10-14T19:40:15+00:00","mainEntityOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/"},"wordCount":1627,"commentCount":0,"publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg","keywords":["Data Analytics","python"],"articleSection":["Data Science and Analytics"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/","url":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/","name":"Introduction to Exploratory Data Analysis (EDA) in Python","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#primaryimage"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg","datePublished":"2020-04-27T05:46:04+00:00","dateModified":"2024-10-14T19:40:15+00:00","description":"Exploratory Data Analysis (EDA): This article is a guide to EDA in Python. If you are thinking of learning exploratory data analysis, start with this to understand basics.","breadcrumb":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#primaryimage","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg","width":1000,"height":662,"caption":"EDA in Python"},{"@type":"BreadcrumbList","@id":"https:\/\/www.mygreatlearning.com\/blog\/understanding-eda-in-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog","item":"https:\/\/www.mygreatlearning.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Data Science and Analytics","item":"https:\/\/www.mygreatlearning.com\/blog\/data-science\/"},{"@type":"ListItem","position":3,"name":"Understanding the Exploratory Data Analysis (EDA) in Python"}]},{"@type":"WebSite","@id":"https:\/\/www.mygreatlearning.com\/blog\/#website","url":"https:\/\/www.mygreatlearning.com\/blog\/","name":"Great Learning Blog","description":"Learn, Upskill &amp; Career Development Guide and Resources","publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"alternateName":"Great Learning","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.mygreatlearning.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization","name":"Great Learning","url":"https:\/\/www.mygreatlearning.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","width":900,"height":900,"caption":"Great Learning"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/GreatLearningOfficial\/","https:\/\/x.com\/Great_Learning","https:\/\/www.instagram.com\/greatlearningofficial\/","https:\/\/www.linkedin.com\/school\/great-learning\/","https:\/\/in.pinterest.com\/greatlearning12\/","https:\/\/www.youtube.com\/user\/beaconelearning\/"],"description":"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.","email":"info@mygreatlearning.com","legalName":"Great Learning Education Services Pvt. Ltd","foundingDate":"2013-11-29","numberOfEmployees":{"@type":"QuantitativeValue","minValue":"1001","maxValue":"5000"}},{"@type":"Person","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad","name":"Great Learning Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","caption":"Great Learning Editorial Team"},"description":"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.","sameAs":["https:\/\/www.mygreatlearning.com\/","https:\/\/in.linkedin.com\/school\/great-learning\/","https:\/\/x.com\/https:\/\/twitter.com\/Great_Learning","https:\/\/www.youtube.com\/channel\/UCObs0kLIrDjX2LLSybqNaEA"],"award":["Best EdTech Company of the Year 2024","Education Economictimes Outstanding Education\/Edtech Solution Provider of the Year 2024","Leading E-learning Platform 2024"],"url":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"}]}},"uagb_featured_image_src":{"full":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",1000,662,false],"thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963-150x150.jpg",150,150,true],"medium":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963-300x199.jpg",300,199,true],"medium_large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963-768x508.jpg",768,508,true],"large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",1000,662,false],"1536x1536":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",1000,662,false],"2048x2048":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",1000,662,false],"web-stories-poster-portrait":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",640,424,false],"web-stories-publisher-logo":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",96,64,false],"web-stories-thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2020\/04\/shutterstock_352982963.jpg",150,99,false]},"uagb_author_info":{"display_name":"Great Learning Editorial Team","author_link":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"},"uagb_comment_info":0,"uagb_excerpt":"Contributed by: Manorama Yadav Introduction to EDA in Python Exploratory data analysis is the analysis of the data and brings out insights. It's storytelling, a story that data is trying to tell. EDA is an approach to analyzing the data with the help of various tools and graphical techniques like barplot, histogram, etc. According to&hellip;","_links":{"self":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/14062","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/comments?post=14062"}],"version-history":[{"count":19,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/14062\/revisions"}],"predecessor-version":[{"id":115355,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/14062\/revisions\/115355"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media\/14067"}],"wp:attachment":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media?parent=14062"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/categories?post=14062"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/tags?post=14062"},{"taxonomy":"content_type","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/content_type?post=14062"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}