Data Analysis using Jupyter Notebook
Speaker: Santhosh Kumar Balan – Hyderabad, IndiaTopic(s): Information Systems, Search, Information Retrieval, Database Systems, Data Mining, Data Science
Abstract
Data analysis is a cycle of reviewing, purging, changing and demonstrating data with the objective of finding valuable data, advising ends and supporting dynamic. Data analysis has various features and approaches, including assorted strategies under an assortment of names, and is utilized in various business, science, and sociology areas. In the present business world, data analysis assumes a part in settling on choices more logical and helping organizations work all the more successfully. The data are vital as contributions to the analysis, which is determined dependent on the prerequisites of those coordinating the analysis or clients. The overall sort of element whereupon the data will be gathered is alluded to as a trial unit. Explicit factors with respect to a populace might be determined and gotten. Data might be mathematical or downright. Data are gathered from an assortment of sources. The necessities might be conveyed by experts to caretakers of the data, for example, data innovation faculty inside an association. The data may likewise be gathered from sensors in nature, for example, traffic cameras, satellites, recording gadgets, and so on. It might likewise be gotten through meetings, downloads from online sources, or understanding documentation. Data at first acquired must be handled or sorted out for analysis.
For example, these may include putting data into lines and sections in a table arrangement for additional analysis, for example, inside a spreadsheet or measurable programming. When handled and composed, the data might be inadequate, contain copies, or contain mistakes. The requirement for data cleaning will emerge from issues in the manner that data are entered and put away. Data cleaning is the way toward forestalling and adjusting these mistakes. Regular undertakings incorporate record coordinating, distinguishing error of data, and generally nature of existing data, deduplication, and segment division. Such data issues can likewise be distinguished through an assortment of diagnostic strategies. When the data are cleaned, it tends to be dissected. Investigators may apply an assortment of procedures alluded to as exploratory data analysis to start understanding the messages contained in the data. The cycle of investigation may bring about extra data cleaning or extra demands for data, so these exercises might be iterative in nature. Unmistakable measurements, for example, the normal or middle, might be produced to help comprehend the data. Data representation may likewise be utilized to analyze the data in graphical arrangement, to get extra knowledge in regards to the messages inside the data.
Jupyter Lab is an online intuitive advancement condition for Jupyter journals, code, and data. Jupyter Lab is adaptable: design and orchestrate the UI to help a wide scope of work processes in data science, logical registering, and AI. Jupyter Lab is extensible and measured: compose modules that include new segments and incorporate with existing ones. The Jupyter Notebook is an open-source web application that permits you to make and offer records that contain live code, conditions, perceptions, and account text. Utilizations incorporate data cleaning and change, mathematical reproduction, measurable demonstrating, data perception, AI, and substantially more.
About this Lecture
Number of Slides: 34Duration: 90 minutes
Languages Available: English
Last Updated:
Request this Lecture
To request this particular lecture, please complete this online form.
Request a Tour
To request a tour with this speaker, please complete this online form.
All requests will be sent to ACM headquarters for review.