Nội dung text chapter-2 data science.pdf
© U Dinesh Kumar, IIM Bangalore Structured and Unstructured Data Structured data means that the data is described in a matrix form with labelled rows and columns. Any data that is not originally in the matrix form with rows and columns is an unstructured data.
© U Dinesh Kumar, IIM Bangalore Data Type Cross-Sectional Data: A data collected on many variables of interest at the same time or duration of time is called cross-sectional data. Ex. Data on movies such as budget, actors, directors, genre etc. Time Series Data: A data collected for a single variable such as demand for smartphones collected over several time intervals (weekly, monthly, etc.) is called a time series data. Panel Data: Data collected on several variables (multiple dimensions) over several time intervals is called panel data (also known as longitudinal data). Ex. Gross Domestic Product (GDP) data, Unemployment rate for several years