Data Science
( \newcommand{\kernel}{\mathrm{null}\,}\)
Data science is an interdisciplinary field that uses scientific methods, algorithms, and systems to extract insights and knowledge from structured and unstructured data. It combines elements of statistics, computer science, and domain expertise to analyze large volumes of data, build predictive models, and inform decision-making. Key components include data collection, cleaning, exploration, visualization, and modeling using tools like Python, R, and machine learning frameworks. Data scientists often work with big data technologies and collaborate across disciplines to solve complex problems in industries ranging from healthcare and finance to marketing and technology.
- Principles of Data Science (OpenStax)
- Principles of Data Science is intended to support one- or two-semester courses in data science. It is appropriate for data science majors and minors as well as students concentrating in business, finance, health care, engineering, the sciences, and a number of other fields where data science has become critically important.
- The Crystal Ball - Instruction Manual I: Introduction to Data Science (Davies)
- A perfect introduction to the exploding field of Data Science for the curious, first-time student. The author brings his trademark conversational tone to the important pillars of the discipline: exploratory data analysis, choices for structuring data, causality, machine learning principles, and introductory Python programming using open-source Jupyter Notebooks.
Thumbnail: Algo-r-(h)-i-(y)-thms, 2018. Installation view at ON AIR, Tomás Saraceno's solo exhibition at Palais de Tokyo, Paris, 2018. (Unsplash License; Alina Grubnyak via Unsplash)