Search

Text Color

Margin Size

Font Type

Enable Dyslexic Font

7: Deep Learning and AI Basics

Last updated

Apr 22, 2025
Save as PDF
- 6.11: References
- 7.0: Introduction

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\id}{\mathrm{id}}$ $\newcommand{\Span}{\mathrm{span}}$

( \newcommand{\kernel}{\mathrm{null}\,}\) $\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$ $\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$ $\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\id}{\mathrm{id}}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\kernel}{\mathrm{null}\,}$

$\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$

$\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$

$\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$ $\newcommand{\AA}{\unicode[.8,0]{x212B}}$

$\newcommand{\vectorA}[1]{\vec{#1}} % arrow$

$\newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$

$\newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vectorC}[1]{\textbf{#1}}$

$\newcommand{\vectorD}[1]{\overrightarrow{#1}}$

$\newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}$

$\newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}}$

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\avec}{\mathbf a}$

$\newcommand{\bvec}{\mathbf b}$

$\newcommand{\cvec}{\mathbf c}$

$\newcommand{\dvec}{\mathbf d}$

$\newcommand{\dtil}{\widetilde{\mathbf d}}$

$\newcommand{\evec}{\mathbf e}$

$\newcommand{\fvec}{\mathbf f}$

$\newcommand{\nvec}{\mathbf n}$

$\newcommand{\pvec}{\mathbf p}$

$\newcommand{\qvec}{\mathbf q}$

$\newcommand{\svec}{\mathbf s}$

$\newcommand{\tvec}{\mathbf t}$

$\newcommand{\uvec}{\mathbf u}$

$\newcommand{\vvec}{\mathbf v}$

$\newcommand{\wvec}{\mathbf w}$

$\newcommand{\xvec}{\mathbf x}$

$\newcommand{\yvec}{\mathbf y}$

$\newcommand{\zvec}{\mathbf z}$

$\newcommand{\rvec}{\mathbf r}$

$\newcommand{\mvec}{\mathbf m}$

$\newcommand{\zerovec}{\mathbf 0}$

$\newcommand{\onevec}{\mathbf 1}$

$\newcommand{\real}{\mathbb R}$

$\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$

$\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$

$\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$

$\newcommand{\laspan}[1]{\text{Span}\{#1\}}$

$\newcommand{\bcal}{\cal B}$

$\newcommand{\ccal}{\cal C}$

$\newcommand{\scal}{\cal S}$

$\newcommand{\wcal}{\cal W}$

$\newcommand{\ecal}{\cal E}$

$\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$

$\newcommand{\gray}[1]{\color{gray}{#1}}$

$\newcommand{\lgray}[1]{\color{lightgray}{#1}}$

$\newcommand{\rank}{\operatorname{rank}}$

$\newcommand{\row}{\text{Row}}$

$\newcommand{\col}{\text{Col}}$

$\renewcommand{\row}{\text{Row}}$

$\newcommand{\nul}{\text{Nul}}$

$\newcommand{\var}{\text{Var}}$

$\newcommand{\corr}{\text{corr}}$

$\newcommand{\len}[1]{\left|#1\right|}$

$\newcommand{\bbar}{\overline{\bvec}}$

$\newcommand{\bhat}{\widehat{\bvec}}$

$\newcommand{\bperp}{\bvec^\perp}$

$\newcommand{\xhat}{\widehat{\xvec}}$

$\newcommand{\vhat}{\widehat{\vvec}}$

$\newcommand{\uhat}{\widehat{\uvec}}$

$\newcommand{\what}{\widehat{\wvec}}$

$\newcommand{\Sighat}{\widehat{\Sigma}}$

$\newcommand{\lt}{<}$

$\newcommand{\gt}{>}$

$\newcommand{\amp}{&}$

$\definecolor{fillinmathshade}{gray}{0.9}$

7.0: Introduction
This page discusses the importance of neural networks in AI, highlighting their structure's similarity to the human brain for pattern recognition and prediction. It covers how neuroscience informs our understanding of neuron connections. While early AI faced computational power limitations, advancements have led to deep learning models that empower complex applications like natural language processing. Additionally, the text tackles ethical concerns associated with AI and machine learning.
7.1: Introduction to Neural Networks
This page covers the fundamentals of neural networks, including their structure, essential components, and applications in image recognition and speech processing. It introduces key concepts like weights, biases, activation functions, and the multilayer perceptron (MLP). The text explains various activation functions and the perceptron learning rule, illustrated through a case study using the Iris dataset.
7.2: Backpropagation
This page discusses the fundamentals of neural networks, focusing on weight and bias adjustments through backpropagation to minimize errors in supervised learning. It covers loss functions, gradient descent, and the importance of differentiability in optimizing models. The implementation of a neural network on the MNIST dataset using TensorFlow is described, alongside the architecture and challenges of recurrent neural networks (RNNs), particularly long short-term memory (LSTM) networks.
7.3: Introduction to Deep Learning
This page provides an overview of deep learning principles, focusing on neural networks and the role of hidden layers in feature recognition and classification. It discusses loss functions, including mean squared error and binary cross entropy, crucial for training. The text highlights the use of sparse categorical cross entropy with softmax for multi-class tasks, demonstrating its application in classifying handwritten numerals using TensorFlow.
7.4: Convolutional Neural Networks
This page offers an overview of convolutional neural networks (CNNs) and their effectiveness in image processing tasks such as classification, object detection, and semantic segmentation. It describes CNN components, including convolutional, pooling, and fully connected layers, and explains feature maps that capture hierarchical image features.
7.5: Natural Language Processing
This page discusses the evolution of Natural Language Processing (NLP) from basic systems to sophisticated models like ChatGPT, highlighting key advancements and applications in various fields. It notes the transformative role of AI in creative industries and workplace productivity, alongside challenges such as ethical concerns regarding intellectual property and the impact on human creativity.
7.6: Key Terms
This page serves as a glossary of key terminology and definitions related to neural networks and artificial intelligence, covering concepts such as neuron activation, activation functions, deep learning, CNNs, RNNs, backpropagation, gradient problems, loss functions, and the significance of different layers. It highlights various AI applications, including natural language processing and generative art, making it a comprehensive resource for foundational AI and machine learning knowledge.
7.7: Group Project
This page discusses three machine learning projects: Project A aims to develop a neural network for diagnosing cirrhosis, focusing on data preparation and model training with TensorFlow. Project B involves building a convolutional neural network to identify handwritten digits from the MNIST dataset, promoting experimentation. Project C uses natural language processing and AI art for creating an illustrated story, highlighting theme development, narrative coherence, and visual-text alignment.
7.8: Chapter Review
This page discusses the role of hidden layers in neural networks for learning complex patterns, highlights the effectiveness of convolutional neural networks in image-related tasks, and emphasizes advancements in speech recognition for improved user experience. It also addresses ethical concerns regarding AI, particularly the unauthorized data collection by virtual assistants.
7.9: Critical Thinking
This page discusses neural network models, starting with a single perceptron for flu diagnosis using specific markers and weight adjustment. It evaluates standard, recurrent, and convolutional networks for tasks such as sentiment analysis, stock prediction, image classification, and subscription cancellation. It also explores bitmap character representations and image reduction techniques like pooling, concluding with the identification of suitable loss functions for diverse predictive tasks.
7.10: Quantitative Problems
This page describes a neural network designed to classify students' likelihood of graduating using various activation functions (ReLU, Leaky ReLU, Sigmoid, Softplus). It includes an analysis of the most effective activation function for classification and the method for determining the output. The page also discusses calculating average loss using Mean Squared Error (MSE) and Hinge loss for predictions on different data points.
7.11: References

Support Center

How can we help?