author: Juma Shafara date: "2024-08-12" title: Activation function and Maxpooling keywords: [Training Two Parameter, Mini-Batch Gradient Decent, Training Two Parameter Mini-Batch Gradient Decent] description: In this lab, you will learn two important components in building a convolutional neural network.¶

Photo by DATAIDEA

Objective for this Notebook

1. Learn how to apply an activation function.

2. Learn about max pooling

Table of Contents¶

In this lab, you will learn two important components in building a convolutional neural network. The first is applying an activation function, which is analogous to building a regular network. You will also learn about max pooling. Max pooling reduces the number of parameters and makes the network less susceptible to changes in the image.

Activation Functions

Max Pooling

Estimated Time Needed: 25 min

Import the following libraries:

In [1]:

  Copied!     
 
import torch 
import torch.nn as nn
import matplotlib.pyplot as plt
import numpy as np
from scipy import ndimage, misc
import torch import torch.nn as nn import matplotlib.pyplot as plt import numpy as np from scipy import ndimage, misc 

Activation Functions

Just like a neural network, you apply an activation function to the activation map as shown in the following image:

No description has been provided for this image

Create a kernel and image as usual. Set the bias to zero:

In [2]:

  Copied!     
 
conv = nn.Conv2d(in_channels=1, out_channels=1,kernel_size=3)
Gx=torch.tensor([[1.0,0,-1.0],[2.0,0,-2.0],[1.0,0,-1.0]])
conv.state_dict()['weight'][0][0]=Gx
conv.state_dict()['bias'][0]=0.0
conv.state_dict()
conv = nn.Conv2d(in_channels=1, out_channels=1,kernel_size=3) Gx=torch.tensor([[1.0,0,-1.0],[2.0,0,-2.0],[1.0,0,-1.0]]) conv.state_dict()['weight'][0][0]=Gx conv.state_dict()['bias'][0]=0.0 conv.state_dict()

Out[2]:

OrderedDict([('weight',
              tensor([[[[ 1.,  0., -1.],
                        [ 2.,  0., -2.],
                        [ 1.,  0., -1.]]]])),
             ('bias', tensor([0.]))])

In [3]:

  Copied!     
 
image=torch.zeros(1,1,5,5)
image[0,0,:,2]=1
image
image=torch.zeros(1,1,5,5) image[0,0,:,2]=1 image

Out[3]:

tensor([[[[0., 0., 1., 0., 0.],
          [0., 0., 1., 0., 0.],
          [0., 0., 1., 0., 0.],
          [0., 0., 1., 0., 0.],
          [0., 0., 1., 0., 0.]]]])

The following image shows the image and kernel:

Apply convolution to the image:

In [4]:

  Copied!     
 
Z=conv(image)
Z
Z=conv(image) Z

Out[4]:

tensor([[[[-4.,  0.,  4.],
          [-4.,  0.,  4.],
          [-4.,  0.,  4.]]]], grad_fn=<ConvolutionBackward0>)

Apply the activation function to the activation map. This will apply the activation function to each element in the activation map.

In [5]:

  Copied!     
 
A=torch.relu(Z)
A
A=torch.relu(Z) A

Out[5]:

tensor([[[[0., 0., 4.],
          [0., 0., 4.],
          [0., 0., 4.]]]], grad_fn=<ReluBackward0>)

In [6]:

  Copied!     
 
relu = nn.ReLU()
relu(Z)
relu = nn.ReLU() relu(Z)

Out[6]:

tensor([[[[0., 0., 4.],
          [0., 0., 4.],
          [0., 0., 4.]]]], grad_fn=<ReluBackward0>)

The process is summarized in the the following figure. The Relu function is applied to each element. All the elements less than zero are mapped to zero. The remaining components do not change.

Max Pooling

Consider the following image:

In [7]:

  Copied!     
 
image1=torch.zeros(1,1,4,4)
image1[0,0,0,:]=torch.tensor([1.0,2.0,3.0,-4.0])
image1[0,0,1,:]=torch.tensor([0.0,2.0,-3.0,0.0])
image1[0,0,2,:]=torch.tensor([0.0,2.0,3.0,1.0])

image1
image1=torch.zeros(1,1,4,4) image1[0,0,0,:]=torch.tensor([1.0,2.0,3.0,-4.0]) image1[0,0,1,:]=torch.tensor([0.0,2.0,-3.0,0.0]) image1[0,0,2,:]=torch.tensor([0.0,2.0,3.0,1.0]) image1

Out[7]:

tensor([[[[ 1.,  2.,  3., -4.],
          [ 0.,  2., -3.,  0.],
          [ 0.,  2.,  3.,  1.],
          [ 0.,  0.,  0.,  0.]]]])

Max pooling simply takes the maximum value in each region. Consider the following image. For the first region, max pooling simply takes the largest element in a yellow region.

The region shifts, and the process is repeated. The process is similar to convolution and is demonstrated in the following figure:

Create a maxpooling object in 2d as follows and perform max pooling as follows:

In [8]:

  Copied!     
 
max1=torch.nn.MaxPool2d(2,stride=1)
max1(image1)
max1=torch.nn.MaxPool2d(2,stride=1) max1(image1)

Out[8]:

tensor([[[[2., 3., 3.],
          [2., 3., 3.],
          [2., 3., 3.]]]])

If the stride is set to None (its defaults setting), the process will simply take the maximum in a prescribed area and shift over accordingly as shown in the following figure:

Here's the code in Pytorch:

Objective for this Notebook

1. Learn how to apply an activation function.

2. Learn about max pooling

Table of Contents¶

Don't Miss Any Updates!

Activation Functions

Max Pooling

What's on your mind? Put it in the comments!