AI Pros Bootcamp – Module 1 - Session 4: Working with Tensors

What are Tensors?

Tensors are the core Data Structure in PyTorch

Images as Tensors

PyTorch convention ordering for images is: (channels, height, width).

Why Learn About Tensors?

Many PyTorch errors come from tensor issues

Shape mismatches
Data type problems
Device mismatches

Master tensors now, avoid frustration later

Understanding Shapes

distances.shape  # torch.Size([6, 1])

[6, 1] means:

6 samples (batch size)
1 feature per sample

Shape mismatches = most common PyTorch errors

Why Batch Size Doesn’t Cause Problems

Model expects first dimension = batch size

Think of it like a stack of papers:

Model reads each page the same way
Whether 6 pages or 600 pages
First dimension = how many
Rest = what each sample looks like

Data Types

# Defaults
torch.tensor([1, 2, 3])        # int64
torch.tensor([1.0, 2.0, 3.0])  # float32

# Explicit
torch.tensor([1, 2, 3], dtype=torch.float32)
tensor.float()  # convert to float32

For neural networks: float32 is the sweet spot

Now you understand shapes, let’s talk about data types. When you create a tensor, PyTorch can use defaults. Enter integers, you’ll get int64. Include a decimal point, then you’ll get float32.

If you want to be explicit, you can use the dtype argument. It guarantees float32 even if you forget the decimal point. Alternatively, you can use .float() to convert any tensor to float32.

What happens if you mix types? PyTorch will automatically handle mixed types through type promotion. For example, if you add an int tensor and a float tensor, PyTorch will automatically convert the int to float and then return a float result, just like regular Python.

There are other types too, like float64 for extra precision or int8 for memory savings. But for neural networks, float32 is the sweet spot - it’s fast, accurate, and standard on modern hardware.

Creating Tensors

From Python lists:

torch.tensor([[1, 2], [3, 4]])

From NumPy:

torch.from_numpy(np_array)

Built-in patterns:

torch.zeros(2, 3)
torch.ones(2, 3)
torch.randn(2, 3)  # random values

Reshaping Tensors

Common error: forgetting the batch dimension

# Wrong: scalar
single_value = torch.tensor(25.0)  # shape: []

# Right: batch dimension
single_value = torch.tensor([[25.0]])  # shape: [1, 1]

Adding Dimensions

# Add dimension
tensor.unsqueeze(0)  # add at position 0

# Remove dimensions
tensor.squeeze()  # removes all size-1 dimensions

Always check shape before using unsqueeze()

Indexing and Slicing

predictions[0]        # first prediction
predictions[0:3]      # first three
predictions[0].item() # convert to Python float

.item() only works on tensors with exactly one element

Multiple Features

# Shape: [batch_size, num_features]
data = torch.tensor([[5.0, 14.0, 1.0],  # distance, hour, weather
                     [6.0, 15.0, 0.0]])

data[0, 1]  # first sample, second feature (hour = 14.0)

Element-Wise Operations

weight = 3.4
bias = 5.0
distances = torch.tensor([[5.0], [6.0], [8.0]])

# Element-wise: applies to each element independently
predictions = weight * distances + bias

Same operation applied to all elements at once

Broadcasting

Broadcasting is a powerful mechanism in PyTorch and NumPy that allows you to perform arithmetic operations on tensors of different shapes without manually duplicating data in memory. It makes your code faster and more memory-efficient by vectorizing operations in C rather than Python loops.

Broadcasting: Example

Problem: Want to apply different adjustments to different features

Solution: Broadcasting automatically expands dimensions

# Instead of repeating [1.1, 1.0, 5.0] three times:
adjustments = torch.tensor([[1.1, 1.0, 5.0]])  # shape: [1, 3]
data = torch.tensor([[5.0, 14.0, 1.0],
                     [6.0, 15.0, 0.0],
                     [8.0, 16.0, 1.0]])  # shape: [3, 3]

result = data * adjustments  # Broadcasting!

How Broadcasting Works

Rule: When one dimension is 1 and the other is larger, PyTorch expands the smaller dimension

Example:

[1, 3] × [3, 1] → both become [3, 3]
[1, 1] × [1, 3] → first becomes [1, 3]

Module 1 Summary

You’ve learned:

Why PyTorch exists and what makes it special
How neural networks learn from data
The complete ML pipeline
Building and training your first model
Activation functions for non-linear patterns
Working with tensors (shapes, types, operations)

Lab 3: Tensors: The Core of PyTorch

“For the things we have to learn before we can do them, we learn by doing them.”

CUE: START THE LAB HERE

Assignment 1: Deeper Regression, Smarter Features

“What I cannot create, I do not understand.”

CUE: START THE ASSIGNMENT HERE

What’s Next?

In Module 2: Image Classification we learn:

Tackle classification problems
Dive deeper into how neural networks learn
Build your first image classifier