Regresssion technique part of Machine learning

Linear Regression
Wajahat Hussain

Acknowledgement
● These slides are mainly inspired by the online course offered by Prof Andrew
Ng (stanford university) at coursera
● The slides and videos are available online at
Coursera: https://www.coursera.org/learn/machine-learning
Youtube: https://www.youtube.com/watch?v=qeHZOdmJvFU&list=PLZ9qNFMHZ-A4rycgrgOYma6zxF4BZGGPW

Regression? Which curve better represents data pattern
House
Size
House
Price
House
Size
House
Price House
Size
House
Price
θ0
+ θ1
x θ0
+ θ1
x +θ2
x2
θ0
+ θ1
x +θ2
x2
+θ3
x3
● x = house size
● Which curve better predicts the price for house

Examples: Regression
● House price prediction
5 10 15 20
House Size
10
20
30
House
Price
Your House Size

● House price prediction
5 10 15 20
House Size
10
20
30
House
Price

● GPA prediction
600 700 800 900
FSc Marks
1
2
3
GPA 4
● Regression: Predict continuous valued output
● Supervised Learning: Given the right answer for each example.

● Current Prediction
● Reinventing Ohm’s Law V = IxR
5 10 15 20
Voltage
10
20
30
Current
Voltage Applied

● Current Prediction
● Reinventing Ohm’s Law
5 10 15 20
Voltage
10
20
30
Current
Voltage Applied

● Predicting the score of the rain affected match, e.g., Duckworth-Lewis
5 10 15 20
Overs Remaining
10
20
30
Runs
Scored
in
these
overs

Regression? Why not just fit a curve?
House
Size
House
Price
House
Size
House
Price House
Size
House
Price
θ0
+ θ1
x θ0
+ θ1
x +θ2
x2
θ0
+ θ1
x +θ2
x2
+θ3
x3
● x = house size
● Which curve better predicts the price for house

Linear Regression. How to choose the line?
House
Size
House
Price
θ0
+ θ1
x
θ0
+ θ1
x
θ0
+ θ1
x
● How to automatically choose the best line from infinite lines possible?

Regression Notation
Size in feet2
(x) House Price in 1000$ (y)
2104 460
1416 232
1534 315
... ...
● m = number of training examples
● x = input variable/ feature
● y = target or output variable
● (x,y) = one training example
● (x(i)
,y(i)
) = ith
training example
● x(1)
= 2104
● x(2)
= 1416
● y(1)
= 460
Training set of housing prices
m

Regression
Training Set
h
hypothesis
Learning
Algorithm
House
Size (x)
Estimated
Price
h(x)
House
Size
House
Price
θ0
+ θ1
x
θ0
+ θ1
x
θ0
+ θ1
x
hθ
(x) = θ0
+ θ1
x
● Linear regression with one variable. Here there is one variable x.
● Univariate linear regression

Regression Notation
Size in feet2
(x) House Price in 1000$ (y)
2104 460
1416 232
1534 315
... ...
● m = number of training examples
● x = input variable/ feature
● y = target or output variable
● (x,y) = one training example
● (x(i)
,y(i)
) = ith
training example
● x(1)
= 2104
● x(2)
= 1416
● y(1)
= 460
Training set of housing prices
● Hypothesis: hθ
(x) = θ0
+ θ1
x
● θi’s
: Parameters
● How to choose θi’s
automatically?
House
Size
House
Price
θ0
+ θ1
x
θ0
+ θ1
x
θ0
+ θ1
x

How to choose θi’s
automatically?
● Hypothesis: hθ
(x) = θ0
+ θ1
x
● θi’s
: Parameters
● How to choose θi’s
automatically?
● Let’s choose θ0
and θ1
so that hθ
(x) is close to y
for our training example (x,y) House
Size
House
Price
θ0
+ θ1
x
θ0
+ θ1
x
θ0
+ θ1
x
Σ(hθ
(xi
) - yi
)2
i=1
i=m
1
__
2m Σ(θ0
+ θ1
xi
- yi
)2
i=1
i=m
1
__
2m
minimize
θ0
, θ1
minimize
θ0
, θ1
J(θ0
,θ1
)
J(θ0
,θ1
)
Cost Function
Minimize the squared error cost function

automatically?
● Hypothesis: hθ
(x) = θ0
+ θ1
x
● θ0
, θ1
: Parameters
● Let's set θ0
= 0
● Simplified hypothesis: hθ
(x) = θ1
x
● Cost function
● Goal
Σ(θ1
xi
- yi
)2
i=1
i=m
1
__
2m
J(θ1
)
minimize
θ1
J(θ1
)
1 2 3 x
1
2
3
hθ
(x) = x
θ1
= 0.5
0.5 1 1.5 2
0.5
1
1.5
J(θ1
)
θ
hθ
(x)
J(1) = ((1-0.5)2
+ (2-1)2
+ (3-1.5)2
)/(2x3) =
0.58

automatically?
● Hypothesis: hθ
(x) = θ0
+ θ1
x
● θ0
, θ1
: Parameters
● Let's set θ0
= 0
(x) = θ1
x
● Cost function
● Goal
Σ(θ1
xi
- yi
)2
i=1
i=m
1
__
2m
J(θ1
)
minimize
θ1
J(θ1
)
1 2 3 x
1
2
3
hθ
(x) = x
θ1
= 1.5
0.5 1 1.5 2
0.5
1
1.5
J(θ1
)
θ
hθ
(x)
J(1.5) = ((1-1.5)2
+ (2-3)2
+ (3-4.5)2
)/(2x3) = 0.58

automatically?
● Hypothesis: hθ
(x) = θ0
+ θ1
x
● θ0
, θ1
: Parameters
● Let's set θ0
= 0
(x) = θ1
x
● Cost function
● Goal
Σ(θ1
xi
- yi
)2
i=1
i=m
1
__
2m
J(θ1
)
minimize
θ1
J(θ1
)
1 2 3 x
1
2
3
hθ
(x) = x
θ1
= 1
0.5 1 1.5 2
1
2
3
J(θ1
)
θ
hθ
(x)
J(1) = ((1-1)2
+ (2-2)2
+ (3-3)2
)/(2x3) = 0

automatically?
● Have some function
● Goal
Outline
● Start with some θ1
,e.g., θ1
= 0.5
● Keep changing θ1
to reduce until we
reach the minimum
J(θ1
)
1 2 3 x
1
2
3
hθ
(x) = x
θ1
= 1
0.5 1 1.5 2
1
2
3
J(θ1
)
θ
hθ
(x)
minimize
θ1
J(θ1
)
J(θ1
)
What is minimum? 0 or eps

J(θ1
)
θ1
● Which of the following is true?
● Blue slope (gradient) is negative
● Red slope (gradient) is positive
● Magenta slope is less negative than blue slope
● Yellow slope is close to zero

J(θ1
)
θ1
● If slope is negative you want to increase θ1
● If slope is positive you want to decrease θ1

Gradient Descent Algorithm
J(θ1
)
θ1
● If slope is negative you want to increase θ1
● If slope is positive you want to decrease θ1
θ1
:= θ1
- α J(θ1
)
dθ1
__
d , α = 1

Learning rate α: Large vs Small

Regresssion technique part of Machine learning

How does cost function J(θ0,
θ1
) Look Like?
● Does it matter where we start from?
● Is the solution unique?
Start point

● Does it matter where we start from?
● Is the solution unique?
How does cost function J(θ0,
θ1
) Look Like?

● Multivariate linear regression. It means multiple features (x1
,x1, …,
xn
)
● Previously it was univariate linear regression.

● How to craft new features?
● Hand crafted features
● Is it possible to auto-create new features? Yes.

Regresssion technique part of Machine learning

More Related Content

Similar to Regresssion technique part of Machine learning (20)

Recently uploaded (20)

Regresssion technique part of Machine learning