0% found this document useful (0 votes)

14 views16 pages

How To Implement Linear Regression

The document provides an overview of a Numerical Linear Algebra course originally taught at the University of San Francisco, including links to resources such as lecture videos and forums for questions. It details the implementation of linear regression using Python libraries like scikit-learn and scipy, discussing various methods such as least squares, Cholesky factorization, QR factorization, and SVD. The document includes code snippets and performance comparisons for different approaches to solving linear regression problems.

Uploaded by

Sai Prabanjan Kumar Kalvapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views16 pages

How To Implement Linear Regression

Uploaded by

Sai Prabanjan Kumar Kalvapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

5/7/25, 4:44 PM Jupyter Notebook Viewer

numerical-linear-algebra (/github/fastai/numerical-linear-algebra/tree/master)
/ nbs (/github/fastai/numerical-linear-algebra/tree/master/nbs)
You can read an overview of this Numerical Linear Algebra course in this blog post
(http://www.fast.ai/2017/07/17/num-lin-alg/). The course was originally taught in the
University of San Francisco MS in Analytics (https://www.usfca.edu/arts-
sciences/graduate-programs/analytics) graduate program. Course lecture videos are
available on YouTube (https://www.youtube.com/playlist?list=PLtmWHNX-
gukIc92m1K0P6bIOnZb-mg0hY) (note that the notebook numbers and video numbers do
not line up, since some notebooks took longer than 1 video to cover).
You can ask questions about the course on our fast.ai forums (http://forums.fast.ai/c/lin-
alg).

6. How to Implement Linear Regression

In the previous lesson, we calculated the least squares linear regression for a diabetes
dataset, using scikit learn's implementation. Today, we will look at how we could write our
own implementation.
In [2]: from sklearn import datasets, linear_model, metrics
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import PolynomialFeatures
import math, scipy, numpy as np
from scipy import linalg

In [3]: np.set_printoptions(precision=6)

In [4]: data = datasets.load_diabetes()

In [5]: feature_names=['age', 'sex', 'bmi', 'bp', 's1', 's2', 's3', 's4', 's5', 's6']

In [6]: trn,test,y_trn,y_test = train_test_split(data.data, data.target, test_size=0

In [7]: trn.shape, test.shape

Out[7]: ((353, 10), (89, 10))

In [8]: def regr_metrics(act, pred):

return (math.sqrt(metrics.mean_squared_error(act, pred)),
metrics.mean_absolute_error(act, pred))

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 1/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

How did sklearn do it?

How is sklearn doing this? By checking the source code (https://github.com/scikit-
learn/scikit-learn/blob/14031f6/sklearn/linear_model/base.py#L417), you can see that in
the dense case, it calls scipy.linalg.lstqr
(https://github.com/scipy/scipy/blob/v0.19.0/scipy/linalg/basic.py#L892-L1058), which is
calling a LAPACK method:
Options are ``'gelsd'``, ``'gelsy'``, ``'gelss'``. Defaul
t
(``'gelsd'``) is a good choice. However, ``'gelsy'`` can
be slightly
faster on many problems. ``'gelss'`` was used historical
ly. It is
generally slow but uses less memory.

gelsd
(https://software.intel.com/sites/products/documentation/doclib/mkl_sa/11/mkl_lapack_e
uses SVD and a divide-and-conquer method
gelsy (https://software.intel.com/en-us/node/521113): uses QR factorization
gelss (https://software.intel.com/en-us/node/521114): uses SVD
Scipy Sparse Least Squares
We will not get into too much detail about the sparse version of least squares. Here is a
bit of info if you are interested:
Scipy sparse lsqr (https://docs.scipy.org/doc/scipy-
0.14.0/reference/generated/scipy.sparse.linalg.lsqr.html#id1) uses an iterative method
called Golub and Kahan bidiagonalization (https://web.stanford.edu/class/cme324/paige-
saunders2.pdf).
from scipy sparse lsqr source code
(https://github.com/scipy/scipy/blob/v0.14.0/scipy/sparse/linalg/isolve/lsqr.py#L96):
Preconditioning is another way to reduce the number of iterations. If it is possible to solve
a related system M*x = b efficiently, where M approximates A in some helpful way (e.g.
M - A has low rank or its elements are small relative to those of A), LSQR may converge
more rapidly on the system A*M(inverse)*z = b , after which x can be recovered by
solving M*x = z.
If A is symmetric, LSQR should not be used! Alternatives are the symmetric conjugate-
gradient method (cg) and/or SYMMLQ. SYMMLQ is an implementation of symmetric cg
that applies to any symmetric A and will converge more rapidly than LSQR. If A is positive
definite, there are other implementations of symmetric cg that require slightly less work
per iteration than SYMMLQ (but will take the same number of iterations).

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 2/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

linalg.lstqr
The sklearn implementation handled adding a constant term (since the y-intercept is
presumably not 0 for the line we are learning) for us. We will need to do that by hand
now:
In [9]: trn_int = np.c_[trn, np.ones(trn.shape[0])]
test_int = np.c_[test, np.ones(test.shape[0])]

Since linalg.lstsq lets us specify which LAPACK routine we want to use, lets try
them all and do some timing comparisons:
In [10]: %timeit coef, _,_,_ = linalg.lstsq(trn_int, y_trn, lapack_driver="gelsd")

290 µs ± 9.24 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [11]: %timeit coef, _,_,_ = linalg.lstsq(trn_int, y_trn, lapack_driver="gelsy")

140 µs ± 91.7 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [12]: %timeit coef, _,_,_ = linalg.lstsq(trn_int, y_trn, lapack_driver="gelss")

199 µs ± 228 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)

Naive Solution
Recall that we want to find that minimizes:
x̂

∣ ∣ Ax − b∣ ∣
∣∣ ∣∣2

Another way to think about this is that we are interested in where vector is closest to b

the subspace spanned by (called the range of ). This is the projection of onto .
A A b A

Since must be perpendicular to the subspace spanned by , we see that

b − Ax ̂ A

T
A ̂ = 0
(b − Ax )

(we are using because we want to multiply each column of by

A
T
A b − Ax ̂

This leads us to the normal equations:

T −1 T
x = (A A) A b

In [13]: def ls_naive(A, b):

return np.linalg.inv(A.T @ A) @ A.T @ b

In [14]: %timeit coeffs_naive = ls_naive(trn_int, y_trn)

45.8 µs ± 4.65 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [15]: coeffs_naive = ls_naive(trn_int, y_trn)

regr_metrics(y_test, test_int @ coeffs_naive)

Out[15]: (57.94102134545707, 48.053565198516438)

Normal Equations (Cholesky)

Normal equations:
https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 3/16
5/7/25, 4:44 PM Jupyter Notebook Viewer
T T
A Ax = A b

If has full rank, the pseudo-inverse

A is a square, hermitian positive
(A
T
A)
−1
A
T

definite matrix. The standard way of solving such a system is Cholesky Factorization,
which finds upper-triangular R s.t. . A
T
A = R
T
R

The following steps are based on Algorithm 11.1 from Trefethen:

In [16]: A = trn_int

In [17]: b = y_trn

In [18]: AtA = A.T @ A

Atb = A.T @ b

Warning: Numpy and Scipy default to different upper/lower for Cholesky

In [19]: R = scipy.linalg.cholesky(AtA)

In [196… np.set_printoptions(suppress=True, precision=4)

Out[196]: array([[ 0.9124, 0.1438, 0.1511, 0.3002, 0.2228, 0.188 ,

-0.051 , 0.1746, 0.22 , 0.2768, -0.2583],
[ 0. , 0.8832, 0.0507, 0.1826, -0.0251, 0.0928,
-0.3842, 0.2999, 0.0911, 0.15 , 0.4393],
[ 0. , 0. , 0.8672, 0.2845, 0.2096, 0.2153,
-0.2695, 0.3181, 0.3387, 0.2894, -0.005 ],
[ 0. , 0. , 0. , 0.7678, 0.0762, -0.0077,
0.0383, 0.0014, 0.165 , 0.166 , 0.0234],
[ 0. , 0. , 0. , 0. , 0.8288, 0.7381,
0.1145, 0.4067, 0.3494, 0.158 , -0.2826],
[ 0. , 0. , 0. , 0. , 0. , 0.3735,
-0.3891, 0.2492, -0.3245, -0.0323, -0.1137],
[ 0. , 0. , 0. , 0. , 0. , 0. ,
0.6406, -0.511 , -0.5234, -0.172 , -0.9392],
[ 0. , 0. , 0. , 0. , 0. , 0. ,
0. , 0.2887, -0.0267, -0.0062, 0.0643],
[ 0. , 0. , 0. , 0. , 0. , 0. ,
0. , 0. , 0.2823, 0.0636, 0.9355],
[ 0. , 0. , 0. , 0. , 0. , 0. ,
0. , 0. , 0. , 0.7238, 0.0202],
[ 0. , 0. , 0. , 0. , 0. , 0. ,
0. , 0. , 0. , 0. , 18.7319]])

check our factorization:

In [20]: np.linalg.norm(AtA - R.T @ R)

Out[20]: 4.5140158187158533e-16

T T
A Ax = A b
T T
R Rx = A b
T T
R w = A b

Rx = w

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 4/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

In [21]: w = scipy.linalg.solve_triangular(R, Atb, lower=False, trans='T')

It's always good to check that our result is what we expect it to be: (in case we entered
the wrong params, the function didn't return what we thought, or sometimes the docs are
even outdated)
In [22]: np.linalg.norm(R.T @ w - Atb)

Out[22]: 1.1368683772161603e-13

In [23]: coeffs_chol = scipy.linalg.solve_triangular(R, w, lower=False)

In [24]: np.linalg.norm(R @ coeffs_chol - w)

Out[24]: 6.861429794408013e-14

In [25]: def ls_chol(A, b):

R = scipy.linalg.cholesky(A.T @ A)
w = scipy.linalg.solve_triangular(R, A.T @ b, trans='T')
return scipy.linalg.solve_triangular(R, w)

In [26]: %timeit coeffs_chol = ls_chol(trn_int, y_trn)

111 µs ± 272 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [27]: coeffs_chol = ls_chol(trn_int, y_trn)

regr_metrics(y_test, test_int @ coeffs_chol)

Out[27]: (57.9410213454571, 48.053565198516438)

QR Factorization
Ax = b

A = QR

QRx = b
T
Rx = Q b

In [28]: def ls_qr(A,b):

Q, R = scipy.linalg.qr(A, mode='economic')
return scipy.linalg.solve_triangular(R, Q.T @ b)

In [29]: %timeit coeffs_qr = ls_qr(trn_int, y_trn)

205 µs ± 264 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [30]: coeffs_qr = ls_qr(trn_int, y_trn)

regr_metrics(y_test, test_int @ coeffs_qr)

Out[30]: (57.94102134545711, 48.053565198516452)

SVD
Ax = b

A = U ΣV
T
ΣV x = U b

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 5/16

5/7/25, 4:44 PM Jupyter Notebook Viewer
T
Σw = U b
T
x = V w

SVD gives the pseudo-inverse

In [253… def ls_svd(A,b):
m, n = A.shape
U, sigma, Vh = scipy.linalg.svd(A, full_matrices=False, lapack_driver='ge
w = (U.T @ b)/ sigma
return Vh.T @ w

In [32]: %timeit coeffs_svd = ls_svd(trn_int, y_trn)

1.11 ms ± 320 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [254… %timeit coeffs_svd = ls_svd(trn_int, y_trn)

266 µs ± 8.49 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [255… coeffs_svd = ls_svd(trn_int, y_trn)

regr_metrics(y_test, test_int @ coeffs_svd)

Out[255]: (57.941021345457244, 48.053565198516687)

Random Sketching Technique for Least Squares Regression

Linear Sketching (http://researcher.watson.ibm.com/researcher/files/us-
dpwoodru/journal.pdf) (Woodruff)
1. Sample a r x n random matrix S, r << n
2. Compute S A and S b
3. Find exact solution x to regression SA x = Sb
Timing Comparison
In [244… import timeit
import pandas as pd

In [245… def scipylstq(A, b):

return scipy.linalg.lstsq(A,b)[0]

In [246… row_names = ['Normal Eqns- Naive',

'Normal Eqns- Cholesky',
'QR Factorization',
'SVD',
'Scipy lstsq']

name2func = {'Normal Eqns- Naive': 'ls_naive',

'Normal Eqns- Cholesky': 'ls_chol',
'QR Factorization': 'ls_qr',
'SVD': 'ls_svd',
'Scipy lstsq': 'scipylstq'}

In [247… m_array = np.array([100, 1000, 10000])

n_array = np.array([20, 100, 1000])

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 6/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

In [248… index = pd.MultiIndex.from_product([m_array, n_array], names=['# rows', '# co

In [249… pd.options.display.float_format = '{:,.6f}'.format

df = pd.DataFrame(index=row_names, columns=index)
df_error = pd.DataFrame(index=row_names, columns=index)

In [256… # %%prun
for m in m_array:
for n in n_array:
if m >= n:
x = np.random.uniform(-10,10,n)
A = np.random.uniform(-40,40,[m,n]) # removed np.asfortranarray
b = np.matmul(A, x) + np.random.normal(0,2,m)
for name in row_names:
fcn = name2func[name]
t = timeit.timeit(fcn + '(A,b)', number=5, globals=globals()
df.set_value(name, (m,n), t)
coeffs = locals()[fcn](A, b)
reg_met = regr_metrics(b, A @ coeffs)
df_error.set_value(name, (m,n), reg_met[0])

In [257… df

Out[257]: # rows 100 1000

In [252… df_error

Out[252]: # rows 100 1000

# cols 20 100 1000 20 100 1000 20 100
Normal 1.702742 0.000000 NaN 1.970767 1.904873 0.000000 1.978383 1.980449
Eqns- Naive
Normal
Eqns- 1.702742 0.000000 NaN 1.970767 1.904873 0.000000 1.978383 1.980449
Cholesky
QR 1.702742 0.000000 NaN 1.970767 1.904873 0.000000 1.978383 1.980449
Factorization
SVD 1.702742 0.000000 NaN 1.970767 1.904873 0.000000 1.978383 1.980449
Scipy lstsq 1.702742 0.000000 NaN 1.970767 1.904873 0.000000 1.978383 1.980449

In [618… store = pd.HDFStore('least_squares_results.h5')

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 7/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

In [619… store['df'] = df

C:\Users\rache\Anaconda3\lib\site-packages\IPython\core\interactiveshell.py:
2881: PerformanceWarning:
your performance may suffer as PyTables will pickle object types that it can
not
map directly to c-types [inferred_type->floating,key->block0_values] [items-
>[(100, 20), (100, 100), (100, 1000), (1000, 20), (1000, 100), (1000, 1000),
(5000, 20), (5000, 100), (5000, 1000)]]

exec(code_obj, self.user_global_ns, self.user_ns)

Notes
I used the magick %prun to profile my code.
Alternative: least absolute deviation (L1 regression)
Less sensitive to outliers than least squares.
No closed form solution, but can solve with linear programming

Conditioning & stability

Condition Number
Condition number is a measure of how small changes to the input cause the output to
change.
Question: Why do we care about behavior with small changes to the input in numerical
linear algebra?
The relative condition number is defined by
‖δf ‖ ‖δx‖
κ = sup
‖f (x)‖ / ‖x‖

where is infinitesimal
δx

δx

According to Trefethen (pg. 91), a problem is well-conditioned if is small (e.g. , , κ 1 10

10 ) and ill-conditioned if is large (e.g. , )

2
κ 10
6
10
16

Conditioning: perturbation behavior of a mathematical problem (e.g. least squares)

Stability: perturbation behavior of an algorithm used to solve that problem on a
computer (e.g. least squares algorithms, householder, back substitution, gaussian
elimination)
Conditioning example
The problem of computing eigenvalues of a non-symmetric matrix is often ill-conditioned

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 8/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

In [178… A = [[1, 1000], [0, 1]]

B = [[1, 1000], [0.001, 1]]

In [179… wA, vrA = scipy.linalg.eig(A)

wB, vrB = scipy.linalg.eig(B)

In [180… wA, wB

Out[180]: (array([ 1.+0.j, 1.+0.j]),

array([ 2.00000000e+00+0.j, -2.22044605e-16+0.j]))

Condition Number of a Matrix

The product ‖A‖‖Acomes up so often it has its own name: the condition number of
−1
‖

A. Note that normally we talk about the conditioning of problems, not matrices.
The condition number of relates to: A

computing given and in

b A x Ax = b

computing given and in

x A b Ax = b

Loose ends from last time

Full vs Reduced Factorizations
SVD
Diagrams from Trefethen:

QR Factorization exists for ALL matrices

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 9/16
5/7/25, 4:44 PM Jupyter Notebook Viewer

Just like with SVD, there are full and reduced versions of the QR factorization.

Matrix Inversion is Unstable

In [197… from scipy.linalg import hilbert

In [229… n = 14
A = hilbert(n)
x = np.random.uniform(-10,10,n)
b = A @ x

In [230… A_inv = np.linalg.inv(A)

In [233… np.linalg.norm(np.eye(n) - A @ A_inv)

Out[233]: 5.0516495470543212

In [231… np.linalg.cond(A)

Out[231]: 2.2271635826494112e+17

In [232… A @ A_inv

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 10/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

Out[232]: array([[ 1. , 0. , -0.0001, 0.0005, -0.0006, 0.0105, -0.0243,

0.1862, -0.6351, 2.2005, -0.8729, 0.8925, -0.0032, -0.0106],
[ 0. , 1. , -0. , 0. , 0.0035, 0.0097, -0.0408,
0.0773, -0.0524, 1.6926, -0.7776, -0.111 , -0.0403, -0.0184],
[ 0. , 0. , 1. , 0.0002, 0.0017, 0.0127, -0.0273,
0. , 0. , 1.4688, -0.5312, 0.2812, 0.0117, 0.0264],
[ 0. , 0. , -0. , 1.0005, 0.0013, 0.0098, -0.0225,
0.1555, -0.0168, 1.1571, -0.9656, -0.0391, 0.018 , -0.0259],
[-0. , 0. , -0. , 0.0007, 1.0001, 0.0154, 0.011 ,
-0.2319, 0.5651, -0.2017, 0.2933, -0.6565, 0.2835, -0.0482],
[ 0. , -0. , 0. , -0.0004, 0.0059, 0.9945, -0.0078,
-0.0018, -0.0066, 1.1839, -0.9919, 0.2144, -0.1866, 0.0187],
[-0. , 0. , -0. , 0.0009, -0.002 , 0.0266, 0.974 ,
-0.146 , 0.1883, -0.2966, 0.4267, -0.8857, 0.2265, -0.0453],
[ 0. , 0. , -0. , 0.0002, 0.0009, 0.0197, -0.0435,
1.1372, -0.0692, 0.7691, -1.233 , 0.1159, -0.1766, -0.0033],
[ 0. , 0. , -0. , 0.0002, 0. , -0.0018, -0.0136,
0.1332, 0.945 , 0.3652, -0.2478, -0.1682, 0.0756, -0.0212],
[ 0. , -0. , -0. , 0.0003, 0.0038, -0.0007, 0.0318,
-0.0738, 0.2245, 1.2023, -0.2623, -0.2783, 0.0486, -0.0093],
[-0. , 0. , -0. , 0.0004, -0.0006, 0.013 , -0.0415,
0.0292, -0.0371, 0.169 , 1.0715, -0.09 , 0.1668, -0.0197],
[ 0. , -0. , 0. , 0. , 0.0016, 0.0062, -0.0504,
0.1476, -0.2341, 0.8454, -0.7907, 1.4812, -0.15 , 0.0186],
[ 0. , -0. , 0. , -0.0001, 0.0022, 0.0034, -0.0296,
0.0944, -0.1833, 0.6901, -0.6526, 0.2556, 0.8563, 0.0128],
[ 0. , 0. , 0. , -0.0001, 0.0018, -0.0041, -0.0057,
-0.0374, -0.165 , 0.3968, -0.2264, -0.1538, -0.0076, 1.005 ]])

In [237… row_names = ['Normal Eqns- Naive',

'QR Factorization',
'SVD',
'Scipy lstsq']

name2func = {'Normal Eqns- Naive': 'ls_naive',

'QR Factorization': 'ls_qr',
'SVD': 'ls_svd',
'Scipy lstsq': 'scipylstq'}

In [238… pd.options.display.float_format = '{:,.9f}'.format

df = pd.DataFrame(index=row_names, columns=['Time', 'Error'])

In [239… for name in row_names:

fcn = name2func[name]
t = timeit.timeit(fcn + '(A,b)', number=5, globals=globals())
coeffs = locals()[fcn](A, b)
df.set_value(name, 'Time', t)
df.set_value(name, 'Error', regr_metrics(b, A @ coeffs)[0])

SVD is best here!

DO NOT RERUN
In [240… df

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 11/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

Out[240]: Time Error

Normal Eqns- Naive 0.001334339 3.598901966
QR Factorization 0.002166139 0.000000000
SVD 0.001556937 0.000000000
Scipy lstsq 0.001871590 0.000000000

In [240… df

Out[240]: Time Error

Normal Eqns- Naive 0.001334339 3.598901966
QR Factorization 0.002166139 0.000000000
SVD 0.001556937 0.000000000
Scipy lstsq 0.001871590 0.000000000
Another reason not to take inverse
Even if is incredibly sparse, is generally dense. For large matrices,
A A
−1 −1
A could be so
dense as to not fit in memory.

Runtime
Matrix Inversion: 2n
3

Matrix Multiplication: n
3

Cholesky: 1

3
n
3

QR, Gram Schmidt: , (chapter 8 of Trefethen)

2mn
2
m ≥ n

QR, Householder: (chapter 10 of Trefethen)

2mn
2
−
2

3
n
3

Solving a triangular system: n

Why Cholesky Factorization is Fast:

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 12/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

(source: [Stanford Convex Optimization: Numerical Linear Algebra Background Slides]

(http://stanford.edu/class/ee364a/lectures/num-lin-alg.pdf))
A Case Where QR is the Best
In [65]: m=100
n=15
t=np.linspace(0, 1, m)

In [66]: # Vandermonde matrix

A=np.stack([t**i for i in range(n)], 1)

In [67]: b=np.exp(np.sin(4*t))

# This will turn out to normalize the solution to be 1

b /= 2006.787453080206

In [68]: from matplotlib import pyplot as plt

%matplotlib inline

In [69]: plt.plot(t, b)

Out[69]: [<matplotlib.lines.Line2D at 0x7fdfc1fa7eb8>]

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 13/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

Check that we get 1:

In [58]: 1 - ls_qr(A, b)[14]

Out[58]: 1.4137685733217609e-07

Bad condition number:

In [60]: kappa = np.linalg.cond(A); kappa

Out[60]: 5.827807196683593e+17

In [181… row_names = ['Normal Eqns- Naive',

'QR Factorization',
'SVD',
'Scipy lstsq']

name2func = {'Normal Eqns- Naive': 'ls_naive',

'QR Factorization': 'ls_qr',
'SVD': 'ls_svd',
'Scipy lstsq': 'scipylstq'}

In [74]: pd.options.display.float_format = '{:,.9f}'.format

df = pd.DataFrame(index=row_names, columns=['Time', 'Error'])

In [75]: for name in row_names:

fcn = name2func[name]
t = timeit.timeit(fcn + '(A,b)', number=5, globals=globals())
coeffs = locals()[fcn](A, b)
df.set_value(name, 'Time', t)
df.set_value(name, 'Error', np.abs(1 - coeffs[-1]))

In [76]: df

Out[76]: Time Error

Normal Eqns- Naive 0.001565099 1.357066025
QR Factorization 0.002632104 0.000000116
SVD 0.003503785 0.000000116
Scipy lstsq 0.002763502 0.000000116

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 14/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

The solution for least squares via the normal equations is unstable in general, although
stable for problems with small condition numbers.
Low-rank
In [258… m = 100
n = 10
x = np.random.uniform(-10,10,n)
A2 = np.random.uniform(-40,40, [m, int(n/2)]) # removed np.asfortranarray
A = np.hstack([A2, A2])

In [259… A.shape, A2.shape

Out[259]: ((100, 10), (100, 5))

In [260… b = A @ x + np.random.normal(0,1,m)

In [263… row_names = ['Normal Eqns- Naive',

'QR Factorization',
'SVD',
'Scipy lstsq']

name2func = {'Normal Eqns- Naive': 'ls_naive',

'QR Factorization': 'ls_qr',
'SVD': 'ls_svd',
'Scipy lstsq': 'scipylstq'}

In [264… pd.options.display.float_format = '{:,.9f}'.format

df = pd.DataFrame(index=row_names, columns=['Time', 'Error'])

In [265… for name in row_names:

In [266… df

Out[266]: Time Error

Normal Eqns- Naive 0.001227640 300.658979382
QR Factorization 0.002315920 0.876019803
SVD 0.001745647 1.584746056
Scipy lstsq 0.002067989 0.804750398

Comparison
Our results from above:
In [257… df

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 15/16

5/7/25, 4:44 PM Jupyter Notebook Viewer

Out[257]: # rows 100 1000

# cols 20 100 1000 20 100 1000 20 100
Normal 0.001276 0.003634 NaN 0.000960 0.005172 0.293126 0.002226 0.021248
Eqns- Naive
Normal
Eqns- 0.001660 0.003958 NaN 0.001665 0.004007 0.093696 0.001928 0.010456
Cholesky
QR 0.002174 0.006486 NaN 0.004235 0.017773 0.213232 0.019229 0.116122
Factorization
SVD 0.003880 0.021737 NaN 0.004672 0.026950 1.280490 0.018138 0.130652
Scipy lstsq 0.004338 0.020198 NaN 0.004320 0.021199 1.083804 0.012200 0.088467
From Trefethen (page 84):
Normal equations/Cholesky is fastest when it works. Cholesky can only be used on
symmetric, positive definite matrices. Also, normal equations/Cholesky is unstable for
matrices with high condition numbers or with low-rank.
Linear regression via QR has been recommended by numerical analysts as the standard
method for years. It is natural, elegant, and good for "daily use".

https://nbviewer.org/github/fastai/numerical-linear-algebra/blob/master/nbs/6. How to Implement Linear Regression.ipynb 16/16

Num Lin Alg Software
No ratings yet
Num Lin Alg Software
28 pages
Linear Algebra: Assignment I
No ratings yet
Linear Algebra: Assignment I
11 pages
Outline of Next 2 Lectures: Matrix Computations: Direct Methods I
No ratings yet
Outline of Next 2 Lectures: Matrix Computations: Direct Methods I
16 pages
Numerical Methods For Least Squares Problems, Second Edition
No ratings yet
Numerical Methods For Least Squares Problems, Second Edition
510 pages
Health Outcomes With Linear Regression
No ratings yet
Health Outcomes With Linear Regression
8 pages
Lect 01 (Intro) 2025
No ratings yet
Lect 01 (Intro) 2025
25 pages
Ecp2018 Magma Tutorial 1
No ratings yet
Ecp2018 Magma Tutorial 1
50 pages
Pma Exam
100% (1)
Pma Exam
358 pages
Scipy Cheat Sheet Python For Data Science: Linear Algebra
No ratings yet
Scipy Cheat Sheet Python For Data Science: Linear Algebra
1 page
MATLAB To Python
No ratings yet
MATLAB To Python
5 pages
Coursenotes
No ratings yet
Coursenotes
157 pages
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
No ratings yet
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
32 pages
Least Square+Best Fit
No ratings yet
Least Square+Best Fit
78 pages
Cholesky Decomposition & MATLAB
No ratings yet
Cholesky Decomposition & MATLAB
33 pages
Sparse Matrix Methods Overview and Techniques
No ratings yet
Sparse Matrix Methods Overview and Techniques
17 pages
Randomized Numerical Linear Algebra
No ratings yet
Randomized Numerical Linear Algebra
202 pages
Interactive Linear Algebra - Ila
No ratings yet
Interactive Linear Algebra - Ila
340 pages
CS 240A: Solving Ax B in Parallel: Dense A: Gaussian Elimination With Partial Pivoting (LU)
No ratings yet
CS 240A: Solving Ax B in Parallel: Dense A: Gaussian Elimination With Partial Pivoting (LU)
35 pages
Numerical Linear Algebra Techniques
No ratings yet
Numerical Linear Algebra Techniques
10 pages
SciPy for Data Scientists
No ratings yet
SciPy for Data Scientists
1 page
Layton W., Sussman M. Numerical Linear Algebra 2020
No ratings yet
Layton W., Sussman M. Numerical Linear Algebra 2020
274 pages
Scilab Course: Matrices & Equations
No ratings yet
Scilab Course: Matrices & Equations
36 pages
Sparse Solvers
No ratings yet
Sparse Solvers
31 pages
RTV 4 Manual - Regu Tools
No ratings yet
RTV 4 Manual - Regu Tools
128 pages
Matlab Regularization Tools Package
No ratings yet
Matlab Regularization Tools Package
128 pages
Math 3na3 Textbook
No ratings yet
Math 3na3 Textbook
586 pages
MAT321 Lecture Notes Boumal 2019
No ratings yet
MAT321 Lecture Notes Boumal 2019
203 pages
LAPACK Users Guide PDF
No ratings yet
LAPACK Users Guide PDF
425 pages
(Yousef Saad) Iterative Methods For Sparse Linear (BookFi)
No ratings yet
(Yousef Saad) Iterative Methods For Sparse Linear (BookFi)
547 pages
SciPy & NumPy for Data Scientists
No ratings yet
SciPy & NumPy for Data Scientists
1 page
Fundamentals of Numerical Linear Algebra
No ratings yet
Fundamentals of Numerical Linear Algebra
265 pages
Stat513 l12
No ratings yet
Stat513 l12
26 pages
Interactive Linear Algebra: UBC Edition
No ratings yet
Interactive Linear Algebra: UBC Edition
477 pages
Applied Numerical Linear Algebra. Lecture 5
No ratings yet
Applied Numerical Linear Algebra. Lecture 5
52 pages
Interactive Linear Algebra
No ratings yet
Interactive Linear Algebra
441 pages
Null Space and Column Space Relations
No ratings yet
Null Space and Column Space Relations
17 pages
Sparse Regression and Dictionary Learning
No ratings yet
Sparse Regression and Dictionary Learning
14 pages
Advances in Sketching for Linear Algebra
No ratings yet
Advances in Sketching for Linear Algebra
139 pages
Fundamentals of Matrix Computations: Second Edition
0% (1)
Fundamentals of Matrix Computations: Second Edition
10 pages
MATLAB Coding Techniques
No ratings yet
MATLAB Coding Techniques
7 pages
Jayant Kumar Maths File 20176802719
No ratings yet
Jayant Kumar Maths File 20176802719
29 pages
Numerical Linear Algebra Course
No ratings yet
Numerical Linear Algebra Course
86 pages
Notes On MATLAB
No ratings yet
Notes On MATLAB
26 pages
Matlab Lab Manual
33% (3)
Matlab Lab Manual
49 pages
Direct Methods For Sparse Linear Systems by Timothy A. Davis
No ratings yet
Direct Methods For Sparse Linear Systems by Timothy A. Davis
230 pages
Computer Lab 3
No ratings yet
Computer Lab 3
2 pages
Matlab Fundamentals: Computer-Aided Manufacturing
No ratings yet
Matlab Fundamentals: Computer-Aided Manufacturing
47 pages
PageRank With Eigen Decompositions
No ratings yet
PageRank With Eigen Decompositions
92 pages
Introduction
No ratings yet
Introduction
4 pages
Advanced Numerical Methods With Matlab 1 Function Approximation and System Resolution 1st Edition Bouchaib Radi Sample
100% (1)
Advanced Numerical Methods With Matlab 1 Function Approximation and System Resolution 1st Edition Bouchaib Radi Sample
148 pages
MATLAB Practice Problems Overview
No ratings yet
MATLAB Practice Problems Overview
5 pages
Introduction
No ratings yet
Introduction
4 pages
Multiobjective Optimization Scilab PDF
No ratings yet
Multiobjective Optimization Scilab PDF
11 pages
9th Grade Maths Guide: Algebra Basics
No ratings yet
9th Grade Maths Guide: Algebra Basics
2 pages
Lee Et Al - 1997 - Scattered Data Interpolation With Multilevel B-Splines
No ratings yet
Lee Et Al - 1997 - Scattered Data Interpolation With Multilevel B-Splines
17 pages
Introduction To Polynomials Eng Hindi - 01
No ratings yet
Introduction To Polynomials Eng Hindi - 01
14 pages
Algegbra II Notes
No ratings yet
Algegbra II Notes
3 pages
Mesh Deformation Using Radial Basis Functions
No ratings yet
Mesh Deformation Using Radial Basis Functions
13 pages
Initial Feasible Solutions in Transportation Method
100% (1)
Initial Feasible Solutions in Transportation Method
31 pages
Important Numerical Questions
No ratings yet
Important Numerical Questions
8 pages
Fa21 Lecture 4 Linear Regression
No ratings yet
Fa21 Lecture 4 Linear Regression
21 pages
Polynomial Zeros and Quadratic Problems
No ratings yet
Polynomial Zeros and Quadratic Problems
4 pages
AS and A-Level Maths: Topic Test
No ratings yet
AS and A-Level Maths: Topic Test
3 pages
Compound Shape 1
No ratings yet
Compound Shape 1
1 page
Polynomial Functions
No ratings yet
Polynomial Functions
5 pages
Grade 10 Work Sheets (5 Chapters)
No ratings yet
Grade 10 Work Sheets (5 Chapters)
2 pages
Numerical Differentiation Lab Report
No ratings yet
Numerical Differentiation Lab Report
4 pages
MTH603 Final Term Papers in One File PDF
No ratings yet
MTH603 Final Term Papers in One File PDF
11 pages
Polynomial Factoring Methods Guide
No ratings yet
Polynomial Factoring Methods Guide
42 pages
ch2 Maths Important Questions For Pre Board Exam 2023 24
No ratings yet
ch2 Maths Important Questions For Pre Board Exam 2023 24
2 pages
Regression MCQs Part 1
No ratings yet
Regression MCQs Part 1
21 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
1st Periodical Examination in Mathematics 7
No ratings yet
1st Periodical Examination in Mathematics 7
2 pages
NMF P
No ratings yet
NMF P
57 pages
Unit 4 Numerical Techniques Unit: 4: Btech4 Sem
No ratings yet
Unit 4 Numerical Techniques Unit: 4: Btech4 Sem
106 pages
Linear Programming Practice Problems
No ratings yet
Linear Programming Practice Problems
3 pages
Metodo de Ramanujan
No ratings yet
Metodo de Ramanujan
7 pages
Course Outline Math Methods 2
No ratings yet
Course Outline Math Methods 2
4 pages
Conjugate Gradient Algorithms and Finite Element Methods
No ratings yet
Conjugate Gradient Algorithms and Finite Element Methods
382 pages
AS SE S: Polynomial
100% (1)
AS SE S: Polynomial
12 pages
Chapter 3
No ratings yet
Chapter 3
15 pages
Constrained Optimization Problems: Lagrangian and Lagrange Multipliers
No ratings yet
Constrained Optimization Problems: Lagrangian and Lagrange Multipliers
1 page