Introduction to Machine Learning

You are not logged in.
Please Log In for full access to the web site.
Note that this link will take you to an external site (https://shimmer.mit.edu) to authenticate, and then you will be redirected back to this page.

Homework 1 -- Numpy and ML

Due: Wednesday, February 15, 2023 at 11:00 PM

Welcome to your first homework! Homeworks are designed to be our primary teaching and learning mechanism, with conceptual, math, and coding questions that are designed to highlight the critical ideas in this course. You may choose to tackle the questions in any order, but the homeworks are designed to be followed sequentially. Often, insights from the early problems will help with the later ones.

You have 'free checking'! That means you can check and submit your answer as many times as you want. Your best submission (the one that gives you the most points taking into account correctness and lateness) will be counted---you don't have to worry about it.

After submitting your answers, even if you have gotten a perfect score, we highly encourage you to hit 'View Answer' to look at the staff solution. You may find the staff solutions approached the problems in a different way than you did, which can yield additional insight. Be sure you have gotten your points before hitting 'View Answer', however. You will not be allowed to submit again after viewing the answer.

Each week, we'll provide a Colab notebook for you to use draft and debug your solutions to coding problems (you have better editing and debugging tools there); but you should submit your final solutions here to claim your points.

This week's Colab notebook can be found here: HW01 Colab Notebook (Click Me!)

The homework comes in two parts:

Learning to use numpy
Introduction to linear regression

Numpy

Machine learning algorithms almost always boil down to matrix computations, so we'll need a way to efficiently work with matrices.

numpy is a package for doing a variety of numerical computations in Python that supports writing very compact and efficient code for handling arrays of data. It is used extensively in many fields requiring numerical analysis, so it is worth getting to know.

We will start every code file that uses numpy with import numpy as np, so that we can reference numpy functions with the np. precedent. The fundamental data type in numpy is the multidimensional array, and arrays are usually generated from a nested list of values using the np.array command. Every array has a shape attribute which is a tuple of dimension sizes.

In this class, we will use two-dimensional arrays almost exclusively. That is, we will use 2D arrays to represent both matrices and vectors! This is one of several times where we will seem to be unnecessarily fussy about how we construct and manipulate vectors and matrices, but this will make it easier to catch errors in our code. Even if [[1,2,3]] and [1,2,3] may look the same to us, numpy functions can behave differently depending on which format you use. The first has two dimensions (it's a list of lists), while the second has only one (it's a single list). Using only 2D arrays for both matrices and vectors gives us predictable results from numpy operations.

Using 2D arrays for matrices is clear enough, but what about column and row vectors? We will represent a column vector as a d\times 1 array and a row vector as a 1\times d array. So for example, we will represent the three-element column vector,

x = \left[ \begin{array}{c} 1 \\ 5 \\ 3 \\ \end{array} \right],

as a 3 \times 1 numpy array. This array can be generated with

~~~ x = np.array([[1],[5],[3]]),

or by using the transpose of a 1 \times 3 array (a row vector) as in,

~~~ x = np.transpose(np.array([[1,5,3]]),

where you should take note of the "double" brackets.

It is often more convenient to use the array attribute .T , as in

~~~ x = np.array([[1,5,3]]).T

to compute the transpose.

Before you begin, we would like to note that in this assignment we will not accept answers that use for or while loops. One reason for avoiding loops is efficiency. For many operations, numpy calls a compiled library written in C, and the library is far faster than interpreted Python (in part due to the low-level nature of C, optimizations like vectorization, and in some cases, parallelization). But the more important reason for avoiding loops is that using numpy library calls leads to simpler code that is easier to debug. So, we expect that you should be able to transform loop operations into equivalent operations on numpy arrays, and we will practice this in this assignment.

Of course, there will be more complex algorithms that require loops, but when manipulating matrices you should always look for a solution without loops.

You can find general documentation on numpy here.

Numpy functions and features you should be familiar with for this assignment:

np.array
np.transpose (and the equivalent method a.T)
np.ndarray.shape
np.dot (and the equivalent method a.dot(b) )
np.linalg.inv
np.vstack
np.ones
np.sqrt
Elementwise operators +, -, *, /

Note that in Python, np.dot(a, b) is the matrix product a @ b, not the dot product a^T b.

If you're unfamiliar with numpy and want to see some examples of how to use it, please see this link: Numpy Overview.

Array Basics

Creating Arrays

Provide an expression that sets A to be a 2 \times 3 numpy array (2 rows by 3 columns), containing any values you wish.

Transpose

Write a procedure that takes an array and returns the transpose of the array. You can use np.transpose or the property T, but you may not use loops.

Note: as with other coding problems in 6.390 you do not need to call the procedure; it will be called/tested when submitted.

Shapes Hint: If you get stuck, code and run these expressions (with array values of your choosing), then print out the shape using A.shape

Let A be a 4\times 2 numpy array, B be a 4\times 3 array, and C be a 4\times 1 array. For each of the following expressions, indicate the shape of the result as a tuple of integers (recall python tuples use parentheses, not square brackets, which are for lists, and a tuple with just one item x in it is written as (x,) with a comma). Write "none" (as a Python string with quotes) if the expression is illegal.

For example,

If the result array was [45, 36, 75], the shape is (3,)
If the result array was [[1,2,3],[4,5,6]], the shape is (2,3)

np.array([1,2,3])

np.array([[1,2,3]])

Reminder: A is 4\times 2, B is 4\times 3, and C is 4\times 1.

C*C

Reminder: A is 4\times 2, B is 4\times 3, and C is 4\times 1.

np.dot(C, C)

Reminder: A is 4\times 2, B is 4\times 3, and C is 4\times 1.

np.dot(np.transpose(C), C)

Reminder: A is 4\times 2, B is 4\times 3, and C is 4\times 1.

np.dot(A, B)

Reminder: A is 4\times 2, B is 4\times 3, and C is 4\times 1.

np.dot(A.T, B)

Hint: for more compact and legible code, use @ for matrix multiplication, instead of np.dot. If A and B, are matrices (2D numpy arrays), then A @ B = np.dot(A, B).

Indexing vs. Slicing The shape of the resulting array is different depending on if you use indexing or slicing. Indexing refers to selecting particular elements of an array by using a single number (the index) to specify a particular row or column. Slicing refers to selecting a subset of the array by specifying a range of indices.

If you're unfamiliar with these terms, and the indexing and slicing rules of arrays, please see the indexing and slicing sections of this link: Numpy Overview (Same as the Numpy Overview link from the introduction). You can also look at the official numpy documentation here.

In the following questions, let A = np.array([[5,7,10,14],[2,4,8,9]]). Tell us what the output would be for each of the following expressions. Use brackets [] as necessary. If the operation is invalid, write the python string "none".

Note: Remember that Python uses zero-indexing and thus starts counting from 0, not 1. This is different from R and MATLAB.

Indexing

A[1,2] =

Indexing, revisited