How to find linearly independent rows from a matrix How to find linearly independent rows from a matrix python python

How to find linearly independent rows from a matrix


First, your 3rd row is linearly dependent with 1t and 2nd row. However, your 1st and 4th column are linearly dependent.

Two methods you could use:

Eigenvalue

If one eigenvalue of the matrix is zero, its corresponding eigenvector is linearly dependent. The documentation eig states the returned eigenvalues are repeated according to their multiplicity and not necessarily ordered. However, assuming the eigenvalues correspond to your row vectors, one method would be:

import numpy as npmatrix = np.array(    [        [0, 1 ,0 ,0],        [0, 0, 1, 0],        [0, 1, 1, 0],        [1, 0, 0, 1]    ])lambdas, V =  np.linalg.eig(matrix.T)# The linearly dependent row vectors print matrix[lambdas == 0,:]

Cauchy-Schwarz inequality

To test linear dependence of vectors and figure out which ones, you could use the Cauchy-Schwarz inequality. Basically, if the inner product of the vectors is equal to the product of the norm of the vectors, the vectors are linearly dependent. Here is an example for the columns:

import numpy as npmatrix = np.array(    [        [0, 1 ,0 ,0],        [0, 0, 1, 0],        [0, 1, 1, 0],        [1, 0, 0, 1]    ])print np.linalg.det(matrix)for i in range(matrix.shape[0]):    for j in range(matrix.shape[0]):        if i != j:            inner_product = np.inner(                matrix[:,i],                matrix[:,j]            )            norm_i = np.linalg.norm(matrix[:,i])            norm_j = np.linalg.norm(matrix[:,j])            print 'I: ', matrix[:,i]            print 'J: ', matrix[:,j]            print 'Prod: ', inner_product            print 'Norm i: ', norm_i            print 'Norm j: ', norm_j            if np.abs(inner_product - norm_j * norm_i) < 1E-5:                print 'Dependent'            else:                print 'Independent'

To test the rows is a similar approach.

Then you could extend this to test all combinations of vectors, but I imagine this solution scale badly with size.


With you can find the linear independant rows using: sympy.Matrix.rref:

>>> import sympy >>> import numpy as np>>> mat = np.array([[0,1,0,0],[0,0,1,0],[0,1,1,0],[1,0,0,1]])  # your matrix>>> _, inds = sympy.Matrix(mat).T.rref()   # to check the rows you need to transpose!>>> inds[0, 1, 3]

Which basically tells you the rows 0, 1 and 3 are linear independant while row 2 isn't (it's a linear combination of row 0 and 1).

Then you could remove these rows with slicing:

>>> mat[inds]array([[0, 1, 0, 0],       [0, 0, 1, 0],       [1, 0, 0, 1]])

This also works well for rectangular (not only for quadratic) matrices.


I edited the code for Cauchy-Schwartz inequality which scales better with dimension: the inputs are the matrix and its dimension, while the output is a new rectangular matrix which contains along its rows the linearly independent columns of the starting matrix. This works in the assumption that the first column in never null, but can be readily generalized in order to implement this case too. Another thing that I observed is that 1e-5 seems to be a "sloppy" threshold, since some particular pathologic vectors were found to be linearly dependent in that case: 1e-4 doesn't give me the same problems. I hope this could be of some help: it was pretty difficult for me to find a really working routine to extract li vectors, and so I'm willing to share mine. If you find some bug, please report them!!

from numpy import dot, zerosfrom numpy.linalg import matrix_rank, normdef find_li_vectors(dim, R):    r = matrix_rank(R)     index = zeros( r ) #this will save the positions of the li columns in the matrix    counter = 0    index[0] = 0 #without loss of generality we pick the first column as linearly independent    j = 0 #therefore the second index is simply 0    for i in range(R.shape[0]): #loop over the columns        if i != j: #if the two columns are not the same            inner_product = dot( R[:,i], R[:,j] ) #compute the scalar product            norm_i = norm(R[:,i]) #compute norms            norm_j = norm(R[:,j])            #inner product and the product of the norms are equal only if the two vectors are parallel            #therefore we are looking for the ones which exhibit a difference which is bigger than a threshold            if absolute(inner_product - norm_j * norm_i) > 1e-4:                counter += 1 #counter is incremented                index[counter] = i #index is saved                j = i #j is refreshed            #do not forget to refresh j: otherwise you would compute only the vectors li with the first column!!    R_independent = zeros((r, dim))    i = 0    #now save everything in a new matrix    while( i < r ):        R_independent[i,:] = R[index[i],:]         i += 1    return R_independent