How to handle log(0) when using cross entropy

If you don't mind the dependency on scipy, you can use scipy.special.xlogy. You would replace the expression

np.multiply(np.log(predY), Y) + np.multiply((1 - Y), np.log(1 - predY))

with

xlogy(Y, predY) + xlogy(1 - Y, 1 - predY)

If you expect predY to contain very small values, you might get better numerical results using scipy.special.xlog1py in the second term:

xlogy(Y, predY) + xlog1py(1 - Y, -predY)

Alternatively, knowing that the values in Y are either 0 or 1, you can compute the cost in an entirely different way:

Yis1 = Y == 1cost = -(np.log(predY[Yis1]).sum() + np.log(1 - predY[~Yis1]).sum())/m

numpy machine-learning deep-learning

How do you usually handle this issue?

Add small number (something like 1e-15) to predY - this number doesn't make predictions much off, and it solves log(0) issue.

BTW if your algorithm outputs zeros and ones it might be useful to check the histogram of returned probabilities - when algorithm is so sure that something's happening it can be a sign of overfitting.

numpy machine-learning deep-learning

One common way to deal with log(x) and y / x where x is always non-negative but can become 0 is to add a small constant (as written by Jakub).

You can also clip the value (e.g. tf.clip_by_value or np.clip).

CodeHunter

How to handle log(0) when using cross entropy

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last