How to do gradient clipping in pytorch?

python machine-learning deep-learning pytorch gradient-descent

A more complete example

optimizer.zero_grad()        loss, hidden = model(data, hidden, targets)loss.backward()torch.nn.utils.clip_grad_norm_(model.parameters(), args.clip)optimizer.step()

Source: https://github.com/pytorch/pytorch/issues/309

python machine-learning deep-learning pytorch gradient-descent

clip_grad_norm (which is actually deprecated in favor of clip_grad_norm_ following the more consistent syntax of a trailing _ when in-place modification is performed) clips the norm of the overall gradient by concatenating all parameters passed to the function, as can be seen from the documentation:

The norm is computed over all gradients together, as if they were concatenated into a single vector. Gradients are modified in-place.

From your example it looks like that you want clip_grad_value_ instead which has a similar syntax and also modifies the gradients in-place:

clip_grad_value_(model.parameters(), clip_value)

Another option is to register a backward hook. This takes the current gradient as an input and may return a tensor which will be used in-place of the previous gradient, i.e. modifying it. This hook is called each time after a gradient has been computed, i.e. there's no need for manually clipping once the hook has been registered:

for p in model.parameters():    p.register_hook(lambda grad: torch.clamp(grad, -clip_value, clip_value))

python machine-learning deep-learning pytorch gradient-descent

Reading through the forum discussion gave this:

clipping_value = 1 # arbitrary value of your choosingtorch.nn.utils.clip_grad_norm(model.parameters(), clipping_value)

I'm sure there is more depth to it than only this code snippet.

CodeHunter

How to do gradient clipping in pytorch?

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last