How to select the rows with maximum values in each group with dplyr? [duplicate]

Try this:

result <- df %>%              group_by(A, B) %>%             filter(value == max(value)) %>%             arrange(A,B,C)

Seems to work:

identical(  as.data.frame(result),  ddply(df, .(A, B), function(x) x[which.max(x$value),]))#[1] TRUE

As pointed out in the comments, slice may be preferred here as per @RoyalITS' answer below if you strictly only want 1 row per group. This answer will return multiple rows if there are multiple with an identical maximum value.

r dplyr plyr greatest-n-per-group

df %>% group_by(A,B) %>% slice(which.max(value))

r dplyr plyr greatest-n-per-group

You can use top_n

df %>% group_by(A, B) %>% top_n(n=1)

This will rank by the last column (value) and return the top n=1 rows.

Currently, you can't change the this default without causing an error (See https://github.com/hadley/dplyr/issues/426)

CodeHunter

How to select the rows with maximum values in each group with dplyr? [duplicate]

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last