How to retrieve the most repeated value in a column present in a data frame

r dataframe max

tail(names(sort(table(Forbes2000$category))), 1)

r dataframe max

In case two or more categories may be tied for most frequent, use something like this:

x <- c("Insurance", "Insurance", "Capital Goods", "Food markets", "Food markets")tt <- table(x)names(tt[tt==max(tt)])[1] "Food markets" "Insurance"

r dataframe max

Another way with the data.table package, which is faster for large data sets:

set.seed(1)x=sample(seq(1,100), 5000000, replace = TRUE)

method 1 (solution proposed above)

start.time <- Sys.time()tt <- table(x)names(tt[tt==max(tt)])end.time <- Sys.time()time.taken <- end.time - start.timetime.taken

Time difference of 4.883488 secs

method 2 (DATA TABLE)

start.time <- Sys.time()ds <- data.table( x )setkey(ds, x)sorted <- ds[,.N,by=list(x)]most_repeated_value <- sorted[order(-N)]$x[1]most_repeated_valueend.time <- Sys.time()time.taken <- end.time - start.timetime.taken

Time difference of 0.328033 secs

CodeHunter

How to retrieve the most repeated value in a column present in a data frame

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last