Display rows where any value in a particular column occurs more than once
Use duplicated
with subset='website'
and keep=False
:
df[df.duplicated(subset='website', keep=False)]
Sample Input:
col1 website0 A abc.com1 B abc.com2 C abc.com3 D abc.net4 E xyz.com5 F foo.bar6 G xyz.com7 H foo.baz
Sample Output:
col1 website0 A abc.com1 B abc.com2 C abc.com4 E xyz.com6 G xyz.com