Pandas Change A Specific Column Value Of Duplicate Rows
Using the example here Drop all duplicate rows in Python Pandas Lets say I don't want to drop the duplicates but change the value of the data in one of the columns in the subset. S
Solution 1:
You could use cumcount
and do something like
>>> c = df.groupby(["A","C"]).cumcount()
>>> c = c.replace(0, '').astype(str)
>>> df["A"] += c
>>> df
A B C
0 foo 0 A
1 foo1 1 A
2 foo 1 B
3 bar 1 A
This works because the cumcount
gives us
>>> df.groupby(["A","C"]).cumcount()
0 0
1 1
2 0
3 0
dtype: int64
Post a Comment for "Pandas Change A Specific Column Value Of Duplicate Rows"