Skip to content Skip to sidebar Skip to footer

Add Space Before Capital Letters In A Dataframe Or Column In Python Using Regex

I need to have the values in the columns split up where the capital letters are. So it looks like this: West Afghanistan or North East Afghanistan I tried this so far and nothing

Solution 1:

Use Series.str.replace with replace uppercase by same vales with space before and then remove first space:

df = pd.DataFrame({'U.N.Region':['WestAfghanistan','NorthEastAfghanistan']})

df['U.N.Region'] = df['U.N.Region'].str.replace( r"([A-Z])", r" \1").str.strip()
print (df)
                U.N.Region
0         West Afghanistan
1   North East Afghanistan

Solution 2:

Another option would be,

import pandas as pd
import re


df = pd.DataFrame({'U.N.Region': ['WestAfghanistan', 'NorthEastAfghanistan']})

df['U.N.Region'] = df['U.N.Region'].str.replace(
    r"(?<=[a-z])(?=[A-Z])", " ")
print(df)

Solution 3:

Yet another solution:

df.apply(lambda col: col.str.replace(r"([a-z])([A-Z])",r"\1 \2"))  

Out: 
              U.N. Region   Centers
0  North East Afghanistan  Fayzabad
1        West Afghanistan  Qala Naw

Post a Comment for "Add Space Before Capital Letters In A Dataframe Or Column In Python Using Regex"