Removing space from columns in pandas
I am trying to remove spaces from a dataframe I have. The columns names look like below. I am trying to get the spaces between name out and replace it with "_" wherever present.
['join_date' 'fiscal_quarter' 'fiscal_year' 'primary_channel'
'secondary_channel' 'customer_count' 'new_members' 'revisit_next_day'
'revisit_14_day' 'demand_1yr' 'revisit_next_day_rate'
'revisit_14_day_rate' 'demand_1yr_per_new_member' u'ch_Ad Network'
u'ch_Affiliate' u'ch_Branded SEM' u'ch_DSP' u'ch_Daily Email'
u'ch_Daily Messaging' u'ch_Direct' u'ch_Direct Publisher' u'ch_Email'
u'ch_Feeds' u'ch_Native' u'ch_Non-Branded SEM' u'ch_Organic Search'
u'ch_Paid Social' u'ch_Site' u'ch_Special Email' u'ch_Television'
u'ch_Trigger Email' u'ch_UNMAPPED' u'ch_Unpaid Social' u'quarter_Q2'
u'quarter_Q3' u'quarter_Q4']
Solution 1:
- To remove white spaces:
- To remove white space everywhere:
df.columns = df.columns.str.replace(' ', '')
- To remove white space at the beginning of string:
df.columns = df.columns.str.lstrip()
- To remove white space at the end of string:
df.columns = df.columns.str.rstrip()
- To remove white space at both ends:
df.columns = df.columns.str.strip()
- To replace white spaces with other characters (underscore for instance):
- To replace white space everywhere
df.columns = df.columns.str.replace(' ', '_')
- To replace white space at the beginning:
df.columns = df.columns.str.replace('^ +', '_')
- To replace white space at the end:
df.columns = df.columns.str.replace(' +$', '_')
- To replace white space at both ends:
df.columns = df.columns.str.replace('^ +| +$', '_')
All above applies to a specific column as well, assume you have a column named col
, then just do:
df[col] = df[col].str.strip() # or .replace as above
Commands can be chained
df.columns = df.columns.str.strip().str.replace(' ', '_')