Remove Whitespace or specific characters in Pandas

To remove whitespace (including newlines) or specific characters on text data in a Series or DataFrame, use the following methods in Python Pandas:

  • strip(): Strip whitespace (including newlines) or specific characters from the left and right
  • lstrip(): Strip whitespace (including newlines) or specific characters from only the left side
  • rstrip(): Strip whitespace (including newlines) or specific characters from only the right side

Before moving further, we’ve prepared a video tutorial to remove whitespace or specific characters in Pandas:

strip() method

To strip whitespace (including newlines) or specific characters from both the left and right side of values in a Series or DataFrame, use the strip() method in Pandas. Let us see an example:

# strip() method in Python Pandas
# Code by Studyopedia

import pandas as pd

# Data to be stored in the Pandas Series
data = ["!Jacob", "Amit\n\n", "Trent", "Nathan\t", "Martin"]

# Create a Series using the Series() method
series = pd.Series(data)

# Display the Series
print("Series:\n", series)

# Strip the values
print("\nStrip from both the sides\n",series.str.strip("!\n\t"))

Output

Series:
0      !Jacob
1    Amit\n\n
2       Trent
3    Nathan\t
4      Martin
dtype: object

Strip from both the sides
0     Jacob
1      Amit
2     Trent
3    Nathan
4    Martin
dtype: object

lstrip() method

To strip whitespace (including newlines) or specific characters from the left side of values in a Series or DataFrame, use the lstrip() method in Pandas. Let us see an example:

# lstrip() method in Python Pandas
# Code by Studyopedia

import pandas as pd

# Data to be stored in the Pandas Series
data = ["!Jacob", "\n\tAmit\n\n", "!Trent!", "Nathan\t", "Martin"]

# Create a Series using the Series() method
series = pd.Series(data)

# Display the Series
print("Series:\n", series)

# Strip from the left
print("\nStrip from the left side:\n", series.str.lstrip("!\n\t"))

Output

Series:
 0          !Jacob
1    \n\tAmit\n\n
2         !Trent!
3        Nathan\t
4          Martin
dtype: object

Strip from the left side:
 0       Jacob
1    Amit\n\n
2      Trent!
3    Nathan\t
4      Martin
dtype: object

rstrip() method

To strip whitespace (including newlines) or specific characters from the right side of values in a Series or DataFrame, use the rstrip() method in Pandas. Let us see an example:

# rstrip() method in Python Pandas
# Code by Studyopedia

import pandas as pd

# Data to be stored in the Pandas Series
data = ["!Jacob", "\n\tAmit\n\n", "!Trent!", "Nathan\t", "Martin"]

# Create a Series using the Series() method
series = pd.Series(data)

# Display the Series
print("Series:\n", series)

# Remove characters from the right side
print("\nRemove from the right:\n", series.str.rstrip("\n\t!"))

Output

Series:
 0          !Jacob
1    \n\tAmit\n\n
2         !Trent!
3        Nathan\t
4          Martin
dtype: object

Remove from the right:
 0      !Jacob
1    \n\tAmit
2      !Trent
3      Nathan
4      Martin
dtype: object

If you liked the tutorial, spread the word and share the link and our website Studyopedia with others.


For Videos, Join Our YouTube Channel: Join Now


Read More:

String Operations on Text Data in Pandas
Find and Remove Duplicates from rows in Pandas
Studyopedia Editorial Staff
[email protected]

We work to create programming tutorials for all.

No Comments

Post A Comment