How do you remove unwanted characters from python?

The generic problem faced by the programmers is remove unwanted characters from a string using Python. But sometimes the requirement is way above and demands the removal of more than 1 character, but a list of such malicious characters. These can be in the form of special characters for reconstructing valid passwords and many other applications possible. Let’s discuss certain ways to perform this particular task.

Method 1: Removing symbol from string using str.isalnum()

Python String isalnum() method checks whether all the characters in a given string are alphanumeric or not. It returns a boolean as True – If all the characters are alphanumeric or else false – If one or more characters are not alphanumeric.

Python3

string = "Ge;ek * s:fo ! r;Ge * e*k:s !"

test_str = ''.join(letter for letter in string if letter.isalnum())

print(test_str)

Output:

GeeksforGeeks

Method 2: Removing symbol from string using replace() 

One can use str.replace() inside a loop to check for a bad_char and then replace it with the empty string hence removing it. This is the most basic approach and inefficient on a performance point of view.

Python3

bad_chars = [';', ':', '!', "*", " "]

test_string = "Ge;ek * s:fo ! r;Ge * e*k:s !"

print("Original String : " + test_string)

for i in bad_chars:

    test_string = test_string.replace(i, '')

print("Resultant list is : " + str(test_string))

Output : 

Original String : Ge;ek*s:fo!r;Ge*e*k:s!
Resultant list is : GeeksforGeeks

Method 3: Removing symbol from string using join() + generator 

By using Python join() we remake the string. In the generator function, we specify the logic to ignore the characters in bad_chars and hence construct a new string free from bad characters.

Python3

bad_chars = [';', ':', '!', "*", " "]

test_string = "Ge;ek * s:fo ! r;Ge * e*k:s !"

print("Original String : " + test_string)

test_string = ''.join(i for i in test_string if not i in bad_chars)

print("Resultant list is : " + str(test_string))

Output : 

Original String : Ge;ek*s:fo!r;Ge*e*k:s!
Resultant list is : GeeksforGeeks

Method 4: Removing symbol from string using translate() 

The most elegant way to perform this particular task, this method is basically used for achieving the solution to this kind of problems itself, we can translate each bad_char to an empty string and get the filtered string.

Python3

import string

bad_chars = [';', ':', '!', "*"]

test_string = "Ge;ek * s:fo ! r;Ge * e*k:s !"

print("Original String : " + test_string)

delete_dict = {sp_character: '' for sp_character in string.punctuation}

delete_dict[' '] = ''

table = str.maketrans(delete_dict)

test_string = test_string.translate(table)

print("Resultant list is : " + str(test_string))

Output : 

Original String : Ge;ek*s:fo!r;Ge*e*k:s!
Resultant list is : GeeksforGeeks

Method 5: Removing symbol from string using filter() 

This is yet another solution to perform this task. Using the lambda function, filter function can remove all the bad_chars and return the wanted refined string.

Python3

bad_chars = [';', ':', '!', "*"]

test_string = "Ge;ek*s:fo!r;Ge*e*k:s!"

print("Original String : " + test_string)

test_string = ''.join((filter(lambda i: i not in bad_chars,

                              test_string)))

print("Resultant list is : " + str(test_string))

Output: 

Original String : Ge;ek*s:fo!r;Ge*e*k:s!
Resultant list is : GeeksforGeeks

Method 6: Removing symbol from string using re.sub() function: 

Regular expressions are used to identify the bad character in the string and re.sub function is used to replace the bad_char from the string. 

Python3

import re

test_str = "Ge;ek * s:fo ! r;Ge * e*k:s !"

bad_char = [";", "!", "*", ":", " "]

print("The original string is : " + test_str)

temp = ''

for i in bad_char:

    temp += i

res = re.sub(rf'[{temp}]', '', test_str)

print("The strings after extra space removal : " + str(res))

Output: 

The original string is : Ge;ek * s:fo ! r;Ge * e*k:s !
The strings after extra space removal : GeeksforGeeks

Method 6 : Using in ,not in operators

Python3

bad_chars = [';', ':', '!', "*", " "]

test_string = "Ge;ek * s:fo ! r;Ge * e*k:s !"

print ("Original String : " + test_string)

s=""

for i in test_string:

    if i not in bad_chars:

        s+=i

print ("Resultant list is : " + str(s))

Output

Original String : Ge;ek * s:fo ! r;Ge * e*k:s !
Resultant list is : GeeksforGeeks


How do I remove junk characters from a string in Python?

Method 2: Removing symbol from string using replace() One can use str. replace() inside a loop to check for a bad_char and then replace it with the empty string hence removing it.

How do I remove unwanted characters from a string?

Example of removing special characters using replaceAll() method.
public class RemoveSpecialCharacterExample1..
public static void main(String args[]).
String str= "This#string%contains^special*characters&.";.
str = str.replaceAll("[^a-zA-Z0-9]", " ");.
System.out.println(str);.

How do I remove a specific character from a list in Python?

The remove() method removes the first matching element (which is passed as an argument) from the list. The pop() method removes an element at a given index, and will also return the removed item. You can also use the del keyword in Python to remove an element or slice from a list.

How do you remove a character from a file in Python?

In Python you can use the replace() and translate() methods to specify which characters you want to remove from the string and return a new modified string result. It is important to remember that the original string will not be altered because strings are immutable.