Multiple Spelling Results In A Dataframe 1
I have some data containing spelling errors. I'm correcting them and scoring how close the spelling is using the following code: import pandas as pd import difflib Li_A = ['pot
Solution 1:
SequenceMatcher is correctly calculating the ratio using the method described by Ratcliff and Metzener, 1988. That is, for the number of characters found in common (CC) and the total number of characters in the two strings (CT):
ratio = 2.CC/CT
So it looks like the issue is with get_close_matches
Post a Comment for "Multiple Spelling Results In A Dataframe 1"