Python: splitting string by all space characters

python whitespace

Edit

It turns out that \u200b is not technically defined as whitespace , and so python does not recognize it as matching \s even with the unicode flag on. So it must be treated as an non-whitespace character.

http://en.wikipedia.org/wiki/Whitespace_character#Unicode

http://bugs.python.org/issue13391

import rere.split(ur"[\u200b\s]+", "some string", flags=re.UNICODE)

python whitespace

You can use a regular expression with enabled Unicode matching:

>>> re.split(r'(?u)\s', u'a\u200bc d')[u'a', u'c', u'd']

python whitespace

You can use re.split, like this:

import rere.split(u'\s|\u200b', your_string)

CodeHunter

Python: splitting string by all space characters

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last