How to match a new line character in Python raw string

In a regular expression, you need to specify that you're in multiline mode:

>>> import re>>> s = """cat... dog""">>> >>> re.match(r'cat\ndog',s,re.M)<_sre.SRE_Match object at 0xcb7c8>

Notice that re translates the \n (raw string) into newline. As you indicated in your comments, you don't actually need re.M for it to match, but it does help with matching $ and ^ more intuitively:

>> re.match(r'^cat\ndog',s).group(0)'cat\ndog'>>> re.match(r'^cat$\ndog',s).group(0)  #doesn't matchTraceback (most recent call last):  File "<stdin>", line 1, in <module>AttributeError: 'NoneType' object has no attribute 'group'>>> re.match(r'^cat$\ndog',s,re.M).group(0) #matches.'cat\ndog'

python regex rawstring

The simplest answer is to simply not use a raw string. You can escape backslashes by using \\.

If you have huge numbers of backslashes in some segments, then you could concatenate raw strings and normal strings as needed:

r"some string \ with \ backslashes" "\n"

(Python automatically concatenates string literals with only whitespace between them.)

Remember if you are working with paths on Windows, the easiest option is to just use forward slashes - it will still work fine.

python regex rawstring

def clean_with_puncutation(text):        from string import punctuation    import re    punctuation_token={p:'<PUNC_'+p+'>' for p in punctuation}    punctuation_token['<br/>']="<TOKEN_BL>"    punctuation_token['\n']="<TOKEN_NL>"    punctuation_token['<EOF>']='<TOKEN_EOF>'    punctuation_token['<SOF>']='<TOKEN_SOF>'  #punctuation_token    regex = r"(<br/>)|(<EOF>)|(<SOF>)|[\n\!\@\#\$\%\^\&\*\(\)\[\]\           {\}\;\:\,\.\/\?\|\`\_\\+\\\=\~\-\<\>]"###Always put new sequence token at front to avoid overlapping results #text = '<EOF>!@#$%^&*()[]{};:,./<>?\|`~-= _+\<br/>\n <SOF>\ '    text_=""    matches = re.finditer(regex, text)    index=0    for match in matches:     #print(match.group())     #print(punctuation_token[match.group()])     #print ("Match at index: %s, %s" % (match.start(), match.end()))        text_=text_+ text[index:match.start()] +" "               +punctuation_token[match.group()]+ " "        index=match.end()    return text_

CodeHunter

How to match a new line character in Python raw string

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last