Escaping special characters in elasticsearch

python elasticsearch replace lucene escaping

Yes, those characters will need to be replaced within content you want to search in a query_string query. To do that (assuming you are using PyLucene), you should be able to use QueryParserBase.escape(String).

Barring that, you could always adapt the QueryParserBase.escape source code to your needs:

public static String escape(String s) {  StringBuilder sb = new StringBuilder();  for (int i = 0; i < s.length(); i++) {    char c = s.charAt(i);    // These characters are part of the query syntax and must be escaped    if (c == '\\' || c == '+' || c == '-' || c == '!' || c == '(' || c == ')' || c == ':'      || c == '^' || c == '[' || c == ']' || c == '\"' || c == '{' || c == '}' || c == '~'      || c == '*' || c == '?' || c == '|' || c == '&' || c == '/') {      sb.append('\\');    }    sb.append(c);  }  return sb.toString();}

python elasticsearch replace lucene escaping

I adapted this code I found there:

escapeRules = {'+': r'\+',               '-': r'\-',               '&': r'\&',               '|': r'\|',               '!': r'\!',               '(': r'\(',               ')': r'\)',               '{': r'\{',               '}': r'\}',               '[': r'\[',               ']': r'\]',               '^': r'\^',               '~': r'\~',               '*': r'\*',               '?': r'\?',               ':': r'\:',               '"': r'\"',               '\\': r'\\;',               '/': r'\/',               '>': r' ',               '<': r' '}def escapedSeq(term):    """ Yield the next string based on the        next character (either this char        or escaped version """    for char in term:        if char in escapeRules.keys():            yield escapeRules[char]        else:            yield chardef escapeESArg(term):    """ Apply escaping to the passed in query terms        escaping special characters like : , etc"""    term = term.replace('\\', r'\\')   # escape \ first    return "".join([nextStr for nextStr in escapedSeq(term)])

python elasticsearch replace lucene escaping

to answer the question directly, below is a cleaner python solution using re.sub

import reKIBANA_SPECIAL = '+ - & | ! ( ) { } [ ] ^ " ~ * ? : \\'.split(' ')re.sub('([{}])'.format('\\'.join(KIBANA_SPECIAL)), r'\\\1', val)

however a better solution is to properly parse out the bad characters that get sent to elasticsearch:

import six.moves.urllib as urlliburllib.parse.quote_plus(val)

CodeHunter

Escaping special characters in elasticsearch

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last