Native shell command set to extract node value from XML

xml xmllint

--format is used only to format (indent, etc) the document. You can do that using --xpath (tested in Ubuntu, libxml v20900):

$ xmllint --xpath "//project/parent/version/text()" pom.xml1.5.0

xml xmllint

I've managed to solve it for the time being with this rather unwiedly script using xmllint --shell.

echo "cat //project/parent/version" | xmllint --shell pom.xml | sed '/^\/ >/d' | sed 's/<[^>]*.//g'

If the XML nodes have namespace attributes like my pom.xml had, things get heavier, basically extracting the node by name:

echo "cat //*[local-name()='project']/*[local-name()='parent']/*[local-name()='version']" | xmllint --shell pom.xml | sed '/^\/ >/d' | sed 's/<[^>]*.//g'

Hope it helps. If anyone can simply these expressions, I'd be grateful.

xml xmllint

I came here looking for a nice way to scrape a value from a website. The following example may be useful to those (unlike the poster) who have a version of xmllint which supports --xpath.

I needed to pull the most recent stable version of the elasticsearch .debfile and install it. The maintainers have helpfully put the version number in a span with the class "version".

version=`curl -s http://www.elasticsearch.org/download/ |\ xmllint --html --xpath '//span[@class="version"]/text()'\ 2>/dev/null - `;

What goes on:

We use the curl -s (silent) option.

curl -s http://www.elasticsearch.org/download/

We use the xmllint --html and --xpath switches. The xpath arguments (in single quotes)

'//span[@class="version"]/text()'

... looks for a <span> node with the class attribute (@class) "version", and extracts the text value (/text()).

Since xmllint is (surprise!) a linter, it will squawk about the inevitable garbage in your html stream. We direct the stderr to /dev/null in the usual way:

 2>/dev/null

Finally, note the " - " at the end of the xmllint command, which tells xmllint the stream is coming from stdin.

CodeHunter

Native shell command set to extract node value from XML

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last