1214 shaares
Xidel is a command line tool to download and extract data from HTML/XML pages as well as JSON APIs.
It supports:
- CSS 3 Selectors: to extract elements unchanged
- XPath 3.0: to extract values and calculate things with them.
- XQuery 3.0: to create new documents from the extracted values and to build Turing-complete scripts.
- Pattern matching: to extract several expressions in an easy way using an annotated version of the input page for pattern-matching.
See also:
- https://github.com/mgdm/htmlq Like jq, but for HTML. Uses CSS selectors to extract bits of content from HTML files.