Thursday, December 11, 2008

Ian Bicking: a blog :: lxml: an underappreciated web scraping library

Ian Bicking: a blog :: lxml: an underappreciated web scraping library: "One you have lxml installed, you have a great parser (which happens to be super-fast and that is not a tradeoff). You get a fairly familiar API based on ElementTree, which though a little strange feeling at first, offers a compact and canonical representation of a document tree, compared to more traditional representations."