dev@glassfish.java.net

Re: HTML parser

From: Kohsuke Kawaguchi <Kohsuke.Kawaguchi_at_Sun.COM>
Date: Wed, 21 May 2008 09:25:21 -0700

Jason Lee wrote:
> On Tue, May 20, 2008 at 12:22 PM, Lloyd L Chambers
> <Lloyd.Chambers_at_sun.com> wrote:
>> Anyone know of a good HTML (XHTML) parser?
>>
>> I've tried the JDK Swing parse, but it can't parse XHTML properly.
>
> I've heard good things about NekoHTML
> (http://sourceforge.net/projects/nekohtml), and it has had a release
> in the last month or so. FWIW, HttpUnit uses this.

+1. I've been using NekoHTML in most of the projects where I needed HTML
parsing.

-- 
Kohsuke Kawaguchi
Sun Microsystems                   kohsuke.kawaguchi_at_sun.com