users@fi.java.net

Re: Non-FI performance problems

From: Tatu Saloranta <cowtowncoder_at_yahoo.com>
Date: Fri, 20 Jan 2006 23:31:47 -0800 (PST)

--- Jimmy Zhang <crackeur_at_comcast.net> wrote:

> Real application code would be very helpful, but XML
> is used
> in so many ways so it is difficult to provide a
> general purpose app
> code that applies to all use cases. As an extreme
> case, I can parse
> XML then go into a infinite loop (while(true);) in
> this case, the parser
> performance would not matter.

Yes, I understand this. But it would be useful to
maybe come up with one scenario, to prove power of VTD
for a realistic use scenario?

Maybe I should actually try to do that, simple things
like calculating document statistics (as some
benchmarks do), to ensure all information is accessed.
And compare various approaches. Or perhaps existing
test frameworks could be modified.

...
> VTD-XML, like many technologies, isn't perfect, it
> is designed
> to provide an option and new possiblities...

Yes, that is a very reasonable goal. ;-)

> > (a) Dealing with namespace bindings is tricky; and
> in
> > general it is not possible to just replace
> sub-trees
> > in-place (not so much a problem for read-only)
>
> This is done by lookups in VTD-XML.

Hmm? What I meant was that if one modifies sub-trees,
and new bindings are needed, this may get tricky. This
is not always (or often?) the case, but dealing with
edge cases gracefully is needed to make tools
intuitive and usable by developers. That is, reading
namespace information should be easy. But do
modifications and following serialization take care to
keep namespace information consisten?

Anyway, thank you for the explanations. I really
honestly would like to see some benchmarks and example
code (I did check out vtd page earlier and there
wasn't very much there, although API was explained),
to know some ballpark estimates for some use cases. I
do not doubt VTD (and similar approaches) can improve
efficiency for operations. But it'd be nice to get
some ballpark figures.

-+ Tatu +-


__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com