[libxml-devel] ctrlA characters in the xml file

stefan lauer stefan.lauer at hps-technologies.de
Thu Mar 29 01:33:22 EDT 2007


Hello,

on my Server amd x86_64 ruby 1.8.4 and libXML 0.3.8.4 are installed. I
read in large xml-Files (my testfile has 7Mb and roundabout 120000  
lines)
and process it (that means delete some
nodes/elements) and write them out. The output file contains sometimes
suddenly ctrl+A characters.

I tried it also with ruby 1.8.5 and libXML 0.3.8.2. and crossover it. It
is always the same.

Sometimes the ctrlA is in the middle of the element-tag and the rest is
deleted. It looks like this for example
normal:
<attrib name="foobar">text</attrb>

with ctrlA
<attrib name="foo^A>text</attrib>

For me it looks like an overflow.

I found out the ctrlA comes in on the read in with
XML::Document.file(file)

If read in with File.new and write it out the ctrlA isn't in the xml
file.

Is this a known Bug? I can't find anything.
Is a workaround for that available?

Thanks for help
Stefan
--
Stefan Lauer <stefan.lauer at hps-technologies.de>
HPS Technologies GmbH
Firmensitz: Kempten
Anschrift: Am Prinzenbuckel 21a, 87490 Haldenwang, Deutschland
Registergericht: Amtsgericht Kempten, HRB 8690
Geschäftsführer: Frank Penning, Jan Ulbrich






More information about the libxml-devel mailing list