<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>4820</bug_id>
          
          <creation_ts>2005-09-03 01:30:35 -0700</creation_ts>
          <short_desc>hexadecimal HTML entities split across TCP packets are not parsed correctly</short_desc>
          <delta_ts>2005-09-12 01:48:13 -0700</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>DOM</component>
          <version>420+</version>
          <rep_platform>Mac</rep_platform>
          <op_sys>OS X 10.4</op_sys>
          <bug_status>VERIFIED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://nypop.com/~ap/webkit/spaces.html</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Alexey Proskuryakov">ap</reporter>
          <assigned_to name="Darin Adler">darin</assigned_to>
          <cc>darin</cc>
    
    <cc>mitz</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>18514</commentid>
    <comment_count>0</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-03 01:30:35 -0700</bug_when>
    <thetext>If an HTML entity is not completely contained in one TCP packet, it may be not handled as such, and get 
rendered as plain text instead.

Steps to reproduce: Load the page from bug URL. Won&apos;t reproduce if loaded from a local file or from 
cache, of course.

Regression: present in Safari 2.0 (as of Mac OS X 10.4.2 without updates); haven&apos;t checked earlier versions.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18515</commentid>
    <comment_count>1</comment_count>
    <who name="Mark Rowe (bdash)">mrowe</who>
    <bug_when>2005-09-03 01:47:25 -0700</bug_when>
    <thetext>Confirmed with WebKit 412.7 and ToT.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18546</commentid>
    <comment_count>2</comment_count>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-03 13:22:59 -0700</bug_when>
    <thetext>The issue is likely in htmltokenizer.cpp. The code looks like it&apos;s almost right to handle this case, just not 
quite.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18670</commentid>
    <comment_count>3</comment_count>
      <attachid>3755</attachid>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-04 11:17:30 -0700</bug_when>
    <thetext>Created attachment 3755
proposed patch

Apparently, it was only failing for hexadecimal entities - so, I&apos;ve just copied
a couple of lines from a decimal case.

This patch also fixes a bunch of compilation errors with TOKEN_DEBUG defined.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18671</commentid>
    <comment_count>4</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-04 11:24:31 -0700</bug_when>
    <thetext>Is XML DOM indeed a correct component for this? Perhaps, I do not understand the differences well 
enough, but this doesn&apos;t sound related to XML at all (maybe HTML DOM?)</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18678</commentid>
    <comment_count>5</comment_count>
      <attachid>3755</attachid>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-04 12:27:59 -0700</bug_when>
    <thetext>Comment on attachment 3755
proposed patch

I believe this patch reintroduces a bug we fixed a while back. It was trying to
fix that one that we introduced this bug. This was back when we didn&apos;t always
make tests for the bugs we fixed (bad!) so there&apos;s no layout test.

The original fix was back on 2003-11-17, you can see it in the ChangeLog.

3485925: Safari does not correctly parse eight-digit hex character entities

Also, please don&apos;t include the TOKEN_DEBUG changes along with the bug fix. Lets
handle those separately.

Here&apos;s some text from the original bug report:

--------

Safari does not correctly parse eight-digit hex character entities. I noticed
this at &lt;http://www.alanwood.net/unicode/deseret.html&gt;.

&quot;&amp;#x0010400;&quot; in HTML works fine (gives me the glyph for U+10400, &quot;DESERET
CAPITAL LETTER LONG I&quot;).  But if I use &quot;&amp;#x00010400;&quot;, the page renders
incorrectly; I get a Last Resort glyph followed by &quot;0;&quot;

It looks like the numeric entity parser only looks seven digits into the hex
string, so displays U+1040 followed by &quot;0;&quot;, instead of realizing that it&apos;s all
one entity and displaying U+10400.

--------

So we should make a new fix that works properly for both and probably make a
layout test for the older bug fix.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18691</commentid>
    <comment_count>6</comment_count>
      <attachid>3760</attachid>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-04 13:14:36 -0700</bug_when>
    <thetext>Created attachment 3760
test case for the older bug</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18692</commentid>
    <comment_count>7</comment_count>
      <attachid>3761</attachid>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-04 13:17:15 -0700</bug_when>
    <thetext>Created attachment 3761
proposed patch

Just changed the length from 9 to 10 - this seems to fix the issue...

I filed TOKEN_DEBUG patch as bug 4849.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18702</commentid>
    <comment_count>8</comment_count>
      <attachid>3761</attachid>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-04 13:52:58 -0700</bug_when>
    <thetext>Comment on attachment 3761
proposed patch

OK, r=me.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18747</commentid>
    <comment_count>9</comment_count>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-05 10:09:36 -0700</bug_when>
    <thetext>Gecko seems to allow hexadecimal entities with any number of digits; it lets extra leading digits overflow 
off the left.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18749</commentid>
    <comment_count>10</comment_count>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-05 10:15:51 -0700</bug_when>
    <thetext>I added the test case to layout tests.

Perhaps we should change the hexadecimal entity code to not have a limit on number of digits to match 
Gecko/Firefox? Seems that would be simple. I guess it depends on what the behavior of WinIE is, and 
probably worth investigating too-long decimal entities too.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>18803</commentid>
    <comment_count>11</comment_count>
      <attachid>3783</attachid>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-06 03:42:13 -0700</bug_when>
    <thetext>Created attachment 3783
Long entities test

Here is a test matrix for long and otherwise invalid entities with results from
several browsers.

Mozilla bug report about bignum entities is here:
&lt;https://bugzilla.mozilla.org/show_bug.cgi?id=215622&gt;.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19290</commentid>
    <comment_count>12</comment_count>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-10 12:56:44 -0700</bug_when>
    <thetext>Do the matrix entries for WinIE that say &quot;?&quot; mean an actual question mark character? Do the empty matrix 
entries for MacIE mean that no characters at all were emitted?

It seems like our behavior is not much like other browsers -- we should probably do a little work in this 
area?

What about when non-digit characters are included before the semicolon?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19302</commentid>
    <comment_count>13</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-10 13:41:08 -0700</bug_when>
    <thetext>(In reply to comment #12)
&gt; Do the matrix entries for WinIE that say &quot;?&quot; mean an actual question mark character? Do the empty
&gt; matrix entries for MacIE mean that no characters at all were emitted?

  Yes in both cases.

&gt; It seems like our behavior is not much like other browsers -- we should probably do a little work in 
&gt; this area?

  Probably... Would you prefer to have this done while resolving this bug, or in a separate one?

&gt; What about when non-digit characters are included before the semicolon?

  I would expect them to bail out immediately (otherwise, it&apos;s too easy to lose a lot of content because 
of a non-terminated entity)... I&apos;ll check on Monday.

  A note at &lt;http://www.w3.org/TR/REC-html40/charset.html#entities&gt; suggests some more evil 
cases to possibly deal with:

&gt;In SGML, it is possible to eliminate the final &quot;;&quot; after a character 
&gt;reference in some cases (e.g., at a line break or immediately before 
&gt;a tag). In other circumstances it may not be eliminated (e.g., in the 
&gt;middle of a word). We strongly suggest using the &quot;;&quot; in all cases 
&gt;to avoid problems with user agents that require this character to be present.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19304</commentid>
    <comment_count>14</comment_count>
    <who name="Darin Adler">darin</who>
    <bug_when>2005-09-10 14:07:20 -0700</bug_when>
    <thetext>I think we should make one or more new bug reports for the issues we turned up here. I&apos;m going to land 
your patch to fix the &quot;split across two separate tokenizer writes&quot; issue and close this bug report.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19450</commentid>
    <comment_count>15</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2005-09-12 01:48:13 -0700</bug_when>
    <thetext>(In reply to comment #14)

Done; bug 4948.</thetext>
  </long_desc>
      
          <attachment
              isobsolete="1"
              ispatch="1"
              isprivate="0"
          >
            <attachid>3755</attachid>
            <date>2005-09-04 11:17:30 -0700</date>
            <delta_ts>2005-09-04 13:17:15 -0700</delta_ts>
            <desc>proposed patch</desc>
            <filename>entities.txt</filename>
            <type>text/plain</type>
            <size>4504</size>
            <attacher name="Alexey Proskuryakov">ap</attacher>
            
              <data encoding="base64">SW5kZXg6IGh0bWx0b2tlbml6ZXIuY3BwCj09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09
PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT0KUkNTIGZpbGU6IC9jdnMvcm9vdC9X
ZWJDb3JlL2todG1sL2h0bWwvaHRtbHRva2VuaXplci5jcHAsdgpyZXRyaWV2aW5nIHJldmlzaW9u
IDEuMTA5CmRpZmYgLXAgLXUgLXIxLjEwOSBodG1sdG9rZW5pemVyLmNwcAotLS0gaHRtbHRva2Vu
aXplci5jcHAJMjcgQXVnIDIwMDUgMDA6MTI6MzMgLTAwMDAJMS4xMDkKKysrIGh0bWx0b2tlbml6
ZXIuY3BwCTQgU2VwIDIwMDUgMTg6MTQ6NTggLTAwMDAKQEAgLTU4Myw3ICs1ODMsNyBAQCB2b2lk
IEhUTUxUb2tlbml6ZXI6OnBhcnNlQ29tbWVudChUb2tlbml6CiAgICAgICAgIHNjcmlwdENvZGVb
IHNjcmlwdENvZGVTaXplKysgXSA9ICpzcmM7CiAjaWYgZGVmaW5lZChUT0tFTl9ERUJVRykgJiYg
VE9LRU5fREVCVUcgPiAxCiAgICAgICAgIHFEZWJ1ZygiY29tbWVudCBpcyBub3c6IColcyoiLAot
ICAgICAgICAgICAgICAgUUNvbnN0U3RyaW5nKChRQ2hhciopc3JjLmN1cnJlbnQoKSwga01pbigx
Niwgc3JjLmxlbmd0aCgpKSkuc3RyaW5nKCkubGF0aW4xKCkpOworICAgICAgICAgICAgICAgUUNv
bnN0U3RyaW5nKChRQ2hhciopc3JjLm9wZXJhdG9yLT4oKSwga01pbigxNlUsIHNyYy5sZW5ndGgo
KSkpLnFzdHJpbmcoKS5sYXRpbjEoKSk7CiAjZW5kaWYKIAogICAgICAgICBpZiAoc3RyaWN0KSB7
CkBAIC03NjMsMTkgKzc2MywyMCBAQCB2b2lkIEhUTUxUb2tlbml6ZXI6OnBhcnNlRW50aXR5KFRv
a2VuaXplCiAKICAgICAgICAgY2FzZSBIZXhhZGVjaW1hbDoKICAgICAgICAgewotICAgICAgICAg
ICAgaW50IGxsID0ga01pbihzcmMubGVuZ3RoKCksIDhVKTsKKyAgICAgICAgICAgIGludCBsbCA9
IGtNaW4oc3JjLmxlbmd0aCgpLCA5LWNCdWZmZXJQb3MpOwogICAgICAgICAgICAgd2hpbGUobGwt
LSkgewogICAgICAgICAgICAgICAgIFFDaGFyIGNzcmMoc3JjLT5sb3dlcigpKTsKICAgICAgICAg
ICAgICAgICBjYyA9IGNzcmMuY2VsbCgpOwogCiAgICAgICAgICAgICAgICAgaWYoY3NyYy5yb3co
KSB8fCAhKChjYyA+PSAnMCcgJiYgY2MgPD0gJzknKSB8fCAoY2MgPj0gJ2EnICYmIGNjIDw9ICdm
JykpKSB7CisgICAgICAgICAgICAgICAgICAgIEVudGl0eSA9IFNlYXJjaFNlbWljb2xvbjsKICAg
ICAgICAgICAgICAgICAgICAgYnJlYWs7CiAgICAgICAgICAgICAgICAgfQogICAgICAgICAgICAg
ICAgIEVudGl0eVVuaWNvZGVWYWx1ZSA9IEVudGl0eVVuaWNvZGVWYWx1ZSoxNiArIChjYyAtICgg
Y2MgPCAnYScgPyAnMCcgOiAnYScgLSAxMCkpOwogICAgICAgICAgICAgICAgIGNCdWZmZXJbY0J1
ZmZlclBvcysrXSA9IGNjOwogICAgICAgICAgICAgICAgICsrc3JjOwogICAgICAgICAgICAgfQot
ICAgICAgICAgICAgRW50aXR5ID0gU2VhcmNoU2VtaWNvbG9uOworICAgICAgICAgICAgaWYoY0J1
ZmZlclBvcyA9PSA5KSAgRW50aXR5ID0gU2VhcmNoU2VtaWNvbG9uOwogICAgICAgICAgICAgYnJl
YWs7CiAgICAgICAgIH0KICAgICAgICAgY2FzZSBEZWNpbWFsOgpAQCAtODc5LDEwICs4ODAsMTAg
QEAgdm9pZCBIVE1MVG9rZW5pemVyOjpwYXJzZVRhZyhUb2tlbml6ZXJTdAogICAgICAgICBjaGVj
a0J1ZmZlcigpOwogI2lmIGRlZmluZWQoVE9LRU5fREVCVUcpICYmIFRPS0VOX0RFQlVHID4gMQog
ICAgICAgICB1aW50IGwgPSAwOwotICAgICAgICB3aGlsZShsIDwgc3JjLmxlbmd0aCgpICYmICgq
KHNyYy5jdXJyZW50KCkrbCkpLmxhdGluMSgpICE9ICc+JykKKyAgICAgICAgd2hpbGUobCA8IHNy
Yy5sZW5ndGgoKSAmJiAoKihzcmMub3BlcmF0b3ItPigpK2wpKS5sYXRpbjEoKSAhPSAnPicpCiAg
ICAgICAgICAgICBsKys7CiAgICAgICAgIHFEZWJ1Zygic3JjIGlzIG5vdzogKiVzKiwgdHF1b3Rl
OiAlZCIsCi0gICAgICAgICAgICAgICBRQ29uc3RTdHJpbmcoKFFDaGFyKilzcmMuY3VycmVudCgp
LCBsKS5xc3RyaW5nKCkubGF0aW4xKCksIHRxdW90ZSk7CisgICAgICAgICAgICAgICBRQ29uc3RT
dHJpbmcoKFFDaGFyKilzcmMub3BlcmF0b3ItPigpLCBsKS5xc3RyaW5nKCkubGF0aW4xKCksIHRx
dW90ZSk7CiAjZW5kaWYKICAgICAgICAgc3dpdGNoKHRhZykgewogICAgICAgICBjYXNlIE5vVGFn
OgpAQCAtMTIyNCw3ICsxMjI1LDcgQEAgdm9pZCBIVE1MVG9rZW5pemVyOjpwYXJzZVRhZyhUb2tl
bml6ZXJTdAogCiAgICAgICAgICAgICBBdG9taWNTdHJpbmcgdGFnTmFtZSA9IGN1cnJUb2tlbi50
YWdOYW1lOwogI2lmIGRlZmluZWQoVE9LRU5fREVCVUcpICYmIFRPS0VOX0RFQlVHID4gMAotICAg
ICAgICAgICAga2REZWJ1ZyggNjAzNiApIDw8ICJhcHBlbmRpbmcgVGFnOiAiIDw8IHRhZ0lEIDw8
IGVuZGw7CisgICAgICAgICAgICBrZERlYnVnKCA2MDM2ICkgPDwgImFwcGVuZGluZyBUYWc6ICIg
PDwgdGFnTmFtZS5xc3RyaW5nKCkgPDwgZW5kbDsKICNlbmRpZgogCiAgICAgICAgICAgICAvLyBI
YW5kbGUgPHNjcmlwdCBzcmM9ImZvbyIvPiBsaWtlIE1vemlsbGEvT3BlcmEuIFdlIGhhdmUgdG8g
ZG8gdGhpcyBub3cgZm9yIERhc2hib2FyZApAQCAtMTM5OCw3ICsxMzk5LDcgQEAgdm9pZCBIVE1M
VG9rZW5pemVyOjphZGRQZW5kaW5nKCkKIHZvaWQgSFRNTFRva2VuaXplcjo6d3JpdGUoY29uc3Qg
VG9rZW5pemVyU3RyaW5nICZzdHIsIGJvb2wgYXBwZW5kRGF0YSkKIHsKICNpZmRlZiBUT0tFTl9E
RUJVRwotICAgIGtkRGVidWcoIDYwMzYgKSA8PCB0aGlzIDw8ICIgVG9rZW5pemVyOjp3cml0ZShc
IiIgPDwgc3RyIDw8ICJcIiwiIDw8IGFwcGVuZERhdGEgPDwgIikiIDw8IGVuZGw7CisgICAga2RE
ZWJ1ZyggNjAzNiApIDw8IHRoaXMgPDwgIiBUb2tlbml6ZXI6OndyaXRlKFwiIiA8PCBzdHIudG9T
dHJpbmcoKSA8PCAiXCIsIiA8PCBhcHBlbmREYXRhIDw8ICIpIiA8PCBlbmRsOwogI2VuZGlmCiAK
ICAgICBpZiAoIWJ1ZmZlcikKQEAgLTE4MjgsOCArMTgyOSw4IEBAIHZvaWQgSFRNTFRva2VuaXpl
cjo6cHJvY2Vzc1Rva2VuKCkKICAgICBpZiAoIGRlc3QgPiBidWZmZXIgKQogICAgIHsKICNpZmRl
ZiBUT0tFTl9ERUJVRwotICAgICAgICBpZihjdXJyVG9rZW4uaWQpIHsKLSAgICAgICAgICAgIHFE
ZWJ1ZyggInVuZXhwZWN0ZWQgdG9rZW4gaWQ6ICVkLCBzdHI6IColcyoiLCBjdXJyVG9rZW4uaWQs
UUNvbnN0U3RyaW5nKCBidWZmZXIsZGVzdC1idWZmZXIgKS5xc3RyaW5nKCkubGF0aW4xKCkgKTsK
KyAgICAgICAgaWYoY3VyclRva2VuLnRhZ05hbWUubGVuZ3RoKCkpIHsKKyAgICAgICAgICAgIHFE
ZWJ1ZyggInVuZXhwZWN0ZWQgdG9rZW46ICVzLCBzdHI6IColcyoiLCBjdXJyVG9rZW4udGFnTmFt
ZS5xc3RyaW5nKCkubGF0aW4xKCksUUNvbnN0U3RyaW5nKCBidWZmZXIsZGVzdC1idWZmZXIgKS5x
c3RyaW5nKCkubGF0aW4xKCkgKTsKICAgICAgICAgICAgIGFzc2VydCgwKTsKICAgICAgICAgfQog
CkBAIC0xODQ5LDEyICsxODUwLDEyIEBAIHZvaWQgSFRNTFRva2VuaXplcjo6cHJvY2Vzc1Rva2Vu
KCkKICAgICBkZXN0ID0gYnVmZmVyOwogCiAjaWZkZWYgVE9LRU5fREVCVUcKLSAgICBRU3RyaW5n
IG5hbWUgPSBnZXRUYWdOYW1lKGN1cnJUb2tlbi5pZCkucXN0cmluZygpOworICAgIFFTdHJpbmcg
bmFtZSA9IGN1cnJUb2tlbi50YWdOYW1lLnFzdHJpbmcoKTsKICAgICBRU3RyaW5nIHRleHQ7CiAg
ICAgaWYoY3VyclRva2VuLnRleHQpCiAgICAgICAgIHRleHQgPSBRQ29uc3RTdHJpbmcoY3VyclRv
a2VuLnRleHQtPnMsIGN1cnJUb2tlbi50ZXh0LT5sKS5xc3RyaW5nKCk7CiAKLSAgICBrZERlYnVn
KCA2MDM2ICkgPDwgIlRva2VuIC0tPiAiIDw8IG5hbWUgPDwgIiAgIGlkID0gIiA8PCBjdXJyVG9r
ZW4uaWQgPDwgZW5kbDsKKyAgICBrZERlYnVnKCA2MDM2ICkgPDwgIlRva2VuIC0tPiAiIDw8IG5h
bWUgPDwgZW5kbDsKICAgICBpZiAoY3VyclRva2VuLmZsYXQpCiAgICAgICAgIGtkRGVidWcoIDYw
MzYgKSA8PCAiVG9rZW4gaXMgRkxBVCEiIDw8IGVuZGw7CiAgICAgaWYoIXRleHQuaXNOdWxsKCkp
CkBAIC0xODY0LDcgKzE4NjUsNyBAQCB2b2lkIEhUTUxUb2tlbml6ZXI6OnByb2Nlc3NUb2tlbigp
CiAgICAgICAgIGtkRGVidWcoIDYwMzYgKSA8PCAiQXR0cmlidXRlczogIiA8PCBsIDw8IGVuZGw7
CiAgICAgICAgIGZvciAodW5zaWduZWQgbG9uZyBpID0gMDsgaSA8IGw7ICsraSkgewogICAgICAg
ICAgICAgQXR0cmlidXRlSW1wbCogYyA9IGN1cnJUb2tlbi5hdHRycy0+YXR0cmlidXRlSXRlbShp
KTsKLSAgICAgICAgICAgIGtkRGVidWcoIDYwMzYgKSA8PCAiICAgICIgPDwgYy0+bmFtZSgpLnFz
dHJpbmcoKQorICAgICAgICAgICAga2REZWJ1ZyggNjAzNiApIDw8ICIgICAgIiA8PCBjLT5sb2Nh
bE5hbWUoKS5xc3RyaW5nKCkKICAgICAgICAgICAgICAgICAgICAgICAgICAgICA8PCAiPVwiIiA8
PCBjLT52YWx1ZSgpLnFzdHJpbmcoKSA8PCAiXCIiIDw8IGVuZGw7CiAgICAgICAgIH0KICAgICB9
Cg==
</data>
<flag name="review"
          id="525"
          type_id="1"
          status="-"
          setter="darin"
    />
          </attachment>
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>3760</attachid>
            <date>2005-09-04 13:14:36 -0700</date>
            <delta_ts>2005-09-04 13:14:36 -0700</delta_ts>
            <desc>test case for the older bug</desc>
            <filename>hex-entities-length.html</filename>
            <type>text/html</type>
            <size>356</size>
            <attacher name="Alexey Proskuryakov">ap</attacher>
            
              <data encoding="base64">PEhFQUQ+Cjx0aXRsZT5IZXggSFRNTCBlbnRpdGllcyBvZiB2YXJpb3VzIGxlbmd0aHM8L3RpdGxl
Pgo8bWV0YSBodHRwLWVxdWl2PSJDb250ZW50LVR5cGUiIGNvbnRlbnQ9InRleHQvaHRtbDsgY2hh
cnNldD11dGYtOCI+CjwvSEVBRD4KPEJPRFk+CjxwPlRoZXJlIHNob3VsZCBiZSA2IGxpbmVzIGJl
bG93LCBlYWNoIHdpdGggdHdvIGNvcGllcyBvZiB0aGUgc2FtZSBjaGFyYWN0ZXIuPC9wPgo8cD5h
JiN4NjE7PC9wPgo8cD7huIEmI3gxZTAxOzwvcD4KPHA+8JCQgCYjeDEwNDAwOzwvcD4KPHA+8JCR
jSYjeDAxMDQ0RDs8L3A+CjxwPvCQkY4mI3gwMDEwNDRlOzwvcD4KPHA+8JCRjyYjeDAwMDEwNDRm
OzwvcD4KPC9CT0RZPgo=
</data>

          </attachment>
          <attachment
              isobsolete="0"
              ispatch="1"
              isprivate="0"
          >
            <attachid>3761</attachid>
            <date>2005-09-04 13:17:15 -0700</date>
            <delta_ts>2005-09-04 13:52:58 -0700</delta_ts>
            <desc>proposed patch</desc>
            <filename>entities2.txt</filename>
            <type>text/plain</type>
            <size>1155</size>
            <attacher name="Alexey Proskuryakov">ap</attacher>
            
              <data encoding="base64">SW5kZXg6IGh0bWx0b2tlbml6ZXIuY3BwCj09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09
PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT0KUkNTIGZpbGU6IC9jdnMvcm9vdC9X
ZWJDb3JlL2todG1sL2h0bWwvaHRtbHRva2VuaXplci5jcHAsdgpyZXRyaWV2aW5nIHJldmlzaW9u
IDEuMTA5CmRpZmYgLXAgLXUgLXIxLjEwOSBodG1sdG9rZW5pemVyLmNwcAotLS0gaHRtbHRva2Vu
aXplci5jcHAJMjcgQXVnIDIwMDUgMDA6MTI6MzMgLTAwMDAJMS4xMDkKKysrIGh0bWx0b2tlbml6
ZXIuY3BwCTQgU2VwIDIwMDUgMTg6MTQ6NTggLTAwMDAKQEAgLTc2MywxOSArNzYzLDIwIEBAIHZv
aWQgSFRNTFRva2VuaXplcjo6cGFyc2VFbnRpdHkoVG9rZW5pemUKIAogICAgICAgICBjYXNlIEhl
eGFkZWNpbWFsOgogICAgICAgICB7Ci0gICAgICAgICAgICBpbnQgbGwgPSBrTWluKHNyYy5sZW5n
dGgoKSwgOFUpOworICAgICAgICAgICAgaW50IGxsID0ga01pbihzcmMubGVuZ3RoKCksIDEwLWNC
dWZmZXJQb3MpOwogICAgICAgICAgICAgd2hpbGUobGwtLSkgewogICAgICAgICAgICAgICAgIFFD
aGFyIGNzcmMoc3JjLT5sb3dlcigpKTsKICAgICAgICAgICAgICAgICBjYyA9IGNzcmMuY2VsbCgp
OwogCiAgICAgICAgICAgICAgICAgaWYoY3NyYy5yb3coKSB8fCAhKChjYyA+PSAnMCcgJiYgY2Mg
PD0gJzknKSB8fCAoY2MgPj0gJ2EnICYmIGNjIDw9ICdmJykpKSB7CisgICAgICAgICAgICAgICAg
ICAgIEVudGl0eSA9IFNlYXJjaFNlbWljb2xvbjsKICAgICAgICAgICAgICAgICAgICAgYnJlYWs7
CiAgICAgICAgICAgICAgICAgfQogICAgICAgICAgICAgICAgIEVudGl0eVVuaWNvZGVWYWx1ZSA9
IEVudGl0eVVuaWNvZGVWYWx1ZSoxNiArIChjYyAtICggY2MgPCAnYScgPyAnMCcgOiAnYScgLSAx
MCkpOwogICAgICAgICAgICAgICAgIGNCdWZmZXJbY0J1ZmZlclBvcysrXSA9IGNjOwogICAgICAg
ICAgICAgICAgICsrc3JjOwogICAgICAgICAgICAgfQotICAgICAgICAgICAgRW50aXR5ID0gU2Vh
cmNoU2VtaWNvbG9uOworICAgICAgICAgICAgaWYoY0J1ZmZlclBvcyA9PSAxMCkgIEVudGl0eSA9
IFNlYXJjaFNlbWljb2xvbjsKICAgICAgICAgICAgIGJyZWFrOwogICAgICAgICB9CiAgICAgICAg
IGNhc2UgRGVjaW1hbDoK
</data>
<flag name="review"
          id="529"
          type_id="1"
          status="+"
          setter="darin"
    />
          </attachment>
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>3783</attachid>
            <date>2005-09-06 03:42:13 -0700</date>
            <delta_ts>2005-09-06 03:42:13 -0700</delta_ts>
            <desc>Long entities test</desc>
            <filename>entities-length.html</filename>
            <type>text/html</type>
            <size>1596</size>
            <attacher name="Alexey Proskuryakov">ap</attacher>
            
              <data encoding="base64">PEhFQUQ+Cjx0aXRsZT5IVE1MIG51bWVyaWMgZW50aXRpZXMgZXJyb3IgcmVjb3Zlcnk8L3RpdGxl
Pgo8bWV0YSBodHRwLWVxdWl2PSJDb250ZW50LVR5cGUiIGNvbnRlbnQ9InRleHQvaHRtbDsgY2hh
cnNldD11dGYtOCI+CjwvSEVBRD4KPEJPRFk+Cjx0YWJsZT4KIDx0cj4KIAk8dGQ+PC90ZD4KIAk8
dGQ+V2luSUU8L3RkPgogCTx0ZD5NYWNJRTwvdGQ+CiAJPHRkPkZpcmVmb3ggKE1hYyk8L3RkPgog
CTx0ZD5DdXJyZW50PC90ZD4KIDwvdHI+CiA8dHI+CiAJPHRkPiZhbXA7Izk3OzwvdGQ+CiAJPHRk
PmE8L3RkPgogCTx0ZD5hPC90ZD4KIAk8dGQ+YTwvdGQ+CiAJPHRkPiYjOTc7PC90ZD4KIDwvdHI+
CiA8dHI+CiAJPHRkPiZhbXA7IzAwMDAwMDA5Nzs8L3RkPgogCTx0ZD4/PC90ZD4KIAk8dGQ+YTwv
dGQ+CiAJPHRkPmE8L3RkPgogCTx0ZD4mIzAwMDAwMDA5Nzs8L3RkPgogPC90cj4KIDx0cj4KIAk8
dGQ+JmFtcDsjMDAwMDAwMDAwMDAwMDAwMDA5Nzs8L3RkPgogCTx0ZD4/PC90ZD4KIAk8dGQ+YTwv
dGQ+CiAJPHRkPmE8L3RkPgogCTx0ZD4mIzAwMDAwMDAwMDAwMDAwMDAwOTc7PC90ZD4KIDwvdHI+
CiA8dHI+CiAJPHRkPiZhbXA7IzI2ODQzNTU1MzsgKDB4MTAwMDAwNjEpPC90ZD4KIAk8dGQ+Pzwv
dGQ+CiAJPHRkPjwvdGQ+CiAJPHRkPmJyb2tlbjwvdGQ+CiAJPHRkPiYjMjY4NDM1NTUzOzwvdGQ+
CiA8L3RyPgogPHRyPgogCTx0ZD4mYW1wOyM1MzY4NzEwMDk7ICgweDIwMDAwMDYxKTwvdGQ+CiAJ
PHRkPj88L3RkPgogCTx0ZD48L3RkPgogCTx0ZD5icm9rZW48L3RkPgogCTx0ZD4mIzUzNjg3MTAw
OTs8L3RkPgogPC90cj4KIDx0cj4KIAk8dGQ+JmFtcDsjNDI5NDk2NzM5MzsgKDB4MTAwMDAwMDYx
KTwvdGQ+CiAJPHRkPj88L3RkPgogCTx0ZD48L3RkPgogCTx0ZD5hPC90ZD4KIAk8dGQ+JiM0Mjk0
OTY3MzkzOzwvdGQ+CiA8L3RyPgogPHRyPgogCTx0ZD4mYW1wOyN4NjE7PC90ZD4KIAk8dGQ+YTwv
dGQ+CiAJPHRkPmE8L3RkPgogCTx0ZD5hPC90ZD4KIAk8dGQ+JiN4NjE7PC90ZD4KIDwvdHI+CiA8
dHI+CiAJPHRkPiZhbXA7I3gwMDAwMDAwNjE7PC90ZD4KIAk8dGQ+PzwvdGQ+CiAJPHRkPmE8L3Rk
PgogCTx0ZD5hPC90ZD4KIAk8dGQ+JiN4MDAwMDAwMDYxOzwvdGQ+CiA8L3RyPgogPHRyPgogCTx0
ZD4mYW1wOyN4MDAwMDAwMDAwMDAwMDAwNjE7PC90ZD4KIAk8dGQ+PzwvdGQ+CiAJPHRkPmE8L3Rk
PgogCTx0ZD5hPC90ZD4KIAk8dGQ+JiN4MDAwMDAwMDAwMDAwMDAwNjE7PC90ZD4KIDwvdHI+CiA8
dHI+CiAJPHRkPiZhbXA7I3gxMDAwMDA2MTs8L3RkPgogCTx0ZD4/PC90ZD4KIAk8dGQ+PC90ZD4K
IAk8dGQ+YnJva2VuPC90ZD4KIAk8dGQ+JiN4MTAwMDAwNjE7PC90ZD4KIDwvdHI+CiA8dHI+CiAJ
PHRkPiZhbXA7I3gyMDAwMDA2MTs8L3RkPgogCTx0ZD4/PC90ZD4KIAk8dGQ+PC90ZD4KIAk8dGQ+
YnJva2VuPC90ZD4KIAk8dGQ+JiN4MjAwMDAwNjE7PC90ZD4KIDwvdHI+CiA8dHI+CiAJPHRkPiZh
bXA7I3gxMDAwMDAwNjE7PC90ZD4KIAk8dGQ+PzwvdGQ+CiAJPHRkPjwvdGQ+CiAJPHRkPmE8L3Rk
PgogCTx0ZD4mI3gxMDAwMDAwNjE7PC90ZD4KIDwvdHI+CjwvdGQ+CjwvdGFibGU+CjwvQk9EWT4K
</data>

          </attachment>
      

    </bug>

</bugzilla>