<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>34063</bug_id>
          
          <creation_ts>2010-01-24 18:57:53 -0800</creation_ts>
          <short_desc>fail to parse application/xhtml+xml files with encoding=&quot;iso-8859-1&quot; and libxml2 &gt;= 2.7.4</short_desc>
          <delta_ts>2010-08-30 08:09:19 -0700</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>XML</component>
          <version>528+ (Nightly build)</version>
          <rep_platform>PC</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>DUPLICATE</resolution>
          <dup_id>30508</dup_id>
          
          <bug_file_loc>http://www.vinc17.net/test/webkit-latin1.html</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>Major</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>0</everconfirmed>
          <reporter name="Vincent Lefevre">vincent-webkit</reporter>
          <assigned_to name="Nobody">webkit-unassigned</assigned_to>
          <cc>ap</cc>
    
    <cc>a.renevier</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>184130</commentid>
    <comment_count>0</comment_count>
      <attachid>47303</attachid>
    <who name="Vincent Lefevre">vincent-webkit</who>
    <bug_when>2010-01-24 18:57:53 -0800</bug_when>
    <thetext>Created attachment 47303
testcase

Webkit-based applications (midori, liferea, GtkLauncher) fail to parse XHTML files with encoding=&quot;iso-8859-1&quot;.

With the above URL (file also added as an attachment), under Linux (Debian) with libwebkit 1.1.19, I get:
  This page contains the following errors:
  error on line 2 at column 2: StartTag: invalid element name

and with a similar page (which validates with xmllint), under Mac OS X Tiger with Liferea and webkit-gtk 1.1.10, I get:
  This page contains the following errors:
  error on line 2 at column 2: Char 0x0 out of allowed range

(though there isn&apos;t such a character in the page).

There&apos;s no such problem with encoding=&quot;utf-8&quot;, e.g.
  http://www.vinc17.net/test/webkit-utf8.html

Note that these simplified examples contain only ASCII characters.

Also, I couldn&apos;t try with the latest nightly build (23 Jan) on my Mac OS X machine because it crashes immediately.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>184131</commentid>
    <comment_count>1</comment_count>
    <who name="Vincent Lefevre">vincent-webkit</who>
    <bug_when>2010-01-24 19:03:05 -0800</bug_when>
    <thetext>The bug occurs only when the file is served as application/xhtml+xml, not when it is served as text/html. That&apos;s bad because webkit declares to support application/xhtml+xml.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>184445</commentid>
    <comment_count>2</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2010-01-25 17:08:33 -0800</bug_when>
    <thetext>I cannot reproduce with Safari on Mac OS X.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>184589</commentid>
    <comment_count>3</comment_count>
    <who name="Vincent Lefevre">vincent-webkit</who>
    <bug_when>2010-01-26 07:31:02 -0800</bug_when>
    <thetext>I couldn&apos;t reproduce it either with Safari, but my machine is under Mac OS X Tiger, so that&apos;s quite old. Now, I wonder whether this is specific to GTK (but I don&apos;t see what GTK has to do with something related to the encoding or MIME type declaration).

Also I think that there were no such problems in the past (several months ago), but the bug still occurs with old Debian packages of midori and libwebkit-1.0-1.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>184592</commentid>
    <comment_count>4</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2010-01-26 07:38:36 -0800</bug_when>
    <thetext>Could be related to bug 30508.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>184669</commentid>
    <comment_count>5</comment_count>
    <who name="Vincent Lefevre">vincent-webkit</who>
    <bug_when>2010-01-26 12:19:35 -0800</bug_when>
    <thetext>(In reply to comment #4)
&gt; Could be related to bug 30508.

Yes, the bug occurs with the libxml2 2.7.4.dfsg-1 Debian package, but not with 2.7.3.dfsg-2.1.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>271436</commentid>
    <comment_count>6</comment_count>
    <who name="Vincent Lefevre">vincent-webkit</who>
    <bug_when>2010-08-30 08:09:19 -0700</bug_when>
    <thetext>The patch fixing bug 30508 also fixes the problem I&apos;ve reported. So, this is really a duplicate of bug 30508.

*** This bug has been marked as a duplicate of bug 30508 ***</thetext>
  </long_desc>
      
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>47303</attachid>
            <date>2010-01-24 18:57:53 -0800</date>
            <delta_ts>2010-01-24 18:59:33 -0800</delta_ts>
            <desc>testcase</desc>
            <filename>webkit-latin1.html</filename>
            <type>application/xhtml+xml</type>
            <size>299</size>
            <attacher name="Vincent Lefevre">vincent-webkit</attacher>
            
              <data encoding="base64">PD94bWwgdmVyc2lvbj0iMS4wIiBlbmNvZGluZz0iaXNvLTg4NTktMSI/Pgo8IURPQ1RZUEUgaHRt
bCBQVUJMSUMgIi0vL1czQy8vRFREIFhIVE1MIDEuMCBTdHJpY3QvL0VOIgogICJodHRwOi8vd3d3
LnczLm9yZy9UUi94aHRtbDEvRFREL3hodG1sMS1zdHJpY3QuZHRkIj4KPGh0bWwgeG1sbnM9Imh0
dHA6Ly93d3cudzMub3JnLzE5OTkveGh0bWwiPgo8aGVhZD4KPHRpdGxlPkRvY3VtZW50IHdpdGgg
ZW5jb2Rpbmc9Imlzby04ODU5LTEiPC90aXRsZT4KPC9oZWFkPgo8Ym9keT4KPHA+T0s8L3A+Cjwv
Ym9keT4KPC9odG1sPgo=
</data>

          </attachment>
      

    </bug>

</bugzilla>