<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>224289</bug_id>
          
          <creation_ts>2021-04-07 10:05:21 -0700</creation_ts>
          <short_desc>results.webkit.org should provide API for EWS to check flakiness of tests</short_desc>
          <delta_ts>2025-02-13 12:03:46 -0800</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>Tools / Tests</component>
          <version>WebKit Nightly Build</version>
          <rep_platform>Unspecified</rep_platform>
          <op_sys>Unspecified</op_sys>
          <bug_status>ASSIGNED</bug_status>
          <resolution></resolution>
          
          <see_also>https://bugs.webkit.org/show_bug.cgi?id=224434</see_also>
    
    <see_also>https://bugs.webkit.org/show_bug.cgi?id=224435</see_also>
    
    <see_also>https://bugs.webkit.org/show_bug.cgi?id=204368</see_also>
    
    <see_also>https://bugs.webkit.org/show_bug.cgi?id=254148</see_also>
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>InRadar</keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Aakash Jain">aakash_jain</reporter>
          <assigned_to name="Jonathan Bedard">jbedard</assigned_to>
          <cc>aakash_jain</cc>
    
    <cc>ap</cc>
    
    <cc>cgambrell</cc>
    
    <cc>clopez</cc>
    
    <cc>jbedard</cc>
    
    <cc>jenner</cc>
    
    <cc>ryanhaddad</cc>
    
    <cc>webkit-bug-importer</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>1747942</commentid>
    <comment_count>0</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2021-04-07 10:05:21 -0700</bug_when>
    <thetext>results.webkit.org should provide a REST API for EWS to check if the test is passing, consistently failing or flaky.

API should accept these parameters: test name(s), commit identifier, test-suite (layout-tests, api-tests etc,), platform (macos, iOS etc.), configuration (debug, release etc.) and any other necessary parameter.
API should return whether the test is consistently passing, consistently failing, flaky etc.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1749224</commentid>
    <comment_count>1</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2021-04-12 08:39:47 -0700</bug_when>
    <thetext>This might need discussion about the specifics of the API we might need for EWS, specifically for flakiness information. I think we can tackle the problem in two parts: API for dealing with flaky failures in EWS, API for dealing with consistent failures in EWS.

For consistent failures, I filed two specific API requests in Bug 224434 and Bug 224435.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1749247</commentid>
    <comment_count>2</comment_count>
    <who name="Jonathan Bedard">jbedard</who>
    <bug_when>2021-04-12 09:23:56 -0700</bug_when>
    <thetext>I think the way that this API should work is that is should provide a &quot;percent likelihood&quot; for each outcome of a given test with a given configuration at a given commit. We will need to toy with the algorithm a bit to figure out what the appropriate way to rank commits surrounding the commit in question is, I&apos;m envisioning a result that looks something like this:

{
    &quot;PASS&quot;: 80,
    &quot;FAIL&quot;: 10,
    &quot;TIMEOUT&quot;: 5,
    &quot;CRASH&quot;: 5
}

Meaning that given the configuration that the user provided, we would expect that the given test passes 80% of the time, fails 10% of the time, timeout 5% of the time and crashes 5% of the time. From that point, EWS can decide if the pass percentage is high enough to justify failing the build.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1750177</commentid>
    <comment_count>3</comment_count>
    <who name="Radar WebKit Bug Importer">webkit-bug-importer</who>
    <bug_when>2021-04-14 10:06:17 -0700</bug_when>
    <thetext>&lt;rdar://problem/76651206&gt;</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1762563</commentid>
    <comment_count>4</comment_count>
      <attachid>429332</attachid>
    <who name="Chris Gambrell">cgambrell</who>
    <bug_when>2021-05-21 13:40:33 -0700</bug_when>
    <thetext>Created attachment 429332
Current mock-up of script</thetext>
  </long_desc>
      
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>429332</attachid>
            <date>2021-05-21 13:40:33 -0700</date>
            <delta_ts>2021-05-21 13:40:33 -0700</delta_ts>
            <desc>Current mock-up of script</desc>
            <filename>results.py</filename>
            <type>text/x-python-script</type>
            <size>2400</size>
            <attacher name="Chris Gambrell">cgambrell</attacher>
            
              <data encoding="base64">IyEvdXNyL2Jpbi9lbnYgcHl0aG9uMwoKaW1wb3J0IGFyZ3BhcnNlCmltcG9ydCBqc29uCmZyb20g
dXJsbGliLnBhcnNlIGltcG9ydCBxdW90ZSwgdXJsZW5jb2RlCmZyb20gdXJsbGliLnJlcXVlc3Qg
aW1wb3J0IHVybG9wZW4KCnBhcnNlciA9IGFyZ3BhcnNlLkFyZ3VtZW50UGFyc2VyKGRlc2NyaXB0
aW9uPSdHZXQgdGhlIHBlcmNlbnQgY2hhbmNlIG9mIHBhc3MsIHRpbWVvdXQsIGNyYXNoLCBvciBm
YWlsdXJlIG9mIGEgdGVzdCcpCgpwYXJzZXIuYWRkX2FyZ3VtZW50KCd0ZXN0JywgdHlwZT1zdHIs
IGhlbHA9J1RoZSB0ZXN0IHRvIHJldHVybiByZXN1bHRzIGZvci4nKQpwYXJzZXIuYWRkX2FyZ3Vt
ZW50KCctYScsICctLWFyY2hpdGVjdHVyZScsIHR5cGU9c3RyLCBjaG9pY2VzPVsnYXJtNjQnLCAn
eDg2JywgJ3g4Nl82NCddLCBoZWxwPSdUaGUgYXJjaGl0ZWN0dXJlIHRvIHBhc3MgdG8gdGhlIHJl
c3VsdHMgZGIgY2FsbC4nKQpwYXJzZXIuYWRkX2FyZ3VtZW50KCctZicsICctLWZsYXZvcicsIHR5
cGU9c3RyLCBjaG9pY2VzPVsnZ3B1cHJvY2VzcycsICd3azEnLCAnd2syJ10sIGhlbHA9J1RoZSBm
bGF2b3IgdG8gcGFzcyB0byB0aGUgcmVzdWx0cyBkYiBjYWxsLicpCnBhcnNlci5hZGRfYXJndW1l
bnQoJy1sJywgJy0tbGltaXQnLCB0eXBlPWludCwgZGVmYXVsdD0xMDAsIGhlbHA9J051bWJlciBv
ZiBidWlsZHMgdG8gcmV0dXJuIHJlc3VsdHMgZm9yLicpCnBhcnNlci5hZGRfYXJndW1lbnQoJy1t
JywgJy0tbW9kZWwnLCB0eXBlPXN0ciwgY2hvaWNlcz1bJ2lQaG9uZSA4JywgJ2lQYWQgKDV0aCBn
ZW5lcmF0aW9uKScsICdpUGhvbmUgU0UgKDFzdCBnZW5lcmF0aW9uKScsICdNYWNtaW5pOCwxJywg
J01hY21pbmk5LDEnXSwgaGVscD0nVGhlIG1vZGVsIHRvIHBhc3MgdG8gdGhlIHJlc3VsdHMgZGIg
Y2FsbC4nKQpwYXJzZXIuYWRkX2FyZ3VtZW50KCctcCcsICctLXBsYXRmb3JtJywgdHlwZT1zdHIs
IGNob2ljZXM9WydHVEsnLCAnaW9zJywgJ21hYycsICd3aW4nLCAnd2luY2FyaW8nLCAnV1BFJ10s
IGhlbHA9J1RoZSBwbGF0Zm9ybSB0byBwYXNzIHRvIHRoZSByZXN1bHRzIGRiIGNhbGwuJykKcGFy
c2VyLmFkZF9hcmd1bWVudCgnLXMnLCAnLS1zdHlsZScsIHR5cGU9c3RyLCBjaG9pY2VzPVsnZGVi
dWcnLCAncmVsZWFzZSddLCBoZWxwPSdUaGUgc3R5bGUgdG8gcGFzcyB0byB0aGUgcmVzdWx0cyBk
YiBjYWxsLicpCgp2b2x1bWUgPSBwYXJzZXIuYWRkX211dHVhbGx5X2V4Y2x1c2l2ZV9ncm91cCgp
CnZvbHVtZS5hZGRfYXJndW1lbnQoJy12JywgJy0tdmVyYm9zZScsIGFjdGlvbj0nc3RvcmVfdHJ1
ZScpCnZvbHVtZS5hZGRfYXJndW1lbnQoJy1xJywgJy0tcXVpZXQnLCBhY3Rpb249J3N0b3JlX3Ry
dWUnKQoKYXJncyA9IHBhcnNlci5wYXJzZV9hcmdzKCkKCnF1ZXJ5X3N0cmluZyA9IHsKCSdsaW1p
dCc6IGFyZ3MubGltaXQKfQoKaWYgYXJncy5hcmNoaXRlY3R1cmU6CglxdWVyeV9zdHJpbmdbJ2Fy
Y2hpdGVjdHVyZSddID0gYXJncy5hcmNoaXRlY3R1cmUKaWYgYXJncy5mbGF2b3I6CglxdWVyeV9z
dHJpbmdbJ2ZsYXZvciddID0gYXJncy5mbGF2b3IKaWYgYXJncy5wbGF0Zm9ybToKCXF1ZXJ5X3N0
cmluZ1sncGxhdGZvcm0nXSA9IGFyZ3MucGxhdGZvcm0KCndpdGggdXJsb3BlbignaHR0cHM6Ly9y
ZXN1bHRzLndlYmtpdC5vcmcvYXBpL3Jlc3VsdHMvbGF5b3V0LXRlc3RzL3t9P3t9Jy5mb3JtYXQo
cXVvdGUoYXJncy50ZXN0KSwgdXJsZW5jb2RlKHF1ZXJ5X3N0cmluZykpKSBhcyByZXNwb25zZToK
CWRhdGEgPSBqc29uLmxvYWRzKHJlc3BvbnNlLnJlYWQoKSkKCQoJZm9yIGkgaW4gZGF0YToKCQly
ZXN1bHRzID0gewoJCQknY29uZmlndXJhdGlvbic6IGlbJ2NvbmZpZ3VyYXRpb24nXSwKCQkJJ3Jl
c3VsdHMnOiB7CgkJCQknUEFTUyc6IDAsCgkJCQknRkFJTCc6IDAsCgkJCQknVElNRU9VVCc6IDAs
CgkJCQknQ1JBU0gnOiAwCgkJCX0KCQl9CgoJCWlmIGxlbihpWydyZXN1bHRzJ10pID09IDA6CgkJ
CWJyZWFrCgoJCXBsYWNlID0gbGVuKGlbJ3Jlc3VsdHMnXSkKCQliYXNlID0gMAoJCWZvciBqIGlu
IHJhbmdlKDEsIHBsYWNlICsgMSk6CgkJCWJhc2UgKz0gagoKCQlmb3IgcmVzdWx0IGluIGlbJ3Jl
c3VsdHMnXToKCQkJaWYgcmVzdWx0WydhY3R1YWwnXSA9PSAnVEVYVCc6CgkJCQlyZXN1bHRbJ2Fj
dHVhbCddID0gJ0ZBSUwnCgkJCXJlc3VsdHNbJ3Jlc3VsdHMnXVtyZXN1bHRbJ2FjdHVhbCddXSAr
PSBwbGFjZQoJCQlwbGFjZSAtPSAxCgoJCWZvciByZXN1bHRfdHlwZSBpbiByZXN1bHRzWydyZXN1
bHRzJ106CgkJCXJlc3VsdHNbJ3Jlc3VsdHMnXVtyZXN1bHRfdHlwZV0gPSByZXN1bHRzWydyZXN1
bHRzJ11bcmVzdWx0X3R5cGVdIC8gYmFzZSAqIDEwMAoKCQlwcmludChqc29uLmR1bXBzKHJlc3Vs
dHMpKQoK
</data>

          </attachment>
      

    </bug>

</bugzilla>