web-apis icon indicating copy to clipboard operation
web-apis copied to clipboard

IDL scraper is currently pulling in non-authoritative IDL

Open mdittmer opened this issue 8 years ago • 5 comments

At least some of this is coming from <pre> without class="idl"; e.g., from the WebIDL spec's description of IDL syntax.

mdittmer avatar Jan 19 '17 16:01 mdittmer

@LoonyBean FYI. This might to be related to some false positives you've been seeing.

mdittmer avatar Jan 19 '17 16:01 mdittmer

Switching to matching on pre.idl instead of just pre results in losing data for the following items that appear in Blink's IDL (first two columns: count-of-things-in-current-data, count-of-things-in-pre.idl-data):

1       0       HTMLKeygenElement       https://html.spec.whatwg.org/
1       0       PermissionName          https://w3c.github.io/permissions/
1       0       MutationEvent           https://w3c.github.io/uievents/
1       0       KeyFormat               https://w3c.github.io/webcrypto/Overview.html
1       0       KeyUsage                https://w3c.github.io/webcrypto/Overview.html
1       0       SubtleCrypto            https://w3c.github.io/webcrypto/Overview.html
1       0       AlgorithmIdentifier     https://w3c.github.io/webcrypto/Overview.html
1       0       Crypto                  https://w3c.github.io/webcrypto/Overview.html
1       0       CryptoKey               https://w3c.github.io/webcrypto/Overview.html
1       0       PaymentAppRequestData   https://w3c.github.io/webpayments-payment-apps-api/
1       0       MediaImageInit          https://wicg.github.io/mediasession/

mdittmer avatar Jan 19 '17 16:01 mdittmer

Also suspicious, but not in Blink:

1       0       ResponseBodyInit                        https://fetch.spec.whatwg.org/
1       0       PresentationConnectionClosedReason      https://w3c.github.io/presentation-api/
1       0       ProximitySensor                         https://w3c.github.io/sensors/
1       0       ProximitySensorDirection                https://w3c.github.io/sensors/
1       0       ProximitySensorOptions                  https://w3c.github.io/sensors/
1       0       ProximitySensorPosition                 https://w3c.github.io/sensors/
1       0       ProximitySensorReading                  https://w3c.github.io/sensors/
1       0       Keyframe                                https://w3c.github.io/web-animations/
1       0       KeyframeArgument                        https://w3c.github.io/web-animations/
1       0       PropertyIndexedKeyframes                https://w3c.github.io/web-animations/
1       0       ComputedKeyframe                        https://w3c.github.io/web-animations/
1       0       AesCbcParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       JsonWebKey                              https://w3c.github.io/webcrypto/Overview.html
1       0       KeyAlgorithm                            https://w3c.github.io/webcrypto/Overview.html
1       0       KeyType                                 https://w3c.github.io/webcrypto/Overview.html
1       0       NamedCurve                              https://w3c.github.io/webcrypto/Overview.html
1       0       Pbkdf2Params                            https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedImportParams                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedKeyAlgorithm                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedKeyGenParams                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaKeyAlgorithm                         https://w3c.github.io/webcrypto/Overview.html
1       0       RsaKeyGenParams                         https://w3c.github.io/webcrypto/Overview.html
1       0       RsaOaepParams                           https://w3c.github.io/webcrypto/Overview.html
1       0       RsaOtherPrimesInfo                      https://w3c.github.io/webcrypto/Overview.html
1       0       RsaPssParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesCtrParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesDerivedKeyParams                     https://w3c.github.io/webcrypto/Overview.html
1       0       AesGcmParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesKeyAlgorithm                         https://w3c.github.io/webcrypto/Overview.html
1       0       AesKeyGenParams                         https://w3c.github.io/webcrypto/Overview.html
1       0       Algorithm                               https://w3c.github.io/webcrypto/Overview.html
1       0       BigInteger                              https://w3c.github.io/webcrypto/Overview.html
1       0       CryptoKeyPair                           https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyAlgorithm                          https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyGenParams                          https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyImportParams                       https://w3c.github.io/webcrypto/Overview.html
1       0       EcdhKeyDeriveParams                     https://w3c.github.io/webcrypto/Overview.html
1       0       EcdsaParams                             https://w3c.github.io/webcrypto/Overview.html
1       0       GlobalCrypto                            https://w3c.github.io/webcrypto/Overview.html
1       0       HashAlgorithmIdentifier                 https://w3c.github.io/webcrypto/Overview.html
1       0       HkdfParams                              https://w3c.github.io/webcrypto/Overview.html
1       0       HmacImportParams                        https://w3c.github.io/webcrypto/Overview.html
1       0       HmacKeyAlgorithm                        https://w3c.github.io/webcrypto/Overview.html
1       0       HmacKeyGenParams                        https://w3c.github.io/webcrypto/Overview.html
1       0       RTCInboundRTPStreamStats                https://w3c.github.io/webrtc-pc/
1       0       RTCOutboundRTPStreamStats               https://w3c.github.io/webrtc-pc/
1       0       RTCRTPStreamStats                       https://w3c.github.io/webrtc-pc/
1       0       RTCStatsType                            https://w3c.github.io/webrtc-pc/
1       0       DetectedObject                          https://wicg.github.io/shape-detection-api/
1       0       VertexAttribFVSource                    https://www.khronos.org/registry/webgl/specs/latest/1.0/

mdittmer avatar Jan 19 '17 17:01 mdittmer

@foolip PTAL. WDYT of these lists? Note that mentioned commit stores manual snapshot of first list, but not second.

mdittmer avatar Jan 19 '17 17:01 mdittmer

Hmm, sure looks like some IDL that we do want in there. By requiring class=idl, what bogus things do we get rid of? Maybe those can be skipped in some other way?

foolip avatar Jan 19 '17 23:01 foolip