General approach to capability negotiation #176

samuelweiler · 2021-03-19T17:01:42Z

I thank the editors for what appears to be an excellent fingerprinting analysis. This is exactly the sort of thing I'm looking for in specs.

As a general thing, why are we exposing device capabilities to the app for purposes of negotiation? Couldn't we instead have sites expose available media formats and have browsers (perhaps in a way not exposed the application) pick the one they like best? That way a browser wishing to be more privacy preserving could simply make a consistent choice, without having to fake an answer to this API, as recommended in https://w3c.github.io/media-capabilities/#decoding-encoding-fingerprinting.

chcunningham · 2021-03-20T03:28:16Z

I'm thrilled the fingerprinting analysis is good.

This section of the explainer lays out the design philosophy for the current API shape. It mentions your idea as well a potential follow up that could work in tandem this design (so far, not something we pursued). There are some cases having the browser pick works well. For instance, <video> may have multiple nested <source> tags and the UA does choose between those. But modern video APIs like MSE, EME, and WebRTC, have increasingly moved in the direction of letting sites choose what to stream and we tried to align our design with that direction.

Having the browser say which format it prefers is sometimes still compatible with the newer APIs. For instance, with MSE as used by sites like YouTube, this could work fine. But, with EME and WebRTC, its more complicated. For EME, a site like Netflix may balance the most performant stream configuration against the most secure stream configuration. When these are the alligned, the choice is easy. But they are not always aligned, and the site is in a better position to break a tie. With WebRTC, you may have a preferred format, but you have to participate in a negotiation with your peers to arrive at a format that everyone supports. An API that can tell you info about each possible format is better suited to populating the format negotiation ladder.

samuelweiler · 2021-03-31T17:49:57Z

@chcunningham, Thank you!

I'm thrilled the fingerprinting analysis is good.

This section of the explainer lays out the design philosophy for the current API shape. It mentions your idea as well a potential follow up that could work in tandem this design (so far, not something we pursued).

This isn't quite as detailed as I had hoped for. As you say, it mentions the possibility of the UA picking, but it says little about why that path isn't being chosen.

... But modern video APIs like MSE, EME, and WebRTC, have increasingly moved in the direction of letting sites choose what to stream and we tried to align our design with that direction.

If "we're following the example" is the argument, then I'd like to push back. I'm not convinced that these others got it right, and I'd like to take a fresh look here.

Having the browser say which format it prefers is sometimes still compatible with the newer APIs. For instance, with MSE as used by sites like YouTube, this could work fine. But, with EME and WebRTC, its more complicated. For EME, a site like Netflix may balance the most performant stream configuration against the most secure stream configuration. When these are the alligned, the choice is easy. But they are not always aligned, and the site is in a better position to break a tie.

The user might have an opinion in this case, also. e.g., the user might have low power availability and might prefer the lower power choice. As as you point out, there may be misalignment. Given that, I would argue that the UA - as the "user's agent" - is in the better place to break the tie, not the site.

With WebRTC, you may have a preferred format, but you have to participate in a negotiation with your peers to arrive at a format that everyone supports. An API that can tell you info about each possible format is better suited to populating the format negotiation ladder.

The more-than-two-party case (of WebRTC) seems different than the two-party case, and my understanding was that this API is for two-party cases, right? In that case, having the site provide information about what it supports - rather than the UA supply that information - would seem to provide sufficient completeness, right?

chcunningham · 2021-04-13T07:13:44Z

If "we're following the example" is the argument, then I'd like to push back. I'm not convinced that these others got it right, and I'd like to take a fresh look here.

The fresh look is welcome, but I think the proposed design is not feasible at this time. The MediaCapabilities API is widely implemented and used. We have an opportunity to make additions, improvements, refinements, etc... but we cannot make a breaking change of this magnitude.

The user might have an opinion in this case, also. e.g., the user might have low power availability and might prefer the lower power choice. As as you point out, there may be misalignment. Given that, I would argue that the UA - as the "user's agent" - is in the better place to break the tie, not the site.

Sites may offer users this choice while also factoring in their secret sauce for whatever they think makes the best user experience.

The more-than-two-party case (of WebRTC) seems different than the two-party case, and my understanding was that this API is for two-party cases, right? In that case, having the site provide information about what it supports - rather than the UA supply that information - would seem to provide sufficient completeness, right?

For WebRTC usage, this API is not limited to two parties. The API can describe the send and receive capabilities of the local machine. The app could then exchange this information with the N parties in a conference call setup as part of format negotiation.

samuelweiler · 2021-04-14T13:57:31Z

The fresh look is welcome, but I think the proposed design is not feasible at this time. The MediaCapabilities API is widely implemented and used. We have an opportunity to make additions, improvements, refinements, etc... but we cannot make a breaking change of this magnitude.

Please correct me if I'm wrong, but isn't this the first time the WG has sought the Privacy IG's review of this spec?

chcunningham · 2021-04-14T21:27:16Z

You are correct, this is the first time review has been requested. I accept responsibility for the delay in making the request. This spec was my first time navigating the w3c process.

My aim in the comment about "feasibility" is to provide important background. I'm happy to continue discussion on the merits of various designs.

pes10k · 2021-04-15T02:22:12Z

Just to second @samuelweiler (twice):

I think the substance of Sam's issue is important, given that for some users the values here will be highly identifying for browser fingerprinting (and if the approach Sam is suggesting isn't workable, then other fingerprinting protections are needed in the spec. I appreciate and agree with Sam that the text discussing fingerprinting issues is great, but the spec also needs normative protections against the fingerprinting risk)

I think the process points I read in Sam's comment are important too. The purpose of reviews is to identify privacy risks in specs, and make sure they're addressed before things move to REC. Doubly so when the spec touches on topics called out as needing extra care by TAG Design Principals. I see Sam identifying a place where the current spec doesn't seem to follow the least-power principal the TAG suggests (or align with the fingerprinting risks PING is generally concerned with).

@chcunningham are you saying that the WG isn't interested in moving the spec in a direction more in line with TAG guidance (and reducing fingerprinting risk)? Or that a capability navigation approach (or something else more in line with the TAG principals) sounds good, but would need to be achieved in a different way that has been discussed in the thread so far? Or that its simply too late to make any significant changes (in this respect) at all?

chcunningham · 2021-04-15T04:50:59Z

@chcunningham are you saying that the WG isn't interested in moving the spec in a direction more in line with TAG guidance (and reducing fingerprinting risk)?

No. I am happy to make changes that reduce fingerprinting risk. I think being transparent about feasibility of proposed changes is essential to having a good faith conversation about making improvements.

Or that a capability navigation approach (or something else more in line with the TAG principals) sounds good, but would need to be achieved in a different way that has been discussed in the thread so far?

Feasibility aside, I do not think the capability approach sounds good. I gave a few examples of issues in my earlier comments. IMO those examples demonstrate that the current API does align with the least power principal (more power was needed).

chcunningham · 2021-04-15T04:55:44Z

given that for some users the values here will be highly identifying for browser fingerprinting

Can you elaborate on this? I'd like to explore other mitigations.

mwatson2 · 2021-04-30T18:45:28Z

@samuelweiler wrote:

Couldn't we instead have sites expose available media formats and have browsers (perhaps in a way not exposed the application) pick the one they like best?

At least in our system (Netflix) and I imagine in others, the available media formats varies significantly by title and requires a small amount of work server-side to compute. In our case, also, computing the CDN URLs for the various streams involves a larger amount of server-side work. At the moment we do these tasks in a single network request. These tasks can be done speculatively to a certain extent (when there are signals as to which title might be presented next), but we would not want to waste resources on the CDN calculations for stream formats that are not supported by the device. If they cannot be done speculatively, then doing them in a single request is desirable from a responsiveness point of view, rather than one request to get the available formats and another to get the URLs and other metadata for the chosen format.

If I understand correctly, an API that allows the browser to choose a format from a provided list exposes all the same information about device capabilities, since it could be called repeatedly with different lists, so the privacy advantage of that approach is only that those requests could be rate limited and abuse might then be easier to detect. A site like Netflix, though, would need to call this frequently at first as we drive speculative preparation for titles visible in the gallery, for example, so heavy throttling could have a user experience impact and differentiating between normal usage and abuse may not be so easy anyway.

It should always be possible for privacy-sensitive browsers to monitor whether sites request capability information (in general, not just this API) and then do not go on to use the capabilities detected. Browsers can also choose to advertise only a common baseline capability and offer users the choice to expose more information only when a site actually uses that capability.

chrisn · 2023-12-13T10:44:48Z

Discussed in Media WG meeting 12 December 2023 (minutes). Next step: update our privacy considerations.

mwatson2 · 2024-01-09T23:18:29Z

Sorry, I missed the discussion last month. Happy to help draft text for the streaming case, based on the note above. I can prepare a PR if no one else is doing it.

chrisn · 2024-01-10T15:52:54Z

@mwatson2 Thank you, that would be really helpful. It sounds like a good way to start, then we can also add the WebRTC considerations from @aboba.

Partial fix for #176

samuelweiler added the privacy-needs-resolution Issue the Privacy Group has raised and looks for a response on. label Mar 19, 2021

samuelweiler mentioned this issue Mar 19, 2021

Media Capabilities w3cping/privacy-request#35

Closed

w3cbot mentioned this issue Mar 19, 2021

General approach to capability negotiation w3cping/tracking-issues#197

Open

samuelweiler mentioned this issue Mar 29, 2021

capability negotiation, big picture w3c/charter-media-wg#20

Closed

pes10k mentioned this issue May 5, 2023

Align exposing scalabilityMode with the WebRTC "hardware capabilities" check w3c/webrtc-svc#92

Open

AdamSobieski mentioned this issue Aug 18, 2023

Audio Hardware and Configuration Data #206

Open

aboba mentioned this issue Dec 12, 2023

Move to Candidate Recommendation w3c/webrtc-svc#45

Open

chrisn added this to the V1 milestone Dec 13, 2023

chrisn mentioned this issue Dec 13, 2023

PING: Align exposing scalabilityMode with the WebRTC "hardware capabilities" check #209

Open

aboba added a commit that referenced this issue Jan 15, 2024

Capabillity negotiation model

5af1b34

Partial fix for #176

aboba mentioned this issue Jan 15, 2024

RTC capabillity negotiation #212

Open

mwatson2 added a commit to mwatson2/media-capabilities that referenced this issue Feb 12, 2024

Issue w3c#176: Add comments to privacy section to address issue.

0d4b45f

chrisn mentioned this issue Feb 13, 2024

Issue #176: Add comments to privacy section to address issue. #217

Open

chrisn added the TPAC2024 Topic for discussion at TPAC 2024 label Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

General approach to capability negotiation #176

General approach to capability negotiation #176

samuelweiler commented Mar 19, 2021

chcunningham commented Mar 20, 2021 •

edited

Loading

samuelweiler commented Mar 31, 2021

chcunningham commented Apr 13, 2021

samuelweiler commented Apr 14, 2021

chcunningham commented Apr 14, 2021

pes10k commented Apr 15, 2021

chcunningham commented Apr 15, 2021 •

edited

Loading

chcunningham commented Apr 15, 2021

mwatson2 commented Apr 30, 2021

chrisn commented Dec 13, 2023

mwatson2 commented Jan 9, 2024

chrisn commented Jan 10, 2024

General approach to capability negotiation #176

General approach to capability negotiation #176

Comments

samuelweiler commented Mar 19, 2021

chcunningham commented Mar 20, 2021 • edited Loading

samuelweiler commented Mar 31, 2021

chcunningham commented Apr 13, 2021

samuelweiler commented Apr 14, 2021

chcunningham commented Apr 14, 2021

pes10k commented Apr 15, 2021

chcunningham commented Apr 15, 2021 • edited Loading

chcunningham commented Apr 15, 2021

mwatson2 commented Apr 30, 2021

chrisn commented Dec 13, 2023

mwatson2 commented Jan 9, 2024

chrisn commented Jan 10, 2024

chcunningham commented Mar 20, 2021 •

edited

Loading

chcunningham commented Apr 15, 2021 •

edited

Loading