Proposal: Include certificate fingerprint(s) in NEL reports #127

ivanr · 2020-05-15T17:05:56Z

I would like to propose that a certificate hash/fingerprint is included with reports that indicate TLS failures as well as successful reports. When there is a problem, knowing the hash would be very helpful to assist with troubleshooting. In fact, it would be great if either the entire chain is included, or at least the hashes of the entire chain. Given CT, the hashes are sufficient to rebuild the chain in most cases.

Identifying leaf certificate by fingerprint would be most helpful to assist preventing certificate expiration when success sampling is enabled. Although one can monitor from multiple vantage points, catching all servers is impossible with complex infrastructures.

The proposed workflow is as follows: 1) success sampling is enabled; 2) reporting endpoints examine TLS fingerprints to detect expiring certificates that have not been replaced and 3) alert the administrators to take action.

neilstuartcraig · 2020-05-19T15:47:58Z

+1 from me. I'd really like to know the full chain, fingerprints or certs.

By way of illustration, we had an issue with a small percentage of our users (those on Android 6 and older IIRC) whereby they couldn't use our website. They saw the full page interstitial showing a TLS error and we received NEL reports of "TLS cert authority invalid". After a few days of digging, we found that one of our edge services had a cert which had been replaced but without the "extended support" cert chain (i.e. the ultimate root was too new for Android 6 and older). The fix was simple, we added 2 certs to our chain.
If we'd have received the cert chain in the reports, we'd have seen immediately what the issue was.

This would also be useful where 3rd parties (e.g. CDN's, cloud/*aaS vendors) manage the TLS on our behalf. If they replace certs and it causes problems, we'd be able to identify that easily without interaction with them.

We would also be able to use this to distinguish errors caused by our certs versus those caused by AV/MitM TLS proxies.

Also, picking up on the idea for monitoring cert expiration. If NEL had a severity (as per #125 or similar), this could be an event with severity of warn when a cert was encountered with e.g. < 30 days validity.

ivanr · 2020-05-22T09:42:42Z

@neilstuartcraig If you wanted NEL to assign severity to expiring certificates, you'd have to tell it how to recognise them. There's a trend toward using short-lived certificates that might have lifetimes of as little as a week. Granted, not many people use them today, but it's likely that the trend will continue in the long term. Additionally, it's easy to achieve what you want via a smarter monitoring component.

neilstuartcraig · 2020-05-26T11:35:54Z

Yeah that's (shorter lifetime certs) true, i guess it's a question of how long it's likely to take to get there.
I think it's worth considering using NEL as a reporting mechanism for expiration for the same sort of reason that HTTP errors are included in NEL - not all devs/operators have access to other methods (this argument was made in #126 (comment))

sleevi · 2021-05-11T17:00:33Z

I think this is unlikely to be implemented for Chrome. We've consistently rejected such efforts for privacy and fingerprintability reasons. It's a giant fingerprint whether or not a user is, say, behind a proxy or MITM if the fingerprint diverges from what the server expects. We considered this during the design, and made an intentional decision not to expose.

ivanr · 2021-05-19T18:50:46Z

@sleevi You know that threat model better, but would it make any difference if Chrome would support this type of reporting only for certificates issued from public roots? That would still enable owners to be alerted about expiring certificates with much reduced (although probably not 100% eliminated) opportunity for fingerprinting. I think with this change an operator would at best be able to tell if a client is behind some sort of proxy, but that's about it.

sleevi · 2021-05-19T18:52:01Z

@ivanr No, because that effectively reveals when the user is using a proxy/local interception agent, which is a fingerprinting datapoint itself.

ivanr · 2021-05-19T19:32:52Z

@sleevi I am not sure you can completely avoid fingerprinting, though. For example, I suspect it would be easier to detect interception agents by looking at the TLS handshake parameters and other TLS aspects.

Is Chrome's threat model publicly documented? I'd be curious to learn more about what you're protecting against exactly.

sleevi mentioned this issue May 11, 2021

Consider removing nextHopProtocol as it may expose whether visitor is using VPN / proxy w3c/resource-timing#221

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Include certificate fingerprint(s) in NEL reports #127

Proposal: Include certificate fingerprint(s) in NEL reports #127

ivanr commented May 15, 2020

neilstuartcraig commented May 19, 2020

ivanr commented May 22, 2020

neilstuartcraig commented May 26, 2020

sleevi commented May 11, 2021

ivanr commented May 19, 2021

sleevi commented May 19, 2021

ivanr commented May 19, 2021

Proposal: Include certificate fingerprint(s) in NEL reports #127

Proposal: Include certificate fingerprint(s) in NEL reports #127

Comments

ivanr commented May 15, 2020

neilstuartcraig commented May 19, 2020

ivanr commented May 22, 2020

neilstuartcraig commented May 26, 2020

sleevi commented May 11, 2021

ivanr commented May 19, 2021

sleevi commented May 19, 2021

ivanr commented May 19, 2021