[PATCH] conf/nginx.conf: add example "privacy" log_format

Hans-Christoph Steiner hans at guardianproject.info
Wed Jan 13 11:50:31 UTC 2021



Anton Luka Šijanec:
> Hans-Christoph Steiner <hans at guardianproject.info> @ Wed, 13 Jan 2021 10:27:42 +0100:
>> The standard log_formats store detailed information which falls under
>> data regulations like the EU's GDPR and California's CCPA. This merge
>> request adds a suggested "privacy" log_format that generates logs that
>> cannot be used to identify users. This has been developed and used by
>> Tor Project, Guardian Project, and F-Droid.
> 
> IANAL, so: Are there any exceptions in EU's GDPR that allow short-stored logs of user-identifiable information? That would seem useful, as *some* logging is useful when detecting and reporting fraudalent activities and for detecting spam. Logs are rotated and are sometimes useful when a data breach happens.
> 
> I've also seen some examples of ISPs having to store info, that would be classified as user data, for 6 months for detecting illegal activities. See [1].
> 
> Again, IANAL, but [0] describes some allowances regarding log data. I agree with adding the privacy option, but is that really a must when dealing with EU customers?

Both GDPR and CCPA allow log data to be gathered, stored, and used.  Those are 
regulated though, that means they must be considered when a user requests you 
give them their data, to delete all references to a user, etc.  You must also 
consider the legal definition of "for no longer than is necessary for the 
purposes for which the personal data are processed" in the context of your 
business activities and data you're gathering.  These are all non-trivial.

The goal of the "privacy" log mode is to guarantee that the log files do not 
fall under GPDR/CCPA regulation, but still provide useful information.  Then 
those log files can remain outside of GDPR/CCPA reviews.

IANAL, I am a researcher focused on privacy and metadata.  Those log files do 
not contain Personally Identifying Information (PII) and also do not contain 
enough info to identify someone.  They might contain enough data to identify 
someone in combination with other large data sets, like all of a user's browsing 
data.

.hc

-- 
PGP fingerprint: EE66 20C7 136B 0D2C 456C  0A4D E9E2 8DEA 00AA 5556
https://pgp.mit.edu/pks/lookup?op=vindex&search=0xE9E28DEA00AA5556


More information about the nginx-devel mailing list