Port has been successfully running in production since 2020, and has provided the software for dozens of researchers to undertake privacy-preserving data donation studies. In Port, the data extraction happens in a respondent's own browser: we run Python on the page with pyodide to pull out only the parts that are relevant for a given study so the original file never leaves their device. Respondents can look at the rows and choose to remove ones that are sensitive.
The in-the-browser constraint is critical: it's local computation at the moment of collection, and it's a pattern we should expect to see more of in statistics as we seek more privacy and bigger data. As a consequence, we need better ways to scale down our methods and more insight into the impact.
ReferenceBoeschoten, L., de Schipper, N. C., Mendrik, A. M., van der Veen, E., Struminskaya, B., Janssen, H., & Araujo, T. Port: A software tool for digital data donation. Journal of Open Source Software, 8(90), 5596 (2023). doi:10.21105/joss.05596 ↗
Request
A participant makes a GDPR-backed request for their data.
Extract, locally
Extraction runs in their own browser, with the full export parsed entirely on their own machine.
Review & delete
They see their actual data, row by row, and delete what they'd rather not share before agreeing to the donation.
Your comments
Comments you've left on TikTok videos.
| Comment | Date |
|---|