Anonymize BLM Protest Images

Last update: Oct 13, 2022

Overview

Anonymize BLM Protest Images

This repository automates @BLMPrivacyBot, a Twitter bot that shows the anonymized images to help keep protesters safe. Use our interface at blm.stanford.edu.

What's happened? Arrests at protests from public images

Over the past weeks, we have seen an increasing number of arrests at BLM protests, with images circulating around the web enabling automatic identification of those individuals and subsequent arrests to hamper protest activity. This primarily concerns social media protest images.

Numerous applications have emerged in response to this threat that aim to anonymize protest images and enable people to continue protesting in safety. Of course, this would require a shift on the public's part to recognize this issue and an easy and effective method for anonymization to surface. In an ideal world, platforms like Twitter would enable an on-platform solution.

So what's your goal? AI to help alleviate some of the worst parts of AI

The goal of this work is to leverage our group's knowledge of facial recognition AI to offer the most effective anonymization tool that evades the state of the art in facial recognition technology. AI facial recognition models can still recognize blurred faces. This work tries to discourage people from trying to recognize or reconstruct pixelated faces by masking people with an opaque mask. We use the BLM fist emoji as that mask for solidarity. While posting anonymized images does not delete the originals, we are starting with awareness and hope Twitter and other platforms would offer an on-platform solution (might be a tall order, but one can hope).

Importantly, this application does not save images. We hope the transparency of this repository will allow for community input. The Twitter bot posts anonymized images based on the Fair Use policy; however, if your image is used and you'd like it to be taken down, we will do our best to do so immediately.

Q&A

How can AI models still recognize blurred faces, even if they cannot reconstruct them perfectly? Recognition is different from reconstruction. Facial recognition technology can still identify many blurred faces and is better than humans at it. Reconstruction is a much more arduous task (see the difference between discriminative and generative models, if you're curious). Reconstruction has recently been exposed to be very biased (see lessons from PULSE). Blurring faces has the added threat of encouraging certain people or groups to de-anonymize images through reconstruction or directly identifying individuals through recognition.

Do you save my pre-anonymized images? No. The goal of this tool is to protect your privacy and saving the images would be antithetical to that. We don’t save any images you give us or any of the anonymized images created from the AI model (sometimes they’re not perfect, so saving them would still not be great!). If you like technical details: the image is passed into the AI model on the cloud, then the output is passed back and directly displayed in a base64 jpg on your screen.

The bot tweeted my image with the fists on it. Can you take it down? Yes, absolutely. Please DM the bot or reply directly.

Can you talk a bit more about your AI technical approach? We build on state-of-the-art crowd counting AI, because it offers huge advantages to anonymizing crowds over traditional facial recognition models. Traditional methods can only find a few (less than 20 or even less than 5) in a single image. Crowds of BLM protesters can number in the hundreds and thousands, and certainly around 50, in a single image. The model we use in this work has been trained on over 1.2 million people in the open-sourced research dataset, called QNRF, with crowds ranging from the few to the the thousands. False negatives are the worst error in our case. The pretrained model weights live in the LSC-CNN that we build on - precisely, it's in a Google Drive folder linked from their README.

Other amazing tools

We would love to showcase other parallel efforts (please propose any we have missed here!). Not only that, if this is not the tool for you, please check these tools out too:

Image Scrubber by @everestpipkin
Censr (iOS and Android app)

And more...

Built by and built on

This work is built by the Stanford Machine Learning Group. We are Krishna Patel, JQ, and Sharon Zhou.
Flask-Postgres Template by @sharonzhou

https://github.com/sharonzhou/flask-postgres-template

Image Uploader by @christianbayer

https://github.com/christianbayer/image-uploader

LSC-CNN by @vlad3996

https://github.com/vlad3996/lsc-cnn

Paper associated with this work:

@article{LSCCNN20,
    Author = {Sam, Deepak Babu and Peri, Skand Vishwanath and Narayanan Sundararaman, Mukuntha,  and Kamath, Amogh and Babu, R. Venkatesh},
    Title = {Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection},
    Journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
    Year = {2020}
}

Offline mode

See the offline branch to run this work offline using Docker. This awesome code was contributed by @matthiaszimmermann.

Anonymize BLM Protest Images

Related tags

Overview

Anonymize BLM Protest Images

What's happened? Arrests at protests from public images

So what's your goal? AI to help alleviate some of the worst parts of AI

Q&A

Other amazing tools

Built by and built on

Offline mode

Owner

Stanford Machine Learning Group

A Fast Sequence Transducer Implementation with PyTorch Bindings

Official code for the CVPR 2021 paper "How Well Do Self-Supervised Models Transfer?"

An implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Implementation of Shape Generation and Completion Through Point-Voxel Diffusion

U-Net for GBM

Molecular Sets (MOSES): A benchmarking platform for molecular generation models

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Brain tumor detection using Convolution-Neural Network (CNN)

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Library for 8-bit optimizers and quantization routines.

Gesture-Volume-Control - This Python program can adjust the system's volume by using hand gestures

Facestar dataset. High quality audio-visual recordings of human conversational speech.

VGGFace2-HQ - A high resolution face dataset for face editing purpose

Music Generation using Neural Networks Streamlit App

Keras Realtime Multi-Person Pose Estimation - Keras version of Realtime Multi-Person Pose Estimation project

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" (PRICAI 2021)

Repo for paper "Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information"

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)