Skip to content

Latest commit

 

History

History
54 lines (32 loc) · 2.06 KB

readme.md

File metadata and controls

54 lines (32 loc) · 2.06 KB

conspiracy

machine and human apophenia

created for The New Inquiry's Conspiracy issue

view it live here

setup

before installing requirements, install dlib dependencies:

sudo apt-get install build-essential cmake libgtk-3-dev libboost-all-dev -y

to setup darknet (for the yolo object recognition model):

git clone [email protected]:thtrieu/darkflow.git
cd darkflow
python3 setup.py build_ext --inplace
pip install -e .

the model weights are linked to in that repo.

facenet.py is from https://github.com/davidsandberg/facenet. the model weights are linked to in that repo as well.

phantomjs is necessary to screenshot websites. download from:

wget -O phantomjs.tar.bz2 https://bitbucket.org/ariya/phantomjs/downloads/phantomjs-2.1.1-linux-x86_64.tar.bz2
tar jxf phantomjs.tar.bz2
sudo mv phantomjs-2.1.1-linux-x86_64/bin/phantomjs /usr/local/bin/

tesseract for OCR:

sudo apt-get install tesseract-ocr

many images are sourced from wikipedia commons. you can grab the latest dump here: https://dumps.wikimedia.org/commonswiki/latest/commonswiki-latest-image.sql.gz, extract it to assets/commonswiki-latest-image.sql, then run parse_images_dump.py from that directory to build image urls from the dump. this will take a long time

note that this doesn't download the images. you can download a random sample of these commons urls like so:

from images import sampler
sampler.fetch_sample(100)

this program expects to be registered as a listener for reality, which creates a FIFO file (fifo) that main.py polls for new articles to process. run listen.py to listen to the FIFO queue and automatically run object/face detection on new images.

conspire

to generate a conspiracy, run generate.py.

you can view the latest conspiracy by running an HTTP server from public/, e.g.:

cd public
python3 -m http.server