TANDEM in toto

  TANDEM: A Web-Based Text and Image Data Generator   Kelly Blanchat, Jojo Karlin, Stephen Real, Christopher Vitale DH Praxis Spring 2015   ABSTRACT   TANDEM is a Python-based Django web-application that generates text and image data from files submitted by the user. TANDEM is for scholars seeking quantitative insight into a corpus consisting of picture books, comics, advertisements, and other images with overlaid text. The TANDEM application compiles three existing open source technologies: Tesseract OCR, Open Source Computer Vision (OpenCV), and a natural language processing library called Natural Language Toolkit ... Read more

Week 13 Project Update

WEEK 13 TANDEM PROJECT UPDATE: We are happy to announce that the initial version of our near-polished UI is up and functioning on http://dhtandem.com/. This development means that you can now go to the site and walk through uploading files as well as review some early versions of our documentation. Immediate next steps for our team include updating the text on the documentation pages to the more robust things we have patiently waiting in the wings while we finalize the connection of the ... Read more

Tandem Git Repository

The Python Script is available on Github. The repo is here. The core program is Tandem0.2.py in the tandem folder.   Read more
Skip to toolbar