data-collection engine

Description (in English)

Missed Connections is a 2-Channel internet-aware software piece that continuously fetches the latest posts in the "missed connections" section of Craigslist.org. Each post is presented one at a time, and is filtered by looking for so-called stopwords. Computer Scientists define stopwords as those words that do not convey the meaning of a message. In essence, they are considered signal noise in the stream of potential information. Each post is presented simultaneously in two ways: one just with stopwords, the other with non-stopwords, and in both cases the filtered words are displayed as dashed lines, akin to the way words are presented in the game Hangman. Thus, both posts present the same "graphical" structure, but have the potential for very different readings. Source: work description at author's website

Multimedia
Remote video URL
Technical notes

The piece uses Craigslist's RSS feature to obtain new feeds to add to the XML database that the software uses. Once new feeds are obtained, a screen scraping routine is employed to obtain the full text of the post. The software operates in real time, but it keeps a cache of posts to cycle through. This cache is periodically flushed, its period determined by the number of missed connections posts that the program obtains in a day. Like many of my other pieces, Missed Connections was developed in Java. XML reading and writing was made possible via JDOM and the RSS component used ROME for parsing the feeds.

Type
Language
Year
Record Status
Abstract (in English)

Digital Humanities in Practice: Project Work on Developing a Scholarly Database of Electronic Literature

Students work with scholars on a current international research project "Electronic Literature as a Model for Creativity in Practice" (ELMCIP) in particular working on the development and editing of the Knowledge Base for Electronic Literature. The Knowledge Base is a scholarly, relational database programmed in Drupal that documents works, events and actors in the field of electronic literature. In addition to participating in practical project-based work with an established team of senior and junior researchers, students read scholarship on digital humanities as a field and explore and read articles related to the digital humanities.

In individual projects, students develop expertise in a particular field of research in e-lit. In that respect, the course offers students ways to create interpretative frameworks for a specific set of data and trains students in adapting "digital methods" critically. To be agreed upon with individual students skillsets and interests, practices in the course include:

  • reflective editing and documentation: researching, writing, and editing entries about electronic literature in the Knowledge Base
  • development: working on the Drupal backend to the Knowledge Base in collaboration with other project team members, either conceptually or taking part in the programming according to the students prior skills
  • web design and user interface development
  • project planning and implementation; team work and collaboration in academia

After completing the course, students will have assessed the usefulness of a range of digital humanities strategies in specific scholarly work, have experience in discussing organizational and design choices in developing a scholarly database, and have investigated in the community of electronic literature.

 Note: The complete reading list appears in the attached syllabus.

Database or Archive Referenced