corpus

By Li Yi, 29 August, 2018
Author
Language
Year
University
Record Status
Abstract (in English)

In his 1966 essay “Rhétorique et enseignement,” Gérard Genette observes that literary studies did not always emphasize the reading of texts. Before the end of the nineteenth century, the study of literature revolved around the art of writing. Texts were not objects to interpret but models to imitate. The study of literature emphasized elocutio, or style and the arrangement of words. With the rise of literary history, academic reading approached texts as objects to be explained. Students learned to read in order to write essays (dissertations) where they analyzed texts according to prescribed methods. This new way of studying literature stressed dispositio, or the organization of ideas. Recent developments in information technology have further challenged paradigms for reading literature. Digital tools and resources allow for the study of large collections of texts using quantitative methods. Various computational methods of distant as well as close reading facilitate investigations into fundamental questions of the possibilities for literary creation. Technology has the potential for exploring inventio, or the finding of ideas that can be expressed through writing. One possibility is the Word Vector Topic Generator (https://github.com/mbwolff/WVTG), a Python script that makes use of vector space models of words. These models represent relationships between words from a defined corpus in spatial terms and can be used to calculate semantic similarities and differences. With a corpus and a given text it is possible to generate a new text according to how language was used within the corpus. Considered as an algorithmic topos in the Aristotelian sense, the WVTG instantiates an opposition to the thesis of an asserted text through analogy. Three inputs are required: the asserted text, the corpus from which a vector space model of words is derived, and a pair of words establishing an analogy for substitutions in the text. For instance, a corpus of 117 texts by Honoré de Balzac produces a vector space model of words that can generate a new text from Charles Baudelaire’s prose poem Enivrez-vous! by replacing each word in the poem with a word in the vector space model that best completes an analogy from the opposition bénir/maudire as expressed in Balzac’s writing. The code allows a user to easily experiment with different corpora and analogies to generate different texts. Unlike a traditional notion of invention positing that arguments to persuade an audience are discoverable within a shared and uncontested discursive space, the algorithmic invention of WVTG parameterizes both the discursive space and the relationships between words. Rhetorical invention such as this explores the potentiality of language as members of the Oulipo have done with techniques such as Jean Lescure’s S+7 method, Marcel Bénabou’s aphorism formulas and the ALAMO’s rimbaudelaire poems. The WVTG implements analytical tools from the digital humanities as a means for creating e-literature. With technology we can explore not only how something was written and why it was written, but also what was possible to write given a historical linguistic context. 

Description in original language
Description (in English)

Simon Biggs with Mark Shovman developed a virtual interactive artwork in response to a commission of the Poetry Beyond Text project. Tower is inspired by the story of the Tower of Babel. Inter-subjective relations are central to this work, which evokes the idea of first-, second- and third-person perspectives. Tower is an interactive work which creates an immersive 3D textual environment combining visualisation, physical interaction, speech recognition and predictive text algorithms. Viewers (or inter-actors) occupy one of three roles: as central inter-actor, wearing a VR head-display, as one of several inter-actors, wearing 3D spectacles, or as spectator, standing outside the interactive zone. The central inter-actor is located at the vertiginous pinnacle of a virtual spiral word structure. When the inter-actor speaks their spoken words appear to float from their mouth and join the spiralling history of previously spoken words. As the uttered word emerges other words, predicted on the basis of statistical frequency within a textual corpus, spring from the spoken word. The second-person inter-actors see words appearing from the first-person inter-actor's mouth and the spiral gradually growing, with the first-person inter-actor at its pinnacle, while the third-person observers stand outside the interactive zone, observing the tableau. As it grows the spiral comes to resemble a Tower of Babel composed of words, spoken and potential. (Source: Poetry Beyond Text)

Technical notes

Tower requires a high powered gaming type PC with multi-screen capability, 3D stereo projection (two instances of), matching 3D spectacles, a head mounted display system, spatial tracking system, a voice recognition optimised wireless microphone and Windows 7 or XP.