Category Archives: Research

Why use visualizations to study poetry?

[Note: This post was a DHNow Editor’s Choice on May 1, 2012.]

The research I am doing presently uses visualizations to show latent patterns that may be detected in a set of poems using computational tools, such as topic modeling.Â In particular, Iâ€™m looking at poetry that takes visual art as its subject, a genre called ekphrasis, in an attempt to distinguish the types of language poets tend to invoke when creating a verbal art that responds to a visual one.Â Studying wordsâ€™ relationships to images and then creating more images to represent those patterns calls to mind a longstanding contest between modes of representationâ€”which one represents information â€œbetterâ€?Â Since my research is dedicated to revealing the potential for collaborative and kindred relationships between modes of representationÂ historicallyÂ seen in competition with one another, using images to further demonstrate patterns of language might be seen as counter-productive.Â Why use images to make literary arguments? Do images tell us something â€œnewâ€ that words cannot?

Without answering that question, Iâ€™d like instead to present an instance of when using images (visualizations of data) to â€œseeâ€ language led to an improved understanding of the kinds of questions we might ask and the types of answers we might want to look for that wouldnâ€™t have been possible had we not seen them differentlyâ€”through graphical array.

Currently, Iâ€™m using a tool called MALLET to create a model of the possible â€œtopicsâ€ found in a set of 276 ekphrastic poems.Â There are already several excellent explanations of what topic modeling is and how it works (many thanks to Matt Jockers, Ted Underwood, and Scott WeingartÂ who posted these explanations with humanists in mind), so Iâ€™m not going to spend time explaining what the tool does here; however, I will say that working with a set of 276 poems is atypical.Â Topic modeling was designed to work on millions of words, and 276 poems doesnâ€™t even come close; however, part of the project has been to determine a threshold at which we can get meaningful results from a small dataset.Â So, this particular experiment is playing with the lower thresholds of the toolâ€™s usefulness.

When you run a topic model (train-topics) in MALLET, you tell the program how many topics to create, and when the model runs, it can output a variety of results. Â As part of the tinkering process, Iâ€™ve been working with the number of topics to have MALLET use in order to generate the model, and was just about to despair that the real tests I wanted to run wouldnâ€™t be possible at 276 poems. Â Perhaps it was just too few poems to find recognizable patterns. Â For each topic assignment, MALLET assigns an ID number to the topic and “topic keys” as keywords for that topic. Â Usually, when the topic model is working, the results are â€œreadableâ€ because they represent similar language. Â MALLET would not call a topic “Sea,” for example, but might instead provide the following keywords:

blue, water, waves, sea, surface, turn, green, ship, sail, sailor, drown

The researcher would look at those terms and think, â€œOh, clearly thatâ€™s a nautical/sea/sailingâ€ topic, and dub it as such.Â My results, however, on 15 topics over 276 poems were not readable in the same way.Â For example, topic 3 included the following topic keys:

3Â Â Â Â Â Â Â Â Â 0.04026Â Â Â Â Â Â Â Â Â Â with self portrait him god how made shape give thing centuries image more world dread he lands down back protest shaped dream upon will rulers lords slave gazes hoe future

I donâ€™t blame you if you donâ€™t see the pattern there.Â I didnâ€™t.Â Except, well, knowing some of the poems in the set pretty well, I know that it put together â€œLandscape with the Fall of Icarusâ€ by W.C. Williams with â€œThe Poem of Jacobus Sadoletus on the Statue of Laocoonâ€ with â€œThe New Colossusâ€ with â€œThe Man with the Hoe Written after Seeing the Painting by Millet.â€Â I could see that we had lots of kinds of gods represented, farming, and statues, but thatâ€™s only because I knew the poems. Â Without topic modeling, I might put this category together as a â€œmastersâ€ grouping, but itâ€™s not likely. Â Rather than look for connections, I was focused on the fact that the topic keys didnâ€™t make a strong case for their being placed together, and other categories seemed similarly opaque.Â However, just to be sure that I could, in fact, visualize results of future tests, I went ahead and imported the topic associations by file.Â In other words, MALLET can also produce a file that lists each topic (0-14 in this case) with each file name in the dataset and a percentage.Â The percentage represents the degree to which the topic is represented inside each file.Â I imported the MALLET output of topics and files associated with them into Google Fusion Tables and created a dynamic bar graph that collects file-ids along the vertical axis and along the horizontal axis can be found the degree that the given topic (in this case topic 3) is present in the file.Â Â As I clicked through each topicâ€™s graph, I figured I was seeing results that demonstrated MALLETâ€™s confusion, since the dataset was so small.Â But then I saw this: [Below should be a Google Visualization. Â You may need to “refresh” your browser page to see it. Â If you still cannot see it, a static version of the file is visible here.]

If the graphâ€™s visualization is working, when you pass your mouse over the lines in the bar graph, the ones that are higher than 0.4, then the file-id number (a random number assigned during the course of preparing the data) appears. Â Each of these files begin with the same prefix: GS. Â In my dataset, that means that the files with the highest representation of topic 3 in them can all be found in John Hollanderâ€™s collection The Gazerâ€™s Spirit.Â This anthology is considered to be one of the most authoritative and diverseâ€”beginning with classical ekphrasis all the way up to and including poems from the 1980s and 1990s.Â I had expected, given the disparity in time periods, that the poems from this collection would be the most difficult to group together because the diction of the poems changes dramatically from the beginning of the volume to the end.Â In other words, I would have expected the poems to blend with the other ekphrastic poems throughout the dataset more in terms of their similar diction than by anything else.Â MALLET has no way of knowing that these files are included in the same anthology.Â All of the bibliographical information about the poems has been stripped from the text being tested.Â There has to be something else.Â What something else might be requires another layer of interpretation.Â I will need to return to the topic model to see if a similar pattern is present when I use Â other numbers of topicsâ€”or if I include some non-ekphrastic poems to the set being testedâ€”but seeing the affinity in language between the poems included in The Gazerâ€™s Spirit in contrast to other ekphrastic poems proved useful. Â Now, Iâ€™m not inclined to throw the whole test away, but instead to perform more tests to see if this pattern emerges again in other circumstances.Â Iâ€™m not at square one. Iâ€™m at a square 2 that I didnâ€™t expect.

The visualization in the end didnâ€™t produce â€œnew knowledge.â€Â It isnâ€™t hard to imagine that an editor would choose poems that construct a particular argument about what â€œbestâ€ represents a particular genre of poetry; however, if these poems did truly represent the diversity of ekphrastic verse, wouldnâ€™t we see other poems also highly associated with a â€œGazerâ€™s Spirit topicâ€?Â What makes these poems stand out so clearly from others of their kind?Â Might their similarity mark a reason for why critics of the 90s and 2000s define the tropes, canons, and traditions of ekphrasis in a particular vein?Â Iâ€™m now returning to the test and to the texts to see what answers might exist there that I and others have missed as close readers.Â Could we, for instance, run an analysis that determines how closely other kinds of ekphrasis are associated with Gazerâ€™s Spiritâ€™s definition of ekphrasis?Â Is it possible that poetry by male poets is more frequently associated with that strain of ekphrastic discourse than poetry by female poets?

This particular visualization doesnâ€™t make an “argument” in the way humanists are accustomed to making them.Â It doesnâ€™t necessarily produce anything wholly â€œnewâ€ that couldnâ€™t have been discovered some other way; however, it did help this researcher get past a particular kind of blindness and helped me to see alternativesâ€”to consider what has been missed along the wayâ€”and there is, and will be, something new in that.

Chunks, Topics, and Themes in LDA

4 Replies

[NB: This post is the continuation of a conversation begun on Ted Underwoodâ€™s blog under the post â€œA touching detail produced by LDAâ€â€”in which he demonstrates that there is an overlay between the works of the Shelley/Godwin family and a topic which includes the terms mind / heart / felt.Â Rather than hijack his post, Iâ€™m responding here to questions having to do more with process than content; however, to understand fully the genesis of this conversation, I encourage you to read Tedâ€™s post and the comments there first. ]

Ted-

I appreciate your response because it is making me think carefully about what I understand LDA “topics” to represent. Â Iâ€™m not sure that Iâ€™m on board with thinking of topics in terms of discourse or necessarily â€œwaysâ€ of writing. Â Honestly, Iâ€™m not trying to be difficult here; rather, Iâ€™m trying to parse for myself what I mean when I talk about my expectations that particular terms â€œshouldâ€ form the basis for a highly probable topic.Â It seems to me that what one wants from topic modeling are lexical themesâ€”in other words, lexical trends over the course of particular chunks of text.Â Iâ€™m taking to heart here Matt Jockersâ€™s recent post on the LDA buffet in which he articulates the assumption that LDA analysis makesâ€”that the world is composed of a certain number of topics (and in Mallet, we define those topics when we run the topic modeling application).Â As a result, when I run a topic model analysis in Mallet, I am looking at the way graphemes (because the written symbol, of course, is divorced from its meaning) relate to other similar graphemes.Â So, though topics may not have a one-to-one semantic relationship with particular volumes as the â€œmain topicâ€ or â€œsupporting topics,â€ one might reasonably expect that a text with a 90% probability of including a list of graphemes from an LDA topic lexicon (for lack of a better word) would correspondingly address a thematic topic which depends heavily on a closely related vocabulary.Â Similarly, the frequent use of words in a topic lexicon increases the probability that the LDA topic, through the repetition of those words, carries semantic weightâ€”though the degree to which this is the case wouldnâ€™t likely be determined by that initial topic probability.

Iâ€™m chasing the rabbit down a hole here, but I do so for the purpose of agreeing with your earlier claim that what kinds of results we get, their reliability, and their usefulness seems to be largely determined by the kinds of questions weâ€™re asking in the first place.Â I agree that when we use LDA to describe texts, thatâ€™s fundamentally different from using it to test assumptions/expectations.Â In my research, I have attempted to draw very clear distinctions between when I am testing assumptions about the kinds of language that dominate a particular genre of poetry and when I am using LDA to generate a list of potential word groups that could then be used to describe poetic trends.Â I see those as two very different projects.Â When Iâ€™m working with poetry and specifically with ekphrasis, I am testing what people who write about this particular genre assume to be true: that the word or variations of the word still will be one of the most commonly used words across all ekphrastic texts and used at a higher rate than in any other genre of poetry. Itâ€™s true that the word still could be a semantic topic in many other kinds of poetry; however, what weâ€™re trying to get at is that a group of words closely allied with the word still will be the most dominant and recurring trend across all ekphrastic verse.Â The next determination, then, to be made is whether or not that discovery carries semantic weight.Â If still, stillness, death, breathless, etc are not actually a dominant trend, have we overstated the case?

It seems that what youâ€™re saying (and please intervene if Iâ€™m not articulating this correctly) , which I tend to agree with is that â€œchunk sizeâ€ should be something determined by the questions being asked, and stating the way in which data has been chunked reflects the types of results we want to get in return.Â Taking this into consideration, though, certainly has helped the way I position what Iâ€™m doing.Â For me it is significant to chunk at the level of individual poems; however, were I to change my question to something like, â€œWhich poets trend more toward ekphrastic topics than others?â€â€”based on what weâ€™re saying here, that question seems to require chunking volumes rather than individual poems.

In other news, test models on the whole 4500 poems in my dataset, which is chunked at the level of individual poem, yielded much more promising initial results than we thought we would get.Â I would guess that it has something to do with the number of topics we assign when we run the model, and maybe one of the other ways forward is to talk about the threshold number of topics we need to assign in order to garner meaningful results from the model. Â (Obviously people like Matt and Travis have hands-on experience with this; however, I’m wondering if the type of question we’re asking should have a definable impact on how many topics we generate for the different types of tests….) Hopefully, in the near future Iâ€™ll be able to share some of those very preliminary resultsâ€¦ but Iâ€™m still in the midst of refining my queries and configuring my data.

Again, Iâ€™m engaged because I find what youâ€™re doing both relevant and useful, and I think that having these mid-investigation conversations does help to inform the way ahead.Â As you mention, perhaps many of these kinds of questions are answered in Matt Jockersâ€™s book, but it is unlikely Iâ€™ll be able to use that before this first iteration of my project is done in the next month or two. Â I believe that hearing anecdotal conversation about the low-level kinds of tests people are playing with really does help others along in their own work since we’re still figuring out what exactly we can do with this tool.

Preparing texts for network visualization

Leave a reply

When I presented at MSA 13 earlier this month, I was unsatisfied with my methods for creating network visualizations of texts.Â I knew that preprocessing automatically would not work yet, since I have yet to identify precisely how I want to designate nodes across larger bodies of poems.Â What Iâ€™ve been looking for is a way to mark texts up descriptively, using some form of markup language (XML, TEI), that would be uniform enough to render data that could be meaningfully displayed, and then to find a visualization software package with an algorithm that would â€œworkâ€ the way I wanted it to.Â The problem, of course, is that when youâ€™re a rogue DH scholar out in the world borrowing tools and using whatever tends to fall your way, then youâ€™re not going to be sure about how each tool works (unless you have a CS or social science degree that includes learning about network algorithms, which I do not have), and this is going to detract from the validity of how and what you say about your object of study.Â On the flip side, tools and text analysis software are becoming more widely available, and so doing what Iâ€™ve done, which is to say Googled â€œdiscourse network toolâ€ and finding Philip Leifieldâ€™s â€œDiscourse Network Analyzerâ€ is actually possible.Â What is remarkable about how DNA, a GUI text processing software, works is that it is designed as an interpretive tool to mark texts up in XML so that they can be displayed using free network visualizing software such as Visone, Ucinet, or Netdraw.Â The designed purpose of Leifieldâ€™s DNA software is to collect articles on a topic area and to use those articles to create network visualizations of agreement and disagreement between individuals and groups.Â For example, the sample dataset used for a tutorial on the software comes from someone at the University of Maryland named Dana R. Fischer, (I have no idea who she isâ€¦ but Iâ€™m definitely going to look her up!) who marked up articles, testimony, and other texts about climate change.Â Essentially, she could input each text into the DNA software and create a basic XML document with very minimal encoding (document type, author, dates, title) and then use DMA to select portions of text that create a â€œstatementâ€ about climate change.Â By tagging the speaker, the organization the speaker is affiliated with, and the content type â€“a restricted list of terms created by the user to describe the topic being discussedâ€”as well as whether or not the speaker agreed or disagreed with the topic) she could create networks of statements made about climate change that also included the individuals involved in the climate change debate and their organizations.Â Such a visualization helps us to understand how much any one group (say, the Senate and the EPA) agree with one another, to identify the issues on which they agree and disagree, and to also understand affiliations (which speakers are affiliated with which climate change debates).

This isnâ€™t *exactly* what I had in mind, but itâ€™s really darn close.Â The power of this particular piece of software is that I can be in charge of what constitutes an article (a poem), what constitutes a speaker (the poetic speaker, the author, the third person omniscientâ€¦ all of them), and the â€œcontentâ€ to be described.Â Granted the â€œorganizationâ€ classification is less helpful to me, but in the instance of â€œThe Venus Hottentot (1825)â€ I could differentiate between speakers from the first section of the poem from the second using this feature.Â Using the software this way does not begin to utilize itâ€™s real power, which is to read topics and speakers over large corpuses of texts in similar ways.Â For now, Iâ€™m looking at one poem; however, I could see in the future were I to take this poem and situate it in a larger public discourse about black female subjectivity, I could.Â I could import, for example, Sander Gilmanâ€™s article â€œBlack Bodies, White Bodies: Toward an Iconography of Female Sexuality in Late Nineteenth-Century Art, Medicine, and Literature,â€ which we know Elizabeth Alexander read before writing the poem.Â We could also bring in articles by Sadiah Quershi on â€œDisplaying Sara Baartmanâ€ or Terri Francisâ€™s â€œI and I: Elizabeth Alexanderâ€™s Collective First-Person Voice, the Witness and the Lure of Amnesia,â€ or chapters from Deborah Willisâ€™s Hottentot Venus 2010 and demonstrate how Alexanderâ€™s poem participates in a larger act of social recovery.

There are, as with any tool, limitations, though.Â So far, the only way to create the visualizations is using the speakers, organizations, and categories with directional lines indicating agreement or disagreement.Â I have not found a way of creating networks of â€œstatements.â€Â In other words, I have not found a way to pull a category and then visualize the network of statements about that category and how they relate to each speaker; however, I have only begun the process of creating visualizations.Â Another complication is that I have only found ways to make a statement associated with one category.Â Iâ€™m fairly certain I can find a work around for that, but for the moment, thatâ€™s not worked out; however, I will say that having to choose between regular category designations (ones of my own creation) did make me very attuned to my assumptions about the text. Â That process helped me to realize how my visualizations of these networks will always be limited and remind me that I need to make those limitations transparent when I write about what the visualization actually visualizes.

In the meantime, even though I am not teaching right now, Iâ€™m really excited about what this kind of software could mean for my students.Â In the English 101 courses at the University of Maryland, students write three linked assignment papers on a self-selected research topics.Â These are position papers, where the student must make purposeful arguments for what he or she believes in and respond to the discourse of the field in which their selected debate is ongoing.Â We generally assign an annotated bibliography as the first part of that linked assignment as a way of getting students to read the work and to then explain who agrees with each other on particular points and who disagrees.Â The hard part of this assignment is that each entry is generally 2 paragraphs long and includes only 8-10 sources, and getting the students to actually compare arguments, identifying points of agreement and disagreement is difficult.Â However, if the assignment were to use the Discourse Network Analyzer to import each article and then go through each article tagging â€œstatements,â€ â€œspeakers,â€ â€œorganizations,â€ and â€œcategoriesâ€ (for example, are the speakers arguing that a particular action should be taken or that one event causes anotherâ€¦) as well as â€œagreementâ€ or â€œdisagreementâ€ with that statement, they might begin to see how their readings create a network of ideas and by understanding who agrees and what they agree upon, the student might be better able to situate him or herself within the discourse of that issue.Â Itâ€™s an intriguing idea to me, and at some point when Iâ€™m teaching again, I think Iâ€™m going to make use of this technology.

Lisa @ Work

This site has moved as of March 20, 2013 to a new location: www.lisarhody.com

Category Archives: Research

Why use visualizations to study poetry?

Chunks, Topics, and Themes in LDA