Category Archives: Reading and practicum reflection

Clio Wired: Week 13 Reflection

This week’s readings focus on innovative techniques for teaching history, emphasizing students’ (lack of) ability to critically evaluate primary and secondary sources.  Sam Wineburg’s piece “Thinking Like a Historian” sets the stage for the rest of the readings, exploring the reasons for that lack of interest and ability in performing historical study.  Often, this stems from prior emphasis on memorization of facts and dates rather than thinking.  Students can’t begin to imagine that doing history actually involves critical thinking, discovery, and uncertainty, because their only exposure to the field involves regurgitating bullet points.

Wineburg and Daisy Martin explore how the site Historical Thinking Matters guides students through modules on a certain event in history, while teaching them to critically interpret primary sources from various sides of the issue.  The site not only tells students about the importance of sourcing, contextualizing, close reading, using background knowledge, etc., it actually shows historians thinking out loud as they encounter a new document.  By showing rather than simply telling, the site allows students to understand how history is done.

When I went through the HTM module on the Scopes trial as part of the practicum, I felt that the site was very effective.  It definitely deepened my understanding of the various viewpoints involved in the trial.  I really liked how the first page of the module provided background for the event.  Then I was able to see an example of a historian working through a document, which I was then encouraged to do with the rest of the series of primary sources.  I liked that each primary source came with a brief introduction and questions which could be revealed after analyzing the source on your own.  I also liked how the overall question or thesis of the module asked the student to complicate or problematize the notion that the Scopes trial was simply a battle between creationists and evolutionists.  As the rest of the readings showed, one of the most difficult obstacles students face when learning history is understanding that ambiguity and uncertainty are often history’s results.  After spending years reading from what seem like authoritative textbooks, it is quite difficult for students to understand that history is not about find the answer.

Mills Kelly certainly brought innovative history teaching to a new level with his course “Lying About the Past.”  I was fascinated by Kelly’s description of his course and the fallout it caused in the scholarly community.  It was shocking that a course which was able to teach students real research skills and the highly important ability to detect unreliable sources would end up being so vilified.  Although the students did produce a few hoaxes online, they were careful to reveal the hoax fairly quickly; in fact, the public revelation of the hoax was a chance for those who hadn’t taken the course to sharpen their critical thinking skills by learning to question what they read on the internet (or anywhere else).  That Kelly was banned from Wikipedia and treated like a criminal by many in the scholarly community actually serves to prove his point about jumping to conclusions without weighing all of the facts of the situation. I think if the people attacking Kelly with such vitriol had actually understood his goals and the success of his students, they would have moderated their views.  It’s a shame that someone who took the lead in truly innovating history teaching ended up being pilloried rather than emulated or praised, especially in light of the difficulties of getting students truly involved in critical history work.

My lesson for this week’s practicum is inspired by the case studies in “Ways of Seeing: Evidence and Learning in the History Classroom”.  I particularly latched onto Jaffee’s, Felton’s, and Wies’s explorations of how students’ ability to analyze primary sources seems to evaporate when they are faced with images.  As an art historian, I found this particularly worrisome, but also not especially surprising, as historians often ignore their counterparts in the art history field (there, I said it!).  Why these professors did not consult with their colleagues in art history was puzzling, to be honest.  While it’s true that art historians often have the same issues when trying to get students to analyze artworks, clearly they have more ironed-out techniques for getting students to think about images.  Even the introductory chapter in the survey textbook Gardner’s Art through the Ages gives an overview of how students should be prompted to think about art, with questions like “how old is it?”, “what is its style?”, “what is its subject?”, “who made it?” and “who paid for it?”.  It also directs students to think about various types of evidence: documentary, visual, stylistic, physical.

I have never made a lesson plan before, but my idea centers around images of leadership and power.  I would split students into groups and assign each group an image of a leader chosen from various time periods.  Examples could include the Egyptian pharaoh Menkaure, the Roman emperor Augustus, Justinian, Louis XIV, Medici Pope Leo X, and George Washington.

I would give the students some background information on each leader and society.  I would then ask students to prepare a short PowerPoint or Prezi presentation, using the image as its main focus and using comparison images if necessary.  In the presentation they would have to answer how the image communicates the leader’s power, leadership style, type of government, etc.  The students would need to point to specific elements such as material, audience, style, location of the image (if known), accessories, dress, expression, other figures, etc.  They would also present further background information they had researched in order to substantiate their claims.  Further background might include textual primary sources or secondary sources.  The goal of the lesson would be to show how images of power are constructed, and would hopefully teach students to question the imagery they see in their everyday lives.

3 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 12 Reflection

What I thought of the readings:

Being a wanna-be art historian and librarian, I was admittedly more interested in the open access (rather than open source) portion of the readings this week.  However, some of the same issues clearly come up in both realms–the questions of ownership of “intellectual property”, free knowledge exchange, the overreach of copyright or patents, and the role of for-profit companies in IP law, etc.

I enjoyed Lawrence Lessig’s Free Culture which argues that our once free culture is rapidly becoming a “permission culture.”  The exchange of ideas  which used to be part and parcel of the way we communicate, share, and generate culture is threatened by powerful companies whose commercial interests run counter to this model.  Lessig asserts that because this cultural exchange is now more public, recordable, and effortless due to the advent of the internet, companies have severely ramped up their efforts to  strengthen laws which protect so-called intellectual property. He points out that copyright trolls like Disney–by pushing for ever longer and more stringent copyright protection–not only affect their own proprietary works, but all works that fall under copyright legislation, in essence, affecting all cultural objects.  The original intent of copyright–to ensure that the creator could make a reasonable profit for a few years, and then, by design, open the work up for the public’s benefit–has essentially been thrashed by these companies.  Fair use, which is extremely ill-defined anyway, does not seem to be a sufficient defense.

Scary disclaimer about using a picture of M!ckey Mouse

Scary disclaimer about using a picture of M!ckey Mouse

Veering slightly (and delightfully) into hyperbole, Lessig compares the ever-more extremist copyright climate to the system of feudalism, in which a relatively small number of individuals or entities own all property, and which depends on maximum control and little freedom.  He implores government to resist the pull of large corporations and to preserve the tradition of free culture.  He advocates a “middle way” between “all rights reserved” and “no rights reserved” which gives creators freedom to distribute their works as they see fit.  Implied in Lessig’s title Free Culture is not just the adjective free as in “free speech, not free beer”, but also the verb “to free.”  Lessig wants us to unshackle our cultural objects and traditions.

Steering clear of the thorny territory of for-profit content, Elena Giglia and Peter Suber explore the meaning and implications of Open Access (OA) in the scholarly world. OA literature is “digital, online, free of charge, and free of most copyright and licensing restrictions.”  Importantly, however, OA does not give a free pass to plagiarism–in this model, the author is always credited for his or her work.  Suber (whose book is ironically not totally open as of yet), argues that OA is basically a no-brainer for the scholarly arena. He asserts that scholars are uniquely situated to benefit from open access, as their work model has never rested upon being paid for selling content; rather, they are paid a salary by universities or grant-funders to research, peer review, and publish in the normal course of their work.  Scholars benefit in their careers when their works have maximum impact and citations; the larger audience and heightened visibility facilitated by the OA model, then, greatly benefits them.

My one question about this model (which is perhaps answered in one of Suber’s chapters that is not currently open access), is whether this would affect the ability for researchers to use the highly effective search tools provided by databases.  If libraries no longer had to pay for access to journals through various databases, how would researchers comb through vast amounts of material without being able to search by keyword, subject, or any number of highly effective limiters?  How would they search many works at one time, rather than hunting through each title?  Even if every journal I ever wanted to read was freely open on the web, I know I wouldn’t want to be limited to Google or to painstakingly combing through tables of contents.  Perhaps someone would make a highly sophisticated (to a higher degree than GoogleScholar) search engine for scholarly research.

Although one would probably have to be an actual programmer to fully appreciate Karl Fogel’s book on avoiding failure in open source projects, I did appreciate his history of open source.  The anecdote about disenchanted open-sourcer Richard Stallman creating GNU’s General Public License (GPL) explicitly to stick it to the man was quite entertaining.  The GPL asserts that code may be copied or modified without restriction, and that both copies and derivative works must be distributed under the license with no additional restrictions.  This license provides protection for free software and disallows the “enemy”–propriety software–from benefiting from it.  I also appreciated Fogel’s exploration of the the evolution of the term open source from the formerly used “free software.”  Fogel explains that free is a tricky word in English, having no Romantic distinction between gratis and libre; programmers were always having to explain “think free as in freedom – think free speech, not free beer.”  More importantly, however, the term open source was easier to pitch to the corporate world, which didn’t associate it with free’s implication of theft, piracy, or not-for-profit.

Trying it out for myself

The Creative Commons licenses are flexible and relatively easy to implement, though I did have to Google how to attach one to my WordPress blog.  You can create a CC license similar to the GNU GPL, which allows only open access works to use your work, or simply block commercial users.  Importantly, you can also specify who and how to credit. The fact that CC creates a tidy block of code which is easy to copy and paste is convenient, and I of course enjoy the little emblem it creates as well.

2 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 11 Reflection

This week’s readings on preservation of digital materials seem to speak more to the concerns of librarians and archivists than humanities scholars themselves.  They reminded me a lot of the discussions I had while studying for my master’s in library science, during which I also focused on archival work.  What really struck me during those studies, and from the readings this week, is the sheer amount and ephemeral quality of born-digital materials.  Despite that fact that we all mostly acknowledge the superior capabilities for creating data, projects, etc. in digital formats, it is also true that we have not come up with a better medium than paper for long term storage.  Not only does paper not need an appropriate “reader”, such as a CD-drive, floppy disc drive, VCR, etc., it is also highly stable in most cases, and is also still readable after sustaining some damage.  Moreover, preservation of paper is mostly passive (keeping it out of the way of water, fire, acid, etc.), while preservation of digital materials requires constant recopying to either of the same type of media (CDs begin to deteriorate after about 10-15 years) or to a completely new media (if we aren’t going to keep a museum’s worth of old readers, we need to eliminate data storage on old media-types).  This requires tons of human-power, funding, time, planning, etc.  Of course, paper isn’t a cure-all either, especially for born-digital projects.  Obviously, no one is going to print out every single one of their thousands of emails for posterity, and there are many digital works that aren’t simply text, so they cannot feasibly be stored in paper format.  In some ways, it feels like we’ve opened a Pandora’s box with the creation of such an overwhelming amount of born-digital material, but of course all we can do is adapt and try to intelligently create best practices as we go along.

The authors this week have obviously thought a great deal about these issues, but certainly don’t offer a cure-all for these problems; it is heartening, however, that they have offered plans for a way forward.  I especially like the goals or steps laid out by the The NINCH Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials:

  • Identifying the data to be preserved
  • Adopting standards for file formats
  • Adopting standards for storage media
  • Storing data on and off site in environmentally secure locations
  • Migrating data
  • Refreshing data
  • Putting organizational policy and procedures in place

The ethical and technological issues raised by Matthew G. Kirschenbaum in “Digital Forensics and Born Digital Content in Cultural Heritage Collections” in terms of mining a donor or subject’s computer to find historically pertinent information shows that librarians, archivists, and scholars will not only need the technological capabilities to engage in this activity, but will also need to seriously consider the ramifications of having access  to data that may not have been intended for public view.  Of course, this is not necessarily a new problem; as we have seen in recent years with revelations about Thomas Jefferson’s dealings with slaves, for example, even manuscript or printed materials created during a person’s life do not necessarily leave the legacy he or she always intended.  Issues of provenance or authenticity when it comes to born-digital data also have a basis in the techniques and policies of dealing with physical media; however, while techniques such as materials analysis, handwriting analysis, etc. may not be applicable, chain of ownership, word-usage analysis, etc. will still be valuable tools in the arsenal.  In fact, text mining techniques that we have discussed in other weeks could be an increasingly valuable tool for analyzing and determining authenticity of bodies of writing.

As for concerns about terminology for collections of online scholarship or documents discussed by Kenneth M. Price in “Edition, Project, Database, Archive, Thematic Research Collection: What is in a Name?” and Kate Theimer in “The Problem with the Scholar as ‘Archivist,’ or is there a Problem,” and “Archives in Context and as Context,” I think it is both important to acknowledge the correct usage of terms, and also acknowledge that the nature of language is that words evolve and change meaning over time.  However, as a librarian, I also fully understand Theimer’s concern about the implicit disregard for her profession when using the term “archive” very loosely.  Librarians and archivists both have a lot of trouble communicating their worth and professional status to the outside world–even to scholars.  It would be ideal if the scholarly community banded together with librarians and archivists to express the worth of our collective field in the face of ever-increasing budget cuts and disparaging of the worth of cultural institutions and academia in society.

2 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 10 Reflection

What I thought of the readings:

This week’s readings echo many topics addressed in past weeks, such as the need to acknowledge collaborative work, the benefits of open access, and the need for academia to “count” digital history or public history work (as opposed to only the scholarly monograph) towards tenure and promotion.

Addressing the need for new modes of peer review, authorship, and publication, Kathleen Fitzpatrick’s Planned Obsolescence brings a sense of urgency to these concerns by tying them to existential threats to the humanities:  She links the “fundamentally conservative nature” of academia not just to the inability for younger or more technologically savvy scholars to get their work recognized, but to society-at-large’s dismissal of the university in general and the humanities in specific.  In other words, by resisting digital technologies and all the new modes that come with it (open access, open peer review, rethinking of intellectual property rights), academia is further isolating itself from public life, and therefore confirming the public’s misconception that scholarship (especially the humanities) is not worthy of public interest, respect, or funding.  For Fitzpatrick, academia must take responsibility for communicating its worth to the public, and embracing new technologies and forms of communication is a fundamental step.  Moreover, Fitzpatrick emphasizes that academia does not have a choice in whether to adapt to the new technological landscape or to remain in its conservative bubble:  Change is inevitable and academia must react.

In publishing his born-digital article for the American Historical Review, William Thomas experienced both the fundamentally conservative nature of academia addressed by Fitzpatrick and the benefits of embracing new modes of review and publication.  It is  interesting that the summary of the digital article which appeared in the print version of the journal was mistaken by some scholars as the “real” version of the work.  Among reviewers of the digital version, there seemed to be a fundamental misunderstanding of the difference between simply publishing text on the web and creating a dynamic digital history project.  Aside from (rightly) criticizing the gimmicky use of Flash or other convoluted navigation features, reviewers  saw the digital project as having “no argument” due to its lack of linearity and the perceived abdication of authorial control.  For Thomas, these obstacles in having his work accepted by historians show that  we need new conventions for “reading” in the digital medium.

In “Re-Visioning Historical Writing” Dorn and Tanaka also address the need for new modes of historical reading and writing.  Dorn emphasizes that digital projects reveal history to be more than just a “polished argument about the past.”  Rather,  history is a messy proposition which involves many voices, contradictions, and narratives, perhaps best suited to a hypertext, dynamic, ever-evolving presentation rather than a static, linear narrative presented in a monograph or journal article.  Tanaka also cautions against fixating on the “correct” interpretation of the past rather than a heterogeneity of interpretations.  He proposes that the evolving role of the historian will involve corralling a multitude of data in a skilled and reliable way, rather than simply mastering knowledge in a specific area of expertise and presenting that knowledge in an authoritative way.  (For me, this sounds  a lot like the job description of librarians–professionals who constantly need to justify their worth to students, funders, and sometimes even scholars.)

Related to the above authors’ calls for openness and change in academia is the Working Group on Evaluating Public History Scholarship’s guidelines for fair and transparent evaluation of public history faculty.  Again, these guidelines show that change is already upon us, and academia must adapt in order to promote not only fairness to scholars but continuing relevance to the outside world.

Practicum:

The act of commenting on Open Review actually brought up many of the issues addressed by the essay itself.  I found myself wondering whether my comment could actually be useful to the writers, who are subject specialists and have much higher academic credentials that I do.  I also wondered who would be responsible for reading my comment and for what period of time it is actually useful to receive further commentary.  Reading the essay and making a brief comment on one paragraph did not feel to me like terribly helpful or legitimate peer review.  As was said in the essay, different levels of engagement would be required for open review to be feasible, such as a certain number of reviewers being committed to reading the entire work as well as making granular comments.  I do like the idea of opening up works to the scrutiny of any interested commenter, but wonder if it could be difficult for authors and editors to cut through the noise to be able to respond to truly useful recommendations.

I have to admit that I was a bit stumped when it came to developing my own guidelines for evaluating digital history scholarship, not least of all because I am really not familiar with the process of evaluating even traditional scholarship for tenure or promotion purposes.  Therefore, I did some Googling and found various example of guidelines, such as those provided by the American Association for History & Computing, based on guidelines by the MLA.  Both sets of guidelines seem fairly comprehensive, focusing not only on the responsibilities of the reviewers, but on the responsibilities of the candidates in advocating for themselves.  Although candidates are advised to document and explain collaborative relationships, one aspect I thought these guidelines left out was the responsibility of reviewers to fully understand the collaborative nature of digital projects and to seek methods for fairly evaluating this work.  Also, while these guidelines are more general, there are a few specific actions on the part of reviewers I thought could be added:

  • Consider the audience for a digital project; it may not be directed toward scholars, but toward the public, undergraduates, etc.
  • Attempt to explore the digital project through various paths, as the full story of the project may be best communicated through various trials and revisits
  • Evaluate design as an aspect of the project’s argument, thesis, or purpose
  • Take user feedback into account if the project has been opened to the public
  • Understand that the project may be ongoing and evolving, rather than in a final or a static state

What do you think of these recommendations?

1 Comment

Filed under Reading and practicum reflection

Clio Wired: Week 8 Reflection

This week’s readings point out the important distinction between data and interface in digital projects.  While data is contained within a randomly accessible database, the author/editor/curator of a project presents this data in what Lev Manovich calls a “hyper-narrative” form.  In other words, the website author, through various controls such as  information architecture, design, or other cues, leads the user through a series of possible “narratives.”  One iteration of the this hyper-narrative experience may be called a linear narrative (in a sense different from the sole narrative presented in a work such as a monograph).  Unlike in directly accessing the raw database, here, the user does have some element of control, but the author of the interface ultimately guides the experience.

What struck me about this article was Manovich’s cautioning that true interactivity is not constituted by the user’s ability to access a site’s pages in various orders.  I think it would be very useful to contemplate the true parameters of interactivity, especially in light of our grant projects.  It occurs to me that interactivity can exist on several layers:  From simplest to most complex, this might include the options to leave commentary, to add content (as in Philaplace where users can add their own Philadelphia-related stories), to choose data sets or other information to be displayed in various ways, or the (much more involved)  option to extract openly available data and create a totally new interface.  Great examples of the fruits of this type of “interactivity” are the “Irish in Australia: History Wall” and “Invisible Australians” projects, which use digitized data from institutional archives to create totally new projects.  What other types of interactivity have people thought about?

Dan Cohen advocates (here and here) strongly for this type of interactivity, or as he would probably call it, freeing content for reuse and reinterpretation.  Although Cohen is clearly an advocate of digital history projects which guide the user through the site and have a specific message or thesis, he also believes that the data should be “freed” so that scholars can manipulate it for uses unanticipated by the author, or can create an interpretation of their own.  As Cohen makes clear, this open source data model brings up questions of credit for scholarly work; however, as a community, academics should be able to integrate data creation into the products (like monographs, articles, and slowly-but-surely digital history sites) for which scholars receive credit and acknowledgement.

I am also intrigued by Cohen’s idea that the separation of interface and data leads to a longer life for that data.  In other words, even when the interface has gone by the wayside due to lack of upkeep, antiquated technology, or the advent of newer scholarly methodologies, the data can still persist.  If the original creator of the this data–presumably the author of the interface–is no longer acting as a steward for this valuable data, who will take the responsibility?  Cohen thinks that this might suggest new roles for libraries, which could become responsible repositories of data.  This is certainly an intriguing suggestion, as in the library world, the future libraries face in light of new technology is always a big debate.  However, being a database repository does not necessarily shore up libraries’ brick-and-mortar existence (unless they are to transform into huge server farms).  At any rate, it is certainly worth thinking about how valuable data presented by digital history projects will be maintained once those sites are defunct.  What might be other solutions to this data-maintenance problem?

1 Comment

Filed under Reading and practicum reflection

Clio Wired: Week 7 Reflection

What I thought of the readings:

This week’s topic of spatial history built upon last week’s discussion of data mining and visualizations, exemplified by Franco Moretti’s work Graphs, Maps, and Trees.  Delving deeper in to the “maps” aspect of visualization, the readings this week show how historical topics can be enhanced, explained, modeled, synthesized, etc. through the use of spatial visualizations.  It is important to note for our discussion–as emphasized by Richard White–that these digital visualizations are not simply static illustrations accompanying text, but can be dynamic visual aids which allow the user/reader to understand how events unfold over time and space, to ask new questions, and to scrutinize assumptions.

Todd Presner presents a rich spatial history resource, HyperCities, which allows many users to create mapping projects through its interface.  Presner makes the important point that HyperCities differs from simple, commercial mapping projects in that rather than focusing on information like traffic, weather, and commercial interests, these visualizations’ main focus is humanities scholarship related to “urban, cultural, and historical transformations of city spaces.”  Projects ranging from presenting the history of Los Angeles from prehistoric times until now to the mapping of protests in Iran’s 2009 elections show how HyperCities in particular, and spatial history in general, has the ability to present a breadth of scholarship in dynamic and innovative ways.

Presner’s emphasis on spatial history and visualization as legitimate forms of humanities scholarship is also addressed by Jo Guldi and Martyn Jessop.  Although she does not directly address visualizations, Guldi explores the “spacial turn” in a myriad of scholarly areas, explaining how in fields as diverse as psychology, anthropology, history,  and art history, scholars between 1880 and 1960 came to reflect on humans’ “nature as beings situated in space.”  Rather than continuing to concentrate on great personalities, for example, historians began to focus on history as a function of nation or city, and later, as a function of region or center/periphery.  Jessop shows how graphic aids to humanities scholarship are not actually new or out of blue, but rather have a long history, ranging from early modern Kunstkammern, to museums, to film, to theater.  For Jessop, digital technology has simply created a new medium for visualization.

Trying it out myself:

Jessop’s assertion that humanists have a lack of education in visual literacy certainly hit home for me as I was attempting to use the various tools this week.  Visualizing events in space has never been a particular strong suit of mine.  I remember reading Michael Shaara’s Civil War novel The Killer Angels in middle school and hating every minute of it; I couldn’t makes heads or tails of Shaara’s descriptions of troop movements, which at the time, seemed to make up the entirety of the book.  (If someone had made a nifty visualization of the book back then, maybe I could’ve gotten into it!)  Trying to use many of the tools this week brought back that same sense of frustration.  Neatline, for example, has a very steep learning curve.  I really couldn’t figure out how to do anything effective with the site; their demos only showed what masters were able to create, but did not show how novices could learn to use the tool.  I tried to perform the simple task of plotting my birthday in time and space, but couldn’t even figure out how to do that.

Neatline

Trying and failing to use Neatline

I was a bit more successful with GoogleEarth, where I made a map of some of the museums I visited this summer in the Netherlands.  However, as Presnor points out, I am not sure that GoogleEarth on its own is really a digital humanities tool, though clearly some other digital humanities sites, like the historical maps repository at David Rumsey Maps Collection have made use of its data.

Museum visits on GoogleEarth

Museum visits on GoogleEarth

David Rumsey Maps Collection using GoogleEarth

David Rumsey Maps Collection using GoogleEarth

Clearly many of the spatial visualization sites on this week’s tools are very useful and can help scholars produce some unique and intellectually rigorous projects.  I think, though, that I would need a lot more training in order to produce something worthwhile.

2 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 6 Reflection

What I thought of the reading:

This week’s readings were enlightening because they demonstrate how digital tools are useful not only in presenting history to the public or other audiences, but also in the process of researching and creating historical scholarship.

Franco Moretti’s Graphs, Maps, and Trees was a nice introduction to what exactly can be done with manipulating and visually presenting historical data.   For Moretti, visualizations of trends, patterns, and cycles in literary history do not replace close reading of individual texts.  Rather, they add new layers of information, and sometimes even debunk generally held assumptions about literature’s history.  Tim Burke praises Moretti’s approach, in that viewing quantitative data about literature can problematize many commonplace assumptions about it.  However, Burke cautions that, while numbers can seem quite concrete and infallible, they can still be misleading.  For example, quantifying publication does not actually tell us about readership.  He also criticizes Moretti’s lack of emphasis on authors’ agency and the breaks and ruptures (as opposed to gradual divergence) in literary history.  However, I think Moretti is still useful in demonstrating how these tools can be used not just in the social and hard sciences, but also in the humanities.  Burke’s criticisms show that despite these visualizations’ seeming authoritativeness, the way in which they are interpreted or presented is still quite subjective.

While Moretti mostly deals with publication data for various genres, the rest of the authors focus on data mining specific texts or corpuses of texts in order to analyze them in new ways.  Daniel Cohen and Gregory Crane focus on the new scholarly opportunities presented by large digital collections such as Google Books or Project Gutenberg.  In conjunction with close examination of a limited number of texts, scholars who use various data mining/text mining tools can, in the words of Cohen, “find patterns, determine relationships, categorize documents, and extract information from massive corpuses.” For example, one might perform a statistical analyses of how often two keywords or phrases appear together, or find specific types of documents (such as syllabi) by assessing frequently used words in these texts.

Unfortunately, these large digital libraries can have some drawbacks, such as “noise” from incorrect OCR, missing texts due to copyright restrictions or cost of digitization, and inability to present or crawl texts in non-Roman alphabets.  For these reasons, scholars need to be careful about drawing conclusions from potentially-incomplete data sets.

Trying it out myself:

Playing around with some web-based text mining tools, it was obvious that some of the tools are better suited to entertainment than serious scholarship.  Wordle, which generates text clouds of the most frequently used words in a document, creates aesthetically pleasing visualizations.  However, aside from giving a general idea about the topics or keywords of a text, I am not sure that this tool has any serious scholarly use.  Here is my text cloud for Grimm’s Fairy Tales:

Wordle for Grimm's Fairy Tales

Wordle for Grimm’s Fairy Tales

Another tool which was entertaining but probably not statistically sound is Google’s Ngram Viewer.  Because you cannot control which texts are included in the analyzed corpus, the data may be misleading.  However, for general information rather than scholarly purposes, the Ngram Viewer can give a nice idea of when certain terms may have come in and out of fashion.  For example, in the Ngram below, you can see the shift from using the term Great War to the term World War:

Ngram: Great War vs. World War

Ngram: Great War vs. World War

Because of the user’s ability to choose texts and because of its myriad analytical tools, Voyant was the most promising tool for scholarly research.  I chose to analyze the same Grimm’s Fairy Tails text I tried in Wordle, available through Project Gutenberg.  I like how in the user can manipulate the data provided by Voyant in many ways.  Not only can you see the most frequently used words, but you can also compare the frequency of two words against each other and see words in context.  Voyant also provides a word cloud, which seems to be generated using a different algorithm than Wordle’s, as they came out differently.

Voyant analysis of Grimm's Fairy Tales

Voyant analysis of Grimm’s Fairy Tales

Although I felt like I couldn’t take full advantage of Voyant’s tools since I wasn’t undertaking an actual text-mining project, I did find it interesting that Voyant identified “said” as the most frequently used word in Grimm’s Fairy Tales.  This might say something useful about the structure of the tales or how the narrative action is pushed forward.  As you can see above, Wordle actually eliminated “said” from its word cloud, perhaps because it is too commonly used; this shows how lack of control over the algorithm or data manipulation of tools like Wordle and Ngram can lead to misleading information.

3 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 5 Reflection

What I thought of the reading:

The key concept of this week’s readings is crowdsourcing.  The authors explore how the power of the crowd can be harnessed in order to create and improve history content on the web.  This decentralized method of doing history involves many complex issues in the realms of scholarship, human psychology, and technology:  Can crowdsourced history be high quality?  How can we motivate users to perform large amounts of tagging or transcription?  How do institutions learn to trust anonymous users?  How can we create intuitive, effective, and easy-to-use tools for a projects’ volunteers?

For me, the most useful reading this week is by Trevor Owens, whose blog addresses many of these questions.  Owens very usefully breaks down the term “crowdsourcing,” emphasizing that users are not really an undifferentiated crowd, but rather engaged, enthusiastic volunteers and amateurs–in other words, the type of people that museums and libraries have been relying on for years.  Moreover, “sourcing” should not be defined as labor or exploitation, but as meaningful work.  Importantly, the main motivator for users is that the task speaks to their personal identities and gives them a sense of purpose.  Rose Holley, in addition to addressing users’ feeling of purpose, speaks about more concrete types of motivation, such as progress bars, top user rankings, rewards, leveling-up, etc.

Motivational tools on oldweather.org

Motivational tools on Old Weather

Owens also makes an important distinction between types of crowdsourcing projects:  “Human Computation” projects require users to perform short, discrete tasks which are more easily accomplished by a human than a computer. “Wisdom of Crowds/Why Wasn’t I Consulted” projects, on the other hand, require users to present knowledge in a more free-form manner, like in the encyclopedia project Wikipedia.  In my mind, the major distinction between these types of projects is that the former improves access to existing content, while the latter creates new content altogether.  From the projects I’ve sampled this week, I have gathered that Human Computation projects tend to deal with primary source materials provided by a centralized, authoritative institution, while Wisdom of Crowds projects create secondary historical content in a democratized fashion.  The article in History News on “radical trust” presents reactions from institution employees on letting go of some authority by soliciting crowdsourced data; while some interviewees seem thrilled by the prospect, others wished to hold more tightly to the reigns of authority.

The discomfort that some institutions might feel in allowing anonymous users to provide metadata for their digital collections is echoed in scholarly concerns about the validity of Wikipedia.  This debate over Wikipedia has been raging for a long time now.  Wikipedia’s problems (errors, inconsistent coverage, bias toward the subject at hand, and poor writing) are perhaps old news at this point.  However, as Rozensweig points out, Wikipedia’s model of open access and collaboration provides an important example for future digital history projects, and challenges traditional scholarship’s insistence on individualism and hiding behind a pay wall.  Rosenzweig makes a very shrewd recommendation when he implores those who despise Wikipedia to make their own content as easily accessible to users.

Trying it out myself:

I tried my hand at many of the crowdsourcing projects mentioned in the readings this week.  Some like NYPL’s What’s on the menu? and GalaxyZoo are fun and quick.  The tools are easy to use, and the actual transcribing or identifying does not require much skill or practice.  The experience of transcribing Papers of the War Department felt like a completely different animal.  In fact, I consider the experience a bit high stress!  Deciphering handwriting is not an easy task and can by very time-consuming; also, for this project in particular, I felt a lot of responsibility.  Since these letters are of Great Historical Importance, it feels like making a mistake would be doing a very terrible disservice to American history.  On the other hand, if I mistakenly identify a disc-shaped galaxy as round, life will probably go on.

Easy transcription tool on NYPL's "What's on the menu?"

Easy transcription tool on NYPL’s “What’s on the menu?”

Although editing Wikipedia is also a big responsibility, at least I have complete control over what I write and there isn’t much chance of making an accidental mistake.  I can also see how for a scholar, editing or creating a Wikipedia page might not hold much appeal, since Wikipedia’s NPOV policy and prohibition against original material prevents adding anything much beyond factual or commonly-known information.

Public history website review:

Cleveland Historical lets users explore Cleveland’s history in a non-linear, non-narrative fashion.  The site’s informational content in presented in the form of Stories, which also make up various Tours.  A Story consists of a specific location, like Hough Bakery, while a Tour like “Cleveland Food Traditions” points users to Hough Bakery, and other food locales.  There are two ways to access Stories and Tours from the homepage:  Either from the left column list of Tours, or from the navigation bar across the top which offers direct access to both Tours and Stories.

Cleveland Historical homepage

Cleveland Historical homepage

The ability to access Stories through a unifying Tour or from a list offers the user many options for exploring the site.  A framing narrative like “Cleveland Food Traditions” might appeal to some users, while being able to browse and stumble upon information on Hough’s Bakery might appeal to others.

The site’s blue and gray palette, fonts, and graphics are attractive, as is the layout of its Tour and Story pages.  These pages have a clear and logical three column format, with informational text sandwiched between two columns of photos, audio files, and tags.

Cleveland Historical story page

Cleveland Historical Story page

These pages make each Story not only approachable but rich in content.  Being able to read a reasonable amount of text and then having the option to view photographs and listen to oral histories allows the user to customize his or her learning experience.   The Geolocation map is a unique feature, which should jive well with the mobile versions of the site.

The self-guided aspect of the site and the casual feel of the design makes it fun and entertaining.  The user feels in control of the experience, while at the same time, one’s trust in the site’s scholarly credentials is not lost.  This site reflects more modern conceptions of presenting history to the public.  Rather than presenting a one-path, one-perspective, all-encompassing grand narrative, Cleveland Historical lets users play in a historical sandbox, following their own interests and drawing their own conclusions.  My only criticism of the site is that it does not allow users to add their own comments or content.  It would make more sense for a historical site with such a sandbox feel to gives users the opportunity to share reminiscences, opinions, photos, etc.  This would not only give users even more ability to control their experience with this site, but would also enrich the content by drawing on crowdsourced (and of course moderated) material.

5 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 4 Reflection

This week’s readings get down to the nitty gritty of creating webpages, with concrete advice on what to do and what not to do.  The surprising, and frankly comforting, aspect of these readings is that the authors do not condone throwing away all tenets of good design from non-digital media.  In fact, Cohen and Rosensweig specifically point to print and book design as important examples for functional and appealing web design.  Legible print, manageable column width, attractive colors, judicious use of images, thematic unity, and even clear organization– these are all lessons that we do not have to create out of thin air, but which we can draw from our extensive experience with print media.   In fact, while Cohen and Rosensweig do extol the benefits of audio, video, high quality images, and the capabilities of hypertext, they caution against using all available technology just because you can.  Rather, use of technological capabilities should be as thoughtfully chosen as the words themselves.  Just as one would ask, “Do I need this rhetorical flourish, or would my writing be clearer without it?” one should also ask, “Do I need this Flash video, or would my site be cleaner/clearer/more accessible without it?”

Although the principles of good design were fairly universally shared between the authors this week, one major point of contention was the debate about long form writing on the web.  Cohen and Rosensweig come down on the side of appropriately used lengthy prose, emphasizing that giving in to the impulse for short “chunked” text further shortens internet-readers’ attention spans, creating a vicious cycle that does not allow serious scholarship to be presented on the web.  Steven Krug, in his sardonically titled book Don’t Make Me Think, purports to have a more realistic view of internet-users’ behavior.  For Krug, true internet use involves scanning and “satisficing”– clicking on the first thing that looks good, rather than taking the time to find the optimal information.  While I agree this is most likely the behavior used on commercial or business websites, I think that the emergence of scholarly writing on the web does in fact require long form essays, and that users who are serious about accessing this information will muster up the patience to read.

For the practicum this week, the principles of good design discussed by the authors were at the forefront of my mind.  I took a bit of an non-traditional approach for this practicum, not comparing two completely different sites, but actually comparing two iterations of one site.  I compared the relaunched website of The Phillips Collection with its old site, which I accessed through the Internet Archive’s Wayback Machine.  Surprisingly, I liked the Phillips’s old website a lot more; I find their new site overcrowded, over-stimulating, and busy.  Here is a shot of the old site:

The Phillips Collection's former homepage

The Phillips Collection’s former homepage

For me, this site is ideal.  You can view the entire homepage without scrolling down, the navigation is simple and self-explanatory, there is easily accessible key information about the museum, and there is even a representative photo of what you will experience in the galleries.  The homepage’s minimalistic design with a subtle and limited color palate speaks to the museum’s mission of displaying modern art masterpieces, may of which emphasize color, line, and geometry.  To me, this site is “distinctive, natural, brand-appropriate, subtly memorable, and quietly but unmistakably engaging,” in the words of Jeffrey Zeldman.  In other words, it exudes “Phillips Collection.”

Unfortunately, while the new site offers more content, I do not believe it lives up to Zeldman’s tenets.  Unlike the old site, I feel that this new design is forgettable and does not uniquely identify the Phillips.  It has a large, corporate feel which does not evoke the Phillips’s clean, intimate museum space or the qualities of its art collection.  Moreover,  the homepage requires a lot of scrolling before you can access the information, and has dizzying juxtaposition of rotating images in the background and top banner:

The Phillips Collection new hompage

The Phillips Collection new homepage

As you can see, the banner takes up so much space that there is barely any informational content above the “fold.”  Once you scroll down, however, there are so many blocks of text and images that there is nowhere to rest your eye; it is difficult to concentrate and find the information you want.

Lower half of the new Phillips Collection homepage

Lower half of the new Phillips Collection homepage

This layout seems more applicable to a newspaper homepage, and is also probably difficult to use for the visually impaired using a screen reader, or for those with a slow internet connection.  This new website might be more flashy and up with the times, but I think good design and brand-awareness was sacrificed.  The designers of this new site seem to have used web capabilities just because they could–not because it enhanced the quality of the webpage.  The Phillips Collection is a wonderful institution, and it deserves to have a website that fits its intimate, artistic, and non-commercial personality.  While the information architecture of these two sites was not significantly different (the new site has more pages cascading off of sub-pages rather than the homepage, and has more pages in general), the design choices make all the difference.

5 Comments

Filed under Reading and practicum reflection

Clio Wired: Week 3 Reflection

This week’s readings introduce key concepts in digital humanities, attempting to both define digital history (or digital humanities, or new media) and tease out its limitations, possibilities, and successes.  Susan Hockey presents a somewhat straightforward history of the digital humanities, from its inception in the 1950s with Father Busa’s concordance of Aquinas, to the technological limitations which existed through the 1990s, to the medium’s proliferation beyond concordances, dictionaries, etc. Hockey pinpoints the personal computer and the Internet as key drivers of the digital humanities’ popularly and capabilities, both in the realm of scholarly communication (collaboration over ListServ, email, blogs) and in presenting cultural heritage to academics and the public.

Dan Cohen and Roy Rosenzweig offer a brief history of the digital humanities as well, but delve more deeply in the problems and possibilities of the genre.  They point out an important distinction in types of history web resources:  Archival websites, which seek to make troves of primary sources accessible and searchable, and websites that act as secondary sources—those which offer interpretations of primary source material.  For Cohen and Rosenzweig, the capabilities offered by the web and hypertext have not always been fully realized; many of these secondary source websites are simply recapitulations of something already written in print.  In other words, they maintain the linear narrative form, do not encourage free form browsing, or encourage true interactivity.  Cohen and Rosenzweig urge those undertaking a digital history project to become familiar with digital history’s best practices and to seriously consider the project’s audience and goals.

William Cronnon’s focus is not primarily on digital history, yet his recommendations for improving history PhD programs include the exhortation that new media products be “branded” in some way to certify scholarly rigor and excellence.  He emphasizes that the history PhD should not primarily be a gate-keeping tool, but rather a means through which future historians learn history’s best practices, immerse themselves in a wide-range of study, and gain excellence in reading, writing, and teaching.  The kind of intellectual collaboration and community-building that Cronnon recommends, as well as his emphasis on learning to teach many audiences (such as the public or undergraduates) has implications for the field of new media which are picked up in other readings:  For example, the importance of creating digital history projects which have a targeted audience in mind, and allowing digital and/or collaborative works to count toward tenure awards.  That new media is sometimes-slowly, but surely being adopted by historians is made clear by Robert Townsend’s survey of 4000 historians; is it essential, then, that formal channels for judging new media works be put in place.

The discussion moderated by the Journal of America History brings up these issues and many more.  Although the speakers discuss a breadth of topics and ideas, two in particular stood out for me:  First, the distinction between interactivity as entertainment and interactivity as intellectual exchange or engagement.  The ability for a new media project to be a legitimate form of scholarship rests on this distinction.  Moreover, this will be even more crucial as the ability to include moving graphics, sound, and multimedia increases.  Related with this quality is the importance of breaking out of linear, didactic narratives in favor of provocative, problematizing, and free-flowing interactivity.  The other key point for me was the speakers’ emphasis on open-source and open-access projects.  Not only does open-source allow a greater segment of historians to employ new media, it also increases the capabilities of digital scholarship.  Although many scholars have traditionally feared making their work “free,” the panelists in this discussion emphasize that works that are not hidden behind gates or walls actually increase a scholar’s impact and recognition.  Bound up in this discussion of open-access is the need for new avenues for peer-review and scholarly recognition of new media projects.

Tim Sherratt’s blog about his project on “Invisible Australians” provides a great example of the power of new media and emphasizes its ability to be a democratizing force.  In his project, Sherratt manipulated the digitized archives of the National Archive of Australia; taking pre-digitized documents, he arranged them for his own purposes and was able to shed the constraints, biases, and organizational decisions of the archive.  Sherratt shows the power of creating new, parallel interfaces or finding aids to construct a new narrative of the past.

Two other new media projects I examined are the Medici Archive Project http://documents.medici.org/medici_index.cfm and Medieval Illuminated Manuscripts (National Library of the Netherlands) http://www.kb.nl/manuscripts/.  My judgment of these archival sites was based on the following criteria: ease and effectiveness of search function; overall interface; metadata; and unique or helpful features.

The Medici Archive Project is a work in progress with the goal of making available the entire works of the Medici Granducal Archive in Florence.  This project is clearly useful for anyone studying the Medici family, Florence, art, science, etc. during the 15th through 18th centuries.  The search interface allows researchers to search for places, named people, and specific documents or volumes.  Within each category, one can use a keyword search, or select various drop-downs, such as “gender” or “occupation” in the people search.  In the places category, one can specify the “link type,” which is a very clever feature:  For example, if a user enters “London,” she can specify whether this location is linked with a sender location, recipient location, death or birth place.  The search capability is bilingual, in Italian or English, and is generally intuitive; for questions about appropriate search terms, there are good explanations linked with question marks besides each field.  The interface is also quite intuitive and helpful, letting the user sort search results, mark results, or refine searches.  The metadata provided for each entry is very thorough, and includes the date of the document, the correspondents and their locations, people referenced in the document, a synopsis in English with an excerpt of the original Tuscan, and topics or keywords.  The site is also generally attractive, with nice selections of images from the archive.  There is also a form to submit feedback.

Record metadata from Medici Archive Project

Medieval Illuminated Manuscripts is a database or catalog of 11,000 illuminations from illuminated manuscripts in the National Library of the Netherlands.  These images are divorced from the texts that accompany them, so are mainly useful for visual or iconographical analysis.  For a more general user, the browse function would be most useful, as it allows one to drill down by topic to useful and interesting categories.  The advanced search feature is also useful in that indexes of subject terms are provided; however, it might take a more serious scholar to dedicate the energy to perusing these indexes.  There is also a unique categorization feature called Iconclass, which assigns alphanumeric codes to certain topics for greater searching accuracy; again, this would require greater than average dedication to searching the site.  One major downside to the search function is lack of capability for using Boolean operators.  The overall interface can be a bit clunky in that some links direct the user away from the database itself onto the general library website, so navigation can be confusing.  However, there is a very nice feature of blowing up selected images, with the option to zoom in and save details.  Metadata for each image is limited to its manuscript of origin and a short description; however, metadata for entire manuscripts is much more detailed, and even contains a bibliography of literature about the manuscript.  Both this site and the Medici Archive site are very rich, useful archives of primary sources which could be used to develop scholarly, educational, and interactive interpretive sites.

Works discussed:

3 Comments

Filed under Reading and practicum reflection