Digital Humanities
@ Pratt

Inquiries into culture, meaning, and human value meet emerging technologies and cutting-edge skills at Pratt Institute's School of Information

TimelineJS

TimelineJS is a tool that helps create a timeline with various media content. It can also be described as an aggregator of online materials that pertain to a certain theme with a chronological time set.

TimelineJS can be useful for a number of purposes, especially for a digital humanist. Timeline allows one to create according to their project needs. It also proves a s useful tool for teaching and learning because of the level of interactive material that can be added and the visual aspect of it. Instead of just having a traditional timeline it allows users to get more information and explore that information to a great extent.

The video below shows one how to get started in order to create their own timeline (using a google spreadsheet) and the various media content that can be added.

I have included a screenshot that may help to understand exactly how the cells work:

Screen Shot 2013-04-29 at 4.28.10 PM
Retrieved from: http://www.seomoz.org/ugc/using-timelines-to-create-great-content-and-build-links

Video:

Creating a Verite Timeline with a Google Spreadsheet from DHSkillshare on Vimeo.

Google Refine

Google Refine is a desktop application that allows you to tidy up messy data. Google Refine allows you to set up filters and allows you to view more rows than an ordinary spreadsheet would. Google Refine can help the Digital Humanities when going through large, unorganized data sets.

Basic Gephi Tutorial

Gephi Tutorial from DHSkillshare on Vimeo.

This is a basic Gephi tutorial (apologies for the coughing not being totally edited out yet, having trouble getting the export settings right in my editor for a screen cap video). Here are some notes about data prep for Gephi:

Gephi Tutorial from DHSkillshare on Vimeo.

There are two tables: edge and node. Nodes are each of the points on the network map and the edges are the links between them. You can import one or the other first depending on what data you have, and while you can use the data lab in the same way as a spreadsheet manipulation program I would recommend getting your data in order before importing it to Gephi, because the program can crash often and takes lots of processing power to run so it can be slow.

The nodes table should have all the data about your points that you might want to use in Gephi. Most important is a value called, ID. ID is what the node is to Gephi, i.e. identifies, the nodes as distinct. You can set one of the columns in your spreadsheet as ID when importing the data, but it’s easier to pick a column ahead of time and let Gephi discover it on its own. ID is also important because it must correspond to the values in the edge table in order to create the correct edges. ID can be a number; it can be a name; it can be a place; it can be anything so long the values are repeated across the tables.

The edges table has two columns: source and target. Like an arrow, source is where the edge is coming from and target is where it goes. Both of these columns must be named and the values must correspond to the ID values. If you import the edge table first it will create a nodes table using the values in the two columns and you can click the “create missing nodes” box to tell it do so.

 

Basic OCR (optical character recognition)

The first step in many digital humanities projects is to digitize whatever corpus is being used. In order to use digitized text without having to manually transcribe it or laboriously cut and paste it, Optical Character Recognition (OCR) is often used. Using two scanned pages from Virginia Woolf’s novel The Waves, the following screencast offers some basic instruction on how to OCR scanned pages. A few choices of OCR software are also briefly evaluated.

A free, community version of a popular OCR software is tested as well as a free, web-based version and the quality of these is examined. Finally a trial version of a moderately expensive software is tested and evaluated. By the end of the screencast, viewers should have information that will give them a head start when it comes to undertaking a small scanning/OCR project.

LIS-657 Digital Humanities – DHskillshare post – OCR demo from Lauren Spiro on Vimeo.

How to Write an HTML Page

HTML stands for HyperText Markup Language. It is a computer language used to write computer programs. HTML can be useful for digital humanists for connecting parts of websites that have been otherwise digitized. One starts the html document by writing <html> And </html> Then within these one types <head> </head> For the head section. For the body one types <body>…

Using XML to Code Charles Darwin

The term “greater than the sum of its parts” is a bit of a cliché, but there’s no better way to describe XML. When I was planning this Skillshare video, my in-house IT pointed out that “you can teach someone to use XML in five minutes.” And while that’s somewhat true, the concept behind using it as a markup language, and the reasons for doing so, are hard to separate out from the instruction.

So the video below explains a few things: What XML is, how it works, and why we use it at the Darwin Manuscripts Project, an ongoing project based at the American Museum of Natural History, where we transcribe Charles Darwin’s handwritten notes into plain text.

(The video plays best in high definition—click the little gear box at the bottom of the video screen and choose “720 HD”—and I also recommend viewing it as a full screen—click the bracket button at the bottom right-hand corner of the video screen.)

Introduction to Datacite

This screencast provides an introduction to Datacite, a not-for-profit organization formed in London in 2009, whose goals include establishing easier access to research data on the internet, increasing acceptance of research data as legitimate, citable contributions to the scholarly record and supporting data archiving that will permit results to be verified and re-purposed for future study.

This screencast takes viewers on a tour of the Datacite website and explores the purpose of Datacite as laid out by its founding members. It describes the advantages to researchers of citing data, such as enabling easy reuse and verification of data, which promotes scholarship and unfettered access to information. It also discusses things like how to cite data, which is similar to how one would cite a book or article. Further areas of discussion include participating member institutions, how to become a member of Datacite, and other opportunities for involvement such as conference and networking opportunities.

In addition to these areas, the various services offered by Datacite are outlined such as the Datacite Metadata Store and DOI Citation Formatter. Datacite also provides a variety of open access resources such as the DataCite Metadata Schema, which are explored in more depth herein.

It is my hope that this screencast will increase awareness about Datacite and encourage researchers, scholars and students to start or continue citing and sharing their data.
Caitlin Bronner