Weeks 9: Humanities Programming and Distant Reading – March 12

Readings:

Jean-Baptiste Michel et al., “Quantitative Analysis of Culture Using Millions of Digitized Books,” Science 331, no. 6014 (January 14, 2011): 176–82.
Ted Underwood, “Seven ways humanists are using computers to understand text,” The Stone and the Shell: Using Large digital libraries to advance literary history, https://tedunderwood.com/2015/06/04/seven-ways-humanists-are-using-computers-to-understand-text/
Mining the Dispatch: http://dsl.richmond.edu/dispatch
Google N-Gram: http://books.google.com/ngrams
New York Times Chronicle http://chronicle.nytlabs.com/

Lab:

The Programming Historian, available online, http://programminghistorian.org. Do the Introduction to Python lessons.

  1. Python Introduction and Installation
    1. We will use the Python IDLE (Python GUI) instead of the text editor suggested by the lessons, which is already installed on the lab computers. So we will skip the installation lesson.
    2. Search and open Python IDLE
    3. It should open the Python 2.7 Shell
    4. Try this: >>>print ‘hello world’
      1. Hit enter
    5. Click “File” and “Open New File”
    6. You are ready for the next lesson.
  2. Understanding Web Pages and HTML
  3. Working with Text Files in Python
  4. Code Reuse and Modularity in Python
  5. Downloading Web Pages with Python
  6. Manipulating Strings in Python
  7. From HTML to List of Words (part 1)
  8. From HTML to List of Words (part 2)
  9. Normalizing Textual Data with Python
  10. Counting Word Frequencies with Python
  11. Creating and Viewing HTML Files with Python
  12. Output Data as an HTML File with Python
  13. Keywords in Context (Using n-grams) with Python
  14. Output Keywords in Context in an HTML File with Python

Leave a Reply

Your email address will not be published. Required fields are marked *