Welcome to webdubois.org


Digital Humanities Projects
 — by Robert W. Williams

This page is updated for 1 November 2023. I added Section 9, which contains a work flow using regular expression search protocols to decipher illegible letters in the words of a digitized Du Bois manuscript.

Page Contents

• Retextualizer project: Sections  1   •   2   •   3   •   4   •   5 

• Digital Humanities: Online Texts & Resources: Section 6

• Websites: Blogs, Centers, & DH Projects: Section 7

• Regular Expressions (Regexes): Tutorials & Testing: Section 8

• Regex Usage: Interpreting Illegible Letters: Section 9



 The General Purpose of the Retextualizer Project


 Retextualizer is a browser-based application for digital humanities ​(DH) research that is designed to facili­tate new inter­pre­ta­tions of a text, spe­cif­i­cally by disassem­bling texts into mean­ing­ful com­po­nents (here sentences), and then reassem­bling the com­ponents into dif­ferent con­figura­tions, whether in reverse order or in random arrange­ments.


 Retextualizer rearranges the original essay by juxta­posing sentences​​perhaps jar­ringly​​that were not initially so posi­tioned; it thereby can pro­vide the con­di­tions for new insights into the text, its ideas, and its themes.


 Each Retextualizer web page repeats the project's general pur­pose, as well as the instruc­tions, which also can be read below. In addition, each project page will contain further information relevant to that specific essay.



 Retextualizing the Works of W.E.B. Du Bois


 "Souls of Black Folk" [SBFI](The Independent, 1904)


 "The Individual and Social Conscience" [IASC](1905)


 "Address to the Country" [ATTC](1906)


 "The Nature of Intellectual Freedom" [IFRE](1949)


 "Apologia", Suppression of the African Slave-Trade [SSTA](1954)

 "Postscript", The Ordeal of Mansart [PSOM](1957)



 The Project Goals of Retextualizer


Digital Humanities Research:


 Texts can be read in sequence as created and/or pre­sented, pub­licly or other­wise. With computers, we can digitally interact with such works. In their written forms as types­cripts or manu­scripts texts can be digitized and then can be (re-)ana­lyzed and (re-)inter­preted via computer software. Section 6 below lists online resources that cover various dimensions of the digital humanities. Many more resources can be located via Internet searches.


 The digital manipulation of texts includes deformance, as Jerome McGann and Lisa Samuels called it in "Deformance and Interpretation" ​[New Literary History, 30:1 (1999): 25-56; Accessible online]. Literary works have unstable meanings, McGann and Samuels argued, and dis­cussed several methods to use on literary texts, typically poems, including a reversal of the poem's lines.


 Such techniques of deformation have received support, such as:


 Cohen, Matt. 2006. "Trangenic Deformation: Literary Translation and the Digital Archive." Walt Whitman Archive [Website]. Online.


 Sample, Mark. 2012. "Notes towards a Deformed Humanities." Samplereality [Blog], (Posted May 2). Online.


 Criticisms of literary deformance have been put forward:


 Hoover, David L. 2005. "Hot-Air Textuality: Literature after Jerome McGann." TEXT Technology, 14:2. Online (PDF).


 Hoover, David L. 2007. "The End of the Irrelevant Text: Electronic Texts, Linguistics, and Literary Theory." Digital Humanities Quarterly, 1:2. Online.


Robert W. Williams's Research:


 The current Retex­tu­al­izer applica­tion builds on a previous version which had no copying, viewing, or sentence-numbering features. I initially coded the basic ran­domizing and display functions in May 1999 as a way to create and present ran­dom­ized versions of essays by Immanuel Kant and Walter Benjamin.


 Since that first version, digital humanities research has come to influ­ence my schol­ar­ship, most notably by means of computer applica­tions, such as conc­ordancers and col­lation soft­ware. Those digital tools help me to under­stand how Du Bois paired words and phrases within their con­texts, and also to illus­trate how he re-used and mod­i­fied text in dif­ferent works over time. The Retex­tu­al­izer project con­tinues this avenue of my research.

 I have created several hypertext presentations on issues related to DH. ​[This subsection was posted for the 1 February 2019 update.]


"The Intertextuality of Du Bois's Idea of Humanity: A Collation Analysis": The African American Studies and Research Center at Purdue University hosted the 30th Symposium on African American Culture and Philosophy on 1-3 December 2016. The symposium theme was "Exploring the 'Humanity' in the Digital Humanities".


"Algorithmic Displacement and the Black Atlantic: Retextualizing the 'Souls' Essay by W.E.B. Du Bois": I presented this at the 2018 African American Digital Humanities Conference, held at the University of Maryland, College Park, on 20 October 2018. I covered the use of the Retextualizer application on the SBFI essay.





 Digital Humanities: Online Texts and Related Resources


Bailey, Moya Z. 2011. "All the Digital Humanists Are White, All the Nerds Are Men, but Some of Us Are Brave." Journal of Digital Humanities, 1:1 (Winter). Online at JDH.
Drucker, Johanna. 2012. "Humanities Theory and Digital Scholarship." In Debates in the Digital Humanities, 2012 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.​cuny.edu/​debates/​text/34
Drucker, Johanna. 2012. "Representation and the Digital Environment: Essential Challenges for Humanists." Posted at the University of Minnesota Press blog, 16 May 2012. URL: http://uminnpressblog.com/​2012/​05/​representation-and-digital-environment.html
Gallon, Kim. 2016. "Making a Case for the Black Digital Humanities." In Debates in the Digital Humanities, 2016 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.​cuny.edu/​debates/​text/55
Gold, Matthew K. & Lauren F. Klein (Eds.). 2016. Debates in the Digital Humanities, 2016 Ed. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.​cuny.edu/​debates?id=2.
Kim, Dorothy & Jesse Stommel (Eds.). N.D. Disrupting the Digital Humanities. URL: www.disruptingdh.com/​position-papers/
Liu, Allan. 2013. "The Meaning of the Digital Humanities." PMLA: Proceedings of the Modern Language Association, 128:2; pp.409-423. URL: http://escholarship.org/​uc/​item/​5gc857tw.
Nowviskie, Bethany. 2016. "On the Origin of 'Hack' and 'Yack.'" In Debates in the Digital Humanities, 2016 Ed., edited by Matthew K. Gold and Lauren F. Klein. Minneapolis: University of Minnesota Press. URL: http://dhdebates.gc.​cuny.edu/​debates/​text/58
Price, Kenneth M. & Ray Siemens. 2013-2015. Literary Studies in the Digital Age: An Evolving Anthology. URL: https://dlsanthology.mla.​hcommons.org/.
Risam, Roopika. 2015. "Beyond the Margins: Intersectionality and the Digital Humanities." Digital Humanities Quarterly, 9:2. Online at DHQ.
Risam, Roopika. 2015. "On Disruption, Race, and the Digital Humanities." Disrupting the Digital Humanities, Digital Edition. URL: www.disruptingdh.com/​on-disruption-race-and-the-digital-humanities.


[Section 7 was created for the 1 October 2020 update.]

Websites: Blogs, Centers, and DH Projects

African American History, Culture & Digital Humanities (UMD)
URL: https://aadhum.umd.edu

African Diaspora PhD
URL: https://africandiasporaphd.com

Alliance of Digital Humanities Organizations (ADHO)
URL: https://adho.org

Anna Julia Cooper Center
URL: https://ajccenter.com

Black Book Interactive Project
URL: http://bbip.ku.edu

Black Past: Online Reference Guide to African American History
URL: http://blackpast.org

Black Press Research Collective
URL: http://blackpressresearchcollective.org

Black Quotidian: Everyday History in African-American Newspapers
URL: http://blackquotidian.com

Carolina Digital Humanities (UNC-Chapel Hill)
URL: https://cdh.unc.edu/

Center for Digital Research in the Humanities (UNL)
URL: https://cdrh.unl.edu

Center for South Asian and Indian Ocean Studies
URL: https://as.tufts.edu/​csaios/​digitalHumanities

Credo: Special Collections & University Archives, W.E.B. Du Bois Library, Uni­ver­sity of Massachusetts Amherst (Papers of Horace Mann Bond, W.E.B. Du Bois, and others)
URL: http://credo.library.umass.edu/

Colored Conventions Project: Bringing 19th Century Organizing to Digital Life
URL: www.coloredconventions.org

DHCommons.org: A Collaboration Hub
URL: http://www.dhcommons.org

Diaspora Hypertext: Black Femme History and Futures
URL: http://dh.jmjafrx.com/​tag/​howard-ramsby-ii/

Digital Colored American Magazine
URL: http://coloredamerican.org/

Digital Harlem: Everyday Life 1915-1930
URL: http://digitalharlem.org

Digital Humanities Association of Southern Africa
URL: http://digitalhumanities.org.za

Digital Humanities Initiative
URL: http://www.dhinitiative.org

Digital Schomburg African American Women Writers of the 19th Century
URL: http://digital.nypl.org/schomburg/writers_aa19/

East Asian Digital Humanities Lab
URL: https://guides.library.​harvard.edu/​EADH

Frederick Douglass in Britain and Ireland
URL: http://frederickdouglassinbritain.com

HASTAC: Humanities, Arts, Science, & Technology Alliance & Collaboratory
URL: https://www.hastac.org

History of Women Philosophers
URL: https://historyofwomenphilosophers.org

Malcolm X: A Research Site
URL: http://www.brothermalcolm.net

O Say Can You See: Early Washington, D.C., Law & Family
URL: http://earlywashingtondc.org

Recovering the U.S. Hispanic Literary Heritage Blog
URL: https://recoveryprojectappblog.wordpress.com

The Ward: Race and Class in Du Bois' Seventh Ward
URL: http://www.dubois-theward.org

WWP: Women Writers Project
URL: http://www.wwp.northeastern.edu


[Section 8 was created for the 1 March 2022 update.]

Regular Expressions (Regexes)


[Section 9 was created for the 1 November 2023 update.]

Regex Usage: Interpreting Illegible Letters

I created a multi-part tweet in January 2023 on one way to use regular expressions to determine possible characters that otherwise were indecipherable in words arising from printed or handwritten documents. I present that tweet below in its original parts, but with a few additions and reconfigurations to enhance clarity.

#Regex Technique: Unknown Words
The unpublished works of #WEBDuBois sometimes contain handwritten words that I can't read.
To find potential words that fit the sentence's meaning I use regular expression searches of a word list loaded into a text editor or concordancer.
1/11

2/11
Case Study: Handwritten Words
I screen-captured one line within what seems to be a never-finished manuscript in which Du Bois discussed his underlying philosophy.

File: Unpublished Du Bois document archived in the Credo repository (UMASS Library's Special Collections).
Title: "Steps Toward a Science of How Men Act"
Typescript: 4 pages + 3 pages of handwritten notes by Du Bois
ID: mums312-b213-i071

3/11
fragment
What are the 3rd & 4th handwritten words in the image?
Admittedly, those words in the image might be understandable within the context of the sentence fragment.
For purposes of illustrating the #regex technique, I will try to decipher those words in this tweet thread.

4/11
#Regex coded by these indicators:
=Discernible letters (based on similarities w/ letters in known words).
=Number of letters in the word (create range of letters if boundaries are indistinct).
=Contractions need to be expanded for the word list, not for the dictionary.

5/11
fragment
Third word:
=Starting "e"
="g" or other letters with descenders?
=How many letters: 7/8?

Regex
   \be[a-z]{1,3}[gjpqy][a-z]{4,5}\b

Regex briefly annotated (metacharacters manage the search):
\b  =word boundary
e  =literal letter "e"
[a-z]{1,3}  =range of 1-3 letters: "a" through "z"
[gjpqy]  =match any 1 letter
[a-z]{4,5}  =range of 4-5 letters: "a" through "z"
\b  =word boundary

Regex results of 3rd word: 1000+ matches.

6/11
Disambiguation of the irrelevant matches is needed because regex results of 3rd word exceeded 1000 matches.

The more letters we know, or can hypothesize about, the fewer the matches.

fragment
Hypothesize "v" and "h":
   \bev[a-z]{4,5}h[a-z]{1,2}\b

Regex briefly annotated:
\b  =word boundary
ev  =literal letters "e" and "v"
[a-z]{4,5}  =range of 4-5 letters: "a" through "z"
h  =literal letter "h"
[a-z]{1,2}  =range of 1-2 letters: "a" through "z"
\b  =word boundary

Results: 11 matches, including "everythin"
Possibly no final "g" in the original.

7/11
fragment
Unknown fourth word:
=Starting "f"
=Final letters: "ch"?
=How many letters: 5/6?

Regex
   \bfe[a-z]{1,2}ch\b

Regex briefly annotated:
\b  =word boundary
fe  =literal letters "f" and "e"
[a-z]{1,2}  =range of 1-2 letters: "a" through "z"
ch  =literal letters "c" and "h"
\b  =word boundary

Results: 6 matches, including "fetich"

Plausible interpretation of the fragment in context:
"Force - in everything - Fetich"

8/11
I often need to create different regexes: I can
=Change the range of possible letters to match.
=Change the initial or other specified letters to find more possible words.
I then repeat the searching & disambiguation phases.
This #regex technique may not find plausible candidates.

9/11
#Regex technique assumptions:
=Standardized spelling
=Era-appropriate word list or dictionary
=Case insensitive [configurable]
=Patterns: same letter is written in a recognizable style
=Discernible letter boundaries that permit a counting of the letters (or a number range).

10/11
Useful Resources: Words
A. Word list: Alphabetical [No numbers or symbols]
https://github.com/dwyl/english-words/blob/master/words_alpha.txt
[Lowercase]

B. Project Gutenberg: Webster's Unabridged Dictionary
https://www.gutenberg.org/ebooks/29765
[Upper- & lowercase]

11/11
Useful Resources: Regular Expressions
A. #Regex tutorials & guides
* https://regular-expressions.info  (Jan Goyvaerts)
* https://www.rexegg.com
* https://ryanstutorials.net/regular-expressions-tutorial/
* https://riptutorial.com/regex
* http://regextutorials.com

B. Regex testing
* https://regex101.com
* https://regexr.com

END of thread