rehashing archaeology

Shawn Graham

Rehashing Archaeology
Citations
Pivots
Ok, so how do you do this?
- Examples of good teaching?

slides: https://shawngraham.github.io/presentations/harvard.html (resize your browser window a bit smaller, make it square, for best viewing).

video: https://www.youtube.com/watch?v=PGHFt-d9SCo

notes:

Rehashing Archaeology

—

To rehash:

v. To bring forth again in another form without significant alteration: rehashing old ideas. v. To discuss again.

—

Bit of a negative connotation there.

—

I asked on twitter, as you do, what people thought I meant by ‘rehashing archaeology’, as I hadn’t figured it out yet. Titles first, thinking later, eh?

—

Stu: saying the same thing that has been said before but in a slightly different way to avoid plagiarism?

Ted: Fossilized hash browns. Or possibly HashMaps.

Lorna: rearranging what it stands for/means, away from current understanding?

—

I like these.

I want to discuss digital archaeology in all of these terms. Digital archaeology is nothing unless it gets rehashed. The problem is, how do you teach this?

—

Citations

Let’s start with Stu’s thoughts.

Confession – I google myself to see where my work is being cited. Numbers look good; this is a kind of reuse of my archaeological data, right? My studies are being cited, ergo the work I did is useful to someone else, ergo, it is being reused. But when you go deeper into this ‘reuse’, you find that it’s a surface-y reuse. Bit disappointing, really, to think your career is filler for someone else’s padding.

—

That your entire life’s work is just an academic ‘required figure’, as Paul O’Donnel once put it.

—

I have a topic model of 20 000 archae journal articles, from the 1930s to the 2010s, and no topic within it uses the words ‘reuse’ ‘reproducible’, ‘re-analyze’ in any meaningful way. ‘Data’ is prominent in two topics, but always new data. While we might ‘revisit’ an issue from time to time, no one is analyzing someone else’s data in any meaningful way in the broader noise of academia. Play with the topic models yourself at http://graeworks.net/digitalarchae/

—

Nobody is rehashing archaeology.

—

A sobering thought. Part of the problem is packaging. We don’t play nice with each other.

In our heart of hearts, we don’t want other people looking at our data. You see this in embryo when you try to force students to work together on a project. Re-using data is a group project where your group members live in the future, and you just know they’re going to be angry with you.

—

And will the data they play with be the data you left behind? How will they know? Be on guard for deliberate alterations.

[guard image] —

This leads me to make a tie in to Ted’s comment about hashmaps. # hash

A hash function takes any input and maps it to a particular value. It’s deterministic. If the input changes – say a text file gets edited and something gets deleted – the hash at the other end is changed.

—

This is useful in security, to test that a message hasn’t been changed en-route; it’s useful for searching a database only for things that have changed; there are many useful things one can do with it. I direct you to wikipedia for more… In the current attacks on rationality, knowing that our data has not been corrupted is going to become ever more important. We need to become familiar with hash functions in order to guard against malicious use of archaeology for political ends.

—

There are other uses for hash functions in archaeology, of course.

—

In my own work w/ Damien Huffer, we’ve been using Lincoln Mullen’s R package for detecting text-reuse via hash functions to trace patterns in the trade in human remains online (turns out, it’s a very useful signature for individual buyers and sellers’ writing patterns, even when there’s no id otherwise associated with the post. Be sure your sins will find you out.)

—

but again, how do you teach archaeology students these things? it doesn’t ‘look’ like archaeology

Which leads me to Lorna Richardson’s comment on digital archaeology perhaps being a pivot away from how we’ve done things in the past to how we might do things in the future.

—

Pivots

Lorna’s research is extremely important for digital archaeology. One of the things she focuses on is what might be called the consumption side of archaeology within the web – of what is lately known as ‘fake news’ but formerly, ‘propaganda’.

We want people to engage with archaeology, right? Well, take a look at what’s happening on neo-nazi websites (I will not use the term alt-right.)

If people reuse archaeology, how do we teach them to do it correctly? How do we counter this? Are our current approaches to teaching archaeology effective for the wider impact of archaeological knowledge in society?

—

I understand digital archaeology as necessarily a public archaeology.

—

Ok, so how do you do this?

Let’s return to the first problem, of surfacey reuse by academics.

—

Eric and I hatched a scheme last summer to promote re-use of materials in OpenContext.

[img opencontext prize poster] Incentives!, we thought, the good neoliberal dh-tools that we are. We offered real money – up to a $1000 in prizes. We promoted the hang out of it. We made films, we wrote tutorials, we contacted professors across the anglosphere.

—

We had very little uptake.

[img tumbleweeds]

— ## Why?

—

Money is not enough. But I wonder if part of the problem is that we’re dealing with a sunk-cost effect. So much of our computational archaeological infrastructure is proprietary software. Software, and databases, for which we’ve paid licensing fees. We’ve paid so much money, we damn well make sure that there is someone in our department, in our company, who can use Arcgis. Because this department, this company, uses ArcGis, it makes sense to teach that. But all this open source stuff? All these new formats? Who uses it? It’s a classic chicken-and-egg problem.

Two other problems: there is no culture of undergraduate teaching with actual datasets ready to take advantage of the opportunity. Two, working with digital data still requires a level of digital literacy that we haven’t yet reached.

—

Examples of good teaching?

Take a look at what the digital historians are doing: in particular, people like Lincoln Mullen.

—

Lincoln’s work builds capacity in others. https://github.com/ropensci/historydata

— Lincoln’s work is shared, and it’s explicitly about teaching how to use the materials he’s pulling together in his teaching

http://lincolnmullen.com/projects/worksheets/

— Lincoln addresses how to get your materials online in a format that others can then use

https://lmullen.github.io/rmd-notebook/

— Ben Marwick, who is with you today, is doing this as well, and so as not to pre-empt him or otherwise get in his way, I’ll just tell you all to listen to him very closely!

— ## My own teaching fails

I hide in a history department these days, so I try to teach these things to my undergrads. First lesson I’ve learned is to never underestimate the social context of whom you’re trying to teach.

Anxiety – my digital history students, if I can coax them through the door, are not in any way shape or form ‘naturally conversant’ with any of this. “If I wanted to do computers, I wouldn’t have taken history” said a student in one of my (officially non-dh) courses. “What if it breaks? What if it doesn’t work? How do I get an ‘a’?”

Fear of Failure – we have not taught students how to fail productively. we have created systems where the risks of trying something different are usually catastrophic. Digital work requires the ‘screwmeneutical imperative’, as Ramsay famously put it.

— Tech tutorials are awful and I’m guilty of this as anyone: see http://workbook.craftingdigitalhistory.ca

— Tech changes too rapidly, and the kinds of machines that students are sold are not necessarily the kind that can be usefully employed in this work. I’ve had students turn up to my dataviz DH course with nothing more than an iPad mini. I also end up spending two or three weeks solid getting everyone’s machine properly configured.

My mom thinks I mostly do tech support. She’s mostly right.

— # Ok, so what do we do?

We turn our teaching inside out. We do it in public. I’m not talking about MOOCs, though I suppose they have a role here, as educational tourism.

No, I mean, we literally put all of our teaching out there and invite the public to take part alongside our formal students

—

we share with our students (and public) what has worked for us, and what hasn’t. we publish studies where the hypothesis didn’t work out. we replicate (with our students, and our public) someone else’s study.

— this will be dangerous. you’re a white male? put that to use. get out there and take risks, and make it safe for others to do so too.

— And it will require special tools. I’m building one right now with Neha Gupta, Michael Carter, and Beth Compton.

— # ODATE

The Open Digital Archaeology Textbook Environment

[img: DHBox + ProgrammingHistorian + Slack]

funded by ecampusontario: thanks!

— By building this, I’m trying to shift the infrastructure cost of learning to do digital archaeology from the student, from the individual, to myself. Build once, deploy everywhere as they used to say. To do digital archaeology should require only a browser. mitigate sunk costs

Taking our cue from OpenContext, we will include recipes for different kinds of tasks alongside the more formal learning activities.

Want to use ODATE for purely research purposes? No problem. Just use it.

Everything will be hashed properly so that you’ll know that, should you install it locally for yourself, you’ve got the true copy.

— What I need now are case studies. Data sets to work with. Reproducible research. Help?

— Digital archae is an opportunity to rethink how we do things, to put a positive hash on rehashing the past.

— This is what rehashing digital archaeology really means. we have to enable people to reuse our data for themselves, so at least the possibility will exist that they can see the truth of what is argued.

Rehashing is going to take a lot of work. But the results will be delicious.

—

Thank you.