Setting the record straight about Wikipedia

Sunday, August 17th by Ben Yates

Three things that you really should know

Okay. So Wikipedia has a problem: most of the public still does not understand how it works. Even half the people who think they understand Wikipedia do not.

This is partly because the way Wikipedia works is just incredibly weird.

And it’s partly because once people start learning more about wikipedia, they’re inculturated into the wikipedian social sphere, and they start hanging out more with people who already understand wikipedia, and less with people who don’t.

(This is the story of the entire internet, by the way.)

Alright, class.

1. Wikipedia is not an encyclopedia

The first thing people think when they see Wikipedia is OMFG, these guys have everything!

People don’t discover Wikipedia by typing “wikipedia.org”. They discover it by typing “south america” or “techno” or “pope ratzinger” into Google, and then by staring at the screen for an hour discovering that every tiny facet of their favorite thing is covered in exhaustive detail.

They don’t read the “about” pages. They don’t even know those pages exist. They read the articles. And they see the logo in the upper-left corner, which lets them know they’re reading an encyclopedia.

But Wikipedia is actually not an encyclopedia — not in any normal sense of the word. It’s a million miles away from any other encyclopedia; the name obscures more than it reveals.

300px-EncycBrit1913.jpg

  • Every other encyclopedia is written by an identifiable group of people — usually credentialed experts who were recruited and paid by the encyclopedia company.
  • Every other encyclopedia is difficult and expensive to produce. Each edition of Britannica takes decades to complete. When Microsoft created Encarta, they didn’t write it themselves because they knew that would be fucking impossible; they bought and renamed an existing encyclopedia.
  • Every other encyclopedia has gone through a traditional editing process wherein one person creates a document and a few other people fix its spelling mistakes and structural problems.
  • Every other encyclopedia is released in canonical editions — the culmination of a long process that produces final, finished articles. The reader only sees the final version.

Wikipedia is none of these things.

300px-PunkLondon01_ST_02.jpg

  • Unlike the staff of an encyclopedia, Wikipedia’s “staff” is permeable and informal. New people join and leave all the time, without stepping through a defined process — sometimes without identifying themselves or even using a pseudonym. It’s more like a subculture than a company.
  • Wikipedia is officially nonprofit (not just unofficially, like Britannica :P ). It’s run like a charity, but it grows like a Kudzu. It’s already much larger than Britannica, and it’s only a few years old.
  • Every Wikipedia article is written collaboratively, like a whiteboard. The most-edited wikipedia articles are effectively authorless! You can dimly see the imprint of individual authors if you read closely, and you can even view the changes each author made behind the history tab, but the cumulative effect is of reading a document created by a hive of bees. (Wikipedia’s distinct house style is a side effect of its fuzzy editing process.)
  • In general, there’s no such thing as a finished Wikipedia article. See deletion by the numbers.

Wikipedia may be an encyclopedia in the abstract, in the same way that the State of Michigan is a nation. But it is not like any other encyclopedia; it is the black swan.

2. Anyone cannot edit Wikipedia

200px-FluorescentCells.jpg

The second thing people realize about Wikipedia is that anyone can edit it. (This realization comes hard because universal editorship is such a strange idea. Wikipedia used to get panicked emails every day from people who thought they’d been granted improper access to the editing screen.)

And their next thought is holy shit, anyone could have edited this article? Anyone could have added misinformation or errors?

But anyone cannot really edit wikipedia — or rather, the question is philosophical: if you edit an article but your change is rolled back after 30 seconds, did you really edit it?

Basically, Wikipedia is very good at reversing people’s edits. There are two ways to look at how Wikipedia does this.

A. The immune system model.

Okay, so the first time your body encounters a harmful invader, it’s like “oh shit” and it scrambles to wipe out the bug after it’s already started breeding. But the second time it encounters the invader, it’s learned to recognize it. It wipes the bug out in no time flat.

Wikipedia has an immune system. Part of it is the recent changes page, which lists every new edit, anywhere on Wikipedia. Part of it is the hundreds of people who dedicate their time to checking facts and fighting vandals.

300px-US_Customs_and_Border_Protection_officers.jpg

And part of it is the reflexes that these vandal-fighters have developed after months or years in the trenches, and the attitudes they pass on to each other. They’re like policemen on a dark city street — Does that guy look suspicious? He must be up to something, look at the way he’s standing around.

If you submit an article that’s not written in the house style, if you make an edit without registering an account first, if you look at someone a little funny, your edit might get reversed — you’ll look like a hoax-writer or a vandal or a PR guy, even if you’re not.

Okay, make that the immune system/police force model.

B. The subculture model.

People clump. They clump a lot in high school, but also out in the real world, and especially in situations where there’s no formal hierarchy.

Different subcultures have different ways of thinking, different ways of doing things. In theory, you could go up to a bunch of london punks and just sit down and chill with them. But in practice, you can’t — not unless they think you view the world the same way they do.

Anyone can be a wikipedian, but not anyone can be a wikipedian. This is a bit of a problem, but it does work pretty well at keeping articles free from vandalism.

3. Wikipedia tends to be pretty accurate (and when it’s not, it’s easy to tell)

Really, accuracy is generally not a problem. The level of misinformation in Wikipedia is low. For example, the journal Nature compared wikipedia favorably to britannica back in 2005. (Nature’s study did have some flaws.)

But you can rigorously gauge Wikipedia’s accuracy by reading about topics you already know a lot about.

And you can even estimate the accuracy of articles about unfamiliar topics — for example, by checking the article’s revision history and contacting the authors of individual edits, or by going back to the primary sources listed at the bottom of the article.

If you read wikipedia enough, you’ll start to develop an intuition for its accuracy. This sounds mystical, but it’s the same way you determine which newspapers and TV shows to trust.

In the meantime, I recorded a few screencasts to demonstrate clues you can pick up from the article’s context (these are all from a book I’m co-authoring, How Wikipedia Works).

First, check for outgoing Wikilinks. (If you can’t see the embeds, the videos are all here.)


Second, check for incoming Wikilinks.


Third, check for a stub notice.


Fourth, see if the article’s topic is particularly controversial.

Jimmy Wales talked about this a little:

Neutrality Point of View (NPOV) is absolute and non-negotiable in Wikipedia. The problems come up in obscure topics, such as Japanese anime. For common topics people come together and make a decent statement on what it is. It turns out what is really important is that participants have a shared vision of what they want to accomplish.

Mutually-assured destruction is inherent in Wikipedia. People who want to push an agenda end up having to write “for the enemy” rather than to those who share the same bias. Most people are pretty reasonable, but you don’t get that sense from TV where they put up two people on opposite sides. Most people are in the middle and aware of pros and cons of issues.


Fifth, check for categories


Okay, class dismissed.

But if you have any other techniques — or questions, or annoyances — there’s a discussion at the cracked forums.

3 Responses to “Setting the record straight about Wikipedia”

  1. Ben Yates Says:

    I’m going to copy a few of those Cracked posts here.

  2. Ben Yates Says:

    Disco Stu:

    I find Wikipedia useful as a first point of call for any research project. It gives a good background to most issues, and more importantly, almost always gives links to other sources which gives a more in-depth coverage of the subject. Any discrepancies should become apparent as you do more digging into the topic.

    Onebrain:

    I’m ambivalent on the subject of Wikipedia in schools because at that stage I’m not sure if the students are able to identify subtle vandalism as well as they should. I have taught pre-university students and been presented with some howlers in the past culled from wikipedia.

    Mostly they involve anonymous trolls inserting false facts or amending dates (e.g. 1812 becomes 1912). These edits are changed quickly but junior students aren’t aware of how to evaluate wiki version histories so on vandal heavy days often pull up incorrect information and then never re-check information.

    I’d be more confident of them using Wikipedia if articles had a kind of ‘health warning’ that was easy to understand.

    Johnny Roastbeef:

    One problem that I think wikipedia has in terms of acceptance by schools is their “No Self Censorship” policy. What it results in are visual aides that you would not find in any other academic resource.

    Some NSFW examples.
    Mammary Intercourse
    Ejaculation, containing pictures and video of men ejaculating
    Fellatio
    Encyclopedia Dramatica (also very NSFW) has an extensive list

    These aren’t exactly medical diagrams. And censorship or not, I think a good many parents of school children find this kind of material objectionable. The article for “vagina” has a close up of a woman’s vagina in a pose that could have been taken directly from porn. While I can appreciate that the intent of inclusion is different, what parents and schools see is that the outcome is the same.

    DevHyfes:

    My favorite part of Wikipedia is picking some sort of ancient battle (Such as the Battle of Tours) and just clicking on links. It is amazing how many articles people create. Even better, historical experts have done a lot to fill in small definitions that you don’t often see in regular encyclopedias. For example, Wikipedia has tended to have some great overviews of historical weapons, including pictures and details. Often these little points are missed in an encyclopedia where the experts had limited resources.

    That said, I do have a couple of problems with it.

    1) Specifically around historical research, there is often way, way too much import given to revisionist historians. You see a little of it in the Battle of Tours wikipedia article cited above. But if one author makes a striking historical claim, it can derail an entire article into a discussion of the pros and cons of both, giving an indication that there is heavy dispute when indeed the matter is generally settled.

    2) Stubs: Often times people fill out stubs a little more- kind of drawing up a strawman article. These can be extremely inaccurate and a careless person can mistake them for a real article if they have trained themselves not to watch the top of the page for notifications.

    3) Legitimacy: I find myself using wikipedia a lot more for informal cites, especially in forums such as HB. I think that there are appropriate times for this- such as when you are giving people a high level overview of a subject. Even when trying to buttress controversial claims, wikipedia can be useful because some articles have already aggregated quite a bit of research. So long as the person citing the article keeps these points in mind, Wikipedia can be very useful. But too often, people don’t drill down the extra bit to verify the veracity of the supporting cites, and as a result they can find themselves relying on Wikipedia (not the underlying citations) as a source of truth. Wikipedia should be considered the Poor Man’s Graduate Student. Often, professors writing papers have grad students go out and locate, organize and even summarize materials for research. Wikipedia is good for that, but like a good professor, you need to be reminded of its fallibility.

    kelvinc:

    Most articles in my (supposed) area of expertise (economics) have been edited by people far more knowledgeable than myself on the subject, which is why I trust them pretty well. It may also explain why some people are so skeptical of Wikipedia: it takes a certain amount of technical education to appreciate a well-written Wikipedia article, but the skills you’ve explained are sufficient to mark out complete bullocks.

    The Furlinator:

    As a student myself, I know that some teachers at my high school are vehemently anti-Wikipedia, and some won’t let us use the site for research even if we don’t cite it in our paper. They give us the obvious explanation of “Anyone can edit it, so it’s unreliable”. Then something occurred to me. Sure, most anyone can edit an article if they want, but that doesn’t mean they are going to. Because you know you can edit Wikipedia, are you going to? What loser goes “Hahahaha! I’m going to spend three hours changing every major fact in this article!!!” only to have it reversed ten seconds later.

    Teachers at my high school make it seem like Wikipedia is some unorganized social get-together of people who think they know something.

    Tlogmer, your article should be standard reading for all teachers. Nice job.

    The Evil Sloth:

    When I was a teacher, I was vehemently anti-wikipedia. This was, however, largely because the areas I taught weren’t technical and the ‘facts’ were much less important than the analysis of the writing. Or to put it another way, there are maybe a dozen - at most - commentators on Nietzsche who are worth reading. However there are literally thousands of ill-informed college students and graduates who have opinions on Nietzsche, all of which are, as a rule, extremely suspect. Hence, the wikipedia article on Nietzsche has little gems like this:

    Nietzsche’s works remain controversial, and no real consensus exists on their meaning. The interpretation of his works seems shakier than the interpretative literature on most other major philosophers. One can readily identify some key concepts, but the meaning of each, let alone the relative significance of each, remains contested.

    Which is patent nonsense. There are excellent treatments of Nietasche’s work; Heideggar and Kaufman leap out as great examples.

    Wikipedia also cites Morgan as authority for the proposition that ‘most commentators consider Nietzsche an atheist’. To begin with, Morgan’s text is 60-odd years old and Morgan is a minor, christian scholar with an obvious bias against Nietzsche’s claims. It’s a terrible authority for a position which is untenable; most rigorous contemporary scholarship posits that Nietzsche was a theist, though certainly an unconventional one.

    These misconceptions exist because Nietzsche is a popular philosopher - there are lots of people with a non-academic, or not serious academic interest in his work - and there are not enough serious Nietzschean scholars to rectify the misconceptions; or at least they’re not editing wikipedia. The great tragety is that, I suspect, an enormous number of undergrads taking introduction to philosophy courses are going to read that page and come away with some fundamental misunderstandings about Nietzschean philosophy.

    And it’s not just this topic; a few years ago I set a paper on Natural Law theory and had to fail half a dozen students whose research was heavily influenced by the wikipedia article as it, again, contained fundamental errors about the subject matter.

    When it comes to Technical articles on Wikipedia’s I assume, as a non-trained person that they’re a relatively helpful overview of a topic. However, when I examine articles on subjects that require a high level of analytical ability - philosophy, literature - the articles are of a very poor standard.

    Kathana:

    I teach middle school, and I allow Wikipedia as a reference material. I do compare it to a print encyclopedia. And while college level text books are fine, middle and high school level are terrible. There’s no academic prestige in writing books for grade school students, and the lower vocabulary seems to make the authors leave out all sorts of important information. I absolutely detest the Georgia Studies book I’ve been given, and refuse to check it out to the students.

  3. Fajro Says:

    Wikipedia is not an encyclopedia by definition… At least according to the definition from the Wikipedia in Spanish:

    “Una enciclopedia es un compendio exhaustivo del conocimiento humano. La enciclopedia reúne y divulga datos especializados o dispersos que no podrían ser hallados con facilidad y que presentan un importante servicio a la cultura del hombre moderno.”

    Therefore there are more deletionists in the Wikipedia in Spanish.

    The lack of a firm definition of the objectives of the project derived in the sterile fight between inclusionists and deletionists

    Wikipedia is successful because is not an encyclopedia (Is bigger and more useful) and because promises the impossible ideal of the NPOV. NPOV could be subjective like Justice and Beauty, but we want it anyway.

    IMHO Wikipedia should fork in a “Wikibase” (a site for inclusionists, and a source for the other proyects), a “general Wikipedia” (A selection of “the most encyclopedic content” for a printed version and the new main battlefield for POV pushers), and multiple Thematic Encyclopedias (Where super-specific articles will not be erased).

    We just need more Transwikists xD

Leave a comment:

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Lookup any word on eNotes with our dictionary. Highlight the word and press SHIFT + D for a definition, or SHIFT + T for a synonym.