The bias of social software

I am in Web 2.0 ignorance recovery. For two months I’ve been playing catch-up about web 2.0 and social software. If I say something stupid or ask questions already answered please just comment me a link and I’ll shut up. (If you’re new too, start here).

Some folks have criticized blogs for being an echo-chamber: meaning that many blogs simply echo each other and people debate within narrow boundries. I’m not making this point, but I am leapfrogging over any echo debates to ask a larger question.

del.icio.us is a canonical example of both social software and Web 2.0. Simply put: in this example of social software (SS) people mark items that interest them and people can view the results. Looking at the top marks, it reflects a curious bias. Take a look:

Exhibit A: the most recent list of most popular del.icio.us items.

Generalizing from top ten lists has many downsides, given long tails and power laws, but it is an easy catalyst for questions.

popular delicious

This list seems not like what’s popular with most of the people I know, but what’s popular with the Computer science / web development/ blogging crowd (some of the people I know). Ajax is a development tool. Ning, the highest ranked item on the list, is a new kind of social software development platform. Half of the top ten are items reflect the kinds of things the makers of delicious are likely interested in (I mean come on: Monty python is just a dead give away. Although the cleaning link made me consider a programmer/OCD connection). This popularity snapshot expresses an interesting kind of bias: the users of this social software reflect the interests of the makers of the social software. I suspect the majority of delicious users, today, have similiar interests to the makers of delicious.

Exhibit B: Popular delicious tags

Tagging, which is another key aspect of social software, also makes visible some interesting bias effects.

top tags

The most popular tags again reflect the interest of people who make the tools. Web 2.0 and social software is new enough that the early adopters are not just super interested users, but tool builders themselves (See this for more tags). I am not suggesting why this happens, just making the observation that it may be true.

Exhibit C: Popular 43things.com tags

At another social software project, 43things.com, there’s a broader distribution:

43 things top tags

Here are my questions:

1) Why does one social tagging system have such a different kind of focus (delicious) while the other doesn’t? Delicious and 43things are both available to the world. What explans the wide gap in their user’s interests? How do the demographics for Flickr.com, a service believed to be consumer oriented but which many of my thirty something friends have never heard of, compare to other kinds of tools? Other kinds of social software? Are they younger? More affluent? Are there generation gaps tied to social software?

2) In using friendster, linked-in, orkut and other social networking tools, I had the experience of meeting the same people in different systems. It was like being in college and moving the party from building to building (only without the drugs. Well, at least on the networks I was invited onto). Did I just not know enough people? Or is the social software crowd, today, smaller than we admit? Is anyone examining if diversity is growing with size?

3) What are the risks of the frameworks for social software being defined so strongly by early adopters? Do the frontier settlers self-interest, and ability to define their world, restrict the kinds of followers than will come (I think so)? (Insert favorite Mcluhanreference here)

4) How does a user new to a social software project establish a sense for how his interest match with the popular interests of the most active users? Where are the the tools that let me search against delicious like data to see what’s popular with people who have traits or interests I care about? For example: there will never be a luddite group on meetup.com. What other hidden biases are there? The digital divide as an important but easy example, but there are more subtle ones.

5) Are there inherent biases that most active users in social software have (e.g. technical, high math SAT scores, etc.)? How does this impact how social software should be designed? A traditional software designer can shape the design around different, and possibly under-represented, user’s needs – but if social software is user driven what counterbalances are there?

6) Are the things that interest social software early adopters the same things that will interest more mainstream and diverse people? Should anyone care?

8 Responses to “The bias of social software”

  1. Loren Davie

    Hi Scott – a couple of selected thoughts:

    Social software, Web 2.0, AJAX etc., may be the web in-crown buzz du jour, but this stuff has not crossed over into the mainstream. It hasn’t “crossed the chasm”, in Geoff Moore’s language.

    An alternate example, podcasting, probably has more mainstream traction right now due to being included as an iTunes feature. For social software to penetrate mainstream consciousness, there needs to be a widely demanded “killer application” for it. Probably besides hanging out, that is.

    To cherry pick some of your questions (btw you have two #3 questions):

    3. (first one): Theres a risk, but over a long enough time scope its less than you think. The web of today has all of the good and bad qualities of mainstream culture. On the other hand, what you term as “risk” may be considered a feature. One could argue that the distributed nature of the internet (not really a feature of mainstream culture – see TV, radio, Telephones etc) is one of the primary contributors of its success. This is a legacy from the architects and early adopters of the Internet.

    4. Yes, its probably the early adopter characteristics. Above average intelligence, tolerance for change and risk, etc. As the tech goes mainstream this bias will dissappear. This is usually accompanied by some entrepreneur “dumbing down” the tech to appeal to the mainstream. Have you seen 37 signals Writeboard? (Not an endorsement – I haven’t tried using it seriously yet). Its basically a wiki thats been made “mainstream-friendly”.

    5. Probably not exactly. You should care if you’re launching a social software service and you’re trying to appeal to an early-adopter or mainstream crowd.

    Reply
  2. Marilyn Langfeld

    Hi Scott,

    I like your questions. I’ve also been diving into the subject, trying to get ahold of it. I’d like to focus on one word you use: popularity. What is the connection between popularity and veracity, for instance. Or importance. Or…

    I began using del.icio.us as an aid for blogging (haven’t used it lately)–an easy way to categorize sites I’ve seen that I might want to use in a post of one kind or another. I was always pleased to be the first to tag a url, as opposed to being one in a million that had tagged something. Popularity is fine, but not necessarily the only goal you can have. That’s my main concern with all the buzz about social software. Is popularity the best way to judge relevance?

    But, for a general audience, popularity is probably key. Just depends on your goals with this type of thing.

    I’m working my way through your book and am really enjoying it, by the way. I’m a graphic designer, dipping my toes into project management of web development projects.

    Reply
  3. keith bohanna

    Hi Scott,

    enjoyed that post and not going to go too deep into it – for it to be thought provoking was enough for me.

    On top of all of your points is English V social networking in other languages, geographic distribution. Never mind cultural and race issues :-)

    keith

    Reply
  4. blogopogo

    annotation, annotation, annotation…. i never learned enough code to flesh out a ‘7 layer burrito’ sort of thing i thought would be the ultimate knowledge base, i was thinking of it as some combination of timestamp/uid + blob, which picks up title, pointer, type, categories, various key/value pairs and on and on to whatever nth works for the application. then the pile of it all picks up annotations from those who search through it looking for answers, patterns, references, whathaveyou. indexing emerges, eventually polished & focused catalogues are spun from it using the very same iterated/aggregated/popmuzik annotation thingies. 1999 tho! with the right tools i could have gotten so much done… anyway, now that pretty much all the ideas that came from angsting for useful implementations of them are turning up done well, fairly accessible, and no doubt to be thoroughly integrated any moment now (thinking google spreadsheets & ical, opml, georef, vcard, & all the other categories of info bits you can pile into google lately.. the spreadsheet sort of opens it all up (or it will once it’s all glued together) b/c you can set up your stuff however you wanna so no need to wait on this or that, data dump now & xslt it all better later.

    Reply
  5. blogopogo

    oh & no kidding re consider a programmer/OCD connection

    Reply

Pingbacks

Leave a Reply to Holgar Maass

* Required

Click here to cancel reply.