Tipping Point, Power Laws, Linked, and Weblogs


Tipping Point, Power Laws, Linked, and Weblogs

 

The Tipping Point

The Tipping Point

 I posted a link to this online review of:

 "The Tipping Point: How Little Things Can Make a Big Difference" by Malcom Gladwell

to my weblog a few weeks ago, but for some reason Google didn't pick it up in the index. So here it is again.

The books main ideas are:

Tipping Point - Net Version

How to Start a Revolution

paraphrasing the main ideas in Malcolm Gladwell's book The Tipping Point (See Link To Author Happy Feet)

 
 
   


HappyFeet has made the best effort possible to put these items in some form of coherent order. This book used alot of marketing/business angles. I chose to replace those examples, etc. with art, creativity, and revolution. Use this to make the truth bloom.

THE TIPPING POINT IS:

to top

THE LAW OF THE FEW


There are exceptional people out there who are capable of starting epidemics. All you have to do is find them. With an epidemic, a tiny majority of the people do the work. Once critical factor in epidemics is the nature of the messenger. Messengers make something spread.
Word of mouth is still the most important form of human communication. Rumors are the most contagious of all social messages. Connectors

  1. People with a special gift for bringing the world together, people specialists
  2. Know lots of people
  3. Have an extraordinary knack of making friends and acquaintances, making social connections.
  4. Have mastered the "weak tie"; a friendly, yet casual social connection.
  5. Manage to occupy many different worlds and subcultures and niches. By having a foot in so many different worlds, they have the effect of bringing them all together.
  6. Acquaintances represent a source of social power, and the more acquaintances you have the more powerful you are.
  7. Social glue: they spread the message

    Mavens

    • Information specialists
    • Once they figure out how to get that great deal, they want to tell you about it too.
    • Solves his own problems, his own emotional needs, by solving other people's problems.
    • Have knowledge and the social skills to start word-of-mouth epidemics.
    • A teacher and a student
    • In a social epidemic, Mavens are data banks. They provide the message.

     

    Salespeople

    • Have the skills to persuade when we are unconvinced of what we are hearing.
    • Little things can make as much of a difference as big things.
    • Gives nonverbal clues that are more important than verbal clues.
    1. "Interactional synchrony": human interaction has a rhythmic physical dimension. We dance to each other's speech…we're perfectly in harmony.
    2. Motor mimicry: we imitate each other's emotions as a way of expressing support and caring and, even more basically, as a way of communicating with each other. Emotion is contagious. "Senders" are very good at expressing emotions and feelings. They are far more emotionally contagious than the rest of us.
    • Persuasion often works in ways that we do not appreciate
    • You draw others into your own rhythms and dictate the terms of the interaction.

to top

THE STICKINESS FACTOR


There is a simple way to package information that, under the right circumstances, can make it irresistible/sticky and compels a person into action. All you have to do is find it. In order to be capable of sparking epidemics, ideas have to be memorable and move us into action. Content of the message matters too.
 

  1. What is needed is a subtle but significant change in presentation to make most messages stick.
  2. The elements that make an idea sticky turn out to be small and trivial.
  3. "Clutter" has made it harder and harder to get any one message to stick. The information age has created a stickiness problem.
  4. Pay careful attention to the structure and format of your material, and you can dramatically enhance stickiness.
  5. Can tip a message by tinkering, on the margin, with the presentation of their ideas THE POWER OF CONTEXT

We don't necessarily appreciate that our inner states are the result of our outer circumstances. We are more than just sensitive to changes in context. We're exquisitely sensitive to them. And the kinds of contextual changes that are capable of tipping an epidemic are very different than we might ordinarily suspect. The impetus to engage in a certain kind of behavior is not coming from a certain kind of person but from a feature of the environment.
 

  1. Small changes in context can be just as important in tipping epidemics.
  2. An environmental argument.
  3. What really matters is little things
    • "Broken Windows Theory": in a city, relatively minor problems like graffiti, public disorder, and aggressive panhandling, are all the equivalent of broken windows, invitations to more serious crimes (Rudy Gulliani's belief)
  4. An epidemic can be reversed/tipped by tinkering with the smallest details of the immediate environment.
  5. There are specific situations so powerful that they can overwhelm our inherent predispositions.
  6. Human beings invariably make the mistake of overestimating the importance of fundamental character traits and underestimating the importance of the situation and context. We are a lot more attuned to personal cues than contextual cues.
  7. Character is more like a bundle of habits and tendencies and interests, loosely bound together and dependent, at certain times, on circumstances and context.
  8. The convictions of your heart and the actual contents of your thoughts are less important, in the end, in guiding your actions then the immediate context of your behavior.

to top

THE MAGIC NUMBER 150


"There seems to be some limitation built into us either by learning or by the design of the nervous systems, a limit that keeps our channel capacities in this general range (i.e. the human minds inability to comprehend things beyond sets 7)" —George Miller "The Magical Number Seven"


"The figure of 150 seems to represent the maximum number of individuals with whom we can have a genuinely social relationship, the kind of relationship that goes with knowing who they are and how they relate to us. Putting it another way, it's the number of people you would not feel embarrassed about joining uninvited for a drink if you happened to bump into them in a bar." —Robin Dunbar,
 

  1. Even relatively small increases in the size of a group [beyond 150] creates a significant additional social and intellectual burden.
  2. The rule of 150 suggests that the size of a group is another one of those subtle contextual factors that can make a big difference.
  3. Peer pressure is much more powerful than a concept of a boss
  4. Transactive memory: we store information with other people. Since mental energy is limited, we concentrate on what we do best.
  5. Groups of 150 are an organized mechanism that makes it far easier for new ideas and information moving around the organization to tip; to go from one person or one part of the group to the entire group all at once.

CONCLUSION

First Lesson of the Tipping Point


Starting epidemics requires concentrating resources on a few key areas. Your resources ought to be solely concentrated on the Connectors, Mavens, and Salesmen.


Second Lesson of the Tipping Point


The world does not accord with our intuition. Those who are successful at creating social epidemics do not just do what they think is right. They deliberately test their intuitions.

to top
Important Conclusion!


What must underlie successful epidemics, in the end, is a bedrock belief that change is possible, that people can radically transform their behavior or beliefs in the face of the right kind of impetus. Tipping Points are a reaffirmation of the potential for change and the power of intelligent action. Look at the world around you. It may seem like an immovable, implacable place. It is not. With the slightest push; just in the right place; it can be tipped. NOTES, ETC.


Diffusion model: a detailed, academic way of looking at how a contagious idea or "product" or innovation moves through a population.

  1. Innovators: the adventurous ones. Visionaries.
    • Connectors, mavens, and salesmen make it possible for innovations to connect with the early adopters. They are translators: they make ideas and information from a highly specialized world and translate them into a language the rest of us can understand. They drop extraneous details and exaggerate other details so that the message itself acquires a deeper meaning.
  2. Early adopters: the slightly larger group that is infected by the innovators. Visionaries.
  3. Early Majority: the deliberate and the skeptical mass, who would never try anything until the most respected of this group try it first.
  4. Late Majority
  5. Laggards: the most traditional group that see no urgent reason to change.

Social Networks

from PeterMe:
 

It's basically to see how complex human systems really work. It can be a project team in an organization, or a terrorist team, or it could be a community where a virus is spreading, and figuring out how and why it's spreading a particular way. It allows us to look at and map what are normally invisible dynamics inside a community. Once you know about it and start thinking about it, you start to realize there's a lot of relational data out there. If you start putting it together piece by piece, you put together an interesting picture, more than any of the pieces by itself.

valdis_19 (6k image)

That's what I did with that terrorist network map. Everyone was talking about terrorist networks, but no one was putting it together. So I slowly started gathering data from various sources, and with a couple hundred data points, it started looking like something.


sna_moreno (13k image)

 Initially there was a thing called a socio-gram, which Jacob Levi Moreno developed in the early 30s. These were drawn out by hand. You would observe a group, and draw a diagram, where the circles are the people, the lines are who talked to who. Other researchers started to follow.

(ed.: For a good history on the development of social group visualization, read "Visualizing Social Groups," (.PDF) by leading social network theorist Linton C. Freeman. And dig his hair and shirt!)

 

Visual Neighborhood. Blogstreet launches visual neighborhood. Blogstreet just launched a new tool that uses Java to let you view your Blogstreet "neighborhood" and click on your neighbors to expand and see their neighborhoods, etc. You get the idea. The tool is on their site and the developer, Veer, blogs about it.  [Joi Ito's Web -> Ross Mayfield's Weblog]
1:07:22 PM    

Power Laws

 

Jon Udell summarizes Power Laws and Scale-free networks:

"Power law distribution" captures this idea nicely for me. The term "scale-free networks" is less intuitive. In this context, "scale" refers to the connectivity "embodied by the average node and fixed by the peak of the degree distribution." Thus, a bell curve has intrinsic scale. In contrast, a "scale-free" network -- whose graph asymptotically approaches many weakly-connected nodes at one end, and a few highly-connected nodes at the other -- has no intrinsic scale.


8:41:36 AM    
Origin of power laws

Abstract:
- What are power laws?
- Where do power laws come from?
- A hypothesis
- Testing
- Results
- Conclusion

Power laws are characteristic of randomly distributed values that come
from a scarce resource.


[interconnected.org]


Origin of power laws


Abstract:
- What are power laws?
- Where do power laws come from?
- A hypothesis
- Testing
- Results
- Conclusion


* WHAT ARE POWER LAWS?

I've been wondering where power laws come from. They're found in all
kinds of places:
- earthquake size
- size of cities
- community size on LiveJournal
- fluctuation magnitude of a share price

Power laws are characterised by a property:
- the distribution looks the same, regardless of the scale you consider

This is called scale invariance.

So for example: with earthquakes,
if you double the energy, earthquakes become four times as rare
...and this happens at whatever energy you start with.

City growth in the USA: For any city size, there are four times that
many cities at half the size. So, if there's one city of four million
people, there will be four at two million, and sixteen with one million
inhabitants.

That adjustment factor (four times as rare; four times the number of
cities) isn't the important bit -- that can be any number as long as it
stays the same regardless of the scale.


* WHERE DO POWER LAWS COME FROM?

But where do these power laws come from? "Ubiquity" by Mark Buchanan
(from where I've also lifted many of these examples) suggests a number
of different types of game that produce these power laws:
- earthquake system models
- randomly chosen adaptive evolution
but doesn't actually say why these laws emerge (or at least, not before
the last three chapters...)

Other commonly found distributions are:
- Normal distribution (values distributed around a point, for example
the heights of people are all distributed around 5 foot 10 inches, with
many close to that and very few further away)
- Poisson distribution (as Normal distribution, but skewed because 0
acts as a lower bound)

but Buchanan points out that although people *thought* one of these
distributions would explain earthquakes or share price fluctuation, they
don't: power laws do.


* A HYPOTHESIS

What's missing in these other statistical distributions is a concept
that the values are all interlinked. For example, if a person goes and
lives in a city, they can't live in another city. If an earthquake
happens, that energy can't be used for another earthquake.

There's a dependency on a scarce resource.

So my hypothesis is that numbers picked randomly from a scarce resource
will organise themselves into a power law distribution.


* TESTING

The directory
http://interconnected.org/notes/2002/12/power_laws/
contains the script and notes.

resource_allocation.pl
- starts off with an initial resource
- takes a random amount of that resource, and remembers it
- continues until there's no resource left

The script sorts the random chunks taken off the resource in different
ways.
- into percentile buckets (that is, how many chunks are between 42% and
43% of the initial resource)
- into scales. Scales go down as a factor of a half. (That is, how many
chunks are between a half and a quarter the size of the initial
resource.)

(to even things out a bit, this resource allocation is done many, many
times, and the results added together.)

If a random allocation from a scarce resource does produce a power law,
then:
- the percentile bucket sorting should show that there are X times more
chunks in the 50th percentile bucket as the 100th percentile, and X
times more again in the 25th percentile. X has to stay the same
- the scale bucket sorting should show the number of chunks at each
scale staying roughly the same if it's a power law where X is about 2


* RESULTS

In that same directory,

- chunks_per_percentile.gif
does indeed show that the number of chunks doubles as you halve the size
of the chunk you're looking at. So the X factor is about 2 here

- chunks_per_halving.gif
shows that the number of chunks visible at each scale (halving the chunk
size for each scale) remains pretty much the same.

These are characteristic of a power law.


* CONCLUSION

Power laws are characteristic of randomly distributed values that come
from a scarce resource.
 

Power Laws Apply to Langauge

The richness of human languages is a fine-tuned compromise between the needs of speakers and of listeners, explain Ramon Ferrer i Cancho and Ricard Solé of the Universitat Pompeu Fabra in Barcelona. Just a slight imbalance of these demands prevents the exchange of complex information, they argue.

... Human languages, say the duo, seem to sit right on this sudden change. When it happens, the frequency of word usages develops a distinctive mathematical form, called a power law. The power law disappears on either side of the communication jump.

It has been known since the 1940s that human languages do indeed show just this kind of statistical distribution of word usage - the social scientist George Kingsley Zipf spotted the power-law behaviour. But it has never been satisfactorily explained before, although Zipf himself speculated that it might represent some kind of "principle of least effort".

[nature.com]

How do Power Laws apply to Weblogs?

Doc Searl on Clay:

Blogger effect.

Like Dave, I wish Clay Shirky would start a blog too. I know it's not Clay's style, just like it's not, say, Jerry Michalski's (although Jerry has a blog and Clay doesn't). Some things are best learned by doing, and it strikes me that in the absence of doing Clay hasn't learned how poorly the power story applies in the blogosphere.

In Power Laws, Weblogs, and Inequality, Clay runs blogging through the power law mill, and the result is akin to running a cat through a Cuisinart: you get easily measured stuff that bears no resemblence to the subject of the study.

One of his points:

Given the ubiquity of power law distributions, asking whether there is inequality in the weblog world (or indeed almost any social system) is the wrong question, since the answer will always be yes. The question to ask is "Is the inequality fair?" Four things suggest that the current inequality is mostly fair.

The inequality Clay talks about — the fact that some of us are graced with more links and readers than others — is of a purely numerical sort. It says nothing about why people write blogs, and why readers read blogs.

To illustrate my own point here, go sort your email by name. Is there a power curve there? If there is, does it matter?

For many of us (me included), blogs serve a kind of public email function. Since a bunch of correspondents wrote to me asking for my opinion about Clay's essay, I'm writing back to the bunch of them, plus everybody else who cares (a sum which, as a percentage of the true everybody, verges on zero).

.......

Launching good new blogs is no less easy than it ever was, and good new blogs grow no slower than they ever did. Blogging tools like Trackback and Technorati don't signal the drift of blogging toward some kind of portalesque middle-distance, but rather a constant increase in the number and variety of tools available to the practice — plus the constant change those tools bring to the practice itself. Blogging is radically different than it was six months ago. Technorati alone has changed my blogging life, largely by acquainting me with all the worthy blogs with which I had previously been unfamiliar.

The whole blogosphere is characterized by a high exponent of the experimenter effect. To borrow from Forrest Gump, blogging is as blogging does, and that changes every time somebody blogs something interesting that somebody else blogs about. It's wild.

[The Doc Searls Weblog]

Power Laws, Weblogs, and Inequality

First published February 8, 2003 on the 'Networks, Economics, and Culture' mailing list.
Subscribe to the mailing list.

Version 1.1: Changed 02/10/03 to point to the updated "Blogging Ecosystem" project, and to Jason Kottke's work using Technorati.com data. Added addendum pointing to David Sifry's "Technorati Interesting Newcomers" list, which is in part a response to this article.

A persistent theme among people writing about the social aspects of weblogging is to note (and usually lament) the rise of an A-list, a small set of webloggers who account for a majority of the traffic in the weblog world. This complaint follows a common pattern we've seen with MUDs, BBSes, and online communities like Echo and the WELL. A new social system starts, and seems delightfully free of the elitism and cliquishness of the existing systems. Then, as the new system grows, problems of scale set in. Not everyone can participate in every conversation. Not everyone gets to be heard. Some core group seems more connected than the rest of us, and so on.

Prior to recent theoretical work on social networks, the usual explanations invoked individual behaviors: some members of the community had sold out, the spirit of the early days was being diluted by the newcomers, et cetera. We now know that these explanations are wrong, or at least beside the point. What matters is this: Diversity plus freedom of choice creates inequality, and the greater the diversity, the more extreme the inequality.

In systems where many people are free to choose between many options, a small subset of the whole will get a disproportionate amount of traffic (or attention, or income), even if no members of the system actively work towards such an outcome. This has nothing to do with moral weakness, selling out, or any other psychological explanation. The very act of choosing, spread widely enough and freely enough, creates a power law distribution.

A Predictable Imbalance

Power law distributions, the shape that has spawned a number of catch-phrases like the 80/20 Rule and the Winner-Take-All Society, are finally being understood clearly enough to be useful. For much of the last century, investigators have been finding power law distributions in human systems. The economist Vilfredo Pareto observed that wealth follows a "predictable imbalance", with 20% of the population holding 80% of the wealth. The linguist George Zipf observed that word frequency falls in a power law pattern, with a small number of high frequency words (I, of, the), a moderate number of common words (book, cat cup), and a huge number of low frequency words (peripatetic, hypognathous). Jacob Nielsen observed power law distributions in web site page views, and so on.

We are all so used to bell curve distributions that power law distributions can seem odd. The shape of Figure #1, several hundred blogs ranked by number of inbound links, is roughly a power law distribution. Of the 433 listed blogs, the top two sites accounted for fully 5% of the inbound links between them. (They were InstaPundit and Andrew Sullivan, unsurprisingly.) The top dozen (less than 3% of the total) accounted for 20% of the inbound links, and the top 50 blogs (not quite 12%) accounted for 50% of such links.


Figure #1: 433 weblogs arranged in rank order by number of inbound links.
The data is drawn from N.Z Bear's 2002 work on the blogosphere ecosystem.
The current version of this project can now be found at http://www.myelin.co.nz/ecosystem/.

The inbound link data is just an example: power law distributions are ubiquitous. Yahoo Groups mailing lists ranked by subscribers is a power law distribution. (Figure #2) LiveJournal users ranked by friends is a power law. (Figure #3) Jason Kottke has graphed the power law distribution of Technorati link data. The traffic to this article will be a power law, with a tiny percentage of the sites sending most of the traffic. If you run a website with more than a couple dozen pages, pick any time period where the traffic amounted to at least 1000 page views, and you will find that both the page views themselves and the traffic from the referring sites will follow power laws.


Figure #2: All mailing lists in the Yahoo Groups Television category, ranked by number
of subscribers (Data from September 2002.)


Figure #3: LiveJournal users ranked by number of friends listed.
(Data from March 2002)

Rank Hath Its Privileges

The basic shape is simple - in any system sorted by rank, the value for the Nth position will be 1/N. For whatever is being ranked -- income, links, traffic -- the value of second place will be half that of first place, and tenth place will be one-tenth of first place. (There are other, more complex formulae that make the slope more or less extreme, but they all relate to this curve.) We've seen this shape in many systems. What've we've been lacking, until recently, is a theory to go with these observed patterns.

Now, thanks to a series of breakthroughs in network theory by researchers like Albert-Laszlo Barabasi, Duncan Watts, and Bernardo Huberman among others, breakthroughs being described in books like Linked, Six Degrees, and The Laws of the Web, we know that power law distributions tend to arise in social systems where many people express their preferences among many options. We also know that as the number of options rise, the curve becomes more extreme. This is a counter-intuitive finding - most of us would expect a rising number of choices to flatten the curve, but in fact, increasing the size of the system increases the gap between the #1 spot and the median spot.

A second counter-intuitive aspect of power laws is that most elements in a power law system are below average, because the curve is so heavily weighted towards the top performers. In Figure #1, the average number of inbound links (cumulative links divided by the number of blogs) is 31. The first blog below 31 links is 142nd on the list, meaning two-thirds of the listed blogs have a below average number of inbound links. We are so used to the evenness of the bell curve, where the median position has the average value, that the idea of two-thirds of a population being below average sounds strange. (The actual median, 217th of 433, has only 15 inbound links.)

Freedom of Choice Makes Stars Inevitable

To see how freedom of choice could create such unequal distributions, consider a hypothetical population of a thousand people, each picking their 10 favorite blogs. One way to model such a system is simply to assume that each person has an equal chance of liking each blog. This distribution would be basically flat - most blogs will have the same number of people listing it as a favorite. A few blogs will be more popular than average and a few less, of course, but that will be statistical noise. The bulk of the blogs will be of average popularity, and the highs and lows will not be too far different from this average. In this model, neither the quality of the writing nor other people's choices have any effect; there are no shared tastes, no preferred genres, no effects from marketing or recommendations from friends.

But people's choices do affect one another. If we assume that any blog chosen by one user is more likely, by even a fractional amount, to be chosen by another user, the system changes dramatically. Alice, the first user, chooses her blogs unaffected by anyone else, but Bob has a slightly higher chance of liking Alice's blogs than the others. When Bob is done, any blog that both he and Alice like has a higher chance of being picked by Carmen, and so on, with a small number of blogs becoming increasingly likely to be chosen in the future because they were chosen in the past.

Think of this positive feedback as a preference premium. The system assumes that later users come into an environment shaped by earlier users; the thousand-and-first user will not be selecting blogs at random, but will rather be affected, even if unconsciously, by the preference premiums built up in the system previously.

Note that this model is absolutely mute as to why one blog might be preferred over another. Perhaps some writing is simply better than average (a preference for quality), perhaps people want the recommendations of others (a preference for marketing), perhaps there is value in reading the same blogs as your friends (a preference for "solidarity goods", things best enjoyed by a group). It could be all three, or some other effect entirely, and it could be different for different readers and different writers. What matters is that any tendency towards agreement in diverse and free systems, however small and for whatever reason, can create power law distributions.

Because it arises naturally, changing this distribution would mean forcing hundreds of thousands of bloggers to link to certain blogs and to de-link others, which would require both global oversight and the application of force. Reversing the star system would mean destroying the village in order to save it.

Inequality and Fairness

Given the ubiquity of power law distributions, asking whether there is inequality in the weblog world (or indeed almost any social system) is the wrong question, since the answer will always be yes. The question to ask is "Is the inequality fair?" Four things suggest that the current inequality is mostly fair.

The first, of course, is the freedom in the weblog world in general. It costs nothing to launch a weblog, and there is no vetting process, so the threshold for having a weblog is only infinitesimally larger than the threshold for getting online in the first place.

The second is that blogging is a daily activity. As beloved as Josh Marshall (TalkingPointsMemo.com) or Mark Pilgrim (DiveIntoMark.org) are, they would disappear if they stopped writing, or even cut back significantly. Blogs are not a good place to rest on your laurels.

Third, the stars exist not because of some cliquish preference for one another, but because of the preference of hundreds of others pointing to them. Their popularity is a result of the kind of distributed approval it would be hard to fake.

Finally, there is no real A-list, because there is no discontinuity. Though explanations of power laws (including the ones here) often focus on numbers like "12% of blogs account for 50% of the links", these are arbitrary markers. The largest step function in a power law is between the #1 and #2 positions, by definition. There is no A-list that is qualitatively different from their nearest neighbors, so any line separating more and less trafficked blogs is arbitrary.

The Median Cannot Hold

However, though the inequality is mostly fair now, the system is still young. Once a power law distribution exists, it can take on a certain amount of homeostasis, the tendency of a system to retain its form even against external pressures. Is the weblog world such a system? Are there people who are as talented or deserving as the current stars, but who are not getting anything like the traffic? Doubtless. Will this problem get worse in the future? Yes.

Though there are more new bloggers and more new readers every day, most of the new readers are adding to the traffic of the top few blogs, while most new blogs are getting below average traffic, a gap that will grow as the weblog world does. It's not impossible to launch a good new blog and become widely read, but it's harder than it was last year, and it will be harder still next year. At some point (probably one we've already passed), weblog technology will be seen as a platform for so many forms of publishing, filtering, aggregation, and syndication that blogging will stop referring to any particularly coherent activity. The term 'blog' will fall into the middle distance, as 'home page' and 'portal' have, words that used to mean some concrete thing, but which were stretched by use past the point of meaning. This will happen when head and tail of the power law distribution become so different that we can't think of J. Random Blogger and Glenn Reynolds of Instapundit as doing the same thing.

At the head will be webloggers who join the mainstream media (a phrase which seems to mean "media we've gotten used to.") The transformation here is simple - as a blogger's audience grows large, more people read her work than she can possibly read, she can't link to everyone who wants her attention, and she can't answer all her incoming mail or follow up to the comments on her site. The result of these pressures is that she becomes a broadcast outlet, distributing material without participating in conversations about it.

Meanwhile, the long tail of weblogs with few readers will become conversational. In a world where most bloggers get below average traffic, audience size can't be the only metric for success. LiveJournal had this figured out years ago, by assuming that people would be writing for their friends, rather than some impersonal audience. Publishing an essay and having 3 random people read it is a recipe for disappointment, but publishing an account of your Saturday night and having your 3 closest friends read it feels like a conversation, especially if they follow up with their own accounts. LiveJournal has an edge on most other blogging platforms because it can keep far better track of friend and group relationships, but the rise of general blog tools like Trackback may enable this conversational mode for most blogs.

In between blogs-as-mainstream-media and blogs-as-dinner-conversation will be Blogging Classic, blogs published by one or a few people, for a moderately-sized audience, with whom the authors have a relatively engaged relationship. Because of the continuing growth of the weblog world, more blogs in the future will follow this pattern than today. However, these blogs will be in the minority for both traffic (dwarfed by the mainstream media blogs) and overall number of blogs (outnumbered by the conversational blogs.)

Inequality occurs in large and unconstrained social systems for the same reasons stop-and-go traffic occurs on busy roads, not because it is anyone's goal, but because it is a reliable property that emerges from the normal functioning of the system. The relatively egalitarian distribution of readers in the early years had nothing to do with the nature of weblogs or webloggers. There just weren't enough blogs to have really unequal distributions. Now there are.

Addendum:
David Sifry, creator of the Technorati.com, has created the Technorati Interesting Newcomers List, in part spurred by this article. The list is designed to flag people with low overall link numbers, but who have done something to merit a sharp increase in links, as a way of making the system more dynamic.

First published February 8, 2003 on the 'Networks, Economics, and Culture' mailing list.
Subscribe to the mailing list.

Monday, February 17, 2003
Weblogs and power laws

Many systems and phenomena are distributed according to a power law distribution. A power law applies to a system when large is rare and small is common. The distribution of individual wealth is a good example of this: there are a very few rich men and lots & lots of poor folks. A familiar way to think about power laws is the 80/20 rule: 80% of the wealth is controlled by 20% of the population.

It's been shown that the distribution of links on the web scales according to a power law, so it comes as no surprise that the distribution of links to weblogs does as well. Taking the top 100 most linked to weblogs on Technorati as a data set (specifically from 1/24/03), I used Excel to plot and fit a curve to the data:

weblogs obeying the mighty power law

....

This NEC study reveals that the deviation of a set of data from the power law correlates to how much competition is present in the system. The better the fit, the more competitive the environment is. Again, no surprise that the system of weblogs is a highly competitive one.

But what are weblogs competing for? Matt Webb posits that power laws arise due to scarcity.... The scarcity of people's time results in the distribution of links that can be described using power laws.

The idea is that instead of using a quadratic or cubic equation that kinda fits the data, you use a power law equation generated by the data itself to exactly fit the data (or nearly so). The power law equation I derived using the limited sample of the top 100 list is:

y = 5989.8x^(-0.8309)

where y is the # of inbound blogs and x is the rank of the site. I plotted the top 100 data again and tried to fit three curves to it:

fitting three curves to the technorati top 100 data

The dotted blue line is a linear equation, the dashed red line is a quadratic equation, and the solid black line is the aforementioned power law equation. As you can see, the linear and quadratic equations fit the data poorly. The R-squared for the linear equation is 0.31, 0.55 for the quadratic, and 0.99 for the power law equation. So the quadratic is an improvement over the linear equation, but neither compare to the excellent fit of the power law and the excellent results that would follow from using it for Technorati's interesting recent blogs lists.

[Kottke.org]


11:58:17 PM    
Breaking the (power) law

I thought about the problem that this presented to a traditional link engine.  When you rank bloggers simply by the number of people who link to them, you get a very static list of "a-list" bloggers, as shown by the Technorati Top 100.  What I wanted to do was to break that power law, and give more exposure to the lesser known, but still interesting bloggers, especially on days when they stand out and do something interesting.

Siry explains how he inverted the power law to flatten the curve

Basically, the idea is that for a relatively obscure blogger who has, say, 40 people currently linking to his blog, getting 4 or 5 new blogs linking to him can have the same effect as a a-list blogger getting 40 or 50 new links.

This is interesting research for me, but the most satisfying thing about it is that I've found a way to identify interesting new writers and add them to my blogroll.

[Sifry's Alerts]


11:46:58 PM    

a detailed analysis of Power laws as applied to Weblogs, Newspapers and Movies:

http://homepage.mac.com/kevinmarks/powerlaws.html

The conclusions he comes to are:

1. Weblog links do follow a power law
2. This saturates less quickly than other media, due to low barriers to entry
3. Therefore the many lightly linked weblogs outnumber the few heavily linked ones

[Kevin Marks]


11:26:34 PM    

Power Less

... Investigating the "blogosphere" is interesting, but not nearly as interesting as recognizing that there are blogospheres. And these spheres are neither wholly seperate nor wholly integrated with the rest of the network. What constitutes a border for such a sphere? What does this differentiation mean to the structure of the whole.

[Alex.Halavais.net]

The author, Alexander Halavais, works at School of Infomatics at U. Buffalo.  Either we're brining him in for a student speakers talk or I'm gonna go visit him. Or both.

brains, cities, and software trying to prepare for a 8000 word article I have to write for Illume on the future of information. I've been thinking about just this issue for the last month. I think that trying to connect the discussion about emergence with this issue is key to understanding how blogs are different.

...  Although the search engines and metaindexes are useful, they are no longer the first place you go. I read my RSS news feeds before I go searching on a portal for news. As Dave says, don't know most of the blogs on the top 100 list and I don't care. We are organized into more intelligent communities and although there is a power law of sorts with respect to blogs that get a lot of attention, there are many local peaks. I think it looks much more like clusters of blogs with interconnections between communities. A lot like a strength of weak ties sort of map.

...

Technorati top 100 ranking is not as important to me as WHO is linking to me. When I was running Infoseek, all I cared about was HOW MANY pages views we were getting. Sure I brag about my page views to people who don't blog because that's a metric they understand, but the really interesting stuff is going on at a higher level I think.

So... How do we capture the next higher level of order? Well, that's what I hoped you might have an answer for. I think there are ways to look at this subjectively.

One way might be to track a meme through blogspace. See how an idea like your article gets picked up, quoted and where it ends up. Map that and you have one space. Each meme is like a tracer. Some communities will pick up certain ideas, while other will not. You can find the weak ties between to communities as these memes make their way across networks. MANY memes will end up being very local, and SOME will end up on EVERY blog. But I don't know how to do this.

I also want to read his paper on the connection between weblogs, power laws, and democracy.

[Joi Ito]

I am posting this to be sure that I can still find Ross's work later
Ecosystem of Networks.

My post on Distribution of Choice was a little long winded, so let me sum up:

  1. Not all links are created equal
  2. Conversational relationships are not scale-free
  3. Applying these principles reveals a Network Ecosystem Model that helps us understand the political economy of weblogs

Network Size Description Distribution
Political Network ~1000s Blogs as mass media Power-law (scale-free)
Social Network ~150 Blogging Classic Bell-curve (random)
Creative Network ~12 Blogs as dinner conversation Dense (equal)

A link to a site you read isnt the same as a link to someone you know through their blog or someone you actively collaborate with. 

After reviewing data of work relationships, information flows and knowledge exchanges from hundreds of consulting assignments inside Fortune 2000 organizations Valdis Krebs did not see much evidence of power laws in this data. His data is of confirmed ties [both persons agreed/recognized their mutual interactions/flows/relationships] from a worldwide pool of clients dating back to 1988. Of course he found some people were better connected than others, but the extreme hubs found in power law networks just were not evident.

Adapting a famous line from the movie "Blazing Saddles" Valdis concluded: "Power Law? There ain't no stinkin' power law in this data!"

This conclusion fits well with Duncan Watts observation that the more you ratchet up the requirements for a link, recognized connections diminish, and the less you see power laws. 

Which makes all the noise about Power-laws off target.  I had the pleasure of having a dinner conversation with Clay last night.  Yes, he should start a weblog, but he has his own reasons for not doing so yet, which I'll let him explain for himself.  But studying the structure of the weblog ecosystem does not have to be an anthropological exercise.  Its a wonderful testament to the energy of blogspace that Dave Sifry created a new index to reveal the neglected tail of the Power-law distribution of a Political Network.  But we don't have to screw the Power-law or use statistical techniques to reveal a different distribution.  This approach has tremendous value in allowing new cream to arise to the top.  Both innovations are still attempting to filter the wrong set of data and to generalize all of blogspace.  What matters isnt breaking these laws, but the perspective that weblogs, aside from the Political Network publishing dynamics, are communication tools for group forming in Social Networks and Creative Networks.  Meg asks the right question: what if these tools can expand our capacities?  What if 12 and 150 become averages instead of limits? 

Other people are thinking in similar terms from an anthropological perspective as participants.  The Social and Creative Networks are where the new and valuable interpersonal connections are being made.

In the coming days I will build upon the Network Ecosystem Model to explain the Distribution of Influence and Distribution of Social Capital.  My head hurts, but this is getting interesting.

[Ross Mayfield's Weblog]
© Copyright 2003 Micah Alpern.
Last update: 4/3/2003; 11:53:18 AM.