A local department of a political party of which I am a member (never mind which party) had a discussion rolling some time ago concerning spam mail, which led our web editor Rasmus Larsen to ask a few questions concerning Kaplak :
Let’s say that I am organizing an independently financed political forum on the web, with a range of interesting articles by a mixed group of connected people – some writing small newspaper pieces, others longer dissertations. [...] The purpose is not to generate a profit, but 1) to create attention around the webforum as a supplier of meaningful political articles, and 2) to inspire and influence the activities of the target groups, as a kind of prolonged think tank activity.
If there’s an article which supply something innovative on integration policies, it needs to get out in some way to all relevant people who’re already occupied with the subject and are active online. It could be high profile debaters and thinkers, people within different political parties, which leads particular workgroups, certain students and researchers at universities etc.
Now I face these challenges :
- How do I get this article out to the target groups described here, without a firm grasp of who and where they are?
- How do I make sure, they won’t consider it spam or unimportant?
- How do I get it out to the target groups, which I haven’t even considered exists?
These questions hit the nail on a crucial challenge also for Kaplak : “Search” pre-conditions a pre-knowledge, a core of conscious information, which makes someone able to search for something. How do we reach the other someones, who are interested in what we do, without knowing who they are, and they not knowing who we are?
The answer is deceptively simple, yet incredibly hard work. The answer is hyperlinks. Most people don’t realize how important they are. A search engine, for instance, is really nothing but a very advanced index of hyperlinks and hyperlinked webpages. So to be visible for the someones who do search for you, if they know who you are or what your “product” is – let’s imagine you manage to get that information to them by some other means – you have to build a strong interlinked system of hyperlinks, pointing to your site from related sites, networks, communities, blogs etc., which will make search engines better pick up your site and rate it correctly and appropriately.
You can use special techniques often referred to as Search Engine Optimization (SEO) to optimize your visibility for people who search for you on the web. But your efforts will be most efficient, if 1) your target group know beforehand that you exist or already are looking for what you offer, or 2) you can define precisely or near precisely, who your target group is and what they will search for.
The bottom line is still, however, links, links, links. A well placed link in a good spot will direct the right kind of people to your product or message.
So what is ‘a good spot’? I’ll discuss this in a second.
Concerning the questions about how recipients won’t consider your message or product ’spam’, and how to reach groups you haven’t considered existed, I’d like to flip the questions around a bit. In other words, what will recipients consider spam? And how do you reach people, who haven’t even considered your existence?
To the first, clearly most people consider unsolicited mail spam. Information they seek out themselves, are motivated for and have accepted to receive is not spam. If you want your message out, you have to find a way to reach people where this is the case or seems to be the case. The second point has really the same answer. In order to reach people, who don’t know you exist, you have to create a situation, where they seek you out themselves. So how do you do this?
You find out what the good spots are. Get your hyperlinks to the places where you find people who are potentially interested in your message and won’t consider it irrelevant.
At first, this may seem like an impossible task, especially considering the enormous growth of the web. But if one reconsiders, there’s good reason to praise this further atomization of the internet. The growth of the number of sites in the world also means, that there’s a much greater variety and finer segmenting of target groups. If you can define them right and precisely enough, that is. Segments can be anything on the web which has an audience or some point of communication. They can be website communities, but need not be. They need even not be on the web – they could also be mailing lists, intranets or darknets, i.e. closed p2p networks. A segment can be as small as the group of friends around a Facebook profile, or as large as Barack Obamas following on Twitter.
To describe and predict segments like these are exceedingly and increasingly difficult to do from any central point of view (although any SEO will certainly try). This is why we need to utilize local forces and filters (by means of “peer production”), which will help decide for us, what kind of segments will find what pieces of information. Local peers have what we don’t : the expertise in knowing their communities and segments much more precisely than we do. Connecting with these mediators is how we find ‘the good spots’.
In a way, this is what is already happening now all over the internet. An atomization followed by more precise forms of segmenting and reaching audiences and markets. There are a lot of affiliate programs and products such as Google AdWords/AdSense which help mediators make these connections. But in Kaplak we don’t see really efficient solutions which helps the producers on the very slim end of the long tail, because these customers are not really the concern of most market players operating on the web.
What we propose at Kaplak is (among other things) to introduce a capital bonus (i.e. kaplak) to those peers who successfully connect a niche product with a niche market. This, supported by other tools, will help speed up the connecting of products with their markets in online niche contexts and generate larger margins for our customers – as well as for the mediators.
Or do algorithmic search really scale better, work faster and ensure better quality than ’socially produced’ services? A few days ago, I had an interesting exchange of POV’s with Danish SEO Mikkel deMib Svendsen, known among other things from the SEO radio show Strikepoint.
I replied to Mikkel’s blog post on ‘Search – before, now and in the future’, where I tried to make the point that search as a communications solution suffers from key preconditions, which are far from optimal. Among these the fact that in order to search for something, you need to know what to look for before you search, and you need to deliberately and consciously use a search engine to look for it.
In other words, search is a deliberate and conscious affair. This makes it difficult, for instance, to use search to market products, which are not well known, such as niche products, or to address problems or needs, which are not yet consciously thought or expressed.
Add to this the present growth rates of information on the web. For each new website added to the web, you increasingly risk that your information will never reach the queries seeking it. We’re talking exponential growth rates in several millions of new websites, added each month to the web. As Search faces these increasingly greater amounts of information, this problem, which we’ve so far dubbed the mainstream problem, will only become more apparent.
Mikkel, however, firmly believes in the future of algorithmic search, so these claims didn’t go uncommented. First, he argues that machines will always work faster and scale better than social services, which has great filtering and quality challenges :
I am completely in line with Louis Monier [founder of Altavista], and am 100% certain that algorithmic search will remain dominant. Manual data processing, like in the [online] social services, simply suffers from too many scaling and quality assessment-issues to compete in the long run. Only machines are scalable on the necessary scale and with a continued central quality assessment. … [my own translation, MB]
I have a few problems with these arguments for the quality of search as a communications method. I wanted to analyze them a bit here, in order to make them part of our process to find out more about the effectiveness of online communication and it’s niche/longtail effects. Over the past months, I’ve come to question the widespread naturalization of search as ‘the best’ and ‘natural’ method of making information available and visible online. True, right now Search is the dominant method for obtaining information online. It is also a billion dollar business, although this is mostly due to the success of Google AdWords/AdSense, which is quite a different product. However, this may not be so in the future.
The part of Mikkel’s argument which makes a distinction between social services on one hand, which are manually created and processed (by humans), and search on the other, which uses machines and algorithms, and therefore scale better etc, is fundamentally flawed.
First, search results are ‘peer production’ almost as much as any online social bookmarking service is, i.e. they are socially produced. Peer production is a term coined by Harvard professor Yochai Benkler (in The Wealth of Networks (2006) – which can be downloaded freely here). That search results are peer production means that they create value by putting together websites from different peers (i.e. companies, organizations or individuals) in order to respond to a search query. Google does this without even asking the peers first (it’s an opt-out, not an opt-in system), and so the peers used to create value may not even know that they contribute value to this system. However, this doesn’t detract from the fact that it is the sourcing and pooling together of the work of different peers as a response to a human search query, which creates value in a search result. A search result is socially produced, even though the work done filtering and presenting it in few seconds is done by advanced programming, software, hardware – and cables.
These advanced architectures however, are also created by humans, which means there are someone sitting and using their human capabilities to decide what categories and what variables should factor in with how much weight in the algorithms which control the process of finding and delivering information, when somebody searches for something particular.
The problem with this is not that the process is just as human-influenced as any online social bookmarking service for instance, but that the someone deciding what variables should factor in, is (most likely) not an expert on what the someone at the other end typing in a search query is looking for. The other someone is. It’s one size fits all. One architecture (in principle) for all queries in the world.
I tried asking Mikkel how he could be sure, that a query actually met with a usable result. Even if a query is answered by a number of search results, this doesn’t mean that the search results are actually usable and delivers the answer to the query. If this user experience is bad, search fails in delivering an answer, even if there are a million hits on the query.
Let’s take a look at something completely different, i.e. this page at Wikipedia. Notice the edits happening to the article “Tsunami” in december 2004? A page which before december 2004 had minimal contributions and edits made to it, literally exploded with new information, when a tsunami this month devastatingly hit the coasts of Sri Lanka and Thailand. Everything was frequently updated as events rolled along and people in different parts of the world found out new things about what had happened, complete with a small animation to go along with it.
Wikipedia aims to make knowledge freely accessible to anyone on the planet. Like providers of algorithmic search, Wikipedia uses lots of machinery to deliver it’s information, as well as an advanced complex of software architectures. Wikipedia’s articles are peer produced, but much more directly and consciously so than the algorithmically created search result we saw earlier. Even the software is peer produced. MediaWiki is free software, which can be copied and worked upon by anyone who wishes to do so, and any changes may be adopted by the main package.
A second point is the difference in value created. With an example from his own work, Mikkel illustrates how Search Engine Optimization (SEO) done right directly creates great surplus of value for the companies he and other SEO’s work for. Regular SEO maneuvres help direct lots of relevant traffic to the corporate websites.
That SEO helps create value, however, by more directly targeting traffic at corporate websites can’t be said to be an argument for the quality of search as a communications solution in and of itself, but rather for the quality of Mikkel’s and his colleagues’ work. There’s a lot of money in SEO, and that’s not because search is a brilliant solution to a communications problem. It is rather because search is inherently insufficient as a solution to the problem of connecting a query/demand with an answer/product, especially for a company which wants to stay alive and gain a competitive edge. And this problem will grow a lot bigger. I predict that Mikkel and his SEO colleagues will be paid even better in years to come.
It is first and foremost a problem of visibility, not particularly of search. We need to create better ways to make information accessible to the people who need it, without swamping those who don’t. Second, it is a problem of speed, because we need information fast, to better meet the challenges we face, as individuals, organizations and societies.
As a non-profit, Wikipedia doesn’t make any money on the processes involved in creating and building a quality article, but the value that an improved Wikipedia article (such as the tsunami article) provides for millions of journalists, for instance, and the newspapers and media companies which employ them, is indispensable. I know for a fact that reporters use Wikipedia a lot, and with good reason. It is the fastest and most scalable source of information online, beyond any doubt. And when as many contributors as in the tsunami article come together, it also proves a highly reliable and credible source. It beats the crap out of trawling search results pages without finding what you’re looking for. But it is only a small example of what peer production is capable of, given the right architectures and tools.