November 2007 - Page 2

Moose 2008

Nov 112007

Yup, it’s on again. Northern Voice, that is, for 2008. February 22–23, 2008, at the Forestry Sciences Centre at UBC, where we held it last year. It will be the same format (Moose Camp unconference on Friday, conference on Saturday), with some sort of party on the Thursday evening. More details will appear on the web site as we (that is, the organising committee) figure them out.

If you’re interested in personal/educational blogging or social media, check out the Northern Voice web site for more details, including how to submit your ideas for talks you’d like to hear, or talks you’d like to give. Previous years have been a lot of fun, and I’m only a little bit biased.

Web 2.0: Technical

Presentations, Technology No Responses »

Nov 092007

The technical component of Web 2.0 includes XML, Ajax, APP, various programming languages, plug-ins and widgets, and the REST architecture. All of these have a role to play in supporting the web sites that incorporate Web 2.0 features, while many predate the Web 2.0 phenomenon. There are far too many interesting technical features for me to talk about all of them in one post, of course, but this post should at least introduce you to some of the more interesting acronyms.

Technical Cloud Obligatory tag cloud: this one contains some technical terms

Developing Web 2.0 applications is easier than developing large enterprise-style applications. The developer toolkits are a lot easier to use, and it’s much faster to create something. 37 signals, who make Basecamp, amongst other tools, say they put it up in four months with 2.5 developers using Rails, a development framework. For developers there’s now a range of language options, from PHP to C++ or JavaEE, with newer platforms and languages like Ruby and Rails grabbing mindshare as well. People can program in the system they’re comfortable with, and although there’s a certain amount of snooty disparagement of each language from proponents of some other one, what matters in the end is using the right tool for the job. I’ve seen bad code written in Java and good code in PHP, and a system that does less but does it well is preferable to my mind to one that does a lot really badly.

Ajax (Wikipedia link) is another important Web 2.0 technology. It’s really a shorthand to describe a bunch of technologies (HTML, CSS, DOM, JavaScript) that are tied together, using the browser to create a richer environment by tying in scripting and a way to request information from the server without forcing the entire page to be reloaded. It’s powerful and interactive and can be much faster than other methods of adding interactivity to the web pages. There are lots of books on the subject, which is a reasonable indicator of the interest in it.

Since it combines a lot of different applications, debugging can be a problem. Some basic rules that I’ve found useful are: first make sure your HTML/XHTML validates, then make sure your CSS validates, then use Firefox with the Firebug extension to debug the rest. Once you have that working, you can make the changes for other browsers as appropriate.

Poorly written Ajax does have some problems, such as not being able to bookmark results, or the back button not going back to the right place. The big problem is the non-standardized XMLHttpRequest object in JavaScript, the object that lets your page talk to the server and get the right information. The way it works varies between different browsers and different versions of the same browser (IE 6 to IE 7, for example). Although W3C is starting to work on standardizing it, that will take some time. Another problem is the “A” in Ajax — it’s asynchronous, which means that internet latency can be an issue.

These problems can be solved — there are Ajax toolkits available which hide the XMLHttpRequest and other browser incompatibilities, some applications have figured out the back button and the bookmarking URL issues, the asynchronous issues can be dealt with by breaking the applications up into small segments which take into account the fact that the other end may never respond. And as a result of these toolkits and techniques, Ajax is now a major component of many websites, even those that aren’t for Web 2.0 startups.

REST is an architectural framework that explains a lot of why the web is so successful. Roy Fielding’s PhD thesis was the first place where it was codified (and he coined the term). Basically the idea is that everything that you can reach on the web should be a resource with a web address (URI) that you can reach with standard HTTP verbs, and that will have other URIs embedded in it. There’s more to REST, of course, and I’m sure the purists will take issue with my over-simplified description.

REST is widely used in what I call Ajax APIs — the APIs that various applications have that let people get access to the data. Mash-ups, where you take data from one service and combine it with another service, use these APIs all the time. The classic example of a mash-up was to take Craigslist rental data and mash it with Google mapping data onto a third web site (housingmaps) without Craiglist or Google being involved to start with. There are now vast numbers of mash-ups and lots of toolkits to help you create them. One problem with mash-ups is that the people providing the data may not care to have you take it (for example, if they run ads on their sites); the Web 2.0 solution to that is that if you own the data, you need to add more value to it that can’t be mashed as easily. Amazon has book reviews on top of the basic book data, for example, so people use Amazon as a reference link.

The concept of mash-ups goes further into platforms that support plug-ins and widgets. One of the appealing things about Facebook is the fact that application developers can write widgets to do various things (from the trivial to the heavy-weight) that use the information that Facebook provides (this has privacy implications, but more about that in a later post). In a sense, this is about sites (usually commercial sites) using the social aspect of Web 2.0 (user-created content) to provide more features to their users, and is tightly tied to the process implications of Web 2.0 (more about that in the next post).

The Atom Publishing Protocol is fairly recent. Atom is the cleaned-up version of RSS and gives you a feed of information, tagged with metadata such as author, published date, and title. There is now also a protocol to go with it, designed for editing and publishing web resources using HTTP. It can be used as a replacement for the various blog-based publishing APIs, which were used to allow people to post to their blogs from different editors, but it’s now obvious that it can be used to carry other information as well, and not just for blogs. Since it’s a REST-based API that uses basic HTTP, it can be used for more general client-server HTTP-based communication. A good overview is on the IBM developer site.

One of a series on Web 2.0, taken from my talk at the CSW Summer School in July 2007. Here’s the series introduction. Coming up next: process aspects of Web 2.0

Physical Insights

Books, General No Responses »

Nov 082007

Lest anyone think that physicists don’t care about the real world, Bob Park publishes a short weekly newsletter that touches on subjects ranging from scientific hoaxes to inconsistencies in the way the U.S. Administration handles various issues. It mostly concentrates on science and technology, but not only. The Friday, October 26, 2007 newsletter also discusses the successful methods WWII soldiers used to interrogate Nazis, while the Friday, November 2, 2007 newsletter includes the quote “John Marburger, head of the White House science office, realized that the situation she described was serious; decisive action was needed at once — so he deleted half the report. ”

The tagline on the site is Opinions are the author’s and are not necessarily shared by the University, but they should be. I’ve been reading the newsletter for years and it’s always been interesting.

Professor Park also wrote a book, Voodoo Science: The Road from Foolishness to Fraud, that neatly debunks a lot of hoax (or misguided, to be more charitable) science in a readable way.

Web 2.0: Social and Collaboration

Presentations, Technology No Responses »

Nov 082007

The social and collaboration part of Web 2.0 mostly revolves around the concepts of social networking, user-generated content, and the long tail.

Social Cloud

Social networking is the idea that people can meet and talk and organise their social lives using the Web instead of, or in addition to, more traditional methods such as talking face to face, or on the phone. It’s an extension of usenet and bulletin boards that’s based on the web, with more features. Social networking sites tend to go through phases; everyone was into Orkut for a while, now it’s MySpace and Facebook, or Ravelry if you’re a knitter. Features and focus vary, but the idea of creating an online community remains the same.

User-generated content is the idea that non-professionals can contribute content. I don’t like the term much, so I’m going to use the variant user-created content to show that it’s a creative process, not just some machine generating content. The concept of user-created content isn’t new; the Web was first designed as a collaboration platform, the read/write web. In practical terms, however, it was difficult for those without lots of technical knowledge to publish on the web. All these things like blogging and commenting that are now relatively easy for people to do weren’t, just a few years ago. Previously only a few people could make their opinions widely known, in practice professionals with access. Don’t forget that one of the reasons Benjamin Franklin could make such a difference in the early years of the US was that he owned a printing press!

Now basically everyone with access to the internet who’s interested can publish their opinions, their photos, or their videos to their friends and the world. It’s easier to keep in touch with friends far away, or find out what life’s like in some far-off place, or contribute a snippet of knowledge to Wikipedia. Some of these publishers (bloggers, commenters, photo-uploaders) have a large audience, many have an audience that is large enough for them (which may mean just the family, or just themselves, or a few hundred strangers).

One of the downsides of this “democratization”, as it’s sometimes called, is that it can be hard to find the really good information or entertainment — you hear a lot about “cult of the amateur” and “90% of everything is crap”. Some of this is coming from those who are threatened by the availability of information from other sources: journalists and newspapers in particular are right to be scared, since they’re now going to have to work harder to convince the world that they add value. Whether the entertainment created by amateurs that’s available on the web is better than that created by the mass entertainment industry depends on your view of how good a job the latter does at finding and nurturing talent.

The long tail is another aspect of Web 2.0 that you hear about a lot. Booksellers are a good example of how the long tail works: Whereas your average bookseller, even Waterstones or Blackwell’s, has maybe a few thousand or a few tens of thousands of books, an internet seller can have millions. Although the comparison is perhaps not fair, since an internet bookseller, just like your local bookseller, can order from the publisher and will usually count that as being part of the inventory for bragging reasons. And, of course, you can always go to Powell’s Books in Portland, which claims to have over a million books physically in their store. It’s big; they hand out maps at the entrance so you don’t get lost.

The long-tail aspect is this: It turns out that most of the revenue doesn’t come from selling the Harry Potter books, big sellers though those are, it’s from selling those books that aren’t individually big sellers. The total volume of sales in those niche areas is larger than the best-sellers. Other companies that make good use of this of course are eBay, where you can buy things that you can’t get downtown, uptown, or potentially anywhere in your town, and the video rental company Netflix, which rents out some 35,000 titles in the one million videos it sends out each day.

And, of course, the long tail applies to blogs and other online sites. In other words, no matter how specialised your blog is, someone out there in blog-reading land is likely to find it interesting. The big problem is how those potential readers find out about it.

One of a series on Web 2.0, taken from my talk at the CSW Summer School in July 2007. Here’s the series introduction. Coming up next: technical aspects of Web 2.0

Web 2.0: Buzzwords

Presentations, Technology 1 Response »

Nov 072007

Like any hyped technology, Web 2.0 has a lot of buzzwords. They include the tag (as in tag cloud), the folksonomy, the long tail (more about that in a later post), and social software.

Social software is there to support networking and social activities via the internet. Lots of people spend lots of time interacting with friends online, whether they’ve ever met them in person or not. For people who are embedded in that world, it’s a natural way to interact. For everyone else, it can be slightly creepy to think that complete strangers read everything you write and know a lot about you. Lots of real-life friendships have blossomed from online activities, and more than a few problems have occurred as well. The social aspect, that is people interacting with other people, is probably the most important aspect of Web 2.0 sites.

The idea behind tags is to label things, so they’re loosely related to categories or (even more loosely) ontologies. Tags typically aren’t applied by specialists; in keeping with the Web 2.0 philosophy they are applied by the person writing the blog post, or uploading the photo, or storing the bookmark. So you get near-duplications, misspellings, incorrect usages, double meanings etc., but at least you do have some sort of categorisation applied to these bits of content. And many people go to quite a lot of effort to see what sorts of tags other people use, and then pick the same ones where possible. This then ends up being a folksonomy.

This image shows a tag cloud, which is a collection of tags where the tags in big fonts are the more important ones (usually means they show up more often). Unlike say topic maps or RDF, the spatial distribution of the tags doesn’t usually mean anything, although in theory you could use it to show relationships between the tags. Since generally there is no formal relationship between them (other than that from natural language) this would be tricky to automate and most people just fiddle with the cloud to make it look nice.

The other buzzwords on the slide are the important ones from a couple of years ago, these days there would be a few more. There’s also a version of the slide with the words linked to the relevant Wikipedia articles.

One of a series on Web 2.0, taken from my talk at the CSW Summer School in July 2007. Here’s the series introduction. Coming up next: social and collaboration aspects of Web 2.0

Non-Friday Cat Post

Family, General 1 Response »

Nov 062007

If you have cats, or even if you don’t, this video is worth watching. From Lauren Cooney’s This was my morning…. This is often my morning as well.

Older Entries Newer Entries

S	M	T	W	T	F	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30

S	M	T	W	T	F	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30