Kottke quotes from Steven Levy’s Wired magazine article on the syntax and evolving language of search queries:
Google’s synonym system understood that a dog was similar to a puppy and that boiling water was hot. But it also concluded that a hot dog was the same as a boiling puppy. The problem was fixed in late 2002 by a breakthrough based on philosopher Ludwig Wittgenstein’s theories about how words are defined by context.
Being reasonably acquainted with Wittgenstein, I found myself wondering which of his ideas came so integrally into play in solving this problem. The Wired article only links to Wittgenstein’s Stanford Encyclopedia of Philosophy article, which includes a survey of all his major concepts and works. Was it his distinction between sense and nonsense? His arguments against a private language? His work on the connection between seeing and saying and his example of the “duckrabbit”? Or perhaps it was something he didn’t discover but simply weighed in on, like ostensive definitions or contextualism?
The strongest candidate, though, might be his concepts of language games and family resemblance. Wittgenstein’s best-known example of a language game is the “builder’s language.” Here’s how he describes it:
The language is meant to serve for communication between a builder A and an assistant B. A is building with building-stones: there are blocks, pillars, slabs and beams. B has to pass the stones, in the order in which A needs them. For this purpose they use a language consisting of the words “block”, “pillar” “slab”, “beam”. A calls them out; — B brings the stone which he has learnt to bring at such-and-such a call.
This is a very small kit of parts; a lexicon of just four elements, combined in a certain way. But by uttering these words in the right context, a building gets built. The meaning these words have comes from their ability to activate the builder’s assistant to do what the master builder is asking. And their family resemblance has to do with this limited language, in which these words’ meaning is defined by their context and shared by the two builders. After the workday is through, the builder might look forward to how his children “beam” at him when he arrives home, and the context is entirely different.
The Wired article continues,
As Google crawled and archived billions of documents and Web pages, it analyzed what words were close to each other. “Hot dog” would be found in searches that also contained “bread” and “mustard” and “baseball games” — not poached pooches. That helped the algorithm understand what “hot dog” — and millions of other terms — meant.
A rock is a rock. It’s also a stone, and it could be a boulder. Spell it “rokc” and it’s still a rock. But put “little” in front of it and it’s the capital of Arkansas. Which is not an ark. Unless Noah is around.
Oh, and on the headline above — just my humble attempt to confuse the hell out of Google.

