More details emerge on Powerset’s engine

Dan Farber has written a good article on Powerset.

It mostly talks about their grandiose marketing plans and how NLP (Natural Language Processing) will change the world, however it also has a reasonable explanation of what they are doing with fairly transparent references to (expanded) WordNet, named entity recognition, event extraction and semantic web technologies.

It is also . . . → Read More: More details emerge on Powerset’s engine

“State of the art” NLP Wiki

As a starting NLP/CL researcher, I find it really hard to wade through the fragmented community’s research efforts, software and evaluation methods. I am sure, eventually I will settle down into my specific area and will know most of the important works, however I want to have a better view of the general field now. I . . . → Read More: “State of the art” NLP Wiki

Laying out penn treebank output of Stanford parser

I am trying to use Stanford NLP parser for my research and I need to look at the trees it produces for large, complex sentences. I have found several packages for laying out the output as trees, but they are all seem to be targeted at visualizing smaller sentences, suitable for illustrating a point in the . . . → Read More: Laying out penn treebank output of Stanford parser

I have 3 Spock beta invitations to give away

I have written about Spock – the supposedly computational linguistics heavy search engine – before. I have to admit that I could not see what people were getting so excited about. Recently, I have received an invitation to the Spock’s beta. And I am still not very excited.

Similarly to constantly shifting FreshNotes/Knover, this company started from . . . → Read More: I have 3 Spock beta invitations to give away

Spock announces an Entity Resolution competition

Netflix prize must be doing well, as there are now other companies willing to tap into web’s researchers with deep knowledge of computational techniques.

The latest company is Spock, a company so new that you have to read 3rd party sites to figure out exactly what they do. Even to use it, one has to signup for . . . → Read More: Spock announces an Entity Resolution competition

UIMA – a “very quick” quick start guide

UIMA (Unstructured Information Management Architecture) project has recently transitioned from IBM to the Apache incubator. This is only for the open source part, the commercial part is – and will stay – with IBM.

I have last written about UIMA a very long time ago (1, 2) , so I decided to give it another look.

It is . . . → Read More: UIMA – a “very quick” quick start guide

Computational Linguistics – News update for Nov 15, 2006

Lots of new sightings of CL/NLP technologies since the last update:

On the commercial speech recognition front, Nexidia is currently in beta with phonemes-mapping audio search. But don’t go to the company’s site. Instead, read the explanation and collection of links is in the ResourceShelf’s article.
If, instead of waiting for commercial offerings, you would like to contribute . . . → Read More: Computational Linguistics – News update for Nov 15, 2006

Lirix – computational linguistics aspects

In my last update on applied computational linguistics, I have written about PodZinger that uses speech recognition to figure out which advertisement to match to the podcast you are searching with their service.

Another company is claiming to do that with songs – Lirix. Their upcoming AdLirix platform is supposed to be so effective that Lirix would . . . → Read More: Lirix – computational linguistics aspects

There! are the blogs of computational linguists

Nine months ago, I had asked “Where are the blogs of computational linguists?” Now, there is an answer.

The Association for Computational Linguistics has moved its documents (formerly ACL Universe) into the Wiki and there is now a separate page for blogs. It has all of the blogs I found so far and more. It even has . . . → Read More: There! are the blogs of computational linguists

Computational Linguistics – News update for Oct 9, 2006

Couple of interesting things happened recently in the Computational Linguistics related fields that I thought were worth linking to:

ACM Queue had an interview with Mike Cohen of Google (previously of Nuance Communications) discussing recent advances and changes in speech recognition technology.
Pluggd, with its hotly discussed demo of HearHere, uses speech recognition and some sort of topic . . . → Read More: Computational Linguistics – News update for Oct 9, 2006