How e-books could revolutionize language-learning

[This article also appears in a slightly edited form as a TeleRead entry]

Ever tried learning a foreign language? Noticed how the books you could read were often boring,and the books you wanted to read were just that bit too hard to understand? Wished,you could have a quick translation of a . . . →Read More:How e-books could revolutionize language-learning

FreshNotes:Web 2.0 company using computational linguistics

FreshNotes (currently in alpha) uses basic named entity extraction and maybe information extraction to produce a website that allows to search and navigate relationships between people and/or topics. The interface,but of course it is all pre-baked at the moment.

From the CL point of view,I can see that there is very little smarts . . . →Read More:FreshNotes:Web 2.0 company using computational linguistics

UIMA’s expectations of the user

I have written about UIMA,IBM’s Natural Language Processing framework before. Since then,I had a couple of attempts to get a feel for it. Unfortunately,it kept feeling uncomfortable and confusing. Finally,I figured out why.

UIMA’s extensive documentation expects that you are committed to the framework. So,the documentation makes sure you understand . . . →Read More:UIMA’s expectations of the user

Creative use of the Named Entity Recognition techniques

Even the basic techniques from the computational linguistics field can make for interesting and intriguing applications. Gutenkarte takes public domain books,extract geographic names present in the text and plots them on the map. The result is an automatic clustering of place references,both visually and (within single click) textually.

The site itself is self-explanatory, . . . →Read More:Creative use of the Named Entity Recognition techniques

Screencast about IBM’s UIMA text processing architecture

UIMA is a new-ish framework on the block competing/cooperating with GATE framework to do NLP processing,annotation and search. Jon Udell recorded a screencast with a couple of IBM-ers to show off and explain UIMA.

While the screencast moves a little slow for a person familiar with sentence tokenizing principles,it is still interesting to . . . →Read More:Screencast about IBM’s UIMA text processing architecture

Fake research paper detector

If I ever do get to write a PhD,I will have to make sure to run it through this detector (as well covered in the New Scientist). Seriously though,this sounds like a great way to show off the computational linguistics (or more specifically data/text mining) experiments. Hopefully such projects will make the field . . . →Read More:Fake research paper detector

Where are the blogs of computational linguists?

I have a lay interest in the field of computational linguistics. So I want to read the current thoughts of the people in the people. But where are they all? You would think they would blog,after all they are good with computers and they are good with languages.

But no! I can find teachers . . . →Read More:Where are the blogs of computational linguists?