By arafalov,on October 9th,2006 [This article also appears in a slightly edited form as a TeleRead entry] Ever tried learning a foreign language? Noticed how the books you could read were often boring,and the books you wanted to read were just that bit too hard to understand? Wished,you could have a quick translation of a . . . →Read More:How e-books could revolutionize language-learning By arafalov,on July 14th,2006 FreshNotes (currently in alpha) uses basic named entity extraction and maybe information extraction to produce a website that allows to search and navigate relationships between people and/or topics. The interface,but of course it is all pre-baked at the moment. From the CL point of view,I can see that there is very little smarts . . . →Read More:FreshNotes:Web 2.0 company using computational linguistics By arafalov,on July 9th,2006 I have written about UIMA,IBM’s Natural Language Processing framework before. Since then,I had a couple of attempts to get a feel for it. Unfortunately,it kept feeling uncomfortable and confusing. Finally,I figured out why. UIMA’s extensive documentation expects that you are committed to the framework. So,the documentation makes sure you understand . . . →Read More:UIMA’s expectations of the user By arafalov,on June 28th,2006 Even the basic techniques from the computational linguistics field can make for interesting and intriguing applications. Gutenkarte takes public domain books,extract geographic names present in the text and plots them on the map. The result is an automatic clustering of place references,both visually and (within single click) textually. The site itself is self-explanatory, . . . →Read More:Creative use of the Named Entity Recognition techniques By arafalov,on May 2nd,2006 UIMA is a new-ish framework on the block competing/cooperating with GATE framework to do NLP processing,annotation and search. Jon Udell recorded a screencast with a couple of IBM-ers to show off and explain UIMA. While the screencast moves a little slow for a person familiar with sentence tokenizing principles,it is still interesting to . . . →Read More:Screencast about IBM’s UIMA text processing architecture By arafalov,on April 25th,2006 If I ever do get to write a PhD,I will have to make sure to run it through this detector (as well covered in the New Scientist). Seriously though,this sounds like a great way to show off the computational linguistics (or more specifically data/text mining) experiments. Hopefully such projects will make the field . . . →Read More:Fake research paper detector By arafalov,on January 30th,2006 I have a lay interest in the field of computational linguistics. So I want to read the current thoughts of the people in the people. But where are they all? You would think they would blog,after all they are good with computers and they are good with languages. But no! I can find teachers . . . →Read More:Where are the blogs of computational linguists? | |