Jump to content

Talk:Computational linguistics

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Francis Tyers (talk | contribs) at 14:36, 28 February 2009 (Merge with NLP?). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

WikiProject iconComputing Start‑class
WikiProject iconThis article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
???This article has not yet received a rating on the project's importance scale.
WikiProject iconComputer science Start‑class High‑importance
WikiProject iconThis article is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
HighThis article has been rated as High-importance on the project's importance scale.
Things you can help WikiProject Computer science with:

If you folks don't mind, I'm going to make some changes to the Origins section of this article some time in the next week. I'm going to add some material about the work of David G. Hays. He led the machine translation effort at RAND back in the 50s and 60s, was one of the authors of the ALPAC report, wrote the first textbook in computational linguistics, was instrumental in founding the Association for Computational Linguistics, and the International Committee on Computational Linguistics, was the first editor of the ACL's journal and so forth. I'll be adding an article about Hays himself, then I'll link it to this article, and, as I said, make a some changes to the history section.

AI is is generally considered to have originated in 1956 with a conference at Dartmouth, where the term "artificial intelligence" was coined. MT was under-way before that time. When MT morphed into CL during the mid-60s it maintained a somewhat separate identity from AI and, so far as I know, does to this day. AI, CL, and NLP are closely related, obviously, and some researchers may consider themselves to be two or three of those, or only one. Bill 21:22, 16 March 2006 (UTC)[reply]

Merge with NLP?

Judging from the discussions, there doesn't seem to emerge a consensus on whether or not this article should be merged with Natural Language Processing. I think they should be merged since

  • Although it might be historically justified to speak about a division, this is meaningless as the fields stands today. Whether you call it NLP or CL is today arbitrary and mostly a question of what aspect you stress.
  • Current articles in the Computational Linguistics journal, and at the Coling and ACL conferences, are judged on whether they are useful rather than on whether they give any insight on how humans process language.
  • Most of the material in this article belongs in the NLP article anyway.
  • We can't seem to come up with a definition of CL, and where to draw the line between the two "fields".

So I vote for a merge. Kallerdis (talk) 20:06, 29 February 2008 (UTC)[reply]


Dear All,

I am painfully aware of the many names used for this area (CL, NLP, Language Engineering, Human Language Technology, Language Technology, ...). Many of these terms denote nearly identical areas, but not all. My experience is that, in many countries, one uses two separate concepts, one narrow and one broad:

  1. The narrow one being defined (as also cited below by others) as a study of language using the methods and inspiration from computer science. This definition puts much weight on the automatic parsing, especially using linguistically motivated grammar models.
  2. The broad concept includes the narrow one as its core, but also includes much material from the areas where the core methods can be applied. It is also common that speech technology is included as a part of the broad area.

I would not be alone in proposing that, in the Wikipedia, we would establish two concepts (=articles) one for the narrow one and use the title computational linguistics for it, and another with title language technology (or human language technology). Both could have an account of the possible synonyms or alternative terms. The distinction is, anyway present in many languages other than English (Finnish 'tietokonelingvistiikka' vs. 'kieliteknologia', Swedish, Danish and Norwegian roughly 'datalingvistik' vs. 'språkteknologi').

Together with some colleagues, we would like to use Wikipedia as a platform for creating consistent multilingual terminologies for CL/LT. This would be feasible if there are consistent concepts which can be linked between languages.

--
Kimmo Koskenniemi,
prof. of computational linguistic
(but teaching language technology)
--

HI....

I'm a student at a computational linguistics department (CoLi at Saarbruecken), and I have been a (visiting) student at a natural language processing department (HCRC at Edinburgh). I think each might have been close to being the biggest in the world in their respective "fields". But honestly, I can't tell the difference!!!

I think the key is: how are the terms commonly used? Once, (while I was stationed at Edinburgh but making a weekend trip to Cambridge), a Cambridge prof came up to me and said, "So, you're from Edinburgh, I bet you do NLP." Yes, I replied, not wanting to get into the details about really being from CoLi at Saarbruecken. "Heh," he says, "NLP is totally unfounded. At Cambridge, we do computational linguistics." Now (for those of us who are not irony impared ;) I think this is pretty strong evidence that the terms are used as synonyms within the field. And if they are used as synonyms within the field, then what exactly is this page describing?

PS... with the ACL quote... I think it might be decades old. If there is an "engineering/science" distinction between NLP and CL, then the ACL quote has got to be ironic because most of the research in their journal and at their conferences is clearly engineering.

-- pobody




I don't disagree that Natural Language Processing is useful and important. I'm just not convinced that Computational Linguistics is the same thing as Natural Language Processing.

According to the The Association for Computational Linguistics web page defining Computational Linguistics, ( http://www.aclweb.org/archive/what.html ) the definition is broader than appears on this page:

computational linguistics is the scientific study of language from a computational perspective. Computational linguists are interested in providing computational models of various kinds of linguistic phenomena.


i'm changing the article heavily and splitting large portions off into [Natural Language Processing] to reflect the fact that these two fields are related, but not at all the same thing. --jkominek

Well, but I do not think that NLP and CL do not overlap, I think they have a lot in common. At my university we do mostly NLP in computational linguistics. In the first year, there are the foundations (theoretical linguistics, computer science, logic, relations, trees, grammars), and the second year class is called "Natural language processing systems". According to the professor, computational linguistics IS about designing natural language processing systems. But I do not have anything written on my desk here (will deliver this later, ok?). I must also admit that he was CS prof before, so he may be biased ;-). Of course it is ok to have a separate entry, as CL and NLP are not synonyms.

I recommend to delete the sentence "... which is in the domain of computer science", as this implies NLP does not belong to CL. Instead, I would write "An important task of CL _is_ NLP" or sth. similar.

I also do not agree that CL is (only) a subfield of linguistics, as it is an interdisciplinary field, somewhere in between theoretical CS/maths, applied computer science, AI, and linguistics.

As I am new here, and still want to think about, I do not make changes immediatly (I also do not have the time atm to build proper, good-looking sentences, as I am at work).

Best regards,

-- zeno


Computational linguistics is the original term for the field of language processing that developed following the collapse of MT with the ALPAC report in the 1960s. Originally, it was thought that computers could perform translation of language quite easily, whereas in fact its exactly the opposite. I would characterize the difference between computational linguistics and natural language processing as the difference between computer science and software development, i.e., between the theoretical and the practical. Some computational linguists are not really interested in natural language processing and prefer to work exclusively on theoretical problems, whereas others are actual practitioners of the field--same as in computer science.

Also the assumption that computational linguists are linguists is wrong. Computational linguists came into the field from a variety of disciplines including mathematics, computer science and psychology. Computational linguistics predates AI. It is a separate and parallel field of those concerned with having computers process language in any way. It's major subfields include: speech generation, speech recognition, parsing theory, text generation and of course its original impetus, mechanical (or machine) translation.

Whereas today I expect you can easily study computational linguistics in a linguistics department, in the 1970s and early 1980s this was rare.

Dr. Robert A. Amsler, Sr. Computational Linguist, SEA/DOE


Computational linguistics would seem to involve more statistical (pattern recognition, markov, etc.) and NLP more determining parts of speech and trying to get meaning. Wouldn't it? User:KellyCoinGuy

No. both are used/covered by both "fields". --zeno 00:37, 21 Dec 2004 (UTC)
According to the Handbook of Computational Linguistics (2007), "Computational Linguistics" is used when a substantial amount of linguistic knowledge is incorporated into the computational model whereas "natural language processing" usually refers to a mainly stochastic language modeling. --Tomonori 03:31, 1 August 2007 (UTC)[reply]

I had it explained to me something like as follows:

  • Computational linguistics is where we make programs that should work, but don't and we don't know why
  • Natural language processing is where we make programs that shouldn't work, but do and we don't know why

- Francis Tyers · 14:36, 28 February 2009 (UTC)[reply]

Computational Historical Linguistics

In the last decade computational methods have been applied to historical linguistics. This article does not cover this aspect. Really a separate article is required with a change to the title of this one to separate it. Adresia (talk) 18:52, 29 November 2007 (UTC)[reply]

Stub template

Is there any stub template for use in articles related to computational linguistics? I haven't found any, but I think it would be viable, if it really didn't exist. -- Sandius 20:09, 7 February 2006 (UTC)[reply]

"Free online introductory book"?

As of this time, the external link Free online introductory book on Computational Linguistics has not responded at all for several hours. I have no way of knowing if this is temporary, but the link should be removed if this continues. 75.15.115.31 (talk) 08:21, 15 December 2008 (UTC)[reply]

Found it in the Internet Archive. --Thüringer ☼ (talk) 12:46, 15 December 2008 (UTC)[reply]