pith. sign in

arxiv: cmp-lg/9706024 · v1 · submitted 1997-06-18 · cmp-lg · cs.CL

A Lexicalist Approach to the Translation of Colloquial Text

classification cmp-lg cs.CL
keywords translationfoundtextapproachcolloquialenglishsystemtelevision
0
0 comments X
read the original abstract

Colloquial English (CE) as found in television programs or typical conversations is different than text found in technical manuals, newspapers and books. Phrases tend to be shorter and less sophisticated. In this paper, we look at some of the theoretical and implementational issues involved in translating CE. We present a fully automatic large-scale multilingual natural language processing system for translation of CE input text, as found in the commercially transmitted closed-caption television signal, into simple target sentences. Our approach is based on the Whitelock's Shake and Bake machine translation paradigm, which relies heavily on lexical resources. The system currently translates from English to Spanish with the translation modules for Brazilian Portuguese under development.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.