Google translate assessment with error analysis : an attempt to reduce errors - USD Repository

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

(12)

(13)

1

CHAPTER I

INTRODUCTION

A. Background of the Study

In our current world, there is no more limitation since technology is

developing rapidly. People are facilitated sufficiently to get everything easily,

including information. There is a lot of information accessed by people from

media. The media can be written media such as newspapers, books, or magazine,

audio-visual media such as television and radio, and online media such as internet.

In accessing them, however, people still get obstacle in case of language.

Therefore the existing technology provides very vast information which is not

only from one area/country that means one language but also from many countries

with their own languages. Ramis (2006, 1) says that "still, the language barrier is

the only obstacle for this vast information to be fully shared by all users". Ramis

states that accessing information optimally requires people to be literate in more

than one language. It becomes problems for those who only master one or two

languages. In this context, translation plays an important role to help people to

access and understand the information from other languages. People finally do not

have to master many languages because translation has done the job.

Bell (1997:6) defines translation as the replacement of a representation of

a text in one language by a representation of an equivalent text in a second

(14)

paragraphs, or a whole text from a Source Language (SL) to a Target Language

(TL). Along with the developing technology, translation is also influenced by that

development. Currently, translation can be done both manually and automatically.

Manual translation is fully done by human meanwhile automatic translation is

done by computer system which is in practice, with or without human assistance

(translated from Nababan, 1999: 134). The latter which we can call as Machine

Translation (MT) have been the focus of research in translation since 1950s. From

the research, US, Canada, and European countries have developed several systems

of MT. The Systems are among others Météo, Systran, Eurotra, Ariane, and Susy

(Ramis, 2006: 2).

Indeed, the systems are not perfect tools to result satisfying translation

from one language to another language because of several limitation owned by

any kind of machine. In his Ph. D. paper, Gispert reports the Bar-Hillel analysis

that Fully Automatic High-Quality Translation (FAHQT) was an unreachable goal

and that the enthusiasm of the MT research is up and down. It shows scepticism in

viewing the existence of MT in contributing for the development of translation

itself. But looking at the facts of great demand of tools translating texts to help

people facing interlingua condition, the MT is still continuously improved more

and more.

In this study, the researcher focuses on the error analysis of Machine

Translation output because the researcher realizes that it is impossible to analyze

all aspects of Machine Translation whose scope spread from linguistic aspect to

(15)

output to repair. Particularly, the researcher chooses Google Translate, the

most-often-used Machine Translation. The researcher is going to investigate what errors

are found in a text translated by Google Translate. It is because the researcher

finds several errors when one text in one language is translated into English by

Google Translate and finds that it will translate a text differently at two different

period of time. Google Translate is one of Google-search-engine features. Google

Translate is a Machine Translation based on Systran system (Research at Google,

2012).

It is explained that Google Translate is statistically based machine

translation developed from Franz Joseph Och research. It can translate one text

into more or less fifty languages; one of them is from English to Indonesian and

vice versa. One of obvious examples is the translation of iPad user guide from

English to Bahasa Indonesia. "Keep all your app subscriptions in one convenient

place" is translated to "Jauhkan semua langganan aplikasi Anda di satu tempat

yang nyaman". The original meaning in Source Text shifts when it is translated

using Google Translate. Instead of "Jauhkan", "Keep" should be translated to

"Simpan". As said previously, Machine Translations including Google Translate

still have limitation. In this case, it is obviously shown that the translation is not

equivalent. The Target Text translated by Google Translate gives totally wrong

instruction for Target Readers.

According to the real example above, the researcher is interested to

analyze the errors done by Google Translate in translating an English text into

(16)

guide, National Geographic article, and the Ownership Agreement document. By

the result of analysis, the researcher hopes that users can minimize their misuse in

using Google Translate to translate one text and can use that Machine Translation

appropriately.

B. Problem Formulation

There are two problems formulated in this research, namely

1. What errors are found in the three texts of the Ownership Agreement

document, a National Geographic article, and iPad user guide which are

translated into Indonesian using Google Translate?

2. What suggestions are proposed to reduce errors in using Google-Translate?

C. Objectives of the Study

By the problem formulated above, the objectives of this research are to

find and analyze errors in the Bahasa Indonesia texts of the Ownership Agreement

document, National Geography’s article, and iPad user guide which are translated

by Google Translate. And the second is finding suggestions to reduce errors in

using Google-Translate.

D. Benefits of the Study

Theoretically the researcher expects that this research contributes as one of

(17)

Machine Translation and that this finding can be secondary data of errors found in

the translation done by Google-Translate.

Practically, the benefit of this study is not to create absolute solution

which can finish all problems since this study only focuses on linguistic analysis

in Machine Translation, but at least it is to suggest some ways which hopefully

can reduce errors done by Google Translate as a tool to produce raw translation.

E. Definition of Terms

In order to have the same perceptions and terminologies used frequently in

this research, the researcher defines the following terms.

1. Machine Translation

The term Machine Translation (MT) is the now traditional and standard

name for computerized systems responsible for the production of translations

from one natural language into another, with or without human assistance

(Hutchin, 1992: 3).

2. Error Analysis in Machine Translation

Error Analysis in machine translation is counting errors of texts translated

by Google Translate with a classification of errors. It is also an index of the

amount of work required to correct 'raw' Machine Translation output to a standard

(18)

3. Google Translate

Google Translate is an online machine translation system published by

Google which provides automatic translation for more or less 57 languages in the

world including Indonesian. (Research at Google, 2012).

Google Translate is produced with some approach: the computer is fed

with billions of words of text, both monolingual text in the target language, and

aligned text consisting of examples of human translations between the languages.

Then it applies statistical learning techniques to build a translation model.

(19)

7

CHAPTER II

THEORETICAL REVIEW

A.Review of Related Studies

1. Review of Error Analysis of Google Translate Translating Indonesian Text

into German Text

The researcher reviews the study on MT has been done by Iman Santoso.

On his paper entitled “Analisis Kebahasaan Hasil Terjemahan Google-Translate

Teks Bahasa Indonesia Ke Dalam Bahasa Jerman”, he investigated some errors of

Google-Translate in translating Indonesian texts into German texts. In his

research, he applied linguistic approach by using four aspects of linguistics, i.e.

Orthography, Morphology, Syntax, and Semantics. After he classifies the data, it

was shown that Google-Translate had errors in those four aspects. The errors are

simple ones which actually can be translated well by human translators. It

indicates that MT, in this case Google-Translate didn't have sufficient ability yet

to translate the texts equivalently.

According to the data, from the four aspects mentioned the most errors

belong to morphological errors. There are 25 morphological errors of 59 total

errors. An example of orthographical errors is mis-spelling of proper name.

"Gayus" and "Aburizal Bakrie atau Ical" in Indonesian text are translated into

"Gaius" and "iCal Bakrie" in German text. An example of morphological errors is

an Indonesian word "Pendiri" which is translated as "Gründer". The word is

(20)

example of syntactical errors is misstucturing of the sentence "Buku komiknya

sendiri akan beredar di AS mulai akhir Desember dengan harga 6,99 dollar AS".

By Google Translate it is translated "Comic Buch selbst wird in den USA

verbreitet werden ab Ende Dezember 2010 mit dem Preis von $ 6,99". The

translation contains errors in passiving, the appropriate translation must be "Das

Comicheft wird dann in den USA ab Dezember-Ende 2010 mit dem Preis von $

6,99 verkauft". In semantic, errors are found in translating the phrase "naik daun"

to be "stieg". "naik daun" is a metaphor which is synonymous with "being

famous". The appropriate translation must be "berühmt werden.

Based on his study Santoso concludes that Google-Translate translated

texts from Indonesian into German word by word by ignoring its context. Users

have to edit the text after translated by Google-Translate in order to get better

result. He also highlighted that the shorter the texts, the better the result.

2. Review of The Translation Analysis of English Imperatives Translated

into Indonesian by Google Translate

The analysis of Google-Translate errors also done by Kemala Meilinda

Putri. She analyzed the errors of Google-Translate in translating English

imperatives into Indonesian.

She finds that Google-Translate did two categories of errors, those are

function word and miss-selection of words with similar meaning

(21)

Putri states that Google-Translate cannot identify different markers of

English and Indonesian imperatives. Similar to Santoso, she said that the machine

merely translated the sentence word for word.

In her suggestions, she asked users to concentrate and to read the result

again.

According to the two previous study about error analysis on machine

translation, currently the researcher is going to investigate the errrors on machine

translation and tries to give suggestions to users to reduce the errors. This research

does not only concern to what errors occured, like the previous studies, but also

gives some alternatives to overcome the problem.

B.Review of Related Theories

1. Translation

There are many definitions of translation. Every expert certainly has his/her

own definition based on his/her own perspectives. Bell defined the translation

based on some resources as.

[...] the expression in another language (or target language) of what has been expressed in another, source language, preserving semantic and stylistic equivalences [...] Translation is the replacement of a representation of a text in one language by a representation of an equivalent text in a second language. (1997: 5-6)

An expression in one language should have its

translated-to-target-language expression. The expression then should be equivalent in term of semantic

(meaning) and stylistic (the style of language). It is apparent that the keyword

(22)

as similar as possible in case of delivering message in the source language (SL) to

the target language (TL) reader. Bell also adds a quite understandable definition

for “equivalence”.

Texts in different languages can be equivalent in different degrees (fully or partially equivalent, in respect of different levels of presentation (equivalent in respect of context, of semantics, of grammar, of lexis, etc.) and at different ranks (word-for-word, phrase-for-phrase, sentence-for-sentence) (1997: 6).

The equivalence which is varied shows that it is apparent, and has been for

a very long time indeed, that the ideal of total equivalence is impossible to achieve

(Bell, 1997: 6). Therefore there will be no absolute equivalence which can convey

meaning to the TL as similar as in the SL. But at least, it is sufficient to look for

the closest equivalence for each word, phrase, and sentence in the TL.

The following picture is the transformation of a source language text into a

target language text. The process takes place in the memory where it analyzes the

source language text into a universal semantic representation and synthesizes the

semantic representation into the target language text. The following page shows

(23)

Picture 2.1. Process of Translation

2. Machine Translation

Hutchin (1992: 1) explains that Machine Translation is a traditional call for

programs which can produce ‘raw’ translations of texts in relatively well-defined

subject domains, which can be revised to give good quality translated texts at. It is

a tool which can be used to translate automatically one text to another language,

with or without human assistance.

There are some system designs for Machine Translation. The first is the

direct translation approach: the Machine Translation system is designed in all

details specifically for one particular pair of languages in one direction. The

second is the interlingua approach, which assumes the possibility of converting

texts to and from “meaning” representations common to more than one language.

Translation is thus in two stages: from the source language to the interlingua, and

from the interlingua into the target language. The third type is the less ambitious

transfer approach. Rather than operating in two stages through a single interlingua

meaning representation, there are three stages involving, usually, syntactic

representations for both source and target texts. The first stage converts texts into Analysis

Synthesis Source

Language Text

Target

Language Text Semantic

(24)

intermediate representations in which ambiguities have been resolved irrespective

of any other language. In the second stage these are converted into equivalent

representations of the target language; and in the third stage, the final target texts

are generated.

3. Evaluation of Machine Translation

Hutchin (1992: 161) states that the translations produced by MT systems

are inadequate. The MT systems cannot result good translations which are

equivalent in term of semantic and stylistic. He added that Fully Automatic High

Quality Translation (FAHQT) which can produce good translation is not present

possible thus the translations produced by Machine Translation merely stops in

phase of ‘raw translations’ which still need revising or post-editing.

The way to increase the quality of the translation is by evaluating the MT

systems. Here Hutchin (1992: 162) shows the principal areas in which evaluation

can take place, what aspects should be taken into consideration, and some of the

methods may be employed. The types and stages of evaluation are divided into

five categories.

a. Prototype evaluations : Evaluation will be restricted to the testing of

processes alone, without consideration of potential operational environments.

b. Development evaluations : Assuring that the systems does what it is

claimed to do. Economic viability, such as a commercial product, potential

(25)

c. Operational evaluations : assessment of how much and what kind of

human input is required to produce acceptable translations, what technical

facilities are required, how improvements can introduced, etc.

d. Translator evaluations : Finding out all of problems undergone by

Machine Translation from translator’s perspective, e.g. how much work will

be involved in pre-editing, post-editing, or interactive operation, whether

productivity will be increased, how much time and effort is saved, and the

impact on working practices.

e. Recipient evaluations : the final stage of evaluation is concerned

primarily with quality, cost, and speed.

4. Linguistic Evaluation of Raw Output

Common to all the stages is the testing of the linguistic quality of the

output or the quality of raw translations (Hutchin, 1992: 163). There are two types

of the basic computer processes.

a. Glass-box evaluation

- Assessment by those who have access to all the workings of the system.

- Available only to the researchers and developers of prototype systems.

b. Black-box Evaluation

- Assessment by those who can work only with inputs and outputs.

- Potential purchasers and users are restricted to black-box evaluations.

In addition, in examinations of systems, it needs to distinguish further

between overall assessments of 'quality' and more detailed identifications of

(26)

a. Quality Assessment

To test the quality of a translation, there are three obvious objects which

can be tested, they are: (1) its fidelity or accuracy, the extent to which the

translated text contains the 'same' information as the original; (2) its intelligibility

or clarity, the case with which a reader can understand the translation; and (3) its

style, the extent to which the translation uses the language appropriate to its

content and intention. (Hutchin, 1992: 163)

However, applying the test on those three points is excessively subjective.

The readers who are asked to assess the text, either assessing how informative the

text was, evaluating isolated sentences, or measuring style, will have their own

judgments. Their judgments possibly will be different in comprehension and

perspective in which the result will be difficult to measure.

b. Error Analysis

In most instances the most useful practical information is obtained from

error counting. It is an index of the amount of work required to correct 'raw'

Machine Translation output to a standard considered acceptable as a translation

(Hutchin, 1992: 164).

Commonly, the error analysis is done by counting each addition or

deletion of a word, each substitution of one word by another, and each instance of

the transposition of words in phrases, and then calculating the percentage of

corrected words (errors) in the whole text. Similar to Quality Assessment,

however, the method cannot be completely objective because revisers differ in

(27)

which are dependent on the particular circumstances in which the revision is

taking place. Although it seems no better way to evaluate the output objectively,

the creation of operational evaluations can be a solution to end those problems.

Hutchin (1992: 164) says that what is needed is a classification of errors

by type of linguistic phenomenon and by relative difficulty of correction. He

stated that some lexical errors are easily resolved by simple changes to

dictionaries, while other may have implications for grammatical rules and for a

whole range of vocabulary items. Some grammatical mistakes may be corrected

by simple adjustments to a few lexical entries; others might involve alterations to

the basic design of whole translation modules.

In addition, there are still obvious difficulties although operational

evaluations are created. They are the selection of texts tested, what should be

tested, and how performance should be measured.

5. Koponen’s Classification in Assesing Machine Translation

Koponen (2010: 4-5) on her journal “Assessing Machine Translation

Quality with Error Analysis” proposed errors categories and the way the tested

texts are selected. He divides errors categories into two big classes, i.e. relation

between source and target concepts and relation between concepts.

a. Relation between source and target concepts (Koponen, 2010: 4-5)

i. Omitted concept: ST concept that is not conveyed by the TT.

ii. Added concept: TT concept that is not present in the ST.

iii. Untranslated concept: SL words that appear in TT.

(28)

v. Substituted concept: TT concept is not a direct lexical equivalent for ST

concept but can be considered a valid replacement for the context.

vi. Explicitated concept: TT concept explicitly states information left implicit in

ST without adding information.

b. Relations between concepts (Koponen, 2010: 6)

i. Omitted participant: ST relation not conveyed by the TT due to an omitted

head or dependent.

ii. Omitted relation: ST relation not conveyed by the TT due to

morpho-syntactic errors that prevent parsing the relation although both concepts are

present in the TT.

iii. Added participant: TT relation not present in ST introducing an added

concept.

iv. Added relation: TT relation not present in ST arises due to morpho-syntactic

errors.

v. Mistaken participant: Head or dependent of the relation different in ST and

TT, not same entity.

vi. Mistaken relation: relation between two concepts different in ST and TT,

changed role.

vii. Substituted participant: Head or dependent of the relation different in ST and

TT, same entity.

viii. Substituted relation: relation between two concepts different in ST and TT,

(29)

Then for selecting the text, Koponen (2010: 2) suggested to choose three

source texts which are different in types. These different genres are selected to

observe whether the machine translation accuracy varies with text type.

6. Semantics

O’Grady (2010: 203) states that there is more to languages than just form.

Sound pattern, morphological structure, or synactic organization are just the form

of utterances which is obviously heard and seen. However, in order for language

to fulfill its communicative function, utterances must also convey a message; they

must have content (O’Grady, 2010: 203). He adds that what is meant by content in

this context is meaning.

O’Grady explains that the meaning is on the relation between words and

sentences (2010: 204-208). In the first relation, words and phrases can enter into a

variety of semantic relations with each other such as synonymy (automobile =

car), antonymy (dark >< light), polysemy (bright shining, intelligent), and

homophony (bank=financial institution and bank=an edge of a river). Then in the

second relation, sentences also have meanings that can be analyzed in terms of

their relation to other meanings such as paraphrase, entailment, and contradiction

Meanwhile for the meaning itself, O’Grady gives four proposals of

approach to meaning. One of them is componential analysis or semantic

decomposition. This approach tries to analyze the meaning of certain types of

speech classes in terms of semantic feature. The example is man and boy could be

grouped together as [+human, +male] while man and woman could be put in a

(30)

7. Farrus’ Solutions in Optimizing The Use of Machine Translation

Farrus (2011) stated that to resolve the errors described in the previous

section, several techniques need to be applied according to the idiosyncrasy of the

problem. He classified it into two types:

a. Text Edition

Some of the errors need to be analyzed by processing the text before or

after the translation.

b. Grammatical Category-Based Approach

Grammatical categories have been successfully used in statistical

translation in order to deal with several problems like reordering (Crego and

Marino [2007] as quoted by Farrus [2011]) and automatic error analysis (Propovic

et al [2006] as quoted by Farrús [2011]). The aim is to add the grammatical

category, via a tag, corresponding to the word to translate, so that the statistical

model is capable of distinguishing the word depending on its category and to learn

from context.

C. Theoretical Framework

All these theories previously explained will help the analysis which later is

used to answer the two problem formulations.

The first theory to the fifth theory about the concept of translation and

machine translation, machine translation evaluation, and error categories are

applied to answer the first problem formulation. The theories is used to classify

(31)

Next, the theory of Farrus’ Solutions is applied to look for the most

effective method for reducing errors as the answer of the second problem

formulation. The suggestion leads the researcher to look for the most effective

(32)

20

CHAPTER III

METHODOLOGY

A. Object of the Study

The object of the study is sentences taken from three informative texts.

The texts are iPad user guide (the first chapter "At Glance"), National Geographic

article (If They Could Only Talk), and the Ownership Agreement document (The

Yacht Ownership Agreement). The researcher believes that the three texts are the

most truthful English text because they are published by offical company or

website, i.e. Apple Inc., National Geographic, and Limcharoen Hughes &

Glanville. Moreover most of them represent texts which are often accessed by

common readers. iPad user guide is instructions of how to use iPad, the National

Geographic article is information about the legend of walking statues in Easter

Island, and the Ownership Agreement document is about a contract letter of Yacht

ownership.

The forms of data are sentences which are vary in their length and

complexity. The Ownership Agreement document and the National Geographic

contain fairly long and complex sentences, meanwhile the iPad’s user guide

contain many short imperative sentences and sentence fragments. Then all of the

sentences are translated into Indonesian using Google Translate and in the end the

researcher has English sentences and Indonesian sentences from the three texts. In

detail the researcher has 309 English sentences and 309 Indonesian sentences

(33)

the National Geographic article, and 97 English sentences and 97 Indonesian

sentences from The Ownership Agreement document.

B. Method of the Study

This study uses library research. The method is conducted to gather the

information and theories of machine translation and linguistic aspects in order to

be able to analyze the data. It is also done to find the indicator of errors which will

be used to assess the Bahasa Indonesia translation of the three English texts.

The data in this study are primary data which means that the data are not

taken from the other studies or researchers. The data are collected by the

researcher from the English software user guide, the English magazine, and the

English legal document.

C. Research Procedure

1. Kinds of Data

The data in this research are objective data. They consist of whole

document of the first chapter of iPad user guide, of one article from National

Geographic magazine, and of the Ownership Agreement document.

2. Data Collection

In collecting the data for this research, the researcher took several steps in

order to get complete and sufficient data.

Firstly, the researcher chose three texts. They were selected to represent

(34)

ability varied with text type (Koponen, 2010: 2). The three texts were the first

chapter “At Glance” (iPad user guide), If They Could Only Talk (National

Geographic Magazine), and Fractional Yacht Ownership Agreement (legal

document). Then number of sentences from each text were counted and put in the

table. By applying simple purposive sampling, the researcher finally had 77

sentences from software user guide, 34 sentences from the magazine, and 24

sentences from the legal document. After that all sentences were translated using

Google Translate and the translation was also put in the table. The table was

arranged in such a way so that each sentence would be side by side with its

translation. In the table “No.” is number of each sentence in one article, Source

Text (ST) is the original sentence, and Target Text (TT) is the sentence translated

by Google Translate. The table was displayed as the following.

Table 3.1. Data No. 1

No. Source Text (ST) No. Target Text (TT)

184 Keep your calendar current on iPad, or sync it with your Mac OS X or Windows calendar

184 Jauhkan kalender Anda saat ini pada iPad, atau sync dengan Mac OS X atau Windows kalender

3. Population and Sample

The population of data in this research are 309 sentences from the user

guide, 148 sentences from the magazine, and 97 sentences from the legal

document. The researcher did not use all of the data because they were too many.

The researcher took samples by applying the method of simple random sampling

(35)

whole data in each text. Thus, the data used for the analysis are 77 sentences from

the user guide document, 37 sentences from the magazine, and 24 sentences from

the legal document.

4. Data Analysis

After the data had been collected the researcher one group of errors

categories, i.e. errors related to individual concepts. The errors category were

based on theory of omission, addition, explicitation, and substitution created by

Koponen (2010: 4-6). Here was the table used in the analysis.

Table 3.2. Errors in each datum

Code Errors related to individual concepts (a)

Omitted concept

(b) Added concept

(c) Untranslated

concept

(d) Mistranslated

concept

(e) Substituted

concept

(f) Explicitated

concept D/S X/√ X/√ X/√ X/√ X/√ X/√

D1/S184 X X √ √ X X

N1. .... N2. ....

The researcher arranged the data in form of table in order to put them

easily. "Code" in each table refered to the text in the table 1. The code consisted

of D (number of text) and S (number of sentences). The sign " X/√" was used to

confirm that there are/there are not error occurences in one sentence: "X" means

no error occurence and "√" means there are errors. The table 2 totaly has six

confirmation columns. In addition, the row below the confirmation row in each

table is for explaining in detail about what errors are found based on error

occurences in the confirmation column. In this row, N1 and N2 were code for

(36)

In order to answer the first problem the researcher analyzed what error

occurences were found in each sentence. This process took several steps. The first

was classifying errors of a sentence based on errors classification in the table 2.

Then, the researcher had to confirm the error occurences. The detail information

of what errors found were always put after confirming the error classifications

below the confirmation row. The last is analysing the errors using semantic

approach in which it still involves some linguistic aspects such as morphology,

syntax, and semantic itself. Those three steps are very important to identify errors

in the translation in order to help the finding suggestions.

Finally, in order to answer the second problem the researcher needed to

look at the table 2. The researcher took samples from each classification for the

finding suggestion. The finding suggestion took several steps. Firstly, the

researcher listed all of error types and put them in a table. Then every error was

tested with suggestions based on Farrus' proposed solutions whose results were

later shown in the confirmation columns. What were tested were samples for

every error type which were taken from the texts. After the testing, the researcher

kept trying the finding suggestion for errors which were not matched to Farrus'

proposed solutions. This process was done since this study of linguistic analysis in

technology was very open for every possibility. The table used for the finding

(37)

Table 3.3. Mistranslated Concept

No. Code SL TL Description of Errors

Translating in an isolated form

Text Edition

Information

26 37/D1/S1 84

Keep Jauhkan choosing an inappropriate equivalence

√_{Type the word and} search the appropriate equivalence of “keep”

(keep menyimpan)

010912

Each error category then would have its table as above. By showing like

(38)

D. Research Framework

The following was the map of how this research conducted.

ST 1

National Geographic's article

ST 2 Environmental Cooperation document

ST 3

iPad's manual user guide

Translated by GOOGLE TRANSLATE

TT 1

National Geographic's article

TT 2 Environmental Cooperation document

TT 3

iPad's manual user guide

Error Analysis with Koponen’s Errors