Language learning assessment: linking theory to practice

The Textbook and Reference Competition (KOBAR) of the Faculty of Tarbiyah and Teacher Education (FTK) in UIN Mataram in 2021 is an effort by the faculty to contribute to the implementation of the above law, where quantitative, research forms and publications of PTKI lecturers still need to be improved . On behalf of the faculty, I am grateful for the policy and support of the Chancellor of UIN Mataram and his staffs, the writers who have contributed to the 2021 book competition phase, and also the editors of the same field educators and publishers who, without a touch of zauq, would book appearance not be as interesting as this one.

Background

The present chapter is divided into several subsections that contain the background, the purpose of this book, and methods of investigation used in this book. Given these backgrounds, this book attempts to synthesize and provide various issues in language assessment.

Aims of the Book

Methods

Instructional decisions must be made with the intention of promoting learning for all students in the classroom. This is the purpose and an important “why” of the assessment process: to make decisions that are in the best interests of the students and their learning.10.

Figure 1: The Relationship between Concepts in Assessment and Teaching 4

NON-TEST

Checklist: The teacher makes a list of goals for students to achieve, and students can check off each skill as they master it. Portfolio: A compilation of student work that demonstrates the student's efforts, growth, and achievements in one or more areas of the curriculum.

PURPOSES OF ASSESSMENT

Teachers must be skilled in recognizing unethical and inappropriate assessment methods that lead to unfair grading. This means that prior to the selection of the assessment procedure, the learning objectives or outcomes must be clearly identified in advance.

Figure 5: Alignment of Learning Goals, Assessment and Classroom Activity 23

TECHNOLOGY ENHANCED LANGUAGE ASSESSMENT Nowadays the advent of technologies such as the presence of

This chapter presents the main principles of language assessment, which are the basis for designing and deciding what the assessment should look like. However, the test designer must think about how to balance the principles against the assessment objectives.

THE KEY PRINCIPLES

Students-related reliability
Rater reliability
Administration-related reliability
Test-reliability
Content-related evidence
Criterion-related evidence
Construct evidence
Face evidence
Consequential validity

This type of validity refers to the extent to which the assessment meets the established indicators or goals or standards. Face refers to the extent to which the assessment appears or appears to measure what it is supposed to measure.

Figure 6: Alignment between Curricular Aim and Educational Test

PUTTING THE PRINCIPLES INTO CLASSROOM PRACTICES PRACTICES

As illustrated above, classroom assessment is carefully planned and systematic and requires teachers' skills in assessment design. A classroom assessment also takes into account students' background knowledge and is appropriately selected to meet the learning outcomes and students' current level of knowledge and skills to help teachers improve teaching effectiveness.

Table 5: Criteria for of Classroom Assessment 48

THE MAIN APPROACHES

The approaches themselves have evolved and evolved over a period of time as more criticism of previously known approaches mounts. In this chapter, both strengths and weaknesses of most approaches are also discussed to shed light on the decision of which approach to consider.

DISCRETE APPROACH

Another characteristic of this approach is that it stems from a psychometric assumption that the test must be valid and reliable. A large range of materials to be put to the test can be protected by testing this method.

INTEGRATIVE APPROACH

The success of the test cannot be easily deduced from the test taker's ability to communicate in real-world conditions. Test takers must demonstrate not only their knowledge of structures, but also their ability to use them in context.

COMMUNICATIVE APPROACH

That is, the typical assessment underlying this approach uses our everyday communications, such as attending a lecture, seminar, conference, or the like. Because the communicative approach is based on authentic language use, psycholinguistic and sociolinguistic elements must be taken into account in the assessment.

PERFORMANCE-BASED APPROACH

To assess students' language use, assessments are most practical as students role-play as they interact in the real world/everyday life. 53 essential part of performance evaluation is the evaluation of the given task in relation to the performance event. Higher level cognitive processing skills are typically required at the completion of the performance event.

Achievement events are typically reviewed through an objective, systematic scoring of the quality of the student work based on listed standards or.

DIMENSIONS OF LANGUAGE ASSESSMENT

ASSESSMENT OF LEARNING

ASSESSMENT FOR LEARNING

57 from the process to design the next steps to provide them with information that will be useful for their planning and teaching.58 Marking is not intended to make comparative judgments among students, but rather to emphasize each student's strengths and weaknesses and give them comments that will help them learn more effectively. Although instructors play a major role in assessment for learning, they utilize their personal knowledge of students, as well as their awareness of the context of the assessment and curriculum goals, to identify specific learning needs.

ASSESSMENT AS LEARNING

ASSESSMENT IN LEARNING

FORMATIVE AND SUMMATIVE ASSESSMENT

In addition, summative assessment can be further used to determine whether someone can progress to the next level of education, make an award decision or provide careers support. Dixon and Worrell point out that the best form of summative assessment is performance-based assessment, in which teachers provide an opportunity for students to demonstrate what they have learned which can be further categorized into product, performance and process assessment. 60. Summative assessment results are often recorded as points or grades that are then included in a student's permanent academic record.

In contrast, the purpose of summative assessment is to evaluate student learning at the end of a teaching unit by comparing it to some standard or benchmark.

Table 8: Main Differences between Summative and Formative Assessment

INFORMATIVE ASSESSMENT

On the other hand, information from summative assessments can be used formatively when students or teachers use it to guide their efforts and activities in subsequent courses. On the other hand, from a learning perspective, effective students view themselves as owners of their learning process. Provide effective feedback to students; that is, feedback that helps students realize their next steps in learning and how to take them, as well as feedback that helps students become active in their own learning.

Table 10: Landscape of Assessment Practices 64 Assessment

TRADITIONAL ASSESSMENT VS ALTERNATIVE ASSESSMENT

FORMAL AND INFORMAL ASSESSMENT

STANDARDIZED AND NON-STANDARDIZED ASSESSMENT ASSESSMENT

Assessment tools are gadgets used to determine how well a student has acquired a particular subject and/or how well he or she will perform in future endeavors. On the other hand, non-standardized tests are tests that do not follow the rules of standardized tests and where the student's evaluation is not uniform.

SELF AND PEER ASSESSMENT

Understanding each type of test and its categories will shape your decision-making in the assessment process and allow you to construct a test that best suits your goals.

TYPES OF TEST ACCORDING TO THE PURPOSES a. Diagnostic Test

Although relatively similar to standardized achievement tests, especially in terms of clear instructions for students, the informal classroom achievement test differs somewhat from the standardized test when it comes to the nature of the learning outcomes and the content measured, reliability of the test, quality of the test items. the test administration and evaluation procedure as well as the interpretation of the test results. The purpose of a language proficiency test is to predict a person's success in exposure to a foreign language. A proficiency test is used to assess the ability or general ability to learn a foreign language a priori (before enrolling in a course) and the likelihood of success.

Thus, the difference between the two types lies in the types of learning measured and the types of predictions for which each test is most useful.

TYPES OF TESTS ACCORDING TO THE RESPONSE TYPES

75 is small and the test is not intended to be repeated, the test is intended to encourage and reward the test takers' skill development, the examiner is more interested in seeing the test takers' attitude rather than their performance.

TYPES OF TEST ACCORDING TO THE SCORE INTERPRETATION INTERPRETATION

This test, also known as a standardized test, measures the test taker's performance in comparison to other test takers of the same rating. Therefore, the test taker's score is interpreted with reference to the scores of other test takers. The scoring of a norm-referenced test is based on a bell curve, so only half of the test takers' scores are above the 50th percentile.

Similarly, the increasing trend shows the compromise of two types of tests; official test publishers these days have attempted to use both types of tests by providing more description for a norm-referenced test and norm-referenced interpretation for a test constructed for a criterion-referenced test.

TYPES OF TEST ACCORDING TO THE WAY OF ADMINISTRATION ADMINISTRATION

The development of processes designed to tap into the enabling capabilities that underlie the macro-capabilities leads to the indirect evaluation of the capacity in question. So if we assume that grammatical knowledge leads to the ability to write, a grammar test can be used as an indirect writing test. So it is wise to choose which test to use based on the category as there is no best test overall.

The types of test that are planned to be used should match the objective that the examiners want to investigate.

TEST DEVELOPMENT TEST DEVELOPMENT

THEORIES OF TEST DEVELOPMENT
STAGES IN TEST DEVELOPMENT
Measuring knowledge outcomes
Measuring the outcomes of understanding and application Many believe that multiple-choice items are ideal to use when

Create a meaningful stem that presents a definite problem
The stem should include the most relevant item and as much as possible of the item
Use stem in negative statement only when significant learning outcomes require it
All the options/alternatives should be consistent in terms of grammar with the stem
Each multiple-choice item should contain only one clear unambiguous answer
Items constructed for measuring understanding should boast the right amount of novelty
Avoid verbal associations between the stem and the correct answer
The relative length of the options should not provide clues to the correct answer for the test takers
The correct answer should be placed randomly in each item’s alternatives in equal numbers
Use special alternatives such as ‘none of the above’ or ‘all of the above’ carefully
Express the item so that it provides a brief and specific answer
Do not extract statements from a textbook as a basis of short answer item
Indicate the type of the answers required when the intended answers are expressed in numerical units
Blanks for the answers should be equally long and be placed next to the question
Do not include too many blanks when using completion items

UNDERSTANDING LISTENING
LANGUAGE KNOWLEDGE IN LISTENING
WORLD KNOWLEDGE IN LISTENING
FEATURES OF SPOKEN TEXT Phonological modification Phonological modification
TYPES OF LISTENING

One of the most basic types of outcomes that can be measured with multiple-choice questions is knowledge of terminologies. It is suggested that the length of the alternatives should be relatively equal to avoid providing clues to the correct answer. Ensure that the number of test items is proportional to the length of the introductory material.

The first limits the content usually shown to the scope of the topic and. First, test takers must listen to a text to assess the validity of the test. Several types of listening tests can also be used to measure different aspects of the test-taker's knowledge and skills.

Figure 11: Steps Language Test Development 85

ASSESSING READING ASSESSING READING

THE NUTS AND BOLTS OF READING
TEXT PROPERTIES
TASK DEMAND
SKILLS IN READING
TOP-DOWN VS BOTTOM-UP APPROACH TO READING PROCESS PROCESS
The Nature of Speaking
SITUATIONAL CONTEXTS OF SPEAKING
DESIGNING SPEAKING TASKS
INTERVIEW
DESCRIPTIVE TASK
NARRATIVE TASK
COMPARING AND CONTRAST

EXPLAINING AND PREDICTING

DECISION TASK
ROLE-PLAYS AND SIMULATION
REACTING IN SITUATION TASK
STRUCTURED-SPEAKING TASK
MORE SPEAKING TASKS Sentence/Dialogue Completion Sentence/Dialogue Completion

The results of tests should be carefully examined as the setting of the test may fail to minimize the extrinsic motivation of the test takers. The main disadvantage of the cloze procedure is that it is not clear what it measures, as the test designer has no control over the deletion of the word. It allows test takers to check the range of possible answers and determine how they think about the possible answers.

In the summary test, test takers are asked to read a text and then summarize the main ideas of either the whole passage or part of it. It is widely believed that one of the factors that influence one's speech is the situation in which the speech occurs. In a role-playing game, testers play a new role and communicate with partners instead of the tester.