widarini.aureola: Assessing Listening

Language Assessment:

Assessing Listening

I. Introduction

Listening is one of the receptive skills as we can observe neither the process of performing nor the product, but we are observing the result of listening itself. The process of listening performance is the invisible and inaudible process of internalizing meaning from the auditory signals being transmitted to the ear and brain. Invisible and inaudible process means that we cannot see or hear the process of listening; thus, it is impossible to observe the product. We observe only the result of the meaningful input in the form of spoken or written output, and it is usually made by inference. We have developed reasonably good assessment task to make the necessary jump through the process of inference, from unobservable reception to a conclusion about comprehension competence.

In standardized testing industry, it is rare to find just a listening test. One reason for this emphasis is that listening is often implied as a component of speaking. Every teacher of language knows that one’s oral production ability is only as good as someone’s listening comprehension ability. Aural comprehension far outstrips oral production in quantifiable terms of time, number of words, effort, and attention. Hence, we need to pay close attention to listening as a mode of performance for assessment in the classroom.

II. Test Items

In designing appropriate assessment test in listening, we begin with the specification of objectives or criteria. The objectives of assessing listening can be distinguished into:

Comprehending of surface structure elements
Understanding of pragmatic context
Determining the meaning of auditory input
Developing the gist, a global or comprehensive understanding.

Those objectives may be classified in terms of several types of listening performance. There are four commonly types of listening performance:

a. Intensive

Intensive listening is listening for perception of the components (phonemes, words, intonation, discourse, marker, etc.) of a larger stretch of language.

b. Responsive

Listening to a relatively short stretch of language (a greeting, question, command, comprehension check, etc) in order to make a short response.

c. Selective

Processing stretches of discourse such as short monologues for several minutes in order to “scan” for certain information. The purpose is to be able to comprehend designated information in a context of longer stretches of spoken language. For instance, assessment tasks could ask the students to listen for names, numbers a grammatical category, directions (in map exercise), or certain facts and events.

d. Extensive

Extensive listening is listening to develop a top-down, global understanding of spoken language. Listening for the gist, main idea and making inferences are part of extensive listening.

2.1. Designing Assessment Tasks

After we have determined objectives of the test, the next step is to design the tasks, including making decisions about how you will bring out performance and expect the test – taker to respond. There are different types of test that depend on the types of listening performance.

Intensive Listening

There are some test items that are used in intensive listening; those are recognizing phonological and morphological elements, and paraphrase recognition.

a. Recognizing phonological and morphological elements

A classic test task gives a spoken stimulus and asks test – takers to identify the stimulus from two or more choice. The test can be in the form of phonemic pair (consonants or vowels), morphological pair, stress pattern, and one word stimulus.

b. Paraphrase recognition

The next step up on the scale of listening comprehension microskills is words, phrases, and sentences, which are frequently assessed by providing a stimulus sentence and asking the test – taker to choose the correct paraphrase from a number of choices. Sentence paraphrase and dialogue paraphrase are the items of paraphrase recognition.

Responsive Listening

In responsive listening there are two items of test, those are appropriate response to a question and open – ended response to a question. A question – and – answer format can provide some interactivity in these lower – end listening tasks; in addition, the test – taker’s response is the appropriate answer to a question. In test item of appropriate response to a question is usually framed in a multiple – choice format. On the contrary, in open – ended test item the test – takers can write or speak the response. The weakness of open – ended is it will take more time for the teacher to check or read the students’ responses and judge their appropriateness.

Selective Listening

In selective listening, the test – taker listens to a limited quantity of aural input and must discern within it some specific information. Tests that are usually used in selective listening are listening cloze, information transfer, and sentence repetition.

a. Listening cloze

Listening cloze tasks, which also called cloze dictations and partial dictations, require the test –takers to listen to a story, monologue, or conversation, and simultaneously the written text in which selected words or phrases have been deleted. The weakness of listening cloze is it may simply become reading comprehension task. In listening cloze, deletions are governed by the objective of the test. Moreover, it should normally use an exact word method of scoring, in which the actual word or phrase as the correct response and consider other appropriate words as incorrect.

b. Information transfer

In information transfer technique, aurally processed information must be transferred to a visual representation, such as labeling a diagram, identifying an element in a picture, completing a form, or showing routes on a map. There are some test items of information transfer, such as picture-cued selection and chart-filling.

Picture-cued items are sometimes efficient rubrics for assessing certain selected information at the lower end of the scale of linguistic complexity. Picture-cued items are divided into multiple-picture-cued selection and single-picture-cued verbal multiple-choice. Multiple-picture-cued selection is the item in which the test-takers need to focus on just the relevant information. In single-picture-cued verbal multiple-choice, the test-takers are presented a single photograph and then hear four different statements and must choose one of the four to describe the photograph. Another genre of picture-cued tasks is a number of people and/or actions are presented in one picture, such as a group of people at a party. The assessment may take the form of questions, true/ false, and identification.

Information transfer tasks may reflect greater authenticity by using chart-filling. Such chart-filling tasks are good examples of aural scanning strategies; moreover, a listener must discern from a number or pieces of information which pieces are relevant.

c. Sentence repetition

In sentence repetition, the test-taker must retain a stretch of language long enough to produce it, and then must respond with an oral repetition of that stimulus. The mistake in repetition is scored as a mistake in listening. Sentence repetition is far from flawless listening assessment task. This task may test only recognition of sounds and it can be contaminated easily by lack of short-term memory ability. Hence, the teachers may never be able to distinguish a listening comprehension from an oral production error.

Extensive Listening

There are three types of tasks or test in extensive listening; those are dictation, communicative stimulus-response tasks, and authentic listening tasks.

a. Dictation

Dictation is a widely researched genre of assessing listening comprehension. The test-takers hear a passage of 50 to 100 words that recited three times. First reading at normal speed, no pauses, and the test–takers listen for the gist. Second reading is with long pauses between phrases or natural word groups, during which time test-takers write what they have just heard. Third reading is at normal speed so the test-takers can check their work and proofread.

The difficulty of a dictation task can be easily manipulated by the length of the word groups and the pauses, the speed of reading, and the complexity of discourse, grammar, and vocabulary used in the passage.

b. Communicative stimulus-response tasks

It is a popular genre of assessment task in which the test-taker is presented with a stimulus monologue or conversation and then is asked to respond a set of comprehension questions. Here are typical items of the test:

· Dialogue and multiple-choice comprehension items

The monologues, lectures, and brief conversations used in such tasks are sometimes a little unnatural, but we can create reasonably authentic stimuli with some care and creativity.

· Dialogue and authentic questions on details

To compensate for the potential inauthenticity of post-stimulus comprehension questions, we might be able to find contexts where questions that probe understanding are more appropriate with a little creativity.

c. Authentic listening tasks

Ideally, the language assessment field would have a stockpile of listening test types that are cognitively demanding, communicative, and authentic, not to mention interactive by means of integration with speaking. Nevertheless, the nature of test implies an equally limited capacity to mirror all the real-world contexts of listening performance; in addition, every task shares some characteristics with target-language tasks, and no test is completely authentic. Authentic listening tasks can be in the form of:

· Note-taking

In the academic world classroom lectures by professors are common features of a non-native English-user’s experience. One among several formats includes note-taking by the test-takers. This test is evaluated by the teacher on a 30-point system, as follow: 0-15 points for visual representation, 0-10 points for accuracy, and 0-5 points for symbols and abbreviations. The process of scoring is time consuming and it lacks some reliability.

· Editing

Editing provides written and spoken stimulus, and it requires the test-taker to listen for discrepancies or differences. Scoring achieves relatively high reliability as there are usually a small number of specific identified differences.

· Interpretive tasks

An interpretive task extends the stimulus material to a longer stretch of discourse and forces the test-taker to infer a response. The test takers are then directed to interpret the stimulus by answering a few questions in open-ended form.

· Retelling

In retelling tasks, the test-takers listen to a story or news event and then simply retell or summarize it, either orally or in writing. The test-takers must identify the gist, main idea, purpose, supporting points, and/ or conclusion to show full comprehension.

III. Implementation

The commonly used test items in assessing listening are dialogue and multiple-choice comprehension items and dialogue and authentic questions on details. Those test items are commonly used because they can create authentic stimuli and in some rare cases the response mode actually approaches complete authenticity. Moreover, these test items require the process of inference in which all assessment of receptive skills must be made by. The strengths of these test items are:

· can create authentic stimuli.

· the scoring process is not time consuming, so teacher as the assessor will be easy to check the students’ answers.

· use the comprehension questions that aim at assessing certain objectives that are built into the stimulus.

The weaknesses of these test items are:

· Any task following a one-way listening to a conversation is sometimes artificial.

· The test-takers might miss some of the questions if they hear the conversation only once and had no visual access to the items.

· Feedback for the students may be harmful.

widarini.aureola

Saturday, May 25, 2013

Assessing Listening

No comments:

Post a Comment

About Me