Model ANswer
Model ANswer
The text for this task is reproduced on pages 3 and 4. It is being used in the following situation:
M is a manager in a tourist hotel, with customers mainly from the UK. Her job involves
dealing with bookings and correspondence with hotel customers by email. Her overall
level is low intermediate (CEFR B1). Her company has sent her on a 2-week intensive
one-to-one course, focusing on her professional writing needs. At the end of the course
the company informs the teacher that it would like a report. In order to provide more
information for the report for the company, the teacher decides to administer this two-part
writing test from a public exam.
Using your knowledge of relevant testing concepts, evaluate how effectively this task fulfils its purpose
for this learner in this situation.
Make a total of six points. You must include both positive and negative points.
Positive
Level The level of the tasks is suitable for a low intermediate student.
Direct test It is a direct test of writing / it is clearly a writing test and so has face validity.
Integrative test It tests M’s ability to use both language and writing subskills.
Context/content The test is clearly work and business orientated, increasing face validity.
Writing sub-skills The writing will test M’s ability to use writing sub-skills such as paragraphing,
logical progression of ideas, layout, salutations etc.
Functional language The functions required in both tasks (explaining, acknowledging, offering,
requesting, giving directions) are appropriate.
Positive Applications
Relevance M/employer will see the relevance of the test in terms of M’s needs.
Usefulness The functional language tested is useful for M in terms of her work needs.
Appropriateness of task types M/employer will feel able to trust the outcome of the test.
Data from the test The teacher should be able to write a useful report on the basis of the writing.
41
Evidence for M/her employer The test should provide evidence of M’s overall writing ability in
terms of organisation / ability to select relevant content / use of fixed phrases / cohesion etc.
Appropriate style M will see the test as valid as she needs to write in a neutral/formal style at
work.
Format M will not be tested on her ability to think of ideas but rather on her language/writing
skills / she’ll be able to show her true abilities / she’ll know what to do.
Predictive validity The test will show M/employer how she will perform at work / what she needs
to continue working on after the course has finished.
Negative
Lack of email The test does not test M’s ability to write emails.
Lexis/Topic the vocabulary / topic required, e.g. job applications is not relevant to M’s
situation/hotel work.
Subjective marking There is not one correct answer and the marker will have to use their
judgement.
Negative applications
Needs/ lack of relevance M/her employer will not feel that the topics / task types are relevant to
M’s needs at work.
Not a full picture the results of the test will not give indication for M/employer of M’s ability to use
relevant lexis.
Feedback on the course Neither the teacher or M will be able to assess the success of the
course.
Lack of trust/reliability M may not have faith in the teacher’s ability to mark the test / trust the
results of the test / the marking may not be reliable.
42
many answers suffering from cramped or confusing layout. Rather than using columns, better answers
were laid out under the headings of Point and Application, which meant that candidates did not forget
to include both elements.
Weaker answers:
forgot to state the overall purpose of the test, i.e. achievement or they described it as being a
progress or diagnostic text, or they avoided the issue by not saying what kind of test it was
repeated the same point, particularly that it was a direct test or repeated the same application,
particularly that the test would be relevant or not relevant to the learner
linked to the above point, did not use a wide enough range of criteria with which to evaluate
the test
still continued to use testing terms such as content validity as the headings for their points
which resulted in the terminology not being related sufficiently to the point being made. As a
result, it was unclear whether the candidates understood the terms or not and they lost marks
did not refer to relevant testing concepts in terms of this particular test or used terminology
inaccurately to describe the test. For example, a high number of candidates used backwash to
describe an achievement test, showing yet again that they did not understand that this term
refers to the effect that the test has on a course rather than an effect in general. Candidates
clearly did not understand that backwash could not be used to describe an end-of-course
achievement test as the course had already been completed. Some candidates also described
the test as being an example of an indirect rather than a direct test
repeated pre-learnt points that they had seen in previous Guideline answers, particularly with
reference to fresh starts which was not relevant to this particular test
identified key points but then lost marks because they did not include applications
repeated the same application for different points (an application is only credited once) or
included more than one application for a point which resulted in repetition of applications over
the whole answer
did not refer explicitly to the learner and their stated needs/goals and how the test met or did
not meet these needs and goals
43
8.3 Sample Answers
8.3.1 The following sample answer gained a high number of the marks available for this task
POSITIVE
This is a writing test and a direct test of the writing skill so and this is what the learner wanted to work
on in the course. Therefore the test has face validity for the learner.
It is also an integrative test – it will test her knowledge of discourse features – especially in relation to
correspondence – and it tests lexis and grammar appropriate for the general genre of correspondence
too. The style in which the learner must write is formal in part 2 so this has content validity for the
learner who needs to correspond with clients (formal style).
The test has construct validity – the rubric is very specific & the learner will not have to invent
information (this is relevant to her needs because at work she presumably wouldn’t have to invent
information either.) It tests the candidate’s ability to write and not her imagination.
The test, by being divided into 2 parts, allows for “fresh tests” – so that if the learner is nervous or at
the start, she has a chance to settle in and can get be more rel relaxed for the second part of the
exam.
It’s a reliable test – it should give the teacher a good idea of the student’s overall writing ability in
controlled situations and under pressure. It will enable the teacher to make valid points on the report
about the student’s level, progress & achievement in the course.
NEGATIVE
There is a problem with content validity because the learner specifically wants to write emails to clients
and both pa neither parts of the test deal with email (Part 1: memo, Part 2: letter – so the genre is
slightly wrong)
There is another problem with the topics of the correspondence – neither is related to hotels or tourism
– this gives the test rather low content validity and possibly low face validity as the learner may
question what relevance the topics have to her needs.
The one-to-one course that M did may have addressed her specific writing needs and may not have
prepared her for this type of test where she has to write a very formal letter in reply to a person who is
applying for a job (part 2) – responding accurately to this task means requires a good knowledge of
genre, discourse conventions and appropriate lexis and grammar – M may not have studied these
things on the course and therefore the test could be unreliable – it may not be a good indicator of her
actual ability to write emails for her job.
44
8.3.2 The following sample answer gained half of the marks available for this task
1. The two tasks are examples of direct 1. The test does not test M’s use of emails
testing in that they can be applied to which is her principle use of English at work.
everyday language use. As a result, the test As a result, the test lacks validity.
has some content validity.
2. The test covers different skills in writing 2. The test is administered by the teacher
(e.g. note taking, acknowledging, requesting, (and presumably corrected by him/her as
etc.) which are integrative skills and are thus well). As a result, the test may lack some
valid for M. reliability as it is not totally objective.
3. The test items are appropriate for M’s level 3. It is a summative test and presumably
(Low – Intermediate). They should not be does not reflect the 2 – week course
seen by M as beyond her level and should content as M had expressed a desire or a
therefore be seen a containing a high level of course for her needs. As a result, it may not
construct validity. be seen as motivating and creates a negative
4. The tests are practical and easy to backwash effect for M.
administer requiring little expertise on the
part of the tester.
8.3.3 The following sample answer gained only a few of the marks available for this task
(+) The task requires the candidate to use a variety of language structures.
e.g. to talk about the future
to give instructions
Therefore, it is more valid than, for example, a gap-fill activity.
(+) The test would meet a learner’s expectations of what a test should look like i.e. It has face validity.
e.g. – quite formal structure
– bullet points.
– specifies appropriate word count.
(-) The level of the test may not be suitable for a low – intermediate learner.
45
Part 2, for example, requires topic lexis and grammatical structures which a student of this level might
struggle with.
e.g. acknowledging a letter in a formal letter requires quite complex structures
i.e I gratefully received your letter.
In neither question is she tested on her ability to communicate with customers or her ability to write
emails. It is therefore not an accurate gauge of her English in relation to her purpose in learning the
language.
(-) Both task types are very similar. This may be de-motivating to the candidate.
46