Contents
General
Steps in Test Construction
Elaborations
for some of these steps are presented below:
- Outline either a) the
unit learning objectives or b) the unit content major concepts to be covered
by the test.
- Produce a test blueprint,
plotting the outline from step 1 against some hierarchy representing levels
of cognitive difficult or depth of processing.
- For each check on the
blueprint, match the question level indicated with a question type appropriate
to that level.
- For each check on the
blueprint, jot down on a 3x5 card three or four alternative question ideas
and item types which will get at the same objective.
- Put all the cards with
the same item type together and write the first draft of the items following
the guidelines for the item types on the accompanying pages. Write these on
the item cards.
- Put all the cards with
the same topic together to cross check questions so that no question gives
the answer to another question.
- Put the cards aside for
one or two days.
- Reread the items from
the standpoint of a student, checking for construction errors.
- Order the selected questions
logically:
a) Place some simpler
items at the beginning to ease students into the exam;
b) group item types
together under common instructions to save reading time;
c) if desirable, order
the questions logically from a content standpoint (e.g. chronologically,
in conceptual groups, etc.).
- Put the questions away
for one or two days before rereading them or have someone else review them
for clarity.
- Time yourself in actually
taking the test and then multiply that by four to six depending on the level
of the students. Remember, there is a certain absolute minimum amout of time
required to simply physically record an answer, aside from the thinking time.
- Once the test is given
and graded, analyze the items and student responses for clues about well written
and poorly written items as well as problems in understanding of instruction.
More specific
ideas about:
Back
to top
The
Blueprint (Step 2)
Don't make
it overly detailed. It's best to identify major ideas and skills rather than
specific details.
Use a cognitive
taxonomy that is most appropriate to your discipline, including non-specific
skills like communication skills or graphic skills or computational skills if
such are important to your evaluation of the answer.
Weigh the appropriateness
of the distribution of checks against the students' level, the importance of
the test, the amount of time available. Obviously one can have more low level
questions in a given time period, for example.
A blueprint
looks something like this:
COGNITIVE
LEVEL
| Concept |
Basic
Facts
|
Application
|
Synthesis
|
Analysis/Evaluation
|
| Steps in test design |
X
|
X
|
|
X
|
| Item types |
X
|
X
|
|
|
| Errors in items |
|
X
|
|
X
|
| Item Analysis |
|
X
|
|
X
|
Back
to top
The
Alternative Item Types (Step 3)
The following
array shows the most common question types used at various cognitive levels.
It is possible to test almost all levels with all types, but some are simply
more efficient than others.
|
Factual
Knowledge
|
Application
|
Analysis
and Evaluation
|
Multiple Choice
True/False
Matching
Completion
Short Answer |
Multiple Choice
Short Answer
Problems
Essays |
Multiple Choice
Essays |
The
Item Cards (Step 4)
These can be
helpful in making up alternative forms of a test since each item idea on a given
card should test the same content at the same level, but in potentially different
ways.
Once the test
is over, these same cards can be used to keep a record of which ideas were used
and how the students responded. This practice aids in producing more effective
future exams.
Back
to top
Writing
Different Item Types (Step 5)
The next few
sections describe some suggestions to keep in mind in the construction of various
item types.
essay
questions
- Be sure the task is clearly
defined and you give the students some idea of the scope and direction you
intended for the answer to take. It helps to start the question with a description
of the required behavior (e.g. the verb such as "compare" or "analyze") to
put them in the correct mindframe to think about the rest of the question.
- Write the question at
a linguistic level appropriate to the students.
- Construct questions that
require a student to demonstrate command of background information, but are
not a simple repeat of that information.
- If you're going to ask
questions which call for opinions or attitudes from the students, be sure
that the emphasis is not on the opinion but on the way it is presented and
argued.
- Use a larger number of
shorter, more specific questions rather than one or two longer questions so
that more information can be sampled.
- The use of optional questions
results in different exams for different students and makes comparative grading
more difficult. They do, however, often reduce student anxiety. Both effects
must be weighed when considering their use.
Back
to top
Some
additional suggestions aside from constructing the item itself:
- Give the students a "key
word" list containing the verbs you most commonly use in your essay questions
and a short description of what you look for in questions using that verb.
- Give the students a pair
of sample answers to a question of the type you will give. Indicate why one
is good, one bad; be specific.
- In grading, sketch out
a grading scheme for each question before reading the papers OR randomly select
a few papers, read the answers and make up the grading scheme based on the
range represented by those answers.
- Detach identifying information
from a paper and use code numbers instead to avoid letting personality factors
influence you.
- Grade one question at
a time across all papers before moving to the next question.
- After grading all the
papers on one item, reread the first few papers over to be sure that you have
maintained consistent standards.
- Scramble the order of
the papers between grading successive questions.
- Be clear with yourself
and the students the extent to which factors other than content (such as grammar,
handwriting, etc.) will influence the grade.
Back
to top
multiple
choice questions
- Write the item stem first.
- Be sure the stem
asks a clear question.
- Stems phrased as
questions are usually easier to write.
- Stems should not
contain a lot of irrelevant information.
- Reading level should
be kept appropriate to the students.
- Be sure the stem
is grammatically correct.
- Try to avoid negatively
stated stems.
- Write the correct response
next.
- Be sure it is correct.
- Keep the reading
level of the stem and the response consistent.
- Avoid too many qualifiers.
- Avoid grammatical
clues like articles and tense.
- Avoid using stereotyped
responses as the correct response (e.g. catch phrases or very familiar
terms).
- Assign the response
to a random position in the answer sequence right away.
- Read the stem and the
correct response together to be sure they sound right.
- Assign the correct response
to a random position in the response list.
- Generate the distractors.
- Create a "truth table"
to assist in identifying plausible distractors. For example, some tables
for different items might look like this:
|
For a cause
and effect item:
|
Relationship
is:
|
|
True
|
False |
| Effect
is: |
True |
correct
answer |
effect
is true but unrelated to cause |
| False |
effect
is false, but related |
effect
is a false description and unrelated |
|
Example:
What is the effect of releasing a ball in positive gravity?
a) it will
fall "down" (correct
b) it will retain its mass (true but unrelated)
c) it will rise (false but related)
d) its shape will change (false and unrelated)
|
- Parallel the construction
of the correct response.
- Keep the statements
simple.
- Do use stereotyped
responses for distractors.
- Use common misconceptions.
- Make the distractors
different enough from one another to be discriminable.
- Avoid obscure words
or highly implausible distractors.
- Be cautious in the
use of "all" or "none" of the above.
- Put the distractors
in logical order.
- Put words which occur
in all the responses in the stem instead.
- Read the stem and the
distractors together to be sure they sound right together.
Back
to top
matching
questions
Think of this
type of question as a more efficient multiple choice item.
- Only homogeneous premises
(list on the left to be answered) and homogeneous responses (list on the right
consisting of possible answers) should be grouped in one item. For example,
in a question on common and scientific names of plants, don't include animal
names in the lists. Doing so in effect cuts the list of choices by however
many non-plant names there are.
- Relatively short lists
of responses should be used.
- Premises should be arranged
for maximum clarity and convenience.
- Response options should
be arranged alphabetically or numerically.
- Directions should clearly
indicate the basis for matching.
- Position of matches should
be varied.
- All of the choices of
each matching set should be on one page.
- More responses than premises
should be used in a set or a single response should be used to answer more
than one premise.
Back
to top
completion
questions
- Only significant words
should be omitted in incomplete statement items.
- When omitting words,
enough clues should be left so that the student who knows the answer can supply
the correct response.
- Grammatical clues to
the correct answer should be avoided.
- Blanks should occur at
the end of the statement, if possible.
- Limit the length of the
responses to single words or short phrases.
- Verbatim quotes from
the text should be avoided.
- Be clear to yourself
and the students the level of specificity required in this type of question.
Back
to top
true/false
questions
- Each statement should
be clearly true or clearly false.
- Trivial details should
not make a statement false.
- The statement should
be concise without more elaboration that necessary.
- Exact statements should
not be quoted from the text.
- Use quantitative terms
as opposed to qualitative terms if possible.
- Specific determiners
(always, never, etc.) which give a clue to the answer should be avoided.
- Negative statements are
often confusing and should be minimized.
- When a controversial
statement is used, authority should be quoted.
- A pattern of answers
should be avoided.
Back
to top
Doing
an Item Analyses (step 12)
The purpose
of the item analysis is to help you identify questions that can differentiate
between students who "know" the material or are "good" based on some other
criterion measure and those who don't "know" the material or aren't' "good."
It also helps identify poorly written items which mislead students.
The basic theory
requires that the class be divided into identifiable groups representing the
best and worst performers on some measure. One possible measure is the overall
score on the test itself. Theoretically students who get high scores "know"
the material while students who get low scores don't. Therefore you can divide
the class into thirds or quarters on the basis of overall test score and evaluate
the performance of the two extreme groups on each question.
You can use
the same idea, but some other criterion on which to split the class if there
is some other level of performance you're interested in predicting. For example,
you should use lab grades or subsequent test grades to divide the class into
extremes. The results of the item analysis would then tell you how well each
item on the test predicted or correlated with performance on the lab or the
subsequent unit.
The three pieces
of information which are used to evaluate the item are:
a) the difficulty level
- what overall percentage of the students in the analysis groups answered
correctly? It varies from 0.0 to +1.0 with numbers approaching +1.0 indicating
more students answering correctly.
b) the discrimination
level - How well does this item discriminate between the top students and
the bottom students? It varies from -1.0 to +1.0. The closer the number is
to either end, the better that item is in differentiating between those who
know the material and those who don't. The only difference is that when the
number is positive, the items will be answered correctly by the good students
and incorrectly by the poor students; when it is negative, the reverse is
true.
The formula
is:
|
H
- L
|
H = number
of top students correct |
|
N
|
L = number
of bottom correct |
| |
N = number
of students in either top or bottom (should be equal) |
c) the breakdown - How
do the students in each group who answered incorrectly actually answer the
question? This is done by tallying the choices made by those who miss the
question and looking for particularly popular incorrect answers, especially
among the top students. If one distractor is favored, the instructor should
make sure it is indeed incorrect.
Back
to top
Back
to Table of Contents
Home | Faculty
Services | TA/AI Services | Publications
| Resources
| Research | About
CTE
October 15, 2002
The University of Texas
at Austin
Copyright © 2002 Center for Teaching Effectiveness
Contact CTE