[etni] Fw: Reliability and validity of the Bagrut exams
- From: "Ask Etni" <ask@xxxxxxxx>
- To: <etni@xxxxxxxxxxxxx>
- Date: Wed, 24 May 2006 11:42:07 +0200
----- Original Message -----
From: Motti - motti46@xxxxxxxxxxxx
Subject: Reliability and validity of the Bagrut exams
Hi,
It seems clear that the Bagrut exams in English, in their current form, are
neither reliable nor valid forms of assessment of the students' achievements
in English.
It's about time the Szold Institute which is responsible for this faulty
product did something about this. I don't know of any research which found
the correlation between Bagrut results and other criteria of the students'
achievement.
As an experienced teacher I don't think the Bagrut results really reflect
the students' knowledge and the experiences of this week's exams only
accentuated this feeling.
Anyway, the whole idea of modules was that module G would be the most
challenging. It was not! Neither was it in the winter Bagrut.
Something should be done about it and the sooner the better.
If you care to read a few lines about validity and reliability of test read
the following quotes or read them in the original sites.
Motti Mendelkern
A test is valid when it measures what it's supposed to.
If a test is reliable, it yields consistent results.
http://www.georgetown.edu/departments/psychology/researchmethods/researchanddesign/validityandreliability.htm
Test/Retest
Test/retest is the more conservative method to estimate reliability. Simply
put, the idea behind test/retest is that you should get the same score on
test 1 as you do on test 2. The three main components to this method are as
follows:
1.) implement your measurement instrument at two separate times for each
subject;
2). compute the correlation between the two separate measurements; and
3) assume there is no change in the underlying condition (or trait you are
trying to measure) between test 1 and test 2.
http://seamonkey.ed.asu.edu/~alex/teaching/assessment/reliability.html
Type of Reliability How to Measure
Stability or Test-Retest Give the same assessment twice, separated by days,
weeks, or months. Reliability is stated as the correlation between scores at
Time 1 and Time 2.
Alternate Form Create two forms of the same test (vary the items slightly).
Reliability is stated as correlation between scores of Test 1 and Test 2.
Internal Consistency (Alpha, a)Compare one half of the test to the other
half. Or, use methods such as Kuder-Richardson Formula 20 (KR20) or
Cronbach's Alpha.
http://fcit.usf.edu/assessment/basic/basicc.html
Validity refers to the accuracy of an assessment -- whether or not it
measures what it is supposed to measure. Even if a test is reliable, it may
not provide a valid measure.
Type of Validity Definition Example/Non-Example
Content The extent to which the content of the test matches the
instructional objectives.A semester or quarter exam that only includes
content covered during the last six weeks is not a valid measure of the
course's overall objectives -- it has very low content validity.
Criterion The extent to which scores on the test are in agreement with
(concurrent validity) or predict (predictive validity) an external
criterion.If the end-of-year math tests in 4th grade correlate highly with
the statewide math tests, they would have high concurrent validity.
Construct The extent to which an assessment corresponds to other variables,
as predicted by some rationale or theory. If you can correctly hypothesize
that ESOL students will perform differently on a reading test than
English-speaking students (because of theory), the assessment may have
construct validity.
--------------------------------------------
Visit the ETNI site - http://www.etni.org
Send a message to the ETNI list: etni@xxxxxxxxxxxxx
Send queries and questions to: ask@xxxxxxxx
--------------------------------------------
Other related posts:
- » [etni] Fw: Reliability and validity of the Bagrut exams
http://fcit.usf.edu/assessment/basic/basicc.html