Open Access. Powered by Scholars. Published by Universities.®

Education Commons

Open Access. Powered by Scholars. Published by Universities.®

Selected Works

Prof Geoff Masters AO

Testing

Articles 1 - 7 of 7

Full-Text Articles in Education

Adaptive Testing For Psychological Assessment: How Many Items Are Enough To Run An Adaptive Testing Algorithm?, Michaela Wagner-Menghin, Geoff Masters Dec 2012

Adaptive Testing For Psychological Assessment: How Many Items Are Enough To Run An Adaptive Testing Algorithm?, Michaela Wagner-Menghin, Geoff Masters

Prof Geoff Masters AO

Although the principles of adaptive testing were established in the psychometric literature many years ago (e.g., Weiss, 1977), and practice of adaptive testing is established in educational assessment, it is not yet widespread in psychological assessment. One obstacle to adaptive psychological testing is a lack of clarity about the necessary number of items to run an adaptive algorithm. The study explores the relationship between item bank size, test length and measurement precision. Simulated adaptive test runs (allowing a maximum of 30 items per person) out of an item bank with 10 items per ability level (covering .5 logits, 150 items …


Naplan And My School : Shedding Some Light On A Work In Progress, Geoff Masters Aug 2012

Naplan And My School : Shedding Some Light On A Work In Progress, Geoff Masters

Prof Geoff Masters AO

Debate about NAPLAN and the My School website has generated plenty of heat. Geoff Masters casts some light on what is essentially a work In progress.


Computer Adaptive Testing : A Feasibility Study, Siek Khoo, Geoff Masters, Ray Adams Aug 2012

Computer Adaptive Testing : A Feasibility Study, Siek Khoo, Geoff Masters, Ray Adams

Prof Geoff Masters AO

The Australian National Office of Overseas Skills Recognition (NOOSR) commissioned ACER to investigate the feasibility of implementing computer adaptive testing (CAT) in NOOSR’s screening examinations for overseas trained professionals. The National Office administers screening examinations in seven professions (Dentistry, Dietetics, Occupational Therapy, Podiatry, Pharmacy, Physiotherapy and Veterinary Science) at venues throughout Australia and in up to fifty centres around the world. Surveys were conducted to gain an overview of the current methods and procedures used by NOOSR in the screening examinations. Issues related to the application of CAT to NOOSR’s screening examinations and the possible improvement of NOOSR’s assessment program …


Assessing Science Learning, Geoff Masters Aug 2012

Assessing Science Learning, Geoff Masters

Prof Geoff Masters AO

A new school assessment resource provides teachers with information about individual students’ achievement and progress in science. Geoff Masters details the development of the Progressive Achievement Test in Science.


Anchor Tests, Score Equating And Sex Bias, Geoff Masters Dec 1987

Anchor Tests, Score Equating And Sex Bias, Geoff Masters

Prof Geoff Masters AO

This paper discusses the use of anchor tests (scaling tests) to bring two or more sets of scores to a common scale. Particular attention is given to the rescaling of school based assessments against an external test or examination and to potential sources of bias in this procedure. The need for routine validity checks is emphasised, and a latent trait approach to constructing a statistical framework for tests and examination score equating is described and illustrated. Bias caused by rescaling school assessments against an inappropriate anchor test is illustrated using a 1984 attempt to rescale students assessments in English against …


Item Discrimination: When More Is Worse, Geoff Masters Dec 1987

Item Discrimination: When More Is Worse, Geoff Masters

Prof Geoff Masters AO

High item discrimination can be a symptom of a special kind of measurement disturbance introduced by an item that gives persons of high ability a special advantage over and above their higher abilities. This type of disturbance, which can be interpreted as a form of item bias, can be encouraged by methods that routinely interpret highly discriminating items as the best items on a test and may be compounded by procedures that weight items by their discrimination. The type of measurement disturbance described and illustrated in this paper occurs when an item is sensitive to individual differences on a second, …


Banking Non-Dichotomously Scored Items, Geoff Masters, John Evans Dec 1985

Banking Non-Dichotomously Scored Items, Geoff Masters, John Evans

Prof Geoff Masters AO

A method for constructing a bank of items scored in two or more ordered response categories is described and illustrated. This method enables multistep problems, rating scale items, question 'clusters', and other items using partial credit scoring to be calibrated and incorporated into an item bank, and it provides a mechanism for computer adaptive testing with items of this type. Procedures are described for calibrating an initial set of items, for testing the fit of items to the underlying measurement model, and for linking new items to an existing item bank. The method is illustrated using items from the Watson-Glaser …