Web-Teaching

Preface Acknowledgements Contents 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 URLs References

 

 


Chapter 15

Automated Testing *

MASTERY LEARNING *

AUTOMATED TESTING *

EXAMPLE TESTING SYSTEMS *

eGrade *

Test Pilot *

Chemistry Tests *

ADMINISTERING TESTS *

GLOSSARY *

REFERENCES *

URLs *

 


 

CHAPTER 15

Automated Testing

 

 

MASTERY LEARNING

Of the research in learning, no strategies have led to better outcomes than mastery learning. In mastery learning, the learner is required to master material to a specified level before they can go forward. In Human Characteristics and School Learning, Bloom [1976] makes an excellent case for mastery learning.

Several school systems have attempted to implement mastery learning. There is substantial agreement about the success of mastery learning, particularly in college teaching settings [Kulik & Kulik, 1987; Kulik et al., 1990]. Many modern, structured learning programs include strong mastery learning components. In colleges, the most serious, systematic implementation of mastery learning was through so-called Keller Plans, also called personalized systems of instruction (PSI). These are named after Fred Keller, who formulated this approach to mastery learning [Keller, 1968]. Among the key elements to Keller's plan were self-pacing, repeatable testing, and peer tutors. These plans flourished during the decade around 1970. Kulik et al. [1979] {U15.01} reviewed this effort, and provided an excellent summary of the evidence supporting Keller Plan instruction. In one review, they suggested that, "the future will bring even wider use of the plan." Keller Plan testing, however, drove both students and teachers nearly to the point of distraction. Silberman [1978] wrote an excellent summary of a teacher's view of Keller Plan impacts. He reported administering, grading, and filing 2500 five-question quizzes for 90 students in a sophomore organic chemistry class. Today, testing can be made available on demand. Much of this testing can be automated. Under these circumstances, it should be possible to implement modernized Keller Plan courses using technology that makes testing much easier for teachers, and more accessible for students. Elements of the Keller Plan, especially repeatable testing, continue to find favor among college educators [Moore et al., 1975].

If you are interested in automated testing, you'll need more than casual computer support. The testing support available within courseware packages (Chapter 3) is still too limited to support mastery learning.

Mastery learning involves having students demonstrate mastery of material at a defined, high level before they proceed with new material. Two essential elements of this instructional strategy are corrective, enriching feedback and congruence among the instructional components. In Keller Plan courses, quizzing is used for the corrective feedback. Mastery learning is reviewed in Implementing Mastery Learning [Guskey, 1996]. Bloom [1984] suggests that one-to-one tutoring is the most effective strategy known, generally yielding two standard deviations better performance than traditional instruction. He suggests further that mastery learning approaches one-to-one instruction in terms of measured learner gains. When students adopt self-regulatory strategies aimed at mastery learning, these have been associated with positive, achievement oriented activities [Elliot and Dweck, 1988]. Incorporating mastery systems into computer-based instruction has led to excellent learning outcomes [Montazemi & Wang, 1995]. When objective assessments are employed, prior knowledge is an exceptionally strong predictor of learning [Dochy et al, 1999].

Should students make their own decisions about monitoring their progress, or should control be retained by computerized testing? As noted in Chapter 11, Garbin finds that offering lots of information gives students with different styles the opportunity to learn in their own way. Penn & Nedeff [2000] describe a Web-based system for organic chemistry. They demonstrate higher earned test scores for those students making the largest number of attempts on Web-based practice system. Garhart & Hannafin [1986] report that students who control their own decisions about when to terminate instruction often do so prematurely.

 

AUTOMATED TESTING

It has been possible to test students using computers for many years. Certainly, with the advent of desktop computers. Many teachers have developed testing programs of a wide variety. For the most part, when all other things are equal, computer tests give similar outcomes to conventional written tests [Zandvliet & Farragher, 1997.] The Graduate Record Examinations, produced by the Educational Testing Service, are given entirely online. Those tests now use a strategy that involves matching items to test takers such that fewer items are presented, but those presented are selected so as to maximize discrimination for the test taker. This strategy generally is called computer adaptive testing. If the student gets the first item correct, the next item will be harder. On the other hand, if the student misses the first item, the next item will be easier. This approach is based upon item response theory, a remarkably powerful innovation in testing.

The concern that many faculty have about Web testing relates to ensuring who actually completes the test items. UNL has built a testing laboratory. Students can practice test over the Web but, in many courses, they must come to the testing laboratory in order to take tests that count. Testing laboratories are becoming a part of the modern campus landscape.

In a graduate statistics course, Charles Ansorge has students identify a proctor who meets certain criteria. Distant students are then tested in the proctor's presence. Ansorge’s procedure includes having the proctor both sign on and sign off for the test. Faculty report using the strategy of phoning students during the test but this has serious limitations. How do you handle calling into the one-line household in which the one line is connecting the student to the Web?

Assessment and Evaluation of Distance Learners was a PBS Teleconference (12/99), addressed this issue. There is tremendous interest in this area among educators. It often is discussed in journals that deal with online teaching [for example, see Carlson, 2000].

 

EXAMPLE TESTING SYSTEMS

eGrade

To accomplish automatic testing, one needs some powerful computer engines. John Orr's eGrade {U15.02} package, commercially available from John Wiley & Sons, makes use of Java programs written by Orr. This software runs on several platforms.

Orr's software is not restricted to any particular discipline; it is being used widely throughout the University of Nebraska—Lincoln campus. Orr responds to collegial suggestions and, as a result, the power of his system improves incrementally.

 

Test Pilot

Another commercial package, developed at Purdue University, is Test Pilot by ClearLearning {U15.03}. Test Pilot also is authored in the Java language. Faculty at UNL also have very satisfactory experience with this software.

 

Figure 15.01. Web-based testing software developed by John Orr and distributed by John Wiley & Sons, is a powerful Web-based testing tool. This material is used by permission of John Wiley & Sons, Inc.

 

 

Figure 15.02. Test Pilot, a Java-based, platform-independent, Web-server based, online assessment and survey software. With permission.

 

Chemistry Tests

We have done research on automatic testing on the Web. This research is less concerned with traditional testing for assigning grades to students, and more with mastery learning by practice. The testing is designed to keep learners working on a question topic until they "get it right." The questions are generated by the computer, as opposed to pulling them from a bank of questions. The items have great variety. The information is stored in a database, and the questions are generated with encrypted codes. Students respond to quiz items that may be presented in any of ten formats. Upon receipt, the feedback provided to a student’s response is not only context-related, but it may be shaped by features in the student’s response. In a few cases, responses are checked for common misconceptions, and tutoring to address those misconceptions is provided in the student's feedback.

 

Figure 15.03. Response to an item for which the respondent sent in a blank answer.

 

The verification strategy to ensure academic integrity is to sample small portions of the required material after students have completed the entire package. If the small sample indicates that successful learning has occurred, the assumption is the student has learned all of the material as well.

ADMINISTERING TESTS

Although Web-based testing is possible at remote sites, many faculty want to administer proctored tests. UNL has established a testing laboratory. In this laboratory, students are proctored while they take tests. The testing student sits at a terminal, indicates the test to be taken, and logs in to the test. A proctor then verifies identity using the student's photo ID. Upon identification of the testing student, the proctor completes the log-in procedure. At the end of the test, a proctor logs the student out before the test grade and full results are made available. The laboratory shown in Figure 15.04 is being expanded to include about 60 stations.

 

Figure 15.04. Testing lab for administering Web-based tests at UNL.

 

GLOSSARY

computer adaptive testing: the test-taker's ability level relative to a norm group is estimated iteratively during the testing process. Items are selected based on the current ability estimate. The procedure is thought to maximize the information about the test-taker's ability level. Test-takers receive few items that seem very easy or very hard. Testing ceases at a given level that is the test-taker's highest sustainable performance.

item response theory: item response theory (IRT) was developed to overcome shortcomings in the "true score" testing model wherein learners were assigned a score on a test, and that score represents the person's true ability. IRT looks at items, and expects persons of similar ability to perform similarly on an item. In this terminology, a test is unbiased when all testers having the same skill level have an equal probability of getting the item correct. If persons of similar ability but different gender perform differently on the item, then the item is said to be biased.

 

REFERENCES

Bloom, B. S., (1976) Human Characteristics and School Learning, New York : McGraw-Hill.

Bloom, B.(1984) The 2-sigma problem: The search for methods of group instruction as effective as one-to-one tutoring, Educational Researcher, 13, 4-16.

Carlson, R.,(2000) Assessing your students: Testing in the online course, Syllabus, 13, 16-18.

Dochy, P.; Segers, M. & Buehl, M. M.(1999) The relation between assessment practices and outcomes of studies: The case of research on prior knowledge, Review of Educational Research, 69, 145-186.

Elliot, E. & Dweck, C., (1988). Goals: An approach to motivation and achievement, Journal of Personality and Social Psychology, 54, 5-12.

Garhart, C. & Hannafin, M. (1986). The Accuracy of cognitive monitoring during computer-based instruction, Journal of Computer-Based Instruction, 13, 88-93.

Guskey. T. R. (1996). Implementing Mastery Learning, 2nd edition. Belmont CA: Wadsworth

Keller,F. S. (1968). Goodbye, teacher... J. Applied Behavioral Analysis, 1, 79-89.

Kulik, J. A., Kulik, C., & Carmichael, K. (1974). The Keller Plan in science teaching, Science, 183, 379-383. {U15.01}

Kulik, C. C. & Kulik, J. A., (1987). Mastery testing and student learning: A meta-analysis, J.Educational Technology Systems, 15, 325-345.

Kulik, J. A., Kulik, C. C., & Cohen, P. A., (1979). A meta-analysis of outcome studies of Keller's personalized system of instruction, American Psychologist, 34, 307-318.

Kulik, J. A., Kulik, C. C., & Bangert-Drowns, R. L. (1990). Effectiveness of mastery learning programs: A meta-analysis, Review of Educational Research, 60, 265-299.

Montazemi, A. R. & Wang, F. (1995). An empirical Investigation of CBI in support of mastery learning, J. Educational Computing Research, 13, 185-205.

Moore, J. W., Brooks, D. W., Fuller, R. G., & Jensen, D. D. (1977). Repeatable testing, J. Chem. Educ., 54, 276.

Penn, J. & Nedeff, V. M., (2000) Organic chemistry and the Internet: A Web-based approach to homework and testing using the WE_LEARN System, J. Chem. Educ., 77, 227-231.

Silberman, R (1978). The Keller Plan: A personal view, J. Chem. Educ., 55, 97-98.

Zandvliet, D. & Farragher, P. (1997). A comparison of computer-administered and written tests, Journal of Research on Computing in Education, 29, 423-438.

URLs

U15.01. The Keller Plan in Science Teaching, http://plaza.powersurfr.com/kegj /k.html (accessed 7/17/00).

U15.02. Wiley: Technology That Powers Education, http://www.wiley.com/college/egrade/about/index.html (accessed 7/17/00).

U15.03. Test Pilot Online Assessments, http://www.clearLearning.com/ (accessed 5/18/00).