Are These Tests Any Good? Part 2

This is the second entry in a series that examines the test quality of the New York State Math Regents Exams.  In the on-going debate about using student test scores to evaluate teachers (and schools, and the students themselves), the issue of test quality rarely comes up.  And the issue is crucial:  if the tests are ill-conceived, poorly constructed, and erroneous, how legitimate can they be as measures of teaching and learning?

In Part 1 of this series I looked at three questions that demonstrated a significant lack of mathematical understanding on the part of the exam writers.  Here, in Part 2, I will look at three examples of poorly designed questions.

The first is from the 2011 Integrated Algebra Regents:  how many different ways can five books be arranged on a shelf?

This simple question looks innocent enough, and I imagine most students would get it “right”.  Unfortunately, they’ll get it “right,” not by answering the question that’s been posed, but by answering the question the exam writers meant to ask.

How many different ways are there to arrange five books on a shelf?  A lot.  You can stack them vertically, horizontally, diagonally.  You can put them in different orders; you can have the spines facing out, or in.  You could stand them up like little tents.  You could arrange each book in a different way.  The correct answer to this question is probably “as many ways as you could possibly imagine”.  In fact, exploring this question in an open-ended, creative way might actually be fun, and mathematically compelling to boot.

But students are trained to turn off their creativity and give the answer that the tester wants to hear.  A skilled test-taker sees “How many ways can five books be arranged on a shelf?” and translates it into  “If I ignore everything I know about books and bookshelves, stand all the books upright in the normal way, don’t rotate, turn or otherwise deviate from how books in math problems are supposed to behave, then how many ways can I arrange them?”

This question is only partly assessing the student’s ability to identify and count permutations.  This question mostly tests whether the student understands what “normal” math problems are supposed to look like.

This problem is an ineffective assessment tool, but there’s something even worse about it.  Problems like this, of which there are many, teach students a terrible lesson:  thinking creatively will get you into trouble.  This is not something we want to be teaching.

Here’s a question from the 2011 Algebra II and Trigonometry exam:

Solving equations is one of the most important skills in math, and this question pertains to a particular method (completing the square) used to solve a particular kind of equation (quadratic).  But instead of simply asking the student to solve the problem using this method, the question asks something like “if this procedure is executed normally, what number will be written down in step four?”.

This is not testing the student’s ability to do math; instead, it’s testing whether or not they understand what “normal” math looks like.  There are many ways to solve equations, and there are many ways a student might use this method.  Whether it looks exactly like what the teacher did, or what the book did, isn’t especially relevant.  So why is that being tested?  And like the question above, this reinforces the idea that thinking creatively can be dangerous by insisting that students see the “normal” solution as the only correct one.

Finally, here’s a problem from the 2011 Geometry Regents:

Once again, the student is not being tested on their knowledge of a concept or on their ability to perform a task.  Instead, they’re being tested on whether or not they recognize what “normal” math looks like, and that’s just not something worth testing.  There are lots of legitimate ways to construct a perpendicular bisector:  why are we testing whether the student recognizes if the “normal” way has been used?

These three problems showcase some of the dangers inherent in standardized testing.  Questions like these, and the tests built from them, discourage creative thinking;  they send students the message that there is only one right way to do things; they reinforce the idea that the “correct” answer is whatever the tester, or teacher, wants to hear; and they de-emphasize real skills and understanding.

At their worst, these tests may not just be poor measures of real learning and teaching; they may actually be an obstacle to real learning and teaching.

Related Posts

A Good Math Story

This video lecture from the Princeton Public lecture series features psychologist Stanislas Dehaene and mathematician and author Steven Strogatz discussing math, learning, and teaching.

Dehaene, a leading researcher in how the brain acquires and processes mathematical knowledge, has some interesting things to say about whether mathematics is innate or learned.  He also describes some fascinating research conducted on number sense and geometric understanding in primitive societies.

What I enjoyed most in this lecture, however, are the two stories from Steven Strogatz about his personal experiences learning math.  At around the 23:30 mark, Strogatz tells the story of when he first became “really interested” in math.  Strogatz is a wonderful storyteller, and the tale should resonate with anyone who has discovered, or is discovering, their passion for math or science.

Strogatz’s second story is about his first experience being “weeded out” in an advanced math course in college.  This is a story I wish I had heard as a student, and it’s something I’ll definitely be sharing with my students from now on.

And for more on math and teaching from Professor Strogatz, check out how his book “The Calculus of Friendship” is a great read for advanced math students.

Fun With Fractals

I love spending time with my niece and nephew.  They are inquisitive, thoughtful, energetic, fun, and always up for anything.  As math is often on my mind, mathematical ideas often come up when we’re together.  I love exploring math with them:  we talk about numbers, ponder questions, play with shapes, estimate things.  They usually enjoy it, and if they don’t, they don’t mind telling me so.  Just like the rest of my students.

The last time I was in the neighborhood, I stopped by to see if they were around.  They weren’t home, but a I saw a piece of chalk lying on the ground.  I scribbled a quick note in the driveway letting them know I’d been by.  Before I knew it, a little triangle had appeared under my message.

When I came back later, there were many questions to answer about the mysterious diagram (it’s not the first time such triangles have appeared unexpectedly).  I told them a chilling tale about Waclaw Sierpinski, and then showed them how to make their own Sierpinski Triangle:  draw any triangle; find the midpoints of the sides; connect; repeat!  Soon, we were drawing triangles all over the neighborhood.

 An unprompted pentagon even appeared!

After lots of drawing, lots of questions, and a triangle scavenger hunt, it was time to go.  Another fun afternoon spent with my niece and nephew, and I left impressed, as I usually do.  And hopeful, too, that maybe one day my niece will actually admit that she likes math!

Are These Tests Any Good? Part 1

The test is a staple of modern education, and not just at the classroom level.  Today, tests can determine which public schools a child can attend, whether or not a student graduates, which districts get state aid, and of course, which colleges might want you.

There is a movement afoot which seeks to legally tie teacher job performance to student test scores.  There’s a simple argument at the core of this movement (“If students are doing poorly on tests, then the teacher must be doing a poor job”), and a simple counterargument (“There many factors beyond a teacher’s influence that affect student test performance”), but it’s a complex issue, and it has generated much controversy.

As the debate rages on in educational, political, and media circles, one particular aspect of this issue rarely gets discussed:  test quality.  If the tests being used to evaluate students, schools, and now teachers, are ill-conceived, sloppy, and erroneous, how legitimate a measure of teaching and learning could they possibly be?

In short, few people connected to this issue seem interested in the rather important question “Are these tests any good?”

I will address this question in a series of posts that examine the New York State Math Regents Exams.  I’ll begin this series by looking at three questions from the multiple choice section of the 2011 Algebra II and Trigonometry exam.  The official test and scoring guide can be found here.

First, an algebra question:  which answer is equivalent to the given expression?

The “correct” answer, according to the scoring guide, is (1).  However the real answer is that none of these are equivalent to the original expression.  For two expressions to be equivalent, they must agree for every possible value of their variables.  Let  x = -1 and y = 1; the original expression evaluates to 2; the “correct” answer evaluates to undefined(or, if you prefer, to 2i).  The two expressions, therefore, are not equivalent.

Now consider this question about graphs:  which graph is not a function?

A simple way to determine if a graph is the graph of a function is to use the vertical line test:  if a vertical line can be drawn through the graph so that it intersects the graph more than once, then the graph is not the graph of a function.  The “correct” answer according to the scoring guide, is (3), which is indeed not a function.  But take a closer look at (2):

As the red vertical lines suggest, this graph also appears to fail the vertical line test.  Therefore it is not a function.  This question has two correct answers, only one of which was awarded credit.

Lastly, consider this question, again about graphs.

As it turns out, none of these graphs is the graph of cos^{-1}(x).  The graph in (3), the “correct” answer, is only part of the correct graph.  It is not the entire graph.  The actual graph of cos^{-1}(x) extends infinitely up and down.  (If you feel that the notation cos^{-1}(x) implies a restriction, note that none of these restrictions are correct, either).

While it may seem that some of the issues raised here are merely technicalities, keep in mind that technicalities play an important role in mathematics.  Furthermore, students who truly understand the relevant issues here might actually be at a disadvantage on these questions, wasting time sorting through these poorly-conceived problems and worrying about which answer to give.

A lot could be riding on this test: student graduations, teacher jobs, schools closings. With the stakes so high for so many, it seems like we should be paying closer attention to the question: Are these tests any good?

Math and Art: Curvefitting With Geogebra

Here is some student work from a recent project I conducted on fitting curves to images in Geogebra.  The details of the assignment can be found here, and more examples of student work can be seen on my Facebook page.

Students were asked to find pictures and use Geogebra to fit trigonometric curves to the images using transformations. Here are some of the results.

Smart Water = Smart Curves

Geogebra.Curvefit.Water.Bottle

My Good-Looking Windowsill

Geogebra.Curvefit.Windowsill

Sine of Camel Humps

Geogebra.Curvefit.Camel

Overall, I was really impressed with the creativity the students showed, and their facility with fitting these curves to the forms!  A mathematical and artistic success in my book.

Related Posts

Follow

Get every new post delivered to your Inbox

Join other followers: