Beyond the Bubble Test: How 3D Assessment is Revolutionizing Science Learning

For decades, science education emphasized memorization—reciting vocabulary, listing steps in a process, or selecting the right answer from a multiple-choice test. But today, the landscape is changing. Guided by A Framework for K–12 Science Education and the Next Generation Science Standards (NGSS), classrooms are shifting toward helping students think, reason, and solve problems like scientists. As instruction changes, so must assessment. Enter: three-dimensional (3D) assessment.

Traditional vs. 3D Assessment: A Shift in Purpose

Traditional assessments often focused on isolated facts and procedures. In contrast, 3D assessments evaluate how students apply knowledge across three dimensions:

  • Science and Engineering Practices (SEPs): How students investigate, model, and explain scientific phenomena.

  • Disciplinary Core Ideas (DCIs): The essential science content students should understand.

  • Crosscutting Concepts (CCCs): Big-picture ideas like systems, cause and effect, and patterns that connect across science disciplines.

Instead of asking, “What do you remember?” 3D assessments ask, “How can you use what you know to explain what’s happening?”

What Do 3D Assessment Tasks Look Like?

3D assessment tasks are designed to mirror how science works in the real world. They:

  • Present a phenomenon or problem that students must make sense of.

  • Involve multi-step tasks that combine explanation, modeling, and reasoning.

  • Require students to apply knowledge rather than just recall it.

  • Use constructed-response formats (e.g., short answers, essays, models) that allow students to show their thinking.

Although multiple-choice questions can be written to assess all three dimensions, they are rare and harder to design. Open-ended tasks are better suited to capturing the depth of student understanding.

Assessment That Supports Learning

3D assessment isn't just about grading—it's a tool for instruction. Teachers use formative assessment throughout a unit to check for understanding and adjust their teaching. These informal or written checks (like notebook entries, classroom discussions, or exit tickets) help track how students’ ideas evolve over time.

Interpreting 3D Responses

Scoring 3D assessments requires a deeper look at student thinking. Teachers need rubrics that describe not just correct answers, but how students engage with practices, use core ideas, and connect across concepts. Some responses may be evaluated holistically, while others may be broken down to see where support is needed.

Resources like NGSS Evidence Statements and learning progressions can help teachers know what to expect at different grade levels.

The Challenge—and the Payoff

Designing and interpreting 3D assessments is complex and time-intensive. Teachers need time, tools, and training to:

  • Develop strong tasks and prompts,

  • Understand how practices, core ideas, and concepts work together, and

  • Give feedback that moves learning forward.

Despite the challenges, the benefits are clear: 3D assessment gives students the opportunity to demonstrate real scientific thinking and helps teachers support lasting understanding.

3 Resources to Dig Deeper…

2 Questions to Ponder & Discuss

  • How well does my instructional approach align with the idea of 3D assessment?

  • How can I providing opportunities through "Engineering or Science Challenges" for students to apply their knowledge and skills in novel contexts, preparing them to use their learning in future situation?

1 Action to Take

Transform Your Summative Assessments for Three-Dimensional Science Learning

To truly support and measure students' deep understanding and application of science knowledge, it is crucial to evolve our assessment practices. Therefore, we challenge you to select one of your current summative science assessments (e.g., an end-of-unit test or a final exam) and critically evaluate its alignment with three-dimensional learning using the Four Assessment Criteria. Based on your evaluation, select specific test items that fall short of the three-dimensional criteria and attempt to rewrite them. Focus on transforming items that currently assess only recall or single dimensions into tasks that require integrated sense-making.