Acadience Reading K-6 (aka DIBELS Next®)
Maze

Summary

Descriptive Information

Acadience Reading Maze uses standardized maze procedures for measuring reading comprehension. The purpose of a maze procedure is to measure the reasoning processes that constitute comprehension. Specifically, Maze assesses the student's ability to construct meaning from text using word recognition skills, background information and prior knowledge, familiarity with linguistic properties such as syntax and morphology, and reasoning skills. Maze can be given to a whole class at the same time, to a small group of students, or individually. Students are given a passage where approximately every seventh word has been replaced by a box containing the correct word and two distractor words. Using standardized directions, students are asked to read the passage silently and circle their word choices. The student receives credit for selecting the word that best fits the omitted word in the reading passage. The scores that are recorded are the number of correct and incorrect responses. The Maze Adjusted Score, which compensates for guessing, is calculated based on the number of correct and incorrect responses. Half the number of incorrect responses are subtracted from the total correct responses, and the difference is rounded up to the nearest whole number.

Acquisition & Cost

Where to Obtain:: Acadience Learning Inc. and Voyager Sopris Learning; info@acadiencelearning.org; Acadience Learning: 859 Willamette Street, Suite 320, Eugene, OR 97401; Voyager Sopris: 17855 Dallas Parkway, Suite 400, Dallas, TX 75287-6816; Acadience Learning: (541)4316931, (888) 943-1240; Voyager Sopris: (888) 399-1995; Acadience Learning: https://acadiencelearning.org/; Voyager Sopris: http://voyagersopris.com

Initial Cost:: Free

Replacement Cost:: Free

Included in Cost:: Acadience Learning: All materials are available for free download at https://acadiencelearning.org/acadiencereading.html, including progress monitoring student materials for each grade, assessor directions and keys for each grade, the Acadience Reading K-6 Assessment Manual, and the Acadience Reading Technical Manual. Large print materials are also available. Voyager Sopris: There are three purchasing options for implementing progress monitoring materials: 1) Progress monitoring via online test administration and scoring; 2) Progress monitoring materials as part of the purchase of classroom sets, which also include benchmark materials; and 3) Individual progress monitoring materials (i.e., Assessor Materials, Student Booklets). Classroom sets contain everything needed for one person to conduct the benchmark assessment for 25 students and the progress monitoring assessment for up to five students.; Approved accommodations are any accommodations that will not alter the standardization of the assessment. Approved Accommodations: 1. The use of colored overlays, filters, or lighting adjustments for students with visual impairments. 2. The use of student materials that have been enlarged or with larger print for students with visual impairments. 3. The use of assistive technology, such as hearing aids and assistive listening devices (ALDs), for students with hearing impairments. 4. The use of a marker or ruler to focus student attention on the materials for students who are not able to demonstrate their skills adequately without one.

Training & Technical Support

Training Requirements:: Approximately 1-2 hours of training to cover foundations of Acadience Reading, as well as administration and scoring of Maze.

Qualified Administrators:: Paraprofessional-level training and adequate training on administration and scoring of Maze.

Access to Technical Support:: Acadience Learning: Customer support is available from 8:00am to 5:00pm PT, Monday through Friday by phone, email, or through Acadience Learning's website; Voyager Sopris: Customer support is available 8:00am to 6:00pm CT, Monday through Friday by phone, email, or through the Voyager Sopris website.

Administration

Assessment Format:

Individual
Small group
Large group
Computer-administered

Scoring Time:

Scoring is automatic OR
1 minutes per worksheet

Scores Generated:

Raw score
Percentile score
Developmental benchmarks
Developmental cut points

Administration Time:

3 minutes per student or worksheet

Scoring Method:

Manually (by hand)
Automatically (computer-scored)

Technology Requirements:

Tool Information

Descriptive Information

Please provide a description of your tool:: Acadience Reading Maze uses standardized maze procedures for measuring reading comprehension. The purpose of a maze procedure is to measure the reasoning processes that constitute comprehension. Specifically, Maze assesses the student's ability to construct meaning from text using word recognition skills, background information and prior knowledge, familiarity with linguistic properties such as syntax and morphology, and reasoning skills. Maze can be given to a whole class at the same time, to a small group of students, or individually. Students are given a passage where approximately every seventh word has been replaced by a box containing the correct word and two distractor words. Using standardized directions, students are asked to read the passage silently and circle their word choices. The student receives credit for selecting the word that best fits the omitted word in the reading passage. The scores that are recorded are the number of correct and incorrect responses. The Maze Adjusted Score, which compensates for guessing, is calculated based on the number of correct and incorrect responses. Half the number of incorrect responses are subtracted from the total correct responses, and the difference is rounded up to the nearest whole number.

Is your tool designed to measure progress towards an end-of-year goal (e.g., oral reading fluency) or progress towards a short-term skill (e.g., letter naming fluency)?: End-year goal
Short-term skill

The tool is intended for use with the following grade(s).

Preschool / Pre - kindergarten
not selected

Kindergarten
not selected

First grade
not selected

Second grade
selected

Third grade
selected

Fourth grade
selected

Fifth grade
selected

Sixth grade
not selected

Seventh grade
not selected

Eighth grade
not selected

Ninth grade
not selected

Tenth grade
not selected

Eleventh grade
not selected

Twelfth grade

The tool is intended for use with the following age(s).

0-4 years old
not selected

5 years old
not selected

6 years old
not selected

7 years old
not selected

8 years old
not selected

9 years old
not selected

10 years old
not selected

11 years old
not selected

12 years old
not selected

13 years old
not selected

14 years old
not selected

15 years old
not selected

16 years old
not selected

17 years old
not selected

18 years old

The tool is intended for use with the following student populations.

Students in general education
selected

Students with disabilities
selected

English language learners

ACADEMIC ONLY: What dimensions does the tool assess?

Reading

Global Indicator of Reading Competence
not selected

Listening Comprehension
not selected

Vocabulary
not selected

Phonemic Awareness
not selected

Decoding

Passage Reading
not selected

Word Identification
selected

Comprehension

Spelling & Written Expression

Global Indicator of Spelling Competence
not selected

Global Indicator of Writting Expression Competence

Mathematics

Global Indicator of Mathematics Comprehension
not selected

Early Numeracy
not selected

Mathematics Concepts
not selected

Mathematics Computation
not selected

Mathematics Application
not selected

Fractions

Algebra

Other

Please describe specific domain, skills or subtests:

BEHAVIOR ONLY: Please identify which broad domain(s)/construct(s) are measured by your tool and define each sub-domain or sub-construct.

BEHAVIOR ONLY: Which category of behaviors does your tool target?

Acquisition and Cost Information

Where to obtain:

Email Address: info@acadiencelearning.org
Address: Acadience Learning: 859 Willamette Street, Suite 320, Eugene, OR 97401; Voyager Sopris: 17855 Dallas Parkway, Suite 400, Dallas, TX 75287-6816
Phone Number: Acadience Learning: (541)4316931, (888) 943-1240; Voyager Sopris: (888) 399-1995
Website: Acadience Learning: https://acadiencelearning.org/; Voyager Sopris: http://voyagersopris.com

Initial cost for implementing program:

Cost: $0.00
Unit of cost: Acadience Learning: Download for free. Minimal costs associated with printing. Voyager Sopris: $6.49 for Assessor Materials and $10.95 for 5-Pack Student Booklets ($2.19/student)

Replacement cost per unit for subsequent use:

Cost: $0.00
Unit of cost: Acadience Learning: Download for free. Minimal costs associated with printing. Voyager Sopris: $10.95 for 5-pack Student Booklets ($2.19/student)
Duration of license: Voyager Sopris: Number of forms/booklets.

Additional cost information:

Describe basic pricing plan and structure of the tool. Provide information on what is included in the published tool, as well as what is not included but required for implementation.: Acadience Learning: All materials are available for free download at https://acadiencelearning.org/acadiencereading.html, including progress monitoring student materials for each grade, assessor directions and keys for each grade, the Acadience Reading K-6 Assessment Manual, and the Acadience Reading Technical Manual. Large print materials are also available. Voyager Sopris: There are three purchasing options for implementing progress monitoring materials: 1) Progress monitoring via online test administration and scoring; 2) Progress monitoring materials as part of the purchase of classroom sets, which also include benchmark materials; and 3) Individual progress monitoring materials (i.e., Assessor Materials, Student Booklets). Classroom sets contain everything needed for one person to conduct the benchmark assessment for 25 students and the progress monitoring assessment for up to five students.

Provide information about special accommodations for students with disabilities.: Approved accommodations are any accommodations that will not alter the standardization of the assessment. Approved Accommodations: 1. The use of colored overlays, filters, or lighting adjustments for students with visual impairments. 2. The use of student materials that have been enlarged or with larger print for students with visual impairments. 3. The use of assistive technology, such as hearing aids and assistive listening devices (ALDs), for students with hearing impairments. 4. The use of a marker or ruler to focus student attention on the materials for students who are not able to demonstrate their skills adequately without one.

Administration

BEHAVIOR ONLY: What type of administrator is your tool designed for?

General education teacher
not selected

Special education teacher
not selected

Parent

Child

External observer
not selected

Other

If other, please specify:

BEHAVIOR ONLY: What is the administration format?

Direct observation
not selected

Rating scale
not selected

Checklist

Performance measure
not selected

Other

If other, please specify:

BEHAVIOR ONLY: What is the administration setting?

General education classroom
not selected

Special education classroom
not selected

School office
not selected

Recess

Lunchroom

Home

Other

If other, please specify:

Does the program require technology?

If yes, what technology is required to implement your program? (Select all that apply)

Computer or tablet
not selected

Internet connection
not selected

Other technology (please specify)

If your program requires additional technology not listed above, please describe the required technology and the extent to which it is combined with teacher small-group instruction/intervention:

What is the administration context?

Individual

Small group If small group, n=

Large group If large group, n=

Computer-administered
not selected

Other

If other, please specify:

Note- computer administered available from Voyager Sopris.

What is the administration time?

Time in minutes

per (student/group/other unit)

student or worksheet

Additional scoring time:

Time in minutes

per (student/group/other unit)

worksheet

How many alternate forms are available, if applicable?

Number of alternate forms

per (grade/level/unit)

grade level

ACADEMIC ONLY: What are the discontinue rules?

No discontinue rules provided
not selected

Basals

Ceilings

Other

If other, please specify:

BEHAVIOR ONLY: Can multiple students be rated concurrently by one administrator?

If yes, how many students can be rated concurrently?

Training & Scoring

Training

Is training for the administrator required?: Yes

Describe the time required for administrator training, if applicable:: Approximately 1-2 hours of training to cover foundations of Acadience Reading, as well as administration and scoring of Maze.

Please describe the minimum qualifications an administrator must possess.: Paraprofessional-level training and adequate training on administration and scoring of Maze.; No minimum qualifications

Are training manuals and materials available?: Yes

Are training manuals/materials field-tested?: Yes

Are training manuals/materials included in cost of tools?: Yes
If No, please describe training costs:

Can users obtain ongoing professional and technical support?: Yes
If Yes, please describe how users can obtain support:: Acadience Learning: Customer support is available from 8:00am to 5:00pm PT, Monday through Friday by phone, email, or through Acadience Learning's website; Voyager Sopris: Customer support is available 8:00am to 6:00pm CT, Monday through Friday by phone, email, or through the Voyager Sopris website.

Scoring

BEHAVIOR ONLY: What types of scores result from the administration of the assessment?

Score
Observation	Behavior Rating
Frequency Duration Interval Latency	Raw score

Conversion
Observation	Behavior Rating
Rate Percent	Standard score Subscale/ Subtest Composite Stanine Percentile ranks Normal curve equivalents IRT based scores

Interpretation
Observation	Behavior Rating
Error analysis Peer comparison Rate of change	Dev. benchmarks Age-Grade equivalent

How are scores calculated?

Manually (by hand)
selected

Automatically (computer-scored)
not selected

Other

If other, please specify:

Do you provide basis for calculating performance level scores?

Yes

What is the basis for calculating performance level and percentile scores?

Age norms

Grade norms
not selected

Classwide norms
not selected

Schoolwide norms
not selected

Stanines

Normal curve equivalents

What types of performance level scores are available?

Raw score

Standard score
selected

Percentile score
not selected

Grade equivalents
not selected

IRT-based score
not selected

Age equivalents
not selected

Stanines

Normal curve equivalents
selected

Developmental benchmarks
selected

Developmental cut points
not selected

Equated

Probability
not selected

Lexile score
not selected

Error analysis
not selected

Composite scores
not selected

Subscale/subtest scores
not selected

Other

If other, please specify:

Please describe the scoring structure. Provide relevant details such as the scoring format, the number of items overall, the number of items per subscale, what the cluster/composite score comprises, and how raw scores are calculated.: Maze is a group or individually administered measure. The assessor asks students to read a passage and circle the word that makes the most sense in the story. The assessor scores the Maze worksheet after the student has completed it. The assessor corrects the worksheet and calculates the student's number of correct and incorrect responses. If a student completes the assessment before the allotted time (3 minutes) is up, the assessor does not prorate the score. The student receives 1 point for each correct word, minus half a point for each incorrect word. A response is correct if the student circled or otherwise marked the correct word. The assessor will mark a slash (/) through any incorrect responses. Incorrect responses include errors, boxes with more than one answer marked, and items left blank (if they occur before the last item the student attempted within the 3-minute time limit). Items left blank because the student could not get to them before time ran out do not need to be slashed and do not count as incorrect responses. If there are erasure marks, scratched out words, or any other extraneous markings, and the student’s final response is obvious, the assessor should score the item based on that response. Assessors record both scores (correct and incorrect) on the cover sheet. On the cover sheet, “C” designates correct responses and “I” designates incorrect responses. For progress monitoring, there is no scoring booklet for Maze, but there is a progress monitoring chart to record the scores. The Adjusted Score is a modified score that compensates for student guessing and is calculated using the following formula: Adjusted Score = number of correct responses – (number of incorrect responses ÷ 2). The result of the formula should then be rounded to the nearest whole number. Half-points (0.5) should be rounded up. The minimum Adjusted Score is 0. Negative numbers are not recorded.

Do you provide basis for calculating slope (e.g., amount of improvement per unit in time)?: Yes

ACADEMIC ONLY: Do you provide benchmarks for the slopes?: No

ACADEMIC ONLY: Do you provide percentile ranks for the slopes?: No

What is the basis for calculating slope and percentile scores?

Age norms

Grade norms
not selected

Classwide norms
not selected

Schoolwide norms
not selected

Stanines

Normal curve equivalents

Describe the tool’s approach to progress monitoring, behavior samples, test format, and/or scoring practices, including steps taken to ensure that it is appropriate for use with culturally and linguistically diverse populations and students with disabilities.: The Acadience Reading K-6 measures were designed to be economical and efficient indicators of a student's progress toward achieving a general outcome such as reading or phonemic awareness, and to be used for both benchmark assessment and progress monitoring. Progress monitoring refers to the more frequent testing of students who may be at risk for future reading difficulty on the skill areas in which they are receiving instruction, to ensure that they are making adequate progress. Progress monitoring can be conducted using grade-level or out-of-grade materials, depending on the student's needs. Decisions about the skill areas and levels to monitor are made at the individual student level. Students who are receiving additional support should be monitored for progress more frequently to ensure that the instructional support being provided is helping them get back on track. Monitoring may occur once per month, once every two weeks, or as often as once per week. In general, students who need the most intensive instruction are monitored for progress most frequently. Progress monitoring materials contain alternate forms of the same measures administered during benchmark assessment. Each alternate form is of equivalent difficulty. Not all students will need progress monitoring. Progress monitoring materials are organized by measure, since students who need progress monitoring will typically be monitored on specific measures related to the instruction they are receiving, rather than on every measure for that grade. Material selected for progress monitoring must be sensitive to growth, yet still represent an ambitious goal. The standardized procedures for administering an Acadienc Reading K-6 measure may apply when using Acadience Reading K-6 for progress monitoring. Progress monitoring data should be graphed and readily available to those who teach the student. An aimline should be drawn from the student's current skill level (which may be the most recent benchmark assessment score) to the goal. Progress monitoring scores can then be plotted over time and examined to determine whether they indicate that the student is making adequate progress (i.e. fall above or below the aimline). The Acadience Reading K-6 assessments were designed to support students of varied backgrounds. Passages were written with names that represent diverse cultural, racial, and ethnic groups. Acadience Reading K-6 is appropriate for most students for whom an instructional goal is to learn to read in English. For English language learners who are learning to read in English, Acadience Reading K-6 is appropriate for assessing and monitoring progress in acquisition of early reading skills.

Rates of Improvement and End of Year Benchmarks

Is minimum acceptable growth (slope of improvement or average weekly increase in score by grade level) specified in your manual or published materials?: Yes; If yes, specify the growth standards:; Using Acadience Reading Pathways of Progress, the growth standards depend on the student's beginning of year performance relative to students with similar levels of initial skills, i.e., student performance is only compared to other students who have the same beginning of year score. Student scores above the 80th percentile are considered Well Above Typical progress. Student scores between the 60th and 79th percentile are considered Above Typical progress.Student scores between the 40th and 59th percentile are considered Typical progress. Student scores between the 20th and 39th percentile are considered Below Typical progress. And student scores below the 20th percentile are considered Well Below Typical progress.

Are benchmarks for minimum acceptable end-of-year performance specified in your manual or published materials?: Yes; If yes, specify the end-of-year performance standards:; Three primary end-of-year performance standards are specified: Well Below Benchmark, Below Benchmark, and At or Above Benchmark. These standards are used to indicate increasing odds of achieving At or Above benchmark status at the next benchmark administration. End of year benchmarks goals and cut points for risk: Grade 3 benchmark goal: 19, cut point: 14; Grade 4 benchmark goal: 24, cut point: 20; Grade 5 benchmark goal: 24, cut point: 18; Grade 6 benchmark goal: 21, cut point: 15.

What is the basis for specifying minimum acceptable growth and end of year benchmarks?

Norm-referenced
selected

Criterion-referenced
selected

Other

If other, please specify:

True

If norm-referenced, describe the normative profile.

National representation (check all that apply):

Northeast:

New England

Middle Atlantic

Midwest:

East North Central

West North Central

South:

South Atlantic

East South Central

West South Central

West:

Mountain

Pacific

Local representation (please describe, including number of states)

The percentile ranks for the Acadience Reading national norms are based on a large national sample of school children across the United States. Data from the 2014-2015 school year were exported from three separate data management systems and combined into one data set. The final combined sample included approximately 2,765,000 students from 8,805 schools within 2,211 school districts in all 50 states and the District of Columbia, representing every census region in the United States. Thirty five percent of schools were located in cities, 26% were located in suburbs, 12% were located in towns, and 27% were located in rural areas.

Date: 2018
Size: 2,748,243

Gender (Percent)

Male: 52%
Female: 48%
Unknown: 0%

SES indicators (Percent)

Eligible for free or reduced-price lunch: 60%
Other SES Indicators

Race/Ethnicity (Percent)

White, Non-Hispanic: 45.63%
Black, Non-Hispanic: 16.32%
Hispanic: 29.25%
American Indian/Alaska Native: 1.69%
Asian/Pacific Islander: 3.22%
Other: 3.53%
Unknown: 0%

Disability classification (Please describe)
First language (Please describe)
Language proficiency status (Please describe)

Do you provide, in your user’s manual, norms which are disaggregated by race or ethnicity? If so, for which race/ethnicity?

White, Non-Hispanic
not selected

Black, Non-Hispanic
not selected

Hispanic

American Indian/Alaska Native
not selected

Asian/Pacific Islander
not selected

Other

Unknown

If criterion-referenced, describe procedure for specifying criterion for adequate growth and benchmarks for end-of-year performance levels.

The Acadience Reading K-6 benchmark goals provide targeted levels of skill that students need to achieve by specific points in time in order to be considered to be making adequate progress. The Group Reading Assessment and Diagnostic Evaluation (GRADE; Williams, 2001), a high- quality, nationally norm-referenced assessment, was used as an external criterion in the Benchmark Goal Study. In the Benchmark Goal Study, the 40th percentile at or above the GRADE Total Test Raw Score was used as one approximation of adequate reading skill. The intent is to develop generalizable benchmark goals and cut points that are relevant and appropriate for a wide variety of reading outcomes, across a wide variety of states and regions, and for diverse groups of students. The principle vision for Acadience Reading K-6 is a step-by-step vision. Student skills at or above benchmark at the beginning of the year put the odds in favor of the student achieving the middle-of-year benchmark goal. In turn, students with skills at or above benchmark in the middle of the year have the odds in favor of achieving the end-of-year benchmark goal. Finally, students with skills at or above benchmark at the end of the year have odds in favor of having adequate reading skills on a wide, general variety of external measures of reading proficiency. The fundamental logic for developing the benchmark goals and cut points for risk was to begin with the external outcome goal and work backward in that step-by- step system. We first obtained an external criterion measure (the GRADE Total Test Raw Score) at the end of the year with a level of performance that would represent adequate reading skills (the GRADE Total Test Raw Score at the 40th percentile rank). Next, we specified the benchmark goal and cut point for risk for end-of-year Maze with respect to the end-of-year external criterion. Then, using the Maze end-of-year goal as an internal criterion, we established the benchmark goals and cut points for risk for middle-of-year Maze. Finally, we established the benchmark goals and cut points for risk for beginning-of-year Maze using the middle-of-year Maze goal as an internal criterion (see pp. 44-78) of the Acadience Reading K-6 Technical Manual.

Describe any other procedures for specifying adequate growth and minimum acceptable end of year performance.

Acadience Reading Pathways of Progress offers a means of indexing student progress that can be used to evaluate the effectiveness of instruction, establish meaningful, attainable, and ambitious goals, and provide feedback on progress to students and educators. Pathways of Progress is based upon student growth percentiles. Student growth percentiles provide a measure of “how (ab)normal a student’s growth is by examining their current achievement relative to their academic peers—those students beginning at the same place” (Betebenner, 2011, p. 3). Pathways of Progress is based on an analysis of Acadience Reading scores from students across grades K-6 (N ≈ 1.8 million students). Pathways are calculated in a three-step process: 1. At each grade level, students were grouped by their beginning-of-year Acadience Reading Composite Score (BOY RCS) for scores between the first and the 99.5th percentile rank. For each unique BOY RCS, the 20th, 40th, 60th, and 80th quantiles were calculated for the end-of-year Acadience Reading measure (e.g., ORF Words Correct, Maze Adjusted Score, NWF-CLS) or RCS. 2. A stiff spline quantile regression model was fit to each quantile using BOY RCS as the predictor (mean RMSE = .99 for all grades). 3. The predicted quantile scores from the regression model corresponding to each unique BOY RCS were rounded to the nearest one, forming the end-of-year pathway borders. After end-of-year benchmark administration, each student’s score will fall into a single pathway based on the expectation of progress from their beginning-of-year score (Pathway 3 = Typical Progress). Educators may use our Pathways of Progress goal setting utility to establish meaningful, attainable and ambitious progress monitoring goals for individual students. When used in conjunction with the Acadience Reading benchmark goals, Pathways of Progress further empowers educators to set goals that are meaningful, ambitious, and attainable. The Acadience Reading benchmark goals are the same for all students in a grade, regardless of their starting skill level, and represent the lowest score for which a student is likely to still be on track to reach future reading outcomes (e.g., to be on track for fourth grade, every third- grade students should reach a Reading Composite Score of 330 by the end of the year). While benchmark goals are meaningful, there may be some students for whom they are not ambitious enough, and others for whom they are unattainable. Pathways of Progress helps increase decision-making precision with respect to goal setting and evaluating progress. Pathways of Progress allows teachers to use a normative context, in addition to the benchmark goals, when setting goals and evaluating progress. Pathways of Progress clarifies what rate of progress is Typical, Above Typical, or Well Above Typical. Pathways of Progress also informs educators when the rate of progress is Below Typical or Well Below Typical. Teachers can use the Pathways of Progress goal-setting utility available in Acadience Data Management to see the target scores for each pathway and set end-of-year goals for students. These features will assist teachers when tracking students’ progress toward their goals throughout the year. Setting goals is particularly important for students who are performing Below or Well Below Benchmark and in need of additional instructional support. Goal setting is a professional decision that should be made with several considerations in mind. Student goals should represent a professional judgment about a goal that is simultaneously meaningful, ambitious, and attainable. When setting goals, consider the following: 1. What is a meaningful goal? • The big idea is to increase a student’s odds of achieving important literacy outcomes in the future. Therefore, goals should be set with the intention of students exceeding, achieving, or coming as close as possible to their Acadience Reading grade-level benchmark goals. • Moving a student from Below Benchmark to At or Above Benchmark or moving a student from Well Below Benchmark to either Below Benchmark or to At or Above Benchmark represents a meaningful goal. 2. What is an ambitious goal? • Above Typical Progress (Pathway 4) and Well Above Typical Progress (Pathway 5) represent ambitious goals. Below Typical Progress (Pathway 2) and Well Below Typical Progress (Pathway 1) are not considered ambitious goals. • Typical Progress (Pathway 3) may be sufficient for students who are already At or Above Benchmark. • Typical Progress may not be adequate for students who are likely to need additional support to achieve benchmark goals. 3. What is an attainable goal? • Goals in the Well Above Typical range may not always be attainable. • Typical and Above Typical Progress are likely attainable. Well Below Typical and Below Typical Progress may be attainable, but are not ambitious or meaningful. Appropriate goals are both attainable and ambitious. • It is important to consider what might be possible with a very effective, research-based intervention. As progress monitoring data are collected and plotted on a graph, educators can determine where those data fall relative to the Pathways. To make decisions about student progress, we advocate an approach that uses a moving median of the three most recent progress monitoring data points to determine what Pathway a student's data are following. For students who are well below benchmark and likely in need of intensive intervention, at least above typical progress represents acceptable and meaningful growth. Please see the following supporting documents for additional information about the goal setting utility and the technical characteristics of Pathways of Progress: • PathwaysOfProgress_GoalSettingUtility • 2015-02-04 PCRC 2015 Poster handout final • PCRC Pathways Handout_2016-02-03 • Final_PCRC Handout_2018-01-31 • NASP2019UnlockingPotential_2-2019_Final_Handout • 2019-07-11 Pathways of Progress Part A • 2019-07-11 Pathways_of_Progress_Part • 2019-07-02 Pathways of Progress Activities Acadience

Performance Level

Reliability

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

*Offer a justification for each type of reliability reported, given the type and purpose of the tool.: Reliability refers to the relative stability with which a test measures the same skills across minor differences in conditions. Three types of reliability are reported in the table below, alternate form reliability, alpha, and inter-rater reliability. Alternate form reliability is the correlation between different measures of the same early literacy skills. The coefficient reported is the correlation between two forms of the measure. High alternate-form reliability coefficients suggest that these multiple forms are measuring the same construct. Coefficient alpha is a measure of reliability that is widely used in education research and represents the proportion of true score to total variance. Alpha incorporates information about the average inter-test correlation as well as the number of tests. Inter-rater reliability indicates the extent to which results generalize across assessors. The inter-rater reliability estimates reported represent the reliability of the directions and scoring procedures of the measures themselves as interpreted by the assessors administering the measure.

*Describe the sample(s), including size and characteristics, for each reliability analysis conducted.: The data used for assessing reliability came from third through sixth grade. The total sample size is 674 students from 13 schools within 5 school districts. The sample was drawn from two census regions (Pacific and North Central Midwest).

*Describe the analysis procedures for each reported type of reliability.: Alternate form reliability is reported as the correlation between two alternate forms of the same test. Coefficient alpha treats the two tests as separate indicators and is calculated using the alternate form reliability, where the number of tests is equal to two. For inter-rater reliability, pairwise correlations were performed on a data set that included scores from the administrator of the measure and a shadow scorer, resulting in two scores of the same student performance on the Maze measure.

*In the table(s) below, report the results of the reliability analyses described above (e.g., model-based evidence, internal consistency or inter-rater reliability coefficients). Include detail about the type of reliability data, statistic generated, and sample size and demographic information.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published reliability studies:: Yes

Provide citations for additional published studies.: Dewey, E. N., Latimer, R. J., Kaminski, R. A., & Good, R. H. (2011). DIBELS Next Development: Findings from Beta 2 Validation Study (Tech. Report No. 10). Eugene, OR: Dynamic Measurement Group. Available: https://acadienclearning.org. Powell-Smith, K. A., Good, R. H., Latimer, R. J., Dewey, E. N., & Kaminski, R. A. (2011). DIBELS Next Benchmark Goals Study (Tech. Report No. 11). Eugene, OR: Dynamic Measurement Group. Available: https://acadienclearning.org. Dewey, E. N., Powell-Smith, K. A., Good, R. H., & Kaminski, R. A. (2015). Acadience Reading K–6 Technical Adequacy Brief. Eugene, OR: Acadience Learning. Available: https://acadiencelearning.org. Please note that that Dynamic Measurement Group is now Acadience Learning and Acadience Reading K-6 is also published as DIBELS Next. Some historical documents retain the original assessment name and company name.

Do you have reliability data that are disaggregated by gender, race/ethnicity, or other subgroups (e.g., English language learners, students with disabilities)?: No

If yes, fill in data for each subgroup with disaggregated reliability data.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published reliability studies:: No

Provide citations for additional published studies.

Validity

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

*Describe each criterion measure used and explain why each measure is appropriate, given the type and purpose of the tool.: The Group Reading Assessment and Diagnostic Evaluation (GRADE) is an untimed, group-administered, norm-referenced reading achievement test appropriate for children in preschool through grade 12. The GRADE is comprised of 16 subtests within five components. Not all 16 subtests are used at each testing level. Various subtest scores are combined to form the Total Test composite score. The GRADE Total Test score is comprised of scores across subtests of the GRADE that vary by grade level. In kindergarten, the GRADE Total Test score is comprised of measures that assess phonics and phonemic and phonological awareness. In first and second grade, the GRADE Total Test includes word meaning, passage (or sentence) reading, and comprehension measures. In third grade, the GRADE Total Test is comprised of measures assessing word reading, vocabulary, and comprehension. In fourth, fifth, and sixth grade, the GRADE Total Test includes scores from measures of vocabulary and comprehension. The AzMERIT includes a number of different types of questions, including performance tasks that are multi-step assignments that ask students to apply their knowledge and skills to address real-world problems. In English Language Arts (ELA), the subtest examined in our analyses, students apply their research and writing skills. The test also includes traditional multiple choice questions, as well as interactive questions that require students to drag and drop their answers into a box, create equations, and fill in the answer. The California Standards Test (CST) is a statewide achievement test produced for California public schools and was designed to assess the California content standards for English/language arts (ELA), mathematics, history–social science, and science in grades 2-11. According to a technical report from ETS (2011), the CST items were developed and designed to conform to principles of item writing defined by ETS (ETS, 2002). In addition, the items selected underwent an extensive item review process designed to provide the best standards-based tests possible. The Reading cluster of the ELA portion of the CST was examined in our analyses.

*Describe the sample(s), including size and characteristics, for each validity analysis conducted.: The GRADE data set included scores for students in third and fifth sixth grade. The total sample size is 382 students from 13 schools within 5 school districts. The sample was drawn from two census regions (Pacific and North Central Midwest). The AzMERIT data set included scores for students in third and fourth grade. The total sample size was 1,253 students from 16 schools in 1 large-city school district in Mountain West US state. 54% of students were Hispanic/Latino, 23% were White, 11% were Black/African American, 8% were American Indian/Native Alaskan, 5% were Multiracial, and 4% were Asian/Native Hawaiian/Pacific Islander. The CST data set included 2,986 students in fourth through sixth grade from 14 schools in 1 large-suburban school district in 1 Pacific West US state. Approximately 46% of students were White and 38% were Hispanic/Latino. Thirty one percent of students in the district qualified for free/reduced lunch and 20% were English Language Learners.

*Describe the analysis procedures for each reported type of validity.: Predictive validity is the correlation between the Maze Adjusted Score at the beginning of the year and the GRADE, AzMERIT, or CST (as indicated) score at the end of the school year. This coefficient represents the extent to which Maze can predict later reading outcomes. Concurrent validity is the correlation between the Maze Adjusted Score and the GRADE, AzMERIT, or CST (as indicated) measure both at the end of the year. This coefficient represents the extent to which Maze is related to important reading outcomes.

*In the table below, report the results of the validity analyses described above (e.g., concurrent or predictive validity, evidence based on response processes, evidence based on internal structure, evidence based on relations to other variables, and/or evidence based on consequences of testing), and the criterion measures.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of validity analysis not compatible with above table format:

Manual cites other published reliability studies:: No

Provide citations for additional published studies.: Dewey, E. N., Powell-Smith, K. A., Good, R. H., & Kaminski, R. A. (2015). Acadience Reading K–6 Technical Adequacy Brief. Eugene, OR: Acadience Learning. Dewey, E. N., Latimer, R. J., Kaminski, R. A., & Good, R. H. (2011). DIBELS Next Development: Findings from Beta 2 Validation Study (Tech. Report No. 10). Eugene, OR: Dynamic Measurement Group. Available: https://acadiencelearning.org. Powell-Smith, K. A., Good, R. H., Latimer, R. J., Dewey, E. N., & Kaminski, R. A. (2011). DIBELS Next Benchmark Goals Study (Tech. Report No. 11). Eugene, OR: Dynamic Measurement Group. Available: https://acadiencelearning.org. Please note that that Dynamic Measurement Group is now Acadience Learning and Acadience Reading K-6 is also published as DIBELS Next. Some historical documents retain the original assessment name and company name.

Describe the degree to which the provided data support the validity of the tool.: Both the concurrent and predictive correlations are high. These strong correlations suggest that Acadience Reading Maze assesses skills relevant to broad reading outcomes.

Do you have validity data that are disaggregated by gender, race/ethnicity, or other subgroups (e.g., English language learners, students with disabilities)?: No

If yes, fill in data for each subgroup with disaggregated validity data.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of validity analysis not compatible with above table format:

Manual cites other published reliability studies:: No

Provide citations for additional published studies.

Bias Analysis

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating	Not Provided	Not Provided	Not Provided	Not Provided

Have you conducted additional analyses related to the extent to which your tool is or is not biased against subgroups (e.g., race/ethnicity, gender, socioeconomic status, students with disabilities, English language learners)? Examples might include Differential Item Functioning (DIF) or invariance testing in multiple-group confirmatory factor models.: Yes

If yes,
a. Describe the method used to determine the presence or absence of bias:: Bias was conceptualized as different classification accuracy between different groups. This was assessed using a Cleary model with the dichotomous outcome of status on the criterion, where the Maze Adjusted Score, subgroup, and the interaction between the two were used as predictors. If a model with the subgroup and interaction term do not add significantly to model fit, there was evidence that Maze is not biased. Model fit was assessed using the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and the likelihood ratio test (LRT). The effect size for bias was assessed using the difference in AUC for the ROC curves for the different groups. These models were tested for each grade, at each time of year.

b. Describe the subgroups for which bias analyses were conducted:: Bias was assessed across genders and among white and non-white students.

c. Describe the results of the bias analyses conducted, including data and interpretative statements. Include magnitude of effect (if available) if bias has been identified.: Of the 9 models examining bias across ethnicities the AIC and LRT favored a model without bias eight times, while the BIC favored a model without bias all nine times. Of the 21 models examining bias across genders, the AIC favored a model without bias 17 times while the BIC favored a model without bias 20 times. Likewise, the likelihood ratio test favored a model with bias only three times out of 21 models. The results show that the rate of preferring model with bias is near the global Type I error rate of .05, suggesting a lack of bias on the Maze measure.

Growth Standards

Sensitivity: Reliability of Slope

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

Describe the sample, including size and characteristics. Please provide documentation showing that the sample was composed of students in need of intensive intervention. A sample of students with intensive needs should satisfy one of the following criteria: (1) all students scored below the 30th percentile on a local or national norm, or the sample mean on a local or national test fell below the 25th percentile; (2) students had an IEP with goals consistent with the construct measured by the tool; or (3) students were non-responsive to Tier 2 instruction. Evidence based on an unknown sample, or a sample that does not meet these specifications, may not be considered.: The sample consisted of students who were identified as being "Well Below Benchmark" using the benchmark assessment of Acadience Reading at the beginning of year. Being Well Below Benchmark corresponds to being below the 19th, 18th, 19th, and 10th percentiles for third, fourth, fifth, and sixth grades, respectively. Students were only selected if they had a minimum of 15 observations.

Describe the frequency of measurement (for each student in the sample, report how often data were collected and over what span of time).: Progress monitoring data were collected throughout the school year at the discretion of the administering school, but not more frequently than once per week. Any student who had fewer than fifteen progress monitoring assessments was excluded from the analysis.

Describe the analysis procedures.: Reliability of slope was calculated as the ratio of true score variance to observed total variance. The true score variance estimate came from a hierarchical linear model based estimate of the variance in progress monitoring slopes (using the R package lme4), the observed score variance was calculated as the variance of the ordinary least squares slopes created for each student that met the aforementioned inclusion criteria. Confidence intervals were calculated using bootstrap estimation.

In the table below, report reliability of the slope (e.g., ratio of true slope variance to total slope variance) by grade level (if relevant).

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published reliability studies:: No

Provide citations for additional published studies.

Do you have reliability of the slope data that is disaggregated by subgroups (e.g., race/ethnicity, gender, socioeconomic status, students with disabilities, English language learners)?: No

If yes, fill in data for each subgroup with disaggregated reliability of the slope data.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published reliability studies:: No

Provide citations for additional published studies.

Sensitivity: Validity of Slope

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

Describe each criterion measure used and explain why each measure is appropriate, given the type and purpose of the tool.: For the Acadience Maze progress monitoring assessment, we used the Acadience Oral Reading Fluency Words Correct at the end of the subsequent year as the outcome measure. For instance, to calculate the validity of grade 3 progress monitoring slopes, we used the Oral Reading Fluency Words Correct score at the end of grade 4. While the criterion is internal in the sense that both the progress monitoring assessment and the criterion are Acadience measures, the criterion is external in the sense that it is distinct and separate from the Maze progress monitoring system. Indeed, there is no shared method variance between the two: (a) the Maze assessment requires students to read a passage silently and fill in blanks for approximately every 7th word by selecting from a choice of three words the word that makes the most sense in the passage, (b) The Oral Reading Fluency Words Correct assessment requires a student to read a passage aloud accurately and fluently. In addition, there is no overlap of item samples: The passages used for the Maze assessment are completely different and share no overlap with the passages used for the Oral Reading Fluency Words Correct assessment. These requirements (external measures, no shared method variance, no overlap of item samples) serve to ensure a conceptual distance between the slope of Maze and the criterion. In the reported analysis we increased the length of time between the slope of Maze and the criterion measure by examining outcomes to the end of the subsequent academic year. So, for example, the validity of slope of progress on third-grade Maze assessment was examined with respect to end of fourth grade Oral Reading Fluency Words Correct. In sum, we believe that using both an alternative measure of reading skills (Maze vs. Oral Reading Fluency Words Correct), and the length of time between the end of progress monitoring and the criterion (an entire year between the last progress motioning occasion and the criterion) provides a sufficiently powerful examination of the validity of slope.

Describe the sample(s), including size and characteristics. Please provide documentation showing that the sample was composed of students in need of intensive intervention. A sample of students with intensive needs should satisfy one of the following criteria: (1) all students scored below the 30th percentile on a local or national norm, or the sample mean on a local or national test fell below the 25th percentile; (2) students had an IEP with goals consistent with the construct measured by the tool; or (3) students were non-responsive to Tier 2 instruction. Evidence based on an unknown sample, or a sample that does not meet these specifications, may not be considered.: The sample consisted of students who were identified as being "Well Below Benchmark" using the benchmark assessment of Acadience Reading at the beginning of year. Being Well Below Benchmark corresponds to being below the 19th, 18th, 19th, and 10th percentiles for third, fourth, fifth, and sixth grades, respectively. Students were only selected if they had a minimum of 15 observations.

Describe the frequency of measurement (for each student in the sample, report how often data were collected and over what span of time).: Progress monitoring data were collected throughout the school year at the discretion of the administering school, but not more frequently than once per week. Any student who had fewer than fifteen progress monitoring assessments was excluded from the analysis.

Describe the analysis procedures for each reported type of validity.: Validity of slope was assessed using the partial correlations between the students' ordinary least squares slope and the criterion, while controlling for the students' ordinary least squares intercept.

In the table below, report predictive validity of the slope (correlation between the slope and achievement outcome) by grade level (if relevant).
NOTE: The TRC suggests controlling for initial level when the correlation for slope without such control is not adequate.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published validity studies:: No

Provide citations for additional published studies.

Describe the degree to which the provided data support the validity of the tool.: The moderate to strong partial correlations that the OLS slopes have with a criterion that is separated by an entire year and a conceptually different measure of reading skills provides strong evidence for validity.

Do you have validity of the slope data that is disaggregated by subgroups (e.g., race/ethnicity, gender, socioeconomic status, students with disabilities, English language learners)?: No

If yes, fill in data for each subgroup with disaggregated validity of the slope data.

Type of	Subscale	Subgroup	Informant	Age / Grade	Test or Criterion	n (sample/ examinees)	n (raters)	Median Coefficient	95% Confidence Interval Lower Bound	95% Confidence Interval Upper Bound

Results from other forms of reliability analysis not compatible with above table format:

Manual cites other published validity studies:: No

Provide citations for additional published studies.

Alternate Forms

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

Describe the sample for these analyses, including size and characteristics:

What is the number of alternate forms of equal and controlled difficulty?

If IRT based, provide evidence of item or ability invariance

If computer administered, how many items are in the item bank for each grade level?

If your tool is computer administered, please note how the test forms are derived instead of providing alternate forms:

Decision Rules: Setting & Revising Goals

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

In your manual or published materials, do you specify validated decision rules for how to set and revise goals?
If yes, specify the decision rules:

What is the evidentiary basis for these decision rules? NOTE: The TRC expects evidence for this standard to include an empirical study that compares a treatment group to a control and evaluates whether student outcomes increase when decision rules are in place.

Decision Rules: Changing Instruction

Grade	Grade 3	Grade 4	Grade 5	Grade 6
Rating

Legend

Convincing evidence

Partially convincing evidence

Unconvincing evidence

Data unavailable

^dDisaggregated data available

In your manual or published materials, do you specify validated decision rules for when changes to instruction need to be made?
If yes, specify the decision rules:

What is the evidentiary basis for these decision rules? NOTE: The TRC expects evidence for this standard to include an empirical study that compares a treatment group to a control and evaluates whether student outcomes increase when decision rules are in place.

Data Collection Practices

Most tools and programs evaluated by the NCII are branded products which have been submitted by the companies, organizations, or individuals that disseminate these products. These entities supply the textual information shown above, but not the ratings accompanying the text. NCII administrators and members of our Technical Review Committees have reviewed the content on this page, but NCII cannot guarantee that this information is free from error or reflective of recent changes to the product. Tools and programs have the opportunity to be updated annually or upon request.

Summary

Tool Information
Descriptive Information
Administration
Training & Scoring
Benchmarks

Performance Level
Reliability
Validity
Bias Analysis

Growth Standards
Sensitivity
Alternate Forms
Decision Rules

Data Collection Practices

Acadience Reading K-6 (aka DIBELS Next®)Maze

Summary

Tool Information

Descriptive Information

Acquisition and Cost Information

Administration

Training & Scoring

Training

Scoring

Rates of Improvement and End of Year Benchmarks

Performance Level

Reliability

Validity

Bias Analysis

Growth Standards

Sensitivity: Reliability of Slope

Sensitivity: Validity of Slope

Alternate Forms

Decision Rules: Setting & Revising Goals

Decision Rules: Changing Instruction

Data Collection Practices

Acadience Reading K-6 (aka DIBELS Next®)
Maze