Research - Education Next

Can Banning Cellphones Save Student Learning?

David Figlio — Tue, 12 May 2026 09:00:32 +0000

A 9th grader at Delta High School in rural Utah places his cellphone into the pocket of a phone holder before entering class in early 2024. Utah is one of almost two dozen states that have legislated bell-to-bell bans on student cellphone use during the school day. The state’s policy becomes official on July 1 and goes into effect in the 2026–27 school year.

Schoolwide cellphone limits are quickly becoming the norm across the United States. In the past few years, 22 states and the District of Columbia have passed “bell-to-bell” laws, which prohibit students from accessing cellphones throughout the entire school day (see Figure 1). Another 19 states have passed more flexible laws, which leave the rulemaking up to local districts or allow cellphones during noninstructional time. Legislation banning or restricting phone use is currently under consideration in Illinois, Massachusetts, and Pennsylvania.

These policies are popular with educators (see “Take Away Their Cellphones,” features, Fall 2022). A 2024 survey by the National Education Association found that 90 percent of teachers favor banning cellphones during class and 83 percent support stricter, bell-to-bell restrictions. A 2025 survey by the RAND Corporation found that 70 percent of school principals—and 81 percent of middle and high school principals—believe cellphone bans have a positive impact on school climate. However, bans are less popular with students. (Perhaps not surprisingly, that same RAND survey found that just one in 10 students supports banning cellphones in school.) They are also controversial among parents, many of whom cite safety concerns. In a 2024 survey by the National Parents Union, 56 percent of parents said students should sometimes be allowed to use their phones at school.

Missing from these debates is evidence of a ban’s impacts on student learning. We bring such evidence to bear from Florida’s first-in-the-nation statewide cellphone ban, adopted in May 2023. That law banned cellphones during instructional time but allowed local districts to impose additional restrictions according to their needs; a 2025 Florida law instituted a bell-to-bell ban in elementary and middle schools. Our study looks at one of the nation’s 10 largest districts, where local leaders imposed a “bell-to-bell” ban that prohibits using phones, earbuds, and smartwatches throughout the entire school day, including noninstructional time. We track changes in test scores, attendance, and disciplinary incidents and use geolocated cellphone-use data to observe changes in student cellphone use during the school day. Finally, we compare outcomes at schools that had the highest and lowest level of pre-ban student cellphone use to estimate the causal effects of the ban.

We find that prohibiting cellphone use works: Among high school students, daily cellphone visits fell by more than 80 percent after the ban was imposed. However, we also find a temporary spike in suspension rates in the ban’s first year: The overall suspension rate jumps by 25 percent, with the biggest impacts on Black students. At schools with high levels of pre-ban cellphone use, the rate of in-school suspensions for Black students increases by 30 percent, while rates for white and Hispanic students remain steady. However, disciplinary rates return to pre-ban levels in the second year, and student performance on reading and math tests improves. By the end of the second year, scores are up by about 3.5 percentiles compared to scores from May 2023, the end of the pre-ban year. Schools with the highest pre-ban cellphone use experience the largest positive impacts.

Finally, we look at trends in student attendance and find significant reductions in the number of unexcused absences in both the first and second years after the ban. The changes are especially large at middle and high schools, providing suggestive evidence that improved student engagement and school climate could be important factors behind the observed test score benefits.

Overall, our findings reveal that cellphone bans improve student outcomes, but these benefits come at the cost of temporarily elevated suspension rates when the new rules are first enforced. As states and school districts nationwide seek a reset on student cellphone use, the challenge is to minimize short-term adverse effects until a new cellphone-free status quo is established.

A “Teachers’ Bill of Rights”

In 2023, Florida Governor Ron DeSantis signed House Bill 379, one of a set of state education laws known as the “Teachers’ Bill of Rights.” The measure established new rules for Internet safety, required districts to educate students about the risks of social media and block access on school devices, and prohibited students from using wireless communication devices, including phones, watches, and earbuds, during instructional time unless directed to do so by a teacher for educational purposes.

The large, urban district that we study took an even stricter approach, setting a bell-to-bell policy requiring that wireless communications devices be silenced and put away in students’ bags during the entire school day, including lunch and while transitioning between classes. In keeping with the statewide law, students with special needs were allowed to use their devices to monitor a documented health condition. In the case of a schoolwide emergency, students were allowed to take out and use their devices.

The new rules were in effect at the start of the 2023–24 school year and enforced after a short grace period. Starting after Labor Day, if a student violated the rules, their device was to be confiscated and returned at the end of the day. In addition, the student could be punished, including being suspended from school.

This new ban occurred in a context in which cellphones—and in particular, smartphones—have become pervasive in American middle and high schools. In 2024, 95 percent of teenagers and 57 percent of children aged 11–12 had their own smartphone. Those figures have risen rapidly over the last decade; in 2015, just 67 percent of teenagers owned a smartphone.

During that time, incidents of depression and anxiety among adolescents have soared. The share of high school students who report experiencing persistent feelings of sadness and hopelessness increased to 40 percent in 2023 compared to 30 percent in 2013. These trends have triggered public debates about the causal link between the rise in smartphone use among adolescents and decline in their wellbeing. Rigorous evidence about this causal link remains scant. Still, many argue that the adverse effects of smartphones on social isolation, sleep deprivation, and attention fragmentation are responsible for the observed declines in adolescent outcomes (see “No Simple Answer for Kids and Screens,” reviews, September 2025). There is indeed descriptive evidence suggesting that prolonged use of smartphones in children and adolescents are associated with higher rates of anxiety and depression, body dissatisfaction and eating disorders–especially among girls—sleep issues, and cyberbullying.

The potential link between smartphones and student achievement is also a growing area of concern (see “Anxiety, Depression, Less Sleep … and Poor Academic Performance?” what next, Winter 2024). In 2024, 12th-grade reading scores on the National Assessment of Educational Progress (NAEP) fell to a 30-year low, with nearly one-third of students scoring in the lowest “Below Basic” category. Scores had peaked in 2009. And on NAEP student surveys in 2023, the share of 13-year-olds who say they read for fun “almost every day” was about half of what it was a decade earlier: 14 percent compared to 27 percent in 2012.

Estimating Impacts of a Ban

Our study focuses on a three-year period: one year before the ban (2022–23) and the first two years under the new rules (2023–24 and 2024–25). We look at detailed student-level data, including test scores in grades 3–10, disciplinary incidents, and absences, and track outcomes over time relative to the start of the ban. We also investigate differences by student demographics, gender, and grade level.

While the ban was in place at the start of the 2023–24 school year, disciplinary enforcement did not begin until September. Our analysis tracks changes in student performance and suspensions throughout each school year. Florida administers state tests in August/September, December, and May, which allows us to observe student achievement three times a year. Further, we observe the date of each disciplinary incident, which reveals how disciplinary incidents and suspensions changed immediately after the district started referring students for disciplinary action for cellphone-use infractions and whether those changes persisted throughout the year.

In addition to tracking changes descriptively over time, we also compare outcomes at schools based on their relative levels of student cellphone use before the ban. We use detailed smartphone activity data from Advan, a private research firm that tracks foot traffic using point-of-interest coordinates. We look at building-level data from January 2023 to December 2024, focusing on the average number of unique smartphone visits (pings) between 9am and 1pm on school days in the last two months of the 2022–23 school year (right before the ban took effect) and the first two months of the 2023–24 and 2024–25 school years. To disentangle student activity from the smartphone activity of teachers and staff, we subtract the average number of unique smartphone visits between 9am and 1pm on teacher in-service workdays in the same school year, when students are not present, from the average when school is in session. We then sort schools into three groups based on student cellphone use and compare outcomes for those in the highest and lowest groups, allowing us to estimate the causal effects of a cellphone ban.

Results

After a “bell-to-bell” cellphone ban takes effect, students are dramatically less likely to pick up and use their phones at school. Looking at cellphone-use data during school days, we find steep declines in the average daily phone visits per 100 students in year one (see Figure 2). In high school, cellphone use drops by more than 80 percent, from 46 daily visits per 100 students to 10. In middle school, cellphone use falls by half, from 62 daily visits to 31. In year two, middle school visits continue to fall another 23 percent, to 24 daily visits per 100 students, while high school use ticks slightly upward to 13 visits per 100 students.

Discipline

In the short term, we find a significant spike in the rates of disciplinary incidents and suspensions (see Figure 3). Once a new ban is enforced, the suspension rate jumps by 25 percent compared to the same month one year earlier and then remains high throughout the first year of the ban. In the second year, discipline and suspension rates return to pre-ban levels.

The temporary increase in disciplinary sanctions districtwide as the cellphone ban was implemented provides suggestive evidence that the ban led to more students being punished. To strengthen the case that the ban was responsible for the jump, we investigate differences in disciplinary actions and suspensions at schools that had relatively high and low cellphone use before the ban. We make these comparisons separately for students grouped by race and gender. This shows that Black students experience the most substantial increases in disciplinary referrals immediately after a cellphone ban. In the first year, a cellphone ban increases in-school suspensions by roughly 30 percent for Black students but has no significant effect for white and Hispanic students or on out-of-school suspensions. These effects disappear in year two.

Academic achievement

Our analysis also looks at student performance on standardized tests, which students in Florida take three times each year. We average results across subjects and report changes in nationally normed percentiles. Scores remain stable in the first year of a cellphone ban but improve modestly in year two (see Figure 4). Overall, at the end of the second year of a cellphone ban, scores are up by about 4 percentiles compared to the final tests administered during the pre-ban year.

EdNext in your inbox

We then again compare changes at schools with especially high and low pre-ban cellphone activity, which confirms that students in high-use schools made the largest improvements. We look at spring tests, which are used for school accountability and high-stakes student-level decisions such as grade promotion and future course placement. Compared to improvements in schools where cellphone use was less frequent before the ban, we find a cellphone ban in high-use schools increases test scores by 1.1 percentiles overall, 1.2 percentiles for Black students, 1.4 percentiles for white students, 1.4 percentiles for male students, and 1.3 percentiles for middle and high school students. We do not find any significant effects for female students or for students in elementary school.

Attendance

To what extent can these positive effects on test scores be driven by the potential effects of the ban on school climate and student engagement? We use student attendance as a proxy and focus on unexcused absences. The results indicate that the ban reduced the number of unexcused absence days by 5 percent to 10 percent, driven primarily by students in middle and high school. We find no significant effect on student absences in elementary schools. The ban’s effect on unexcused absences in middle and high schools is large enough to explain nearly half of the positive effect on student test scores, though it’s important to note that the change in absences may reflect many other unmeasured dimensions of school climate.

Governor Ron DeSantis signed several pieces of legislation affecting public education in Florida in May 2023, one of which was the “Teacher’s Bill of Rights” that effectively banned cellphone use during instructional time.

Potential and Pitfalls for Phone-Free Schools

Many frustrated parents and teachers have wondered what would happen if cellphones disappeared overnight. Our study looks at the most likely realistic version of that fantasy, and one that will be in place in half of all U.S. states in coming years: a bell-to-bell cellphone ban enforced through disciplinary referrals. We find that this type of ban can work—students in the Florida district we study were far less engaged with their phones during the school day, with phone visits among high school students plunging by 80 percent in the first year. Students also experienced benefits from this major change in behavior. Within two years, they earned higher test scores and were less likely to be absent.

These findings paint a picture of a policy that improved the student learning climate, at least after a period of transition. However, enforcing the ban also led to a spike in disciplinary actions, and the largest effects were for Black students. While this effect disappeared by the second year, these findings should inform states and districts introducing cellphone bans and the schools tasked with implementing and enforcing them. Amid a nationwide surge of policies and soon-to-follow wave of implementation, decisionmakers should investigate practices to ensure a smooth and equitable implementation period.

David N. Figlio is the Gordon Fyfe Professor of Economics and Education at the University of Rochester. Umut Özek is a senior economist at the RAND Corporation.

Suggested citation format:

Figlio, D. N. and Özek, U. (2026). “Can Banning Cellphones Save Student Learning? Evidence from Florida, home of the first statewide mandate.” Education Next, 26(2), 12 May 2026.

The post Can Banning Cellphones Save Student Learning? appeared first on Education Next.

Colleges Are Closing. Who Might Be Next?

Robert Kelchen — Thu, 09 Apr 2026 05:00:53 +0000

Northland College in Ashland, Wisconsin, held its final commencement ceremony in 2025 before closing permanently due to financial challenges and declining enrollment. The college had been in operation since 1892.

“How did you go bankrupt?” one American expat asks another in Ernest Hemingway’s The Sun Also Rises. “Two ways,” his friend responds. “Gradually and then suddenly.”

The most dramatic college closures tend to follow a similar arc. A century-old, beloved liberal arts institution abruptly shuts its doors, sending students and staff scrambling as it succumbs at last to the financial challenges dogging schools that rely on tuition dollars. But the forces creating those challenges—declining enrollment, punishing operating costs, and anemic endowment returns—typically accumulate over a period of years if not decades. In recent years, these forces have all been featured players in the final chapters of small nonprofit schools like Wells College (New York), Northland College (Wisconsin), Iowa Wesleyan University (Iowa), Finlandia College (Michigan), and more than three dozen others. This growth in closures has followed a massive reorganization of the for-profit college sector, where enrollment fell by more than half in the 2010s.

Research and policy discussions often focus on the negative effects of college closures on students, but that’s only part of the story. Colleges also serve as anchor institutions—local economic and cultural engines whose sudden disappearance can leave regions flat-footed. Communities with colleges have higher levels of educational attainment, employment in human capital-intensive industries, economic mobility, and local economic output. Colleges also function as cultural hubs, by supporting civic engagement and the arts, and by providing entertainment and a range of educational and enrichment opportunities for their neighbors. Closures affect entire communities, far beyond campus borders.

With higher education facing daunting financial conditions—not to mention a “demographic cliff” as the number of traditional-age students is set to decline starting with the class of 2026—more schools are likely to close. But which institutions are at the highest risk? To identify common characteristics of schools that close, we assembled the most comprehensive data set to date on colleges and universities. To measure institutions’ risk of closure, we also designed and tested a novel approach that uses machine learning to overcome data and methodological challenges. This method can predict closures with 83 percent average accuracy compared with 77 percent when applying the metrics currently used for federal accountability measurement. It also simultaneously enables the analyst to generate predictions for nearly twice as many institutions, as traditional methods typically discard many institutions due to missing data. Our methods thereby provide a more reliable glimpse into the future that can help local and regional leaders plan for and perhaps prevent potential disruptions.

In looking at closures of four-year schools between 1996 and 2023, we find they are three times as likely at for-profits than at private nonprofits, while public institutions hardly ever close. Some 21 percent of for-profit colleges close compared with 7 percent of nonprofits and less than 1 percent of public schools. Our analysis also shows that closures are much more likely at two-year than at four-year colleges. Among all two-year schools, 21 percent of private nonprofits close compared with 3 percent of public programs. For-profit two-year schools have by far the highest closure rates among two-year programs, at 33 percent.

We investigate the extent to which institutional characteristics and financial metrics might foretell college closures by assessing their impact on the predictive power of traditional risk assessment and on our machine-learning model. This analysis shows that the federal government’s current measures of financial distress are informative; however, we also find that recent changes in enrollment, staffing, or revenue—metrics that are not currently part of federal accountability measurement—increase our model’s predictive power. We then look to the future and provide some back-of-the-envelope calculations to estimate the range of impacts on colleges from the “demographic cliff.”

While the precise effects of changing demographics, along with growing questions about the value of higher education are not yet fully known, it’s clear that more change is coming to American colleges and universities. With sound data and methods, the communities they shape can be better prepared for what’s ahead.

Financial Headwinds

Financial challenges have long played an important role in the history of American postsecondary education. For example, Harvard University relied on fundraising one-quarter of a bushel of corn (“colledge corne”) from each local family in the 1640s and 1650s. For the most part, institutions have proved fairly resilient: A 2016 analysis of nearly 500 private nonprofit colleges identified as having limited resources in 1972 found that nearly 85 percent continued to operate in some form more than four decades later.

But today, colleges are facing particularly unfavorable conditions in enrollment, tuition, and costs. The number of American 18-year-olds is projected to fall by 13 percent between 2026 and 2041, while the share of high-school graduates enrolling in college right away has already shrunk from 70 percent to 62 percent over the last decade. And when we look at enrollment among adult learners, defined as age 25 or older, the trends are even starker: Federal data show annual adult enrollment has fallen by nearly half since 2008. These declines are accompanied by growing skepticism among the public about the value of higher education (see “Apprenticeships on the Rise,” features, Summer 2023) and by the federal government’s expanded student-loan collection efforts.

Meanwhile, a growing number of states are limiting tuition increases at public colleges, and tuition growth has been at or below inflation at private nonprofit schools since 2018. Tuition discount rates at nonprofit schools reached a record-high average of 51 percent in 2022. At the same time, operating costs have risen quickly owing in part to pandemic-era inflation and a longer trend of rising operating expenses. While many colleges survived during the pandemic era due to timely federal support and emergency actions, more than four dozen nonprofit colleges closed between 2022 and 2024. Many of these closures came as a shock to students, staff, and the broader community, with some schools continuing to admit new applicants and coordinate next year’s courseloads weeks or even days before an announcement.

How can we better understand a college’s risk of shutting down? To date, research has identified several institutional characteristics statistically related to closure, such as being a Historically Black College or University (HBCU) or a women’s college, as well as financial characteristics like lower tuition, smaller endowments, and higher shares of instructional spending. Yet many of the factors do not necessarily cause financial distress in and of themselves; rather, they are correlated with other characteristics and indicators predictive of institutional challenges.

To better understand schools’ financial predicament, we look across the higher education industry to assess trends in revenues and expenditures. It is a vast sector: American higher education directly produces approximately $700 billion in expenditures, enrolls nearly 25 million students, and employs approximately 4 million people. It includes 5,686 institutions nationwide that are eligible to receive federal financial aid under Title IV of the Higher Education Act, along with a number of very small colleges, particularly in the for-profit sector, that operate without federal financial aid. Those non-eligible schools are not included in the descriptions and analyses that follow because of a lack of available data.

We first examine revenue sources across types of institutions and find broad differences (see Figure 1). For-profit schools are almost entirely reliant on tuition dollars, which account for about 93 percent of revenues compared with 37 percent at private nonprofits. Other major revenue sources at nonprofits include grants and contracts (18 percent), gifts (12 percent), and profitable auxiliary enterprises like sports (about 8 percent). Tuition dollars are less central at public colleges, which receive more government revenue (22 percent) than net tuition and fees (17 percent).

We also look at trends over time. Public colleges and universities experienced a clear upward trend in inflation-adjusted revenue, from $333 billion in 2002 to $472 billion in 2022, although revenue from both tuition and auxiliaries declined in real terms beginning in 2020 owing to the Covid-19 pandemic and enrollment declines. Revenues at private nonprofit colleges are more sensitive to investment returns due to school endowments; however, the long-term trend has been toward increased revenues for the sector, with tuition and auxiliary revenue generally following the same path as public institutions. By contrast, the finances of for-profit colleges grew sharply and then fell. From 2002 to 2011, revenue tripled from $15 billion to $46 billion amid dramatic growth. Following enrollment declines, a stricter regulatory environment, and the collapse of some large for-profit chains, total revenue declined to about $20 billion by 2018.

During those same years, expenditures grew. Operating costs have increased faster than general inflation for decades, driven by rising expenses for line items such as health insurance and administrative support. As a labor-intensive industry, expenses related to personnel are by far the largest expenditure category in most institutions’ budgets. Federal data from 2021–22 show that nationwide, instruction costs account for about 26 percent of expenditures at public colleges, compared with about 28 percent at nonprofits and 30 percent at for-profit schools.

Assessing Risk of Closure

The balance sheets and financial health of individual institutions vary widely within these broader trends. To take a closer look, we assemble the most comprehensive data set to date on the characteristics of 8,633 American colleges and universities, including dates of operation, institutional setting, student body, staff, and financial data from 2002 to 2023, primarily from the Department of Education’s Integrated Postsecondary Education Data System (IPEDS) data. We focus on variables that could potentially be associated with college closures based on prior research, economic theory, and our experiences in the field of higher education finance. These variables include enrollment, staff, revenues, expenses, assets and debt, financial metrics such as liquidity and leverage, and measures of economic health such as cash on hand and major swings in enrollment. We also document when data is missing—most frequently, institutional data lacks measures of debt, assets, and leverage. Finally, we look at the characteristics of the college’s local population, including rates of employment, poverty, and per-capita income.

We then identify schools that closed between 1996 and 2023 based on Closed School Weekly Reports from the Federal Student Aid’s Postsecondary Education Participants System (PEPS) database. Our analysis only counts schools where the main campus (not a branch or satellite) closed; overall, a total of 1,671 colleges closed during the analysis period.

The vast majority of closures are among private for-profit colleges, which have the highest closure rates (see Figure 2). This is intuitive—whether they are “nimble critters or agile predators,” for-profit colleges are much more likely to exit the marketplace if they do not see the opportunity to make a profit in the near future. Nearly three-fourths of closures in the data set are two-year for-profit colleges, and almost one-third of the 3,732 institutions observed in this sector closed at some point between 1996 and 2023. On the other hand, while closures at private nonprofit four-year colleges get the lion’s share of attention, rates are relatively modest at about 7 percent over the same period. Public schools, in turn, almost never fully close, but rather reorganize.

Comparing schools that close with those that don’t reveals immediate contrasts. Colleges that close tend to be smaller, more tuition-driven, and experience larger declines in enrollment and revenue than colleges that remain open. Among schools that never close, the median operating margin is about 9 percent, and tuition accounts for 45 percent of revenue; at schools that close, the median margin is 3 percent, and tuition makes up 86 percent of revenue two years prior to closure. Our data also show median year-over-year enrollment declines of 58 percent among colleges that close two years later, while those that remain open experience no or smaller enrollment declines. Further, more than one-fourth of colleges that close post operating losses in at least three of the five years prior to closure—twice the rate among colleges that remain open.

However, none of these factors on its own is a reliable predictor of closure, and overall, our comparisons show substantial overlap between open and closed colleges on individual metrics. Current federal accountability metrics mainly rely on a school’s financial responsibility score to assess risk, but achieving a better understanding of a school’s risk of closure requires more than a single metric—even a composite one.

Predicting the Past

To gain additional insight, we turn to decision trees, an “if . . . then” method of exploring how variables relate to an eventual outcome and to one another, and we use a machine learning algorithm called XGBoost to mine our rich data set. Such classification algorithms are designed to work with large amounts of incomplete data and can handle complex interactions and nonlinear relationships, thus they are better suited for predicting rare events like college closures or financial distress compared with score-based accountability metrics or more traditional statistical approaches.

Marissa Schilling receives the last degree conferred by Finlandia University in May 2023. The Hancock, Michigan, school closed after 127 yea

The focus of our analysis is the predictive accuracy of decision trees and traditional probability models rather than the individual effect of a particular factor or characteristic on the risk of closure. We limit our sample to private schools (nonprofit and for-profit) between 2002 and 2023, when the data is most consistent, and compare each model’s predictions with data on whether schools actually closed.

This is not a causal analysis, as factors and characteristics are both interconnected and correlated with closures and with one another. However, the output from our models can be used in at least two commonly accepted ways: First, to identify relative risks, similar to how Financial Responsibility Composite Scores can sort institutions into “zones of danger” and prioritize additional auditing data collection or examination accordingly; and second, as the basis for binary classifiers to serve as a warning sign, such as Heightened Cash Monitoring status in which the government may limit advanced federal aid payments in favor of reimbursement.

In comparing actual and predicted closure probabilities across all schools and models, we find that the XGBoost machine learning method significantly outperforms other approaches. In evaluating the 100 institutions identified as at highest risk of closure, 84 actually close within three years when using the XGBoost model, compared with 47 for the federal metrics model and 61 for traditional econometric methods.

We also look at the relative predictiveness of individual factors, though the fact that many variables are highly correlated with, or even functions of one another, makes any interpretation of magnitude very difficult. For example, the Financial Responsibility Composite Score is a function of several financial metrics, such as a school’s primary reserve ratio. These metrics themselves are functions of other key variables, such as cash reserves and operating margin.

However, this does not mean that we cannot provide evidence on the relative importance of different factors in terms of their effect on predictive power. We find that, reassuringly, the variables that should have a strong theoretical impact on the likelihood of closure (e.g., measures of financial distress) do, in fact, significantly increase the predictive power in our model and traditional estimates of probability. For example, a school’s Financial Responsibility Composite Score is associated with a 4 percent gain in predictive power in the XGBoost model. We also find that including recent and medium-term changes in key factors in prediction models is particularly informative. For example, recent changes in enrollment increase predictive power by 2.4 percent. This stands in contrast to assessing risk based on a benchmark or set minimum level or ratio of key metrics, and it argues for the inclusion of past trajectory as an important factor that should be considered by monitoring agencies in addition to absolute levels. This is not currently a significant part of federal accountability measurement.

Over the “Cliff”

We then turn to a critical question. How many and what types of institutions may be at risk of financial distress in the future given reasonable and extreme scenarios on enrollment, revenue, and expense trends? Still on the horizon for many schools is the so-called “demographic cliff,” which could take the form of a gradual descent in enrollment over 15 years or a sudden drop by as much as 15 percent from 2025 to 2029.

We use 2019 data as a baseline and investigate various scenarios, assuming that revenues and expenses scale with enrollment and that institutions maintain the same revenue and expense shares when this occurs. This likely results in a conservative estimate of the number of closures because of the presence of fixed costs such as facilities and tenured faculty at many institutions. We use parameters from the traditional econometric models to estimate the impacts of no change in current enrollment patterns, a one-time enrollment drop of 15 percent, and a decline in enrollment of 15 percent over five years.

If the enrollment declines that colleges have experienced since 2019 persist into the future, we can expect to see one additional closure per year, which is an increase of 2 percent over the current average. Assuming the worst-case scenario predictions come to pass and higher education experiences a 15 percent enrollment drop overnight, there could be as many as 80 additional closures in a single year. A more gradual 15 percent decrease in enrollment over five years would result in 4.6 additional closures per year (i.e., 23 closures over five years).

EdNext in your inbox

These simulations point to the precarious potential situation facing postsecondary education in the coming years, especially if the demographic cliff materializes in a moderate to severe fashion. While some of the estimated increases might seem small at the national level, they would be significant for the handful of localities predicted to experience college closures in a given year. It is important to reiterate that most institutions that close are somewhat smaller than average, with the median closed school enrolling a student body of about 1,389 full-time equivalent students several years prior to closure. That said, for institutions located in small towns, these colleges are still one of the largest employers in the region. This means that many (if not all) of these additional predicted closures are likely to be at the sorts of local institutions that are significant economic engines and act as community anchors.

Even ignoring the potential negative effects due to reduced training capacity in a county that loses a college, the immediate employment effects as a share of the labor force might be large. This includes not only the loss in employment coming directly from the college but also the immediate spillovers from establishments that provide goods/services to schools (most notably, retail, health care, and food services). Moreover, most students work while attending college, so any working students who are either attracted to or kept from leaving the community because of the presence of the educational institution will also contribute to local economic effects.

Discussion

With the data we now have at hand, college financial distress and closures can be predicted more accurately, and a new method using machine learning techniques is significantly more accurate than extant warning systems. This information will be extremely helpful in the months and years ahead. The “demographic cliff” has just begun.

As future research, it would be valuable to estimate the impact of college closures and severe financial stress on county-level measures of employment, wages, and population. We are particularly interested in the effect of college-induced disruptions on temporary or permanent reallocations of human capital and employment within and across local and regional economic areas.

However, our focus on the negative effects of college closures does not necessarily mean that sector observers or localities should always seek to prevent them from happening. Institutions of higher education (and particularly those in the for-profit sector) do not close randomly or without cause. If they are unable to produce outcomes that students, employers, or society at large find valuable, then they should not be artificially sustained by governments absent evidence of significant positive externalities. Extending the existence of an educational institution destined for failure may actually compound the locality’s fiscal problems if the college cannot ultimately be sustainable on its own, as well as create a negative externality on other universities that could benefit from that enrollment.

While our predictive models of college financial distress and closure may not be able to predict the eventual failure of every institution, they are certainly effective at identifying those facing the greatest risk. The methods we outline can help governments, local communities, and sector observers anticipate labor market and infrastructure disruptions if more college closings appear imminent and be better prepared to support affected community members and businesses during the transition.

Robert J. Kelchen is professor at the University of Tennessee, Knoxville. Dubravka Ritter is special adviser at the Consumer Finance Institute at the Federal Reserve Bank of Philadelphia. Douglas A. Webber is principal economist at the Board of Governors of the Federal Reserve. This article is based on a working paper published by the National Bureau of Economic Research and the Federal Reserve Bank of Philadelphia, “Predicting College Closures and Financial Distress.” The views expressed here are those of the authors and do not necessarily reflect the views of the Board of Governors of the Federal Reserve, the Federal Reserve Bank of Philadelphia, or the Federal Reserve System.

This post has been updated to remove a numerical error in the estimated percentage change of college closures under various conditions.

Suggested citation format:

Kelchen, R. J., Ritter, D., and Webber, D. A.. (2025). “Colleges Are Closing. Who Might Be Next? How machine learning can fill data gaps and help forecast the future.” Education Next, 25(4), 25 November 2025

The post Colleges Are Closing. Who Might Be Next? appeared first on Education Next.

School Enrollment Shifts Five Years After the Pandemic

Joshua Goodman — Tue, 22 Jul 2025 09:00:53 +0000

In spring 2020, the Covid-19 pandemic caused school closures throughout the United States—a seismic disruption with immediate effects on enrollment. By fall 2020, early research found that K–12 public school enrollment had dropped by 3 percent nationwide, as growing numbers of families opted to homeschool or transfer their children to private schools. This was the largest annual decline since 1943, when World War II led large numbers of teenagers to leave high school.

What happened in the years since? Did those trends hold? Or did students return to public schools in the same numbers as before, once the widescale pandemic-related disruptions were behind them?

We conduct the first analysis of the pandemic’s longer-term impact on school enrollment patterns and find a substantial shift away from public schools. Our study focuses on the state of Massachusetts, where fall 2024 enrollment counts are readily available, but we show that trends similar to those we document for the Bay State are evident in national data through fall 2023.

To estimate the pandemic’s effects on school enrollment patterns five years later, we compare actual enrollments to predictions based on trends in the four years leading up to Covid’s onset, or 2015 to 2019. This is a critical step, as looking only at raw differences between fall 2024 and fall 2019 will misstate the pandemic’s impact when enrollments were already trending in a given direction. We make these comparisons against prior trends for all students and separately by student race, school district income, and grade level.

Five years after the pandemic’s onset, there has been a substantial shift away from public schools and toward non-public options. In fall 2024, we find that local public school enrollment in Massachusetts is 4.2 percent lower than it was in fall 2019, which is 1.9 percent lower than predicted based on pre-pandemic trends (see Figure 1). By contrast, enrollment in private schools is 0.7 percent below fall 2019 but 15.6 percent above the predicted decline based on pre-pandemic trends. We also find that homeschooling, which more than doubled from fall 2019 to fall 2020, has had some staying power. In fall 2024, it is 56 percent higher than in fall 2019, which is 50 percent higher than predicted.

Charter school enrollment had been rising rapidly pre-pandemic but then leveled off starting in fall 2020. While fall 2024 enrollment is nearly unchanged relative to 2019, it is 18.9 percent lower than those steep pre-trends had predicted. Some of this may have to do with charter expansion being restricted by Massachusetts law, which caps both the number of charter schools statewide and the share of each district’s funds that can flow to charters.

Overall, our results suggest that the enrollment shifts due to the pandemic have had lasting impacts, especially on the appeal of private and home schools relative to public schools.

Enrollment Differences by Income and Race

Beyond these overall numbers, the demographics of families opting to homeschool or enroll their children in private schools have shifted.

To study differences by income, we categorize local school districts based on their fall 2019 share of economically disadvantaged students and investigate differences between districts in the top 20 percent of income and the rest. Public school enrollment losses are substantially larger in high-income school districts, with fall 2024 enrollment 5.7 percent below predicted levels compared to 1 percent lower in the other 80 percent of school districts (see Figure 2). These top-income districts lost nearly 50 percent more students than the lower-income four-fifths combined.

These enrollment losses are substantially concentrated among white and Asian students: Fall 2024 enrollment is 3.1 percent lower than predicted for white students and 8.1 percent lower for Asian students. In contrast, Black and Hispanic enrollment has recovered and even exceeded pre-pandemic trends. Relative to predicted levels, fall 2024 Black enrollment is 7.7 percent higher and Hispanic enrollment is 0.8 percent higher than expected. In short, the pandemic has substantially shifted the racial and ethnic composition of public schools.

Income and racial differences in the persistence of changing enrollment patterns are likely related to differences in education concerns throughout the pandemic. In many areas, white and Asian families and families in high-income communities were more likely in the early days of the pandemic to switch to private schools, which were far more likely to offer in-person instruction than traditional K–12 public schools (see “Pandemic Parent Survey Finds Perverse Pattern: Students Are More Likely to Be Attending School in Person Where Covid Is Spreading More Rapidly,” survey, Spring 2021). Those dis-enrollments appear to have persisted. Meanwhile, enrollment has largely recovered in middle- and low-income communities and among Black and Hispanic families. These families were more likely to support remote schooling or switch to homeschooling, which for many was a temporary shift.

Differences by Grade Level

Post-pandemic changes in local public school enrollment vary substantially by grade level as well (see Figure 3). Elementary grades have largely recovered to predicted levels. While prekindergarten and kindergarten enrollment in fall 2024 is 3.4 percent below predicted levels, enrollment in grades 1–4 is 2.7 percent higher than predicted.

However, enrollment in middle school, or grades 5–8, has not recovered. Middle school enrollment is 7.7 percent below predicted levels—a decline that far exceeds the total public school enrollment losses across all grades. Census population estimates for Massachusetts do show larger declines in the number of children of middle grade ages than other ages, suggesting that changing migration patterns may be part of the story. The enrollment decline is, however, substantially larger than the estimated 5.9 percent population decline, implying that some portion of this change must be driven by enduring shifts to homeschooling or private schools. Meanwhile, high school enrollment has barely budged since the pandemic’s onset, with enrollment now up 0.1 percent relative to predicted levels.

EdNext in your inbox

Though the enrollment patterns documented here cannot explain why public middle grades have seen a particular exodus of students, our results suggest public schools may need to pay close attention to the mismatch between parental demand and the middle grades experience currently provided. These are students whose formative elementary school experiences were disrupted. Also, post-pandemic increases in challenging student behavior may be particularly prevalent or salient in middle grades, a sensitive developmental time. Some parents may begin to take academic rigor more seriously in middle grades relative to earlier ones, with remote schooling and other disruptions revealing or generating concerns about a lack of such rigor. Private school and homeschool options are likely better substitutes for public middle schools than they are for public high schools, whose economies of scale allow a variety of course and extracurricular offerings that are hard to replicate in smaller settings.

A National Story

To explore whether the enrollment patterns observed in Massachusetts by fall 2024 are representative of the nation more broadly, we compare our data from the Bay State to the most recent data available at the national level from the National Center for Education Statistics’ Common Core of Data for fall 2023. We find remarkable similarities (see Figure 4).

Fall 2023 public school enrollment nationwide was 2.8 percent below predicted levels compared to a 2.6 percent drop for Massachusetts by fall 2024. Both in in Massachusetts and across the U.S., enrollment drops were substantially larger for white and Asian students than for Hispanic and Black students. High school enrollment experienced little change, and the elementary grades recovered, while preschool/kindergarten and middle school grades experienced major drops. These patterns suggest that Massachusetts’s experience is typical of the nation more broadly.

The sustained decline in public school enrollment observed here is consistent with evidence that Americans, including K–12 parents, remain less satisfied with public schools even years after school closures ended. Between 2019 and 2025, the fraction of Americans reporting satisfaction with public education dropped by 12 percentage points, as did the fraction of K–12 parents reporting satisfaction with their oldest child’s school. The fraction of parents saying K–12 education is heading in the wrong direction was fairly stable from 2019 to 2022 but rose in 2023 and then again in 2024 to its highest level in a decade, suggesting continuing or even growing frustration with schools.

Concerns about the learning environment and behavior of their children’s peers may partly explain increasing parental concerns. For example, chronic absenteeism among public school students is a stubborn problem. In 2024, 20 percent of Massachusetts students were chronically absent compared to 13 percent in 2019, an increase that is again mirrored in national data.

Negative student behaviors within schools are also a growing worry. In 2022, a national sample of school leaders attributed a host of challenges to the pandemic and its lingering effects, including acts of disrespect toward teachers and rule-breaking use of electronic devices. Though leaders of all school levels experienced such increases, those in middle schools reported the steepest growth in post-pandemic behavioral problems, particularly physical fights between students, hate crimes, bullying, rowdiness in hallways, and classroom disruptions due to misconduct and unsanctioned cell phone use.

Survey evidence suggests that, if anything, parents’ and school leaders’ perceptions of public school learning environments may be worse now than in the first year or two after the pandemic’s onset. The fraction of K–12 parents who said they fear for their child’s physical safety at school rose by 10 percentage points between 2019 and 2024. And in late 2024, 72 percent of surveyed teachers, principals, and district leaders reported that student behavior was worse than it had been in 2019, a higher percentage than in 2021 and 2023. The share of educators reporting that students were misbehaving “a lot more” than before the pandemic jumped sharply, to 48 percent in late 2024 from 33 percent in early 2023.

Changes in traditional K–12 public school enrollment have been, and will continue to be, influenced by many factors, such as the growth of charter schools and expansion of publicly supported school choice programs. But the disruption of the pandemic and persistent concerns about student behavior are particularly acute in middle schools, consistent with enrollment declines concentrated in such grade levels. The subset of parents turning to private schools and homeschooling may be doing so in hopes of finding their children a safer and less disrupted learning environment.

Our analysis of Massachusetts data through fall 2024 provides the first systematic examination of how these concerns have translated into sustained enrollment shifts, offering insights into whether the initial disruptions to school choice patterns represent temporary adjustments or more fundamental changes in parental preferences for schooling options. Our findings also raise important questions about the long-term implications for public education, given a sustained exodus of higher-income, white, and Asian families.

Joshua Goodman is an associate professor at Boston University, where Abigail Francis is a doctoral student.

Suggested citation format:

Goodman, J., and Francis, A. (2025). “School Enrollment Shifts Five Years After the Pandemic: Public education sees shrinking middle schools and an exodus of wealthy, white, and Asian students.” Education Next, 25(4), 22 July 2025.

For more, please see “The Top 20 Education Next Articles of 2025.”

The post School Enrollment Shifts Five Years After the Pandemic appeared first on Education Next.

College Counseling in the Classroom

Joshua Hyman — Tue, 08 Jul 2025 09:00:16 +0000

As they approach the end of high school, students face a major life decision. Should they go to college?

In many high-income families, the college conversation occurs early and often—and no wonder, because parents probably attended college themselves. These families tap professional networks for recommendations, arrange campus visits, investigate gap years and other postsecondary alternatives, and engage consultants to suggest best-fit schools and polish application essays.

By contrast, students from economically disadvantaged families, whose parents may not have attended college, tend to rely on in-school information and guidance from school counselors. While research shows that effective counselors can boost student outcomes (see “Better School Counselors, Better Outcomes,” research, Summer 2020), they typically carry heavy caseloads that limit individual support: The national average is 470 students per counselor and upward of 1,000-to-1 at schools that serve large numbers of low-income students.

This relative lack of information substantially impacts low-income students’ postsecondary success. Many high-achieving low-income students do not enroll in college at all, and those who do often “undermatch,” enrolling in less-selective, under-resourced schools where they have a greater probability of dropping out (see “Expanding College Opportunities,” research, Fall 2013). At the same time, after decades of expecting “college for all,” many low-achieving, low-income students enroll in a less-selective college without sufficient information about whether it’s the right fit, or if they are prepared to succeed, and quickly drop out—with dire economic effects. Within eight years of leaving high school, 66 percent of students from high-income families earn a degree or credential compared to 26 percent of economically disadvantaged students, federal data show. Meanwhile, one in four U.S. adults under age 40 carries education debt, including a median of between $10,000 and $14,999 for college dropouts.

How can high schools better support low-income students through this high-stakes decision-making process? Expanding the number of in-school counselors is unlikely given the cost, but what if college planning were part of a school’s curriculum and therefore taught by teachers? I designed an experiment that compared post-graduation outcomes among students at high schools randomly assigned to teach, or not to teach, an 18-week college-planning curriculum, either as a standalone class or part of a senior-year humanities course. I find a range of benefits, at a cost of about $8 per student.

The main impact isn’t that more students enroll in college; in fact, the initial college-going rate remains about the same. Instead, the course influences which students go to college: high-achieving students, defined as having above-median GPAs and scores on the SAT, are 4 percent more likely to enroll in either a two- or four-year college, while low-achieving students are 9.5 percent less likely to enroll.

Students also have higher rates of persistence and are more likely to earn an associate degree within six years of high school graduation. The effects are largest for low-income high achievers, who are 6 percent more likely to enroll in college and 11 percent more likely to earn a two- or four-year degree. At the same time, I also find that while enrollment among low-income, low-achieving students falls by 9.5 percent, there is no decline in the share of those students earning a degree. In other words, offering a college-planning curriculum nudges a greater share of academically prepared students to enroll and succeed in college, while some of the students who would be most likely to drop out opt not to enroll in the first place.

Deciding whether, where, and what to study in college is complex, with uncertain costs and returns and substantial implications for degree attainment, lifetime earnings, and education debt. With a very low cost and estimated benefit of up to $5,410 per student, classroom instruction in college planning is a promising intervention to guide young adults toward their best next steps.

An Experiment in Michigan

To estimate the effects of a college-planning curriculum, I designed a randomized control trial and worked with partners in Michigan to carry it out. The state superintendent of schools invited every Michigan high school to take part; ultimately 62 schools opted to participate. The nonprofit Michigan College Access Network developed the curriculum and trained course instructors.

The curriculum covers a range of topics and is taught from September through mid-January, when students typically prepare and submit college applications. The first three weeks of the class focus on the costs and benefits of attending college, different school types, and the match between students’ qualifications and preferred colleges. Weeks 4–9 guide students through the application process with the goal of completing at least three applications—one reach school, one safety, and one match—in time for submission deadlines. Weeks 10–14 teach how to apply for financial aid, budgeting, and managing finances in college. The final four weeks cover career exploration, résumé building, and the final steps to enroll and succeed in school, including accepting an offer of admission, registering for orientation and placement exams, choosing a smart first-year course schedule, and deciding on a major. While much of the curriculum focuses on application steps to four-year colleges, the curriculum also emphasizes community college enrollment and the process of transferring from two-year to four-year schools.

My experiment took place during the 2016–17 school year. Each participating high school chose how the curriculum was delivered, with the goal of two or more sessions per week for a minimum of 90 total minutes. About half of schools taught the college-planning program during an existing 12th grade class (most commonly English), one in four created a new, standalone class, and 21 percent taught the curriculum during homeroom or a senior advisory period. Schools also could choose which staff taught the class. More than half of instructors were English teachers, and just 7 percent were school counselors. The rest were a mix of classroom teachers and other school staff.

My study includes all 6,704 12th grade students at the 62 schools. This sample is 56 percent white, 36 percent Black, and 5 percent Hispanic, and 53 percent qualify for free or reduced-price school meals. Some 33 percent of students live in a suburb, 27 percent are from rural communities, and 23 percent live in a city. Just four percent of study participants attend a charter school. Study participants have an average 10th grade GPA of 2.49 compared to 2.59 statewide. All students take the SAT, which is required in Michigan, with an average score of 917 for students in my study compared to 996 statewide.

I divided the schools into two equal groups and assigned half to enroll a portion of their grade 12 students in the college-planning curriculum during fall 2016. The class was not required, but schools were asked to enroll at least 50 percent of 12th graders, and ultimately 63 percent of eligible students took part. The other half of schools, the control group, did not offer the curriculum in fall 2016, but instead offered it in fall 2017.

A comparison of postsecondary outcomes for all 2016–17 seniors across both groups of schools provides the causal effects of a school offering the program. I examine effects on college enrollment, persistence, and degree receipt, as well as on school type and declared major. I use administrative data from the Michigan Department of Education and Center for Educational Performance and Information that include postsecondary enrollment and degree-receipt data from the National Student Clearinghouse and state Student Transcript and Academic Record Repository. I look at whether students enroll in college within four years of high school graduation, and for those who do, whether they persist and earn a degree within six years.

My analysis focused on school-level effects; that is, I compare outcomes for all students at schools that did or did not offer the program, rather than comparing outcomes for individual students who did or did not take the course. This accounts for “spillover effects” across students within a school, where students who take the class share their increased college knowledge with students who don’t. I view this as a positive aspect of the program, and one that I want to capture as part of its effects. This scenario also reflects the likely real-world situation where a curriculum is made available but not forced on every student.

Impacts on College-Going

Offering a college-planning class has negligible effects on overall college enrollment but significant impacts on which students attend and persist in their studies. When high schools offer a college-planning class, high-achieving students are 4 percent more likely to enroll in postsecondary schooling and low-achieving students are 9.5 percent less likely to enroll (see Figure 1). This upward shift in the achievement level of college enrollees is accompanied by positive findings for persistence. Low-achieving students are just as likely to persist through their second year of college and more likely to persist to year three despite the large enrollment reduction. High-achieving students are 7 percent more likely to persist to year three.

In terms of degree receipt, high-achieving students are 8 percent more likely to earn a degree within six years. I find no significant change in the likelihood of low-achieving students earning a degree. Taken together with the reduction in postsecondary enrollment for low-achieving students, these findings imply that the students who the intervention caused not to enroll would have quickly dropped out.

I then compare student outcomes based on both high-school performance and economic disadvantage. Achievement in high school is positively correlated with student economic advantage, raising the possibility that the effects are concentrated among advantaged students, with little or no improvements for economically disadvantaged students.

I find the opposite: Increases in enrollment, persistence, and degree receipt are concentrated among low-income, high-achieving students (see Figure 2). Low-income high achievers are 6 percent more likely to enroll in college, 13 percent more likely to persist to their third year, and 11 percent more likely to earn a degree within six years of high school graduation. This is substantial, given the importance of boosting the postsecondary attainment rates of high-achieving, economically disadvantaged students. I find no such impacts for high-achieving students who are not economically disadvantaged—their rates of enrollment, persistence, and degree receipt do not change with access to a college-planning class at school.

Overall, the presence of a college-planning class does not boost enrollment among economically disadvantaged students, but I find evidence of improved persistence. Low-income students who enroll in college are about 18 percent more likely to persist to a third year if their high school offers a college-planning class. Low-income, low-achieving students are less likely to enroll in college, but more likely to persist to years two and three. I find no negative effects on degree receipt.

Effects on School Choice and Major

One channel through which the intervention could increase persistence is by affecting where students enroll. Persistence rates vary dramatically across institutions, with the increased college dropout rate and slowing time-to-degree in the United States over the last few decades due in part to differences across colleges in characteristics such as instructor quality, resources for student support, and peer effects.

I look at whether students enroll in a two- or four-year school, as well as how many students transition between the two. The college-planning curriculum emphasizes both the opportunity to apply community-college credits toward a bachelor’s degree and the value of an associate degree in the labor force. I find that it increases the fraction of students enrolling in both a two-year and four-year institution within four years of graduating high school by 27 percent. This impact is largest for low-income, low-achieving students, who are 16 percent less likely to enroll in only a two-year school but 94 percent more likely to enroll in both types of institutions.

While some of this effect is actually due to students attending a four-year and then two-year institution, it is clear that the intervention causes a substantial number of low-achieving, economically disadvantaged students—who otherwise would have enrolled only in a community college—to successfully transition from a community college to a four-year institution.

EdNext in your inbox

I next examine effects on the match between student academic preparation and college quality. I consider a “safety” college for a student to be either a two-year college, regardless of a student’s SAT score, or a four-year college where the student’s SAT score is above the 75th percentile of enrolled students at that school. I consider a “match” college to be a four-year institution where the student’s SAT score is between the 25th and 75th percentile. Finally, I consider a “reach” school one where the student’s SAT score is below the 25th percentile.

The college-planning curriculum increases the fraction of students who enroll in both a safety and non-safety (either a match or reach) college by 23 percent and decreases the fraction of students who enroll only in a safety college by 6 percent, though that difference is not quite statistically significant. Still, this suggests that students who in the absence of the intervention would have only enrolled in a community college or a low-quality four-year institution, instead also enroll at a better-fit college.

The curriculum emphasizes avoiding “undermatch” in school choice, because community colleges and non-selective four-year institutions tend to have fewer resources and lower graduation rates. However, I find that low-income, high-achieving students are 12 percent more likely to enroll at a safety school. It seems this messaging succeeded in preventing low-achieving students who would have enrolled at these types of institutions from doing so, but not in inspiring low-income, high-achieving students to enroll in more competitive schools.

I also look at students’ choice of major. The curriculum covers career exploration, with the goal of identifying high-growth occupations of interest and aligned courses of study. I specifically measure whether more students major in subjects related to high-earning careers: science, technology, engineering, mathematics, economics, and business. Low-income, high-achieving students are nearly 11 percent more likely to enroll in college and pursue one of these majors. There are no increases in the number of students enrolling and majoring in a low-earning field.

An Efficient Intervention

School counselors are the main source of college advising for low-income high school students but are woefully understaffed in high-need schools. I find that when schools offer a college-planning curriculum for high school seniors taught by teachers, students have greater exposure to four-year institutions, are more likely to enroll in college full time, and additionally, have increased “college knowledge.”

While the college-planning curriculum does not increase the number of students earning bachelor’s degrees, students are 15 percent more likely to earn an associate degree within six years. The benefits of these credentials far outweigh the program’s financial cost.

On average in Michigan, an associate degree increases an individual’s annual earnings by $9,753 (in 2022 dollars). In weighing the contribution of the program and assuming a 40-year working life, I find that the net present value of this increase for every student enrolled in a participating school ranges from $2,176 to $2,931. For high-achieving students, who are 19 percent more likely to earn an associate degree, the estimated benefit is nearly twice as large, between $4,016 and $5,410. These net present benefits in lifetime earnings are arguably modest in size, yet they dwarf the program’s minimal cost of about $8 per student.

This near-zero financial cost is an important strength of the intervention. Schools serving large numbers of economically disadvantaged students are rarely in the financial position to hire additional counselors or implement a new college-going intervention, even if it is relatively inexpensive on a per-pupil basis. Yet these students are underrepresented on college campuses and arguably most in need of direct support in navigating admissions and enrollment decisions. A college-planning curriculum delivered by classroom teachers represents a promising alternative.

Joshua Hyman is associate professor at Amherst College and research associate at the National Bureau of Economic Research (NBER).

This article appeared in the Summer 2025 issue of Education Next. Suggested citation format:

Hyman, J. (2025). College Counseling in the Classroom: A low-cost approach improves postsecondary planning and outcomes. Education Next, 25(3), 62-67.

The post College Counseling in the Classroom appeared first on Education Next.

The Predictive Power of Standardized Tests

Darrin DeChane — Tue, 01 Jul 2025 09:00:20 +0000

Standardized tests form the bedrock of school accountability systems and are a primary source of information for the public and policymakers alike. Over the past two decades, these tests also have come to define whether students are on track to being “college and career ready” at the end of high school, in line with state standards for what students should know and be able to do by the spring of each school year.

But many parents and educators have grown skeptical of standardized testing and the relevance of a student’s scores to their long-term success—especially tests given when children are still in elementary or middle school. Some question the typical practice of classifying students into different proficiency levels based on the scores they earned—such as below basic, basic, proficient, and advanced—to help parents and the public understand the results. What can a 14-year-old’s test scores and proficiency levels tell us about college readiness? We decided to find out and designed a study to assess the degree to which middle-school test performance and proficiency level predicts postsecondary success.

Middle-school test scores tell us quite a lot. Students with high scores on reading, math, and science tests in 8th grade are dramatically more likely to earn a bachelor’s degree within five years of finishing high school. We analyzed nine years of data for 260,000 students in Missouri, starting with their 8th-grade scores and following them through high school and the next five years to see which students graduated high school, attended college, and earned a degree. We looked at each subject test separately and in combination, and we looked at students as a whole and grouped by race and gender. Every analysis found the same trend: The higher a student’s middle-school test scores, the more likely they are to graduate high school, attend college, and earn a college degree.

The differences are especially stark among students in the highest- and lowest-score categories. Fewer than 1 percent of students who score below basic in 8th-grade reading go on to earn a four-year college degree compared with almost 23 percent of students who score proficient and 43 percent of students with advanced scores. Put another way, the strongest readers in 8th grade are about 62 times more likely to earn a bachelor’s degree than the students with the lowest reading scores.

We also investigate what we would expect to happen if every student had earned at least a proficient score on all three tests—a goal set by the federal Every Student Succeeds Act and mandated under its predecessor, No Child Left Behind. We simulate these test-score improvements and find substantial, positive impacts on postsecondary outcomes. Overall, the number of students earning four-year college degrees would jump by 55 percent. The number of Black students who earn four-year degrees would nearly triple, with increases of 189 percent for males and 182 percent for females. Among Hispanic students, four-year degree holders would almost double, with increases of 94 percent for males and 86 percent for females.

Standardized testing invites controversy; however, our analysis shows that test results provide relevant and predictive insights about academic achievement and the likelihood of a student’s postsecondary success. On one hand, states typically use test-score data along with other information, including attendance and grades, to identify students “at risk,” and this is important to prevent failure and support students. On the other hand, we focus on predicting students’ long-run success based on academic achievement before high school. Our simulation illustrates the potential impacts of ensuring all students finish middle school with grade-level skills and knowledge in reading, math, and science. A stronger start in high school gives students a far greater chance of earning a college degree.

A Longitudinal Approach

Tracking student outcomes over the longer term is typically the province of dogged and well-resourced academics. Increasingly, state agencies can ask and quickly answer these sorts of questions as well. About three dozen states have built statewide longitudinal data systems, or SLDS, which connect individual-level data from agencies in charge of early childhood and preschool programs, K–12 education, higher education, and the workforce. These so-called “P20W” data systems have grown with the support of a federal grant program and philanthropic investment and as a result of the relatively lower-cost capabilities of open-source code and cloud-based computing. Policymakers are in the early days of using SLDS to assess program effectiveness at the systems level.

That includes Missouri, which has an SLDS dating back to 2006 that is maintained by the state’s Department of Elementary and Secondary Education. Our study uses this system to examine the postsecondary outcomes of 264,590 Missouri students who started high school as first-time freshmen between fall 2009 and fall 2012. Each cohort includes approximately 70,000 students attending 545 public high schools. In all, our study sample is 78 percent white, 18 percent Black, and 4 percent Hispanic.

We track these students over nine years until their fifth year after leaving high school—roughly from ages 14 to 23. We start with each student’s 8th-grade scores on the Missouri Assessment of Progress, which assesses reading, math, and science. We then match that information with high-school graduation data, also housed at the education department. Finally, we use a data link that connects to the National Student Clearinghouse, which provides three other outcomes of interest: college attendance, whether a student earns any postsecondary degree (which includes two- and four-year degrees), and whether they earn a four-year college degree.

Our analysis groups students by 8th-grade test performance, based on the four score categories reported to families and used in state accountability systems: “below basic” and “basic,” which are below grade level, and “proficient” and “advanced,” which meet or exceed grade-level standards. Roughly half of students earn passing scores on the three tests: in reading, 52 percent score proficient or advanced compared with 51 percent in math and 49 percent in science.

We look at test scores by race and gender and find broad differences. For example, in math, 56 percent of white males earn proficient or advanced scores compared to 42 percent of Hispanic males and 23 percent of Black male students. In reading, girls are more likely to score proficient or advanced than boys in all racial groups. Some 25 percent of white females score advanced in reading compared with 13 percent of Hispanic females and 7 percent of Black female students. Black male students have the largest shares of scores in the below basic category: 37 percent in math and 15 percent in reading.

The College Connection

We then link test performance to our target postsecondary outcomes and calculate rates of high-school graduation, college attendance, earning any postsecondary degree, and earning a four-year degree for each score category. We find clear and positive associations between student performance in 8th grade and whether they successfully graduate from high school and college. Across the board, students’ likelihood of postsecondary success lines up with the strength of their reading and math scores at the end of middle school.

In looking at students who earn advanced math scores in 8th grade, 74 percent go on to attend college, 51 percent earn any degree, and 45 percent earn a four-year degree (see Figure 1). Those who score proficient in math also attend and graduate from college, but at far lower rates: 58 percent attend, 32 percent earn any degree, and 22 percent earn a four-year degree. Student performance in reading is similarly predictive. Some 43 percent of 8th graders who score advanced go on to earn a four-year degree compared with 23 percent of proficient readers, 6 percent of students who score at the basic level, and fewer than 1 percent of students in the below basic category.

The differences in college attainment are driven by lower rates of college attendance and weaker persistence to degree completion by students with lower 8th-grade scores. For example, about one in three students who score basic in 8th-grade reading go on to attend college. Of those, just 35 percent earn any postsecondary degree and about 19 percent earn a bachelor’s within five years of leaving high school. Students who score proficient in reading are more likely both to attend and to complete college: 58 percent attend and 48 percent of those students earn a four-year degree. Those rates jump again when looking at students who score in the advanced category on the 8th-grade reading test: Almost three-quarters attend college and 60 percent of those students earn a bachelor’s degree within five years.

Another way to illustrate these differences is by comparing the likelihood of attaining each milestone by middle-school score category. For example, students with advanced math scores are about four times as likely to attend college as students who score below basic. When it comes to earning a four-year degree within five years of leaving high school, advanced math students are 30 times as likely to succeed as students with below basic math scores in 8th grade. Those comparisons are even starker when looking at 8th-grade reading. Compared to students with below basic scores, students with advanced scores are 6 times as likely to attend college, 24 times as likely to earn a postsecondary degree, and 62 times as likely to earn a bachelor’s degree.

EdNext in your inbox

We conduct these same analyses by student race and gender, which show that on average, female and white students are more likely to attend college and earn a degree than male and Black and Hispanic students at all score-category levels, even among the highest-scoring middle schoolers. For example, among girls with advanced math scores in 8th grade, 54 percent of white students earn a four-year degree compared with 39 percent of Hispanic students and 38 percent of Black students. Among boys with advanced math scores, 39 percent of white students earn a four-year degree compared with 27 percent of Hispanic students and 23 percent of Black students.

These differences in outcomes by race and gender among students with similar test scores are an important reminder of the many factors beyond test scores that influence students’ postsecondary success. However, they do not imply that test scores are less predictive for certain subgroups. Our study separately calculates outcomes for every student subgroup, and we find that in every subgroup and on every test, students in higher-score categories have far better long-term education outcomes than students in lower-score categories. Across the board, strong standardized test scores in 8th grade are associated with much higher rates of postsecondary success. And with each downward shift in score category, the likelihood of a student graduating from high school and completing a college degree sharply declines.

Pushing for Proficiency

We then explore what would happen if students with basic or below basic scores had instead scored proficient. What would the potential impacts be if more students finished middle school with grade-level knowledge and skills?

We first look at students in the middle of the pack, who score either proficient or basic, and are therefore either at or just below grade level. A comparison of likely postsecondary outcomes for these students shows consistent, positive impacts from moving to proficient from basic for students of all races and genders.

Overall, the odds of earning a postsecondary degree roughly double for a student who moves from basic to proficient on any of the exams. The odds of earning a four-year degree roughly triple for the same achievement gain. In particular, white males experience slightly larger gains: They are 1.7 times as likely to go to college if they score proficient instead of basic and 3.1 times as likely to earn a four-year degree. Black females are relatively less affected: They are 1.4 times as likely to attend college and 2.6 times as likely to earn a four-year degree.

Finally, we investigate the potential for improving academic achievement such that every student scored at least proficient on every subject test. Getting all students to at least the proficient level is a major goal of state and federal accountability systems and, as evidenced above, a powerful lever to expand postsecondary success. Given the positive associations we find in our analysis, what might the outcomes be?

We simulate this scenario and estimate its impact across the entire student group and by race and gender. If test scores for all students below grade level were to improve and reach proficient, our analysis shows the number of students earning a postsecondary degree growing by 52 percent and number of students earning a four-year degree growing by 55 percent (see Figure 2). We find very large impacts for Black and Hispanic male students, who have higher rates of students scoring at basic or below basic. We show that the number of Hispanic males earning bachelor’s degrees within five years of finishing high school would nearly double, with 94 percent growth. Our calculations also show the number of Black males with four-year degrees would nearly triple, with 189 percent growth.

Testing for the Future

We find a clear and consistent link between middle-school test scores and postsecondary outcomes in early adulthood. The 8th graders with the highest test scores are far more likely to attend and successfully complete a college degree, and the lower a student’s test score, the less likely they are to earn a degree.

Our analysis also shows the power of moving students in the middle of the pack over the grade-level finish line, from scoring basic to the proficient level. Compared to scoring basic, a student scoring proficient on any of the 8th-grade assessments is roughly twice as likely to earn a postsecondary degree and three times as likely to earn a four-year degree. Interventions targeted to “at risk” students, which is a common application of accountability data of this kind, can have major, lifelong impacts by supporting all students below grade level.

There is considerable controversy about the role of standardized tests in state accountability systems, and critics have argued that the tests are inadequate measures of school performance. However, our examination of more than a quarter-million Missouri students finds that federally mandated assessments in 8th grade can provide vital insights for individual students and their families, in addition to more traditional audiences of researchers and policymakers looking at the state or systems level.

Consider the relatively large shares of low-scoring students who attend college but do not earn a degree. Many students who do not score proficient in middle school go on to attend college—for example, among students who score basic in math, 37 percent attend college. But far too many students struggle and do not complete a degree—a phenomenon that has left upward of 40 percent of enrollees with debt but no degree within six years. Amid widespread questions about “college for all,” understanding the clear through-line from middle-school performance to postsecondary education could help families and schools intervene and support students to reach their post-high-school goals.

Our findings also highlight the critical importance and real-world relevance of data from standardized tests. While every student is unique and no one test can predict the whole of the future, a student’s scores shine a light on a likely pathway. Across race and gender, we find that test scores in middle school line up clearly with that student’s odds of education success after high school. Our study amounts to a nine-year sneak peek at what may be ahead—empowering students, families, teachers, and school leaders with the power to make positive change.

Darrin DeChane is data analyst at the Sinquefield Center for Applied Economic Research at Saint Louis University, where Takako Nomi is interim director. Michael Podgursky, a research fellow and former director at the center, is the Chancellor’s Professor of Economics at the University of Missouri.

This research first appeared with CALDER and is available unabridged here.

This article appeared in the Summer 2025 issue of Education Next. Suggested citation format:

DeChane, D., Nomi, T., and Podgursky, M. (2025). The Predictive Power of Standardized Tests: Middle school scores preview college and career outcomes. Education Next, 25(3), 56-61.

For more, please see “The Top 20 Education Next Articles of 2025.”

The post The Predictive Power of Standardized Tests appeared first on Education Next.

Catholic Schools Can’t Compete

Shaun M. Dougherty — Tue, 04 Mar 2025 10:00:07 +0000

During the first half of the 20th century, enrollment at Catholic schools was on the rise as growing numbers of families sought an affordable education alternative. By 1960, about one in seven American schoolchildren attended a Catholic school.

That number has been in broad decline ever since, including a 30 percent drop in enrollment since 2000. A common explanation is the overall decline of religious participation and shrinking white European ethnic enclaves where Catholic school was the norm, as well as lasting negative impacts of the Church’s institutional corruption and sexual abuse scandals.

But that ignores a big part of the story. Over the last 25 years, the school choice landscape in which Catholic schools compete has changed dramatically. In particular, there was explosive growth in the availability of public charter schools, which offer a tuition-free alternative to a family’s zoned public school. Today, about 7 percent of the nation’s K–12 students attend charters compared to about 3.5 percent enrolled in Catholic schools (see Figure 1).

These trends have occurred in plain sight of one another—for example, it’s not uncommon for charter schools to purchase or lease former Catholic school buildings as they open or expand. But little is known about the relationship between charter-school growth and Catholic-school decline. Does one explain the other? Can we predict which Catholic schools will close based on where new charter schools are opened? While earlier studies have examined this relationship in individual states or regions, no research to date has assessed how the expanding charter sector has affected Catholic schools across the United States.

That’s the focus of our analysis, which uses geolocated enrollment data for Catholic and public charter schools nationwide to estimate the impacts of opening a new charter within a five-mile radius of an existing Catholic school. We look at enrollment shifts and school closures in K–8 schools over time and find that within two years of a new charter opening, a local Catholic school’s enrollment drops by 2 percent and its likelihood of closing increases by 1.5 percentage points. These trends are most pronounced in states that don’t cap charter-school growth.

Our findings suggest that public investments in charter schools directly affect Catholic school enrollment and closures. The arrival of a new charter school does not necessarily expand school choice given the increased likelihood that a longstanding alternative, the local Catholic school, will close within a few years.

A Changing School Choice Landscape

Since the early 2000s, researchers have investigated the impact of public charter schools on existing education systems, particularly traditional public schools. Many studies have found that the expansion of charters significantly reduced enrollment in traditional public schools. Research looking at charters’ effects on segregation has found a mix of positive and negative effects (see “Do Charter Schools Increase Segregation?” research, Fall 2019). Meanwhile, a study of enrollment trends at private schools found growing segregation by family income, with steep drops in the share of private school students from the middle class (see “Who Goes to Private School?” research, Fall 2018).

Our analysis looks at the impact of new charter school openings on pre-existing Catholic schools. We review enrollment and location data for every charter and Catholic school in the United States that has at least two years of enrollment data from 1998–99 through 2019–2020, which includes 8,369 Catholic schools and 15,592 charter schools. We focus on K–8 schools, which mirrors the typical grade-band structure of a Catholic school.

To assess the impact of a new charter school, we focus on enrollment shifts at Catholic schools within a five-mile radius that serve the same grade levels. Because we are predominately interested in K–8 Catholic schools that did not previously face competition from a local charter, the schools in our analysis are most often in smaller cities and urban-fringe settings, where defining proximity as within a five-mile radius makes sense. (In comparison, the average distance between existing charters and K–8 Catholic schools during the study period, when charters were concentrated in dense urban communities, is 2.08 miles.)

We use data from National Longitudinal School Database, which includes the Private School Survey, and estimate three outcomes: changes in Catholic school size, whether a school remains open or closes, and the racial diversity of student enrollment. We calculate a new charter school’s impact on these outcomes by comparing data for affected Catholic schools within a five-mile radius of a new charter during the study period to data for Catholic schools serving the same grade levels that do not face competition from a new charter during that time. Further, our estimates include two county-level factors to account for why founders may have chosen the new charter’s location: median household income and local K–12 enrollment in the year 2000.

Results

After a new charter school opens nearby, enrollment at local Catholic schools falls by 2 percent, or about six students in the average size school, within two years (see Figure 2). The negative impact on enrollment increases over the next decade. The risk of a Catholic school closing increases after a charter school opens, as well. In K–8 schools, a Catholic school’s likelihood of closing rises to about 1 to 3.5 percentage points in each year beyond the second year after a charter school opening. Together with the enrollment impacts, this suggests that across five years, enrollment at a K–8 Catholic school newly located within a five-mile radius of a charter school will drop by nearly 10 percent and the risk the school will close grows by more than 4 percentage points.

We also look at how a charter opening affects the racial composition of nearby Catholic schools. After a charter opens, the share of white students in nearby Catholic schools shrinks by 1.4 percent, while a school’s percentages of both Black and Hispanic students increase slightly.

Our analysis relies on the assumption that both affected and unaffected Catholic schools had parallel enrollment trends before a charter opens and would have continued on the same parallel path had the new school not come to town. However, another complicating factor was at play during the study period. The rapid charter-school expansion of the early 2000s occurred just as revelations of clergy sex abuse were most prominently in the news, which may have influenced families’ decisions to enroll or continue in Catholic schools. Indeed, a study by Ali Moghtaderi found that reports of abuse after 2002, when media coverage expanded dramatically nationwide, had a pronounced and lasting negative impact on Catholic school enrollment declines and closures.

We do not believe that the timing and impact of the scandal affect our findings. Because all Catholic schools were potentially associated with the church sex abuse scandal, every school experienced the negative impact regardless of whether a charter school opened nearby. As an extra check, we separately analyze enrollment and closure impacts from 1998 through 2002—the four years before the most novel and substantial reports of abuse were made public—and find the same general results as those for the full study period.

Changing attitudes and state policies to limit the growth of charter schools could also influence impacts on Catholic schools. We compare impacts in states that cap charters to those that do not limit growth; in states without caps, Catholic schools experience stronger and more pronounced enrollment declines and increased risk of closure. On average in a state without a charter cap, Catholic school enrollment drops by 2.5 percent two years after a new charter opens nearby compared to 2 percent in a state that limits charter growth.

EdNext in your inbox

The Role of Family Finances

We look at another state policy that shapes family decision-making: vouchers. Currently, 14 states and Washington, D.C. allow vouchers. To analyze this issue, we took a closer look at nine states, selected based on how much the expansion of charter schools overlapped with the availability of vouchers in those states. This limited sample did not allow us to distinguish the impact of charter openings in states that had vouchers from those that did not. However, theory and more recent observational evidence from the expansion of vouchers in Indiana suggests costs, rather than preferences for explicit quality measures, may dominate enrollment decisions. In the presence of vouchers, stable enrollment in Catholic schools may reflect a true preference for Catholic education or the desire to not change schools when the financial costs of remaining are neutral or at least partially offset.

What about the role of school quality? When comparing traditional and public charter schools with similar academic outcomes, preferences for charters could be motivated by specific learning environments or the ability to innovate. However, Catholic schools generally do not have performance data comparable to that available for traditional public schools, and neither do charter schools at the time that they open. If families choose to withdraw their children from an existing Catholic school and enroll them in a new charter school nearby, the absence of standardized data for both school types suggests that they are likely motivated by a tuition-free alternative or non-religious learning environment.

We look at standardized test score performance in charters and traditional public schools near Catholic schools affected by charter openings. On the whole, there is not much evidence charter schools were so obviously higher performing that this feature would be the reason families might choose them over the nearby Catholic school. At the time of opening, there would be no available signal of quality based on test score performance (unless charters were part of a national network touting its record of performance, but this does not describe the bulk of new openings). Moreover, subsequent performance data does not suggest that the charter schools opening close to Catholic schools were especially high performing.

A Portrait of Family Preferences

Our findings confirm that public investments in charter schools directly and negatively impact the enrollments and persistence of K–8 Catholic schools in the same areas. Second, charter school openings in areas that previously did not have one may not always increase the total number of choice options for families, though they reduce the private cost of attending a school of choice since charter schools are tuition free.

Whereas for much of the last 50 years Catholic schools were the primary low-cost alternatives to public schools, our research demonstrates that in many instances, particularly K–8 schools, family preferences for freely available alternatives exceed their preference for an independent school or religiously centered education. All else equal, families will choose the lower-cost alternative to save money, particularly given that Catholic schools had increasingly served non-Catholic student populations, thus reducing any specific faith-based explanation for enrolling. Further, family choices are less likely to be driven by school attributes for two reasons. First, there is no widespread, obvious evidence that new charters would have stronger school performance than pre-existing Catholic schools. And second, the highly structured, strict, college-focused models of many charter schools are analogous to Catholic schools. Thus, to the extent that some families may have historically chosen Catholic schools as preferred alternatives, charter schools may offer families a reasonable substitute without the financial cost.

Recent developments may shift the school choice landscape once again and, potentially, eliminate or minimize the cost difference between charters and Catholic schools. The U.S Supreme Court will soon hear arguments in two Oklahoma cases regarding St. Isidore of Seville Catholic Virtual School, a planned religious charter school operated by the dioceses of Tulsa and Eastern Oklahoma. One possible outcome is that the court could require all 45 states that grant school charters to also allow religious charter schools. In the meantime, President Trump has issued a broad order for states to prioritize and expand vouchers and other school choice programs, including through new guidance on using federal funds and revamping discretionary grant spending to do so. Depending on what form specific policies take, the competitive landscape may be shifting in Catholic schools’ favor, and soon.

Shaun M. Dougherty is professor of education and policy at Boston College, where Andrew Miller is an assistant professor, and Yerin Yoon is a doctoral candidate.

This article appeared in the Spring 2025 issue of Education Next. Suggested citation format:

Dougherty, S.M., Miller, A., and Yoon, Y. (2025). Catholic Schools Can’t Compete: Tuition-free charter schools dominate school choice. Education Next, 25(2), 56-61.

For more, please see “The Top 20 Education Next Articles of 2025.”

The post Catholic Schools Can’t Compete appeared first on Education Next.

The Power of Performance Pay

Eric A. Hanushek — Tue, 25 Feb 2025 10:00:17 +0000

Natasha Boone, a 3rd-grade reading teacher, high fives a student at Titche Elementary School in Dallas in 2019. Boone was one of 400 teachers who participated in Dallas ISD’s successful ACE program to turn around the district’s lowest-performing schools.

Teacher evaluation reform dominated education policy throughout the 2010s when new performance-based ratings were mandated in 44 states and Washington, D.C. Though high-stakes evaluation has since receded from the headlines, improving teacher quality remains a critical strategy to boost student outcomes and respond to new challenges, such as pandemic learning loss. So, it’s worth taking a close look at the evidence on how performance-based evaluations can affect teacher quality and student achievement. What does the research show, and what can we learn by looking at districts that made the biggest changes?

One oft-cited 2021 study finds that high-stakes evaluations did not change teachers’ paychecks and were, for the most part, a dud. Joshua Bleiberg and co-authors looked across the United States and found negligible effects from new evaluations that included multiple measures of teacher performance, including student test scores. Just like the perfunctory evaluations they replaced, many new systems rated less than 1 percent of teachers “unsatisfactory.” In most states and districts, these systems also were disconnected from pay scales, which maintained traditional step-and-lane schedules that base teacher salaries on experience and education. Despite federal funding for incentives, evaluation reforms were too weak on their own to inform or induce meaningful changes in the quality of states’ teacher workforces.

But that wasn’t the case everywhere. Several large, urban districts implemented sweeping changes that linked performance-based evaluations with new, merit-based pay schedules. In Washington, D.C., for example, the IMPACT system rated teachers based on a variety of outcomes, including student test scores and professional observations, and triggered boosts in pay, targeted supports, or dismissal notices for educators at the ends of the spectrum. A long-running study by Thomas Dee and James Wyckoff found substantial improvement in teacher quality after IMPACT began in 2009, with greater retention of high performers and quick exits or improvements among teachers with lower performance rankings (see “A Lasting Impact,” research, Fall 2017). Student achievement accelerated, particularly in math.

Over the past several years, we have investigated an even more comprehensive effort in Texas that, to date, has received far less attention. Starting in 2013, the Dallas Independent School District completely replaced its traditional pay scales for principals and teachers with an evaluation and compensation system based on multiple measures of effectiveness, including student achievement and student survey responses. The district also established new, robust definitions of educator excellence, performance-based reviews for school principals, and cash incentives to encourage highly rated teachers to move to low-performing schools.

We conducted multiple analyses to track the impact of these efforts. The results show the district’s reforms had a large and durable positive impact on teacher quality and student learning.

In the four years after Dallas adopted new performance-based teacher evaluation and compensation systems, student performance on standardized tests improved by 16 percent of a standard deviation in math and 6 percent in reading, while scores for a comparison group of similar Texas schools remained flat. Teacher turnover in the wake of these reforms was concentrated among lower-rated teachers. And a program that offered sizable financial incentives to reassign top-rated teachers to struggling elementary campuses immediately improved teacher quality and student achievement and had dramatic, lasting, positive effects on student learning through middle school.

Evaluation and Pay Reform in Dallas

A large, urban school district in north central Texas, Dallas ISD enrolls roughly 139,000 students in 240 schools. Some 72 percent of students are Hispanic, about 20 percent are Black, and about 6 percent are white. Approximately 90 percent of students are eligible for free or reduced-price school lunch, and the four-year graduation rate is around 80 percent, which is below the statewide average.

Local efforts to change educator evaluation and compensation began in earnest in 2011, after new state rules empowered Texas districts to develop their own ways of rating teacher performance. In Dallas, the district board of trustees adopted a pay-for-performance compensation system proposed and developed by then-Superintendent Mike Miles. Over about three years, the district established a new multiple-measures evaluation system based on classroom observations, growth in student test scores when available, and student surveys.

The evaluations, adopted in 2015 as part of the Teacher Excellence Initiative, or TEI, are based on detailed rubrics defining excellence and on aligned professional development for teachers and principals. A parallel reform for principals, the Principal Excellence Initiative, uses a similar method to assess and categorize principals by performance, including their use of the rich information created by TEI evaluations to help teachers improve. Pay for teachers and principals is based on their evaluation scores averaged over two years. In combination, these structures aim to support educator growth, to strengthen incentives to improve instruction and leadership practices, and to attract and retain strong teachers and school leaders in Dallas ISD.

Teacher evaluations include 10 classroom observations (some unannounced) each year by the same observer, evidence of student progress toward established learning objectives, test-based measures of achievement growth relative to comparable students, and schoolwide achievement. The district also surveys students in grades 3 through 12 each spring and incorporates responses into eligible teachers’ performance ratings.

Each year, teachers receive an evaluation score that is used to assign them to one of nine performance ratings: unsatisfactory, progressing I and II; proficient I, II, and III; and exemplary. Performance-based salaries in the first year of TEI ranged from $45,000 to $90,000, with the largest share of teachers paid $54,000 at the proficient I level. The system maintained fixed proportions of teachers in each performance category; for example, the exemplary category is targeted for teachers in the top 2 percent by evaluation score, while the unsatisfactory rating is targeted for teachers in the bottom 3 percent. A teacher cannot move up or down more than one effectiveness level per year, and a teacher’s salary can only be adjusted downward after they score at a lower level for three consecutive years.

In 2016, the district built on this work through the Accelerating Campus Excellence program, or ACE, which offers up to $10,000 in additional pay for the highest-rated teachers to work in the lowest-performing schools and smaller amounts to teachers rated less effective. ACE teachers also are required to use data-driven instruction and pass ongoing, rigorous screenings to remain in the program, which resulted in the rapid and voluntary reassignment of most ACE educators in a single year.

We assess the impacts of the Dallas ISD reforms by looking at overall student performance data on state tests in math and reading during a four-year period from 2015 to 2019. We conduct a second analysis focused on schools included in the ACE program. We also look at rates of differential teacher retention based on performance ratings and estimate the degree to which a more effective teaching force contributed to changes in student achievement.

Our analyses are based on student enrollment and demographic data; teacher and principal data such as role, experience, salary, education, class size, grade, population served, and subject taught; and student performance on annual statewide tests in grades 3 through 8. Unique student and educator identifiers enable us to follow students and teachers across districts and schools as long as they remain in a Texas public school. We also create a comparison group from elementary and middle schools in the Texas districts with above-median poverty rates.

Impacts on Student Achievement

After Dallas ISD implemented the new, multiple-measure system of teacher evaluations and performance-based compensation system, students did significantly better on statewide math and reading exams. By 2019, student achievement in math improved by 16 percent of a standard deviation; reading achievement improved by 6 percent of a standard deviation (see Figure 1).

These results come from looking at student performance over time compared to a synthetic comparison group of schools drawn from other high-poverty Texas districts. In tracking the impacts of evaluation and compensation reform over time, we find no difference between Dallas ISD and the comparison group until 2016, the second year of the teacher evaluation and compensation reforms. After that, Dallas scores steadily rise through 2019 (the last year before the Covid-19 pandemic). The initial lag in impact is not surprising given the design of the reforms, which were built on incentivizing, supporting, and rewarding high performance in the classroom. Since evaluations began in 2015, any resulting difference in overall teacher quality would not begin until 2016.

Was differential retention of high- and low-performing teachers the driving force behind these improvements? The Dallas reforms involved simultaneous changes in the strength of incentives, information available for mentoring and professional development, and myriad aspects of school operations and educator composition, complicating efforts to disentangle the contributions of each. That said, if the much closer alignment between effectiveness and salary altered the composition of entrants to and exits from Dallas ISD, educator composition could have been an important channel through which the reforms improved student outcomes in the district. A first-order issue, therefore, is understanding the impact of the reforms on educator selection.

The Role of Teacher Turnover

We focus on teacher departures from Dallas ISD to understand the effects of evaluation and compensation reform on the district’s workforce rather than looking at new arrivals for a practical reason: No other district uses comparable measures of effectiveness. Even estimates of teacher value-added to student test scores, which is a common measure, are available only for the small fraction of entrants who previously taught in a tested grade in another district. No effectiveness measures are available for new entrants to teaching.

The rate of teacher turnover rose sharply after 2012, when the district’s controversial reform efforts were highly publicized but still in the development stage. This increase produced major shifts in the shares of teachers with minimal experience. The share of novice teachers with no prior experience quadrupled within three years, from 3 percent in 2012 to 13 percent in 2015. The share of early-career teachers with zero to two years of experience grew sharply from 12 percent in 2012 to 32 percent in 2016 and then declined modestly until 2019. Because new teachers’ effectiveness improves rapidly in their first few years in the classroom, this influx of teachers with little or no prior experience to Dallas ISD likely had a negative effect on achievement that temporarily dampened achievement growth relative to the synthetic control.

However, the implications of higher turnover depend on whether exiting teachers are above or below average. Although a low rating didn’t trigger dismissal, it did come with a potential negative impact on pay and could have led poor performers to leave on their own accord. We turn our attention to 2015, when TEI took effect, and the years immediately after and then compare the average evaluation scores for teachers who left the district and those who stayed on the job. This comparison reveals pronounced negative selection out of the district (see Figure 2). The average evaluation score for teachers who remained in Dallas ISD exceeds those who leave by more than 50 percent of a standard deviation starting in 2016.

Whether the departure of less effective educators translates into better instruction depends on the quality of their replacements. The absence of a measure of effectiveness for teachers prior to their entry into Dallas ISD precludes the direct estimation of the change in teacher effectiveness; however, we perform a separate analysis to estimate the overall contribution of changes in the composition of the teacher workforce to the district’s student achievement gains. Composition of the teaching force is estimated to contribute more than half of the impact on student learning, in combination with other factors including strengthened performance incentives, enhanced support based on detailed classroom observations and evaluation data, and more effective instructional and school leadership.

EdNext in your inbox

Attracting Effective Educators to Hard-to-Staff Schools

In 2016, Dallas ISD built on its innovations in measuring and rewarding teacher performance to address the challenge of attracting and retaining effective teachers in hard-to-staff, chronically low-performing schools. The path-breaking ACE program focused on selectively retaining and recruiting very high-performing teachers and used large pay increases to reshape instructional staff at schools serving disadvantaged students. It was launched at the district’s four lowest-scoring elementary schools in 2016 and expanded to nine schools in 2018.

At the program’s outset, less than 20 percent of existing staff in ACE schools met ACE performance standards and were retained. The remaining positions were filled by highly rated teachers who transferred from other schools. Teachers who applied and were selected to work at ACE campuses received signing bonuses of $2,000 and annual stipends between $6,000 and $10,000 depending on their position and effectiveness rating from the previous year. Principals, counselors, and instructional coaches received stipends that ranged from $6,000 to $13,000 annually.

We look at the ratings of teachers in ACE schools before and after the program’s start in 2016, and the shift is transformational. Before ACE, the vast majority of teachers were rated in the bottom three categories of performance; after ACE, more than half were rated in the top three performance categories (see Figure 3).

Changing school staffs is not the only focus of the program. Under ACE, educators use data-driven instructional practices and are subject to rigorous performance screenings to retain their roles. Students at ACE schools are provided with three meals a day, afterschool enrichment, and other developmental supports. These interventions and teacher stipends remain in place until student achievement improves and the school no longer qualifies for the program.

To assess the impact of ACE on student learning, we compare scores on standardized reading and math tests at ACE schools with a similar group of Dallas ISD elementary schools with 2014 test scores in the lowest 15 percent of the district. We focus our analysis on three elementary schools in the first wave of the ACE program, from 2016 to 2018. (Scoring problems precluded looking at the fourth ACE school.)

ACE schools show an immediate, large increase in achievement while scores at comparison schools are flat (see Figure 4). Scores at ACE schools increase by almost 50 percent of a standard deviation in math and 25 percent of a standard deviation in reading in the first year and continue to improve in years two and three, when ACE stipends and supports remained in place. Performance in comparison schools improves in those years as well, in line with overall district improvement, but the increase is less steep than for the ACE schools.

In 2019, student achievement at all but one ACE school had improved such that the schools were removed from the program and teacher stipends and additional instructional time ended. After that occurred, teacher quality and student achievement experienced sharp declines after that occurred: more than 40 percent of teachers rated proficient 1 or higher left the ACE schools, and average test scores fell by 23 percent of a standard deviation in math and 17 percent of a standard deviation in reading. Achievement at comparison schools was largely unchanged.

Importantly, students who attended ACE elementary schools during that time experienced lasting positive effects seen in subsequent middle school performance. Students who were in 3rd grade when the program began and received three years of ACE supports score 39 percent of a standard deviation higher in math and 23 percent of a standard deviation higher in reading in 6th grade than similar students in comparison schools (see Figure 5). The prior score gains are not just the result of “teaching to the test” but represent true learning gains.

Implications

The Dallas reforms prove what’s possible when teacher evaluation and compensation reforms are part of a comprehensive reset of districtwide personnel policies and practices. The district virtually eliminated the dependence of salary on experience and postgraduate degrees, radically altering the traditional systems of evaluation and pay found throughout the United States. As a result, both teacher quality and student achievement improved.

The ACE program shows how reforms can be targeted to address the needs of chronically low-performing schools. The information produced by Dallas ISD’s evaluation and compensation reforms provided the basis for effectiveness-adjusted hiring and pay in hard-to-staff schools. Teachers respond to incentives. Our analysis shows that the ACE program remade school staffs virtually overnight and boosted student learning, though that success ultimately resulted in the removal of schools from the program. The poorest-performing schools moved close to the district average in just two years. Students who experienced the ACE reforms continued to benefit into middle school.

While such sweeping changes may appear blunt from a distance, a close look at the Dallas reforms shows they were carefully planned to guard against evaluation inflation, the arbitrary treatment of teachers, and strategic responses such as teaching to the test. Aligning the relationship between educator effectiveness and pay dramatically strengthened performance incentives, while the development of a multiple-measure evaluation system that includes evidence of student learning, supervisor observations, and student-survey feedback recognized the pitfalls of a singular reliance on either test scores or subjective evaluations by supervisors. Importantly, focusing on teachers’ value-added rather than absolute performance measures like passing rates or achievement benchmarks made it clear that the district sought to account for factors outside of educators’ control. As a result, these systems survived controversy and contributed to substantial gains in teacher quality and student learning.

Indeed, this experiment in improved personnel policies continues and has expanded. The State of Texas introduced a grant program designed to induce other districts to follow Dallas’s lead, and some 400 districts have begun such a transformation. And in 2023, the state took over Houston ISD and appointed Mike Miles—the architect of the Dallas system—as superintendent. The largest district in Texas is now undergoing similar evidence-based changes in personnel policies.

Eric A. Hanushek is the Paul and Jean Hanna Senior Fellow at the Hoover Institution of Stanford University; Minh Nguyen is an assistant professor of economics at Ball State University; Ben Ost and Steven G. Rivkin are professors of economics at University of Illinois Chicago. This article is based on two working papers published by the National Bureau of Economic Research: “The Effects of Comprehensive Educator Evaluation and Pay Reform on Achievement” by Hanushek and co-authors and “Attracting and Retaining Highly Effective Educators in Hard-to-Staff Schools” by Andrew J. Morgan, Hanushek, and co-authors.

This article appeared in the Spring 2025 issue of Education Next. Suggested citation format:

Hanushek, E.A., Nguyen, M., Ost, B., and Rivkin, S.G. (2025). The Power of Performance Pay: Smarter teacher retention and accelerated student achievement in Dallas. Education Next, 25(2), 48-55.

The post The Power of Performance Pay appeared first on Education Next.

What’s a Special Education Aide Worth? A $9,607 Raise, to the Average Teacher

Virginia S. Lovison — Tue, 12 Nov 2024 10:00:37 +0000

When it comes to attracting and retaining teachers, public attention is typically focused on pay. The driving assumption is that many teachers are underpaid relative to the challenges and importance of their work, and that by improving teacher compensation, we can solve staffing challenges and improve student outcomes.

It sounds reasonable, but it doesn’t match up with exiting teachers’ feedback about their jobs. Teachers rarely cite dissatisfaction with salary when leaving the job. While compensation certainly matters to teachers, working conditions may matter just as much—or more. To figure this out, we designed a survey that asks teachers to choose which school features they prefer when comparing a pair of hypothetical job offers, including salary, school support personnel, class-size reductions, coaching, and childcare subsidies. We then determined the average costs of those features and conducted an analysis to show how much teachers value them in terms of cuts or boosts to their own pay, to compare the relative values of these roles to teachers with their costs to schools and districts.

Teachers’ preferences are clear: they want to work where they will have the support of full-time experts in special education and pediatric physical and mental health. An overwhelming majority describe these supports as “beneficial” or “extremely beneficial” when asked to rate special-education co-teachers (93 percent) and paraprofessionals (92 percent), as well as counselors (89 percent) and school nurses (88 percent).

These roles are so important that teachers are willing to forgo salary increases when asked to choose between the two. Our analysis shows the average teacher is willing to trade a 21 percent raise for the full-time support of a special-education co-teacher and an 18 percent raise for a full-time special-education aide.

While hiring full-time in-class personnel is expensive, we also find examples of support that would cost districts less to provide than what teachers are willing to trade off in additional salary. Teachers value working at a school with a full-time nurse at $7,041 in additional salary compared to the per-teacher cost of a nurse of $2,045. They also would trade a $6,734 raise to work at a school with a full-time counselor, which costs $2,475 per teacher. Interestingly, teachers are less willing to trade off pay raises for smaller class sizes, which is a common area of focus in union negotiations and state legislation. Our analysis shows teachers would trade $1,819 in additional salary for having three fewer students in class compared to a cost of $7,290. By contrast, teachers also assign a relatively low value to instructional coaching at

$2,245 in additional salary, but that is nearly 50 percent more than the annual cost of $1,512 per teacher.

These insights suggest that school and district leaders should prioritize the hiring and retention of support staff that make classroom jobs more attractive and should consider benefits beyond pay raises to attract and retain teachers. Union leaders and policymakers should consider broadening their efforts to enable teachers to work alongside more school counselors, nurses, and special-education specialists.

Surveying Teacher Preferences

Our work builds on prior research, which has shown that teachers are more likely to stay on the job when they work with strong school leaders. What other school staff shape teachers’ preferences about where they want to work? Although personnel costs are the single largest line item in a school budget, accounting for 79 percent of expenditures in the average public school, we know very little about which staff investments are meaningful from teachers’ perspectives. And while many studies have examined teachers’ preferences regarding tenure policies, performance incentives, health insurance, and retirement benefits, far too little attention has been paid to preferences for school support personnel.

Our experiment uses what’s known as a discrete choice survey, which presents teachers with a pair of hypothetical teaching jobs at different schools, defined by seven factors (see “Surveying Teachers’ Preferences”). These factors map back to the three most cited reasons teachers leave schools: working conditions, family or personal reasons, and pay. They are: reducing class size; the presence of a full-time school nurse, school counselor, and in-class support for special-education students; individual instructional coaching; a 10 percent increase or decrease in salary; and childcare subsidies worth $1,500 or $3,000 per child (capped at two children). We limit each school profile to seven factors to guard against decision fatigue while also giving teachers enough information to choose.

In each choice task, teachers receive the following prompt: “If two schools were otherwise identical in every other way—same building, same principal, same teaching assignment, same students—which school would you prefer?” Teachers then review the two school profiles and indicate their preferred choice. Respondents repeat this choice exercise five times.

The types of comparisons vary. For the salary and class-size attributes, the baseline condition was “same as your current position.” For all other attributes, the baseline condition was the absence of the workplace support (i.e., “no nurse,” “no childcare benefits”). The differences in features are designed to present a substantial, but realistic, difference. For example, we defined the salary difference to be a 10 percent increase or decrease, which is a substantively meaningful change in pay but not so large as to be inconceivable. Counselor choices varied from zero to one or two counselors and the nurse choice varied from zero to one, to most closely resemble the distribution of full-time counselors and nurses across American schools. We did not include part-time staffing models in any choice set.

Data and Method

We worked with an online survey sampling platform to invite teachers to participate in the survey and create a national sample of 1,030 respondents. The sample is 75 percent female, 81 percent white, 85 percent employed at a public school, and evenly split between primary and secondary teachers, in line with the national teaching workforce. On average, respondents have 10 years of teaching experience. Survey respondents, all of whom were currently working as teachers, participated in the survey between November 2020 and January 2021. In all, they rated 10,300 unique school profiles.

To account for the potential influence of pandemic-related disruptions on teachers’ responses, we included a question asking whether the respondent’s choices would have been different before the pandemic. Some 90 percent of respondents said they would be the same. In addition, in late 2022, we repeated the original survey with new respondents and administered a newly expanded survey, including five additional attributes related to administrator support, student discipline policy, and student characteristics related to income, race, and achievement. The 2022 study showed that teachers’ responses regarding school support personnel did not substantially change after pandemic-related conditions had largely ended, even when additional school attributes were featured.

We assess teachers’ responses in two ways. First, we estimate the probability that a teacher would want to work at a school when the school offered a specific benefit, such as one full-time nurse, relative to a school without that benefit, with all other characteristics unchanged. Then we estimate teachers’ willingness to pay for each specific benefit by looking at teacher pay based on 2019 data from the Bureau of Labor Statistics. We estimate the per-teacher cost of schoolwide support staff, like counselors and nurses, based on salary averages divided by 33, which is the average number of teachers per school nationwide. We then compare that cost against a 10 percent change in the 2019 median teacher pay of $54,000, which comes out to a raise or salary cut of $5,400.

Results

Teachers overwhelmingly prefer to work at schools with expert staff support in special education, nursing, and school counseling and are willing to trade off substantial raises to do so (see Figure 1). The 10 most attractive school profiles all have at least one counselor on staff and at least two additional sources of support, from either a nurse, instructional coach, or full-time special-education co-teacher or aide. Teachers place lower values on class-size reductions, and those with young children treat a $3,000 per-child subsidy that expires at age 12 as a near-match for an increase in pay.

We investigate differences by school type and years of experience and find that elementary school teachers hold slightly stronger preferences for working at a school with a full-time nurse and care more about class size and instructional coaching. Meanwhile, secondary teachers are somewhat more averse to taking a reduction in salary and hold slightly stronger preferences for working at a school that employs school counselors. Novice teachers more strongly prefer smaller classes compared to more experienced teachers. They also have a marked preference for schools with more support staff, which is consistent with the notion that early-career teachers require more support.

Special Education

Of all the features we study, teachers place the highest value on special education staffing support. The average teacher is willing to trade a 21 percent increase in pay, or $11,345, for full-time support from a special-education co-teacher. For a full-time special-education paraprofessional, the average teacher is willing to forgo an 18 percent raise, or $9,607.

Special-education teachers are consistently in demand, and the share of U.S. students who qualify for special-education services has grown to 15 percent. Some 95 percent of those students attend traditional public schools, and two-thirds spend most of the school day in general education classrooms. A Massachusetts study found that students in co-taught classrooms scored higher on standardized tests than those in classrooms led by a single teacher (see “Are Two Teachers Better Than One?,” research, Winter 2023).

Providing this level of support is expensive, and the value teachers place on it does not match the full cost. The average cost for a full-time special-education co-teacher with benefits is $82,350. A full-time paraprofessional aide, typically hired without benefits, costs approximately $28,000 per year.

It is therefore noteworthy that, although teachers strongly value in-class support for special-education students, they do not overwhelmingly prefer a co-teacher to a paraprofessional. Recent research from North Carolina suggests teaching assistants, who meet the same credential requirements as paraprofessionals, improve student outcomes. With salaries for full-time paraprofessionals about one-third of those for full-time co-teachers, this may be a compelling staffing option for school districts and policymakers to support.

That said, system-level decisions to hire special-education specialists should depend first and foremost on student need. These results highlight strong positive externalities for teachers where these investments are made.

School Nurses

Teachers are willing to trade a 13 percent increase in salary, or $7,041, to work at a school with a nurse. That is more than triple the per-teacher cost of employing a full-time nurse at an average salary of $67,500, or $2,045 per teacher. Yet the best available data shows that only three-fourths of U.S. schools have a nurse onsite at least part of the time. According to the Civil Rights Data Collection, 67 percent of elementary schools, 63 percent of middle schools, and 57 percent of high schools employ a full-time nurse.

Prior research has found that teachers believe nurses are vital because they address and mitigate health barriers that interfere with student learning. For example, an estimated 14 million students—about 20 percent of all U.S. enrollment—have a chronic health condition like asthma and type 1 diabetes, many of which require daily visits to the school nurse. In addition, a study of North Carolina public schools found better health and learning outcomes in schools with lower school nurse-to-student ratios.

EdNext in your inbox

School Counselors

Much like school nurses, teachers strongly value school counselors and are willing to trade off additional income to work at a school with a counselor. Teachers are willing to trade a 12.5 percent increase in pay, or $6,734, to work in a school with one full-time counselor—more than double the per-teacher cost of $2,475 at a school of average size, based on an average counselor salary of $81,689. Working at a school that employs two full-time counselors is worth trading off $8,959 in additional salary to teachers, which is almost 1.8 times the per-teacher cost of $4,950.

School counselors are in relatively short supply in American schools: just 65 percent of elementary schools, 71 percent of middle schools, and 79 percent of high schools have a full-time counselor on staff, and 11 percent do not employ any counselor at all. Four out of five schools do not meet the recommended counselor-to-student ratio of 250:1.

School counselors are trained to support students’ mental health, including during personal challenges and global emergencies. A Massachusetts study found that effective school counselors also boost college readiness and educational attainment (see “Better School Counselors, Better Outcomes,” research, summer 2020).

Instructional Coaches

An estimated two-thirds of U.S. schools offer teachers access to instructional coaching, either from a dedicated coach or school leader, which is a growing area of focus in research and reform efforts in recent years (see “Taking Teacher Coaching to Scale,” research, fall 2018). Prior research has found that teachers value opportunities for professional growth. On the whole, however, teachers in our sample strictly prefer investments in counselors, nurses, and special-education specialists to investments in instructional coaching. Teachers are willing to trade a 4.2 percent raise, or $2,245 in additional salary, for one hour of individual coaching per month.

Although the offer of coaching does not appear to influence teachers’ employment preferences as much as the availability of special-education specialists, nurses, and school counselors, the value teachers place on coaching exceeds its cost. Based on the average full-time salary and benefits package of $89,100 and assuming three hours of worktime per coaching session, including observation, preparation, delivery, and administrative support, a full year of monthly coaching would cost $1,511 per teacher—about two-thirds of what teachers are willing to forgo in salary increases for that support.

Childcare Subsidies

To date, research on teachers’ preferences has largely overlooked the question of whether offering teachers childcare benefits would be a fruitful strategy to recruit or retain teachers. Districts rarely offer this benefit, which may be a missed opportunity—a robust body of evidence suggests childcare benefits increase women’s participation in the labor market and the challenge of juggling family and professional responsibilities without institutionalized, family-friendly workplace supports has long been a top reason women exit the teaching profession.

We therefore include two childcare subsidy amounts in our survey, both of which are capped at two children: $1,500 and $3,000 per child, for maximum subsidies of $3,000 and eligible expenses like daycare and after-school programs and expire when a child turns 12.

Teachers treat the smaller subsidy as a nearly one-to-one swap in pay, whether or not they have qualifying children at home: the average teacher is willing to trade off a 5.8 percent increase in salary, or $3,121, for a $1,500 per-child benefit that is capped at $3,000. For the larger benefit capped at $6,000, teachers would trade an 8.2 percent pay increase, or $4,411 in additional salary.

Some 43 percent of teachers in our sample had at least one child under 12 at the time of the survey; 57 percent did not and would be ineligible for these benefits. We compare their responses and find that, intuitively, the size of the childcare benefit matters most to eligible teachers, who would trade a $3,148 raise for a $3,000 subsidy and a $5,924 raise for a $6,000 subsidy. Ineligible teachers are willing to trade off about the same amount, approximately 6 percent or $3,200 in additional salary, for both subsidy sizes. This suggests that even teachers who would not immediately benefit from a childcare subsidy still value it.

For a hypothetical teacher making $60,000 with two children under 12, providing the larger childcare subsidy in lieu of a 10 percent raise would be far less expensive in the long run. Districts can cap subsidies at a fixed amount, and only a subset of the teaching workforce is eligible. A childcare subsidy expires as children age out, unlike a pay raise.

Implications

No school or district has unlimited resources, so choosing how to spend a finite budget that supports students and teachers alike is an urgent responsibility for every system leader. Similarly, state departments of education face trade-offs in how to allocate taxpayer dollars to best support district needs. During budget season, union leaders are tasked with representing teachers’ points of view. However, we find that actual teacher preferences may differ somewhat from what is discussed in typical negotiations.

First, policies that exclusively focus on salary increases or class size as incentives to attract and retain teachers are poorly aligned with teachers’ preferences. Other benefits, such as childcare subsidies, can influence where teachers want to work.

Second, noninstructional staff like nurses, counselors, and special-education aides are critically important to teachers. Our analysis highlights both the substantial expense of noninstructional staff members and the highly valued services they provide.

This insight is relevant to an ongoing debate about the value of such staff. One view holds that investments in support staff are investments in the teaching workforce, since these colleagues relieve teachers of peripheral responsibilities and enable them to prioritize core instructional tasks. On the other hand, funds dedicated to noninstructional staff could otherwise be allocated to increasing teacher compensation, either through salary or benefits like childcare subsidies.

Our work points to sources of support that are both valued by teachers and may be cost-effective for school districts, such as full-time school nurses and counselors. We also show that, compared to any other factor in our survey, teachers place the highest values on full-time, in-class special-education colleagues and would trade off raises of up to 21 percent for this support. While these are the most expensive factors in our survey, we also find that teachers value paraprofessionals nearly as much as co-teachers, who are three times as expensive to hire.

Our study casts a new light on staff shortages—including the novel insight that shortages in support staff may aggravate teacher shortages. In a 2022 nationally representative federal survey, 48 percent of principals said they were hard-pressed to fill vacant teaching positions, while 60 percent indicated they were struggling to fill non-teaching positions. The worst shortages were for specialists in special education and mental health.

The substantial shares of students who need nursing, counseling, and special-education services clearly benefit when specialized staff are onsite at their schools. Classroom teachers strongly value their contributions as well. Policymakers focused on supporting the teaching workforce should address the critical need to increase the supply of individuals who can serve in these roles.

Virginia S. Lovison is an associate director at Deloitte Access Economics and Cecilia Hyunjung Mo is an associate profes- sor of political science and public policy at University of California, Berkeley.

This article appeared in the Winter 2025 issue of Education Next. Suggested citation format:

Lovison, V.S., and Mo, C.H. (2025). What’s a Special Education Aide Worth? A $9,607 Raise, to the Average Teacher. Survey evidence shows teachers would trade additional salary for expert support. Education Next, 25(1), 54-61.

The post What’s a Special Education Aide Worth? A $9,607 Raise, to the Average Teacher appeared first on Education Next.

Zooming to Class Slows Student Learning

Michael S. Kofoed — Tue, 17 Sep 2024 09:00:28 +0000

After years of steady growth and a pandemic-related explosion, online learning has become a common format for college courses. A decade ago, just 28 percent of all U.S. college students took at least one of their classes online. By the 2021–22 school year—after widespread pandemic-related lockdowns had ended—that had doubled to 54 percent, or 10.1 million students nationwide.

This shift has helped institutions by keeping the virtual door open during emergencies, broadening their pool of potential students, and decreasing brick-and-mortar operating costs. It also has benefitted students by expanding access to advanced coursework for degree-seekers who live in remote areas or juggle coursework with other responsibilities.

We know less, however, about how online learning affects student learning. While prior research has found negative or mixed effects, these are based on self-selected groups of students. College students typically choose their institution and course of study, including whether they take some or all of their classes online.

Our study is the exception. We are, variously, officers in the United States Army and graduates of and former civilian faculty at the United States Military Academy West Point, a unique institution established to educate future Army officers. To apply, aspiring students face a demanding battery of admissions requirements, including outstanding academic and athletic records, evidence of leadership potential, and a nominating letter, typically written by a member of Congress. To graduate, they must complete academic, military, and athletic training, pass several physical fitness tests, and earn 120 college credits. Unlike other institutions, students have little control over their class schedules. More than 80 percent of the academic program is standard across all majors, and coursework is identical across instructors, who follow a unified curriculum with the same materials and graded events.

These unique attributes allowed us to conduct a randomized controlled experiment during the fall 2020 semester, when West Point brought students back to campus but also used online instruction to limit class sizes, maintain social distancing, and slow the spread of Covid-19. We compare coursework, grades, and results of a post-course survey among 551 students who were randomly assigned to online and in-person sections of Principles of Economics, a required class that most students take their sophomore year.

We find that student learning, focus, and engagement suffer when instruction moves online. Online instruction reduces a student’s final grade by 22 percent of a standard deviation, or about 1.7 percentage points—equivalent to declining to a B+ from an A–. The impact is larger for males, at 27 percent of a standard deviation, compared to 9 percent of a standard deviation for females. There is virtually no impact for Black students, while grades for white students are 28 percent of a standard deviation lower when they take the class online.

We see negative effects from online learning across the entire grade distribution but find the biggest impacts on the least academically prepared students. Compared to their in-person counterparts, grades are lower by 45 percent of a standard deviation for students with prior military service, 38 percent of a standard deviation for students who attended a stepping-stone preparation school prior to being admitted, and 27 percent of a standard deviation for students with the lowest scores on an admissions exam. We also find that online students are less likely to report feeling connected and focused compared to in-person students.

These findings do not indicate that all online learning is detrimental. The classes in our study were identical except for their setting, so students did not experience some unique attributes of some online courses, such as self-paced study, that could positively affect outcomes. In addition, because West Point classes are smaller and feature more individualized attention than the large lecture halls used in previous experiments comparing in-person and online courses, the negative impacts on instructor-student relationships may be especially large.

However, when we consider other ways that West Point differs from traditional colleges and universities, our findings are concerning. West Point students are among the nation’s most disciplined young people. They have met stringent, multidimensional requirements to enroll in a structured military leadership program that accepts about 12 percent of applicants. They attend class in uniform, maintain peak physical fitness, and commit to at least five years of active-duty military service after graduation. If online learning has negative effects at West Point, what does that mean for the typical student?

A Pandemic Experiment

Like many colleges and universities during the Covid-19 pandemic, West Point pivoted to online learning to finish the 2019–20 school year. But while many institutions remained entirely online in fall 2020, West Point students—who are known as “cadets”—were required to return to campus to continue their physical and military training as well as their academic studies. To limit classroom capacity and incorporate social-distancing measures, the academy offered both online and in-person classes. As in years past, students were expected to follow schedules set by the academy and did not have the opportunity to request changes. Attendance was mandatory, regardless of modality.

Our experiment builds on the academy’s regimented coursework and existing practice of random assignment. That semester, 551 students were slated to take Principles of Economics, a core course similar to introductory courses at other universities. We received permission to randomize students across 12 instructors in 36 class sections, half of which were online. Each class section included between 12 and 18 students. In all, 61 percent of students were in online sections and 39 percent attended classes in person.

The syllabus, graded events, homework assignments, and midterms were the same in all classes, but instructors used different final exam versions on different days. Both online and in-person class sections were taught in each class hour the course was available, and we randomly assigned students within the hour into each modality. The only exception was for student-athletes, who are prioritized for morning classes to accommodate team schedules. Within those morning class hours, we randomized student-athletes into either an in-person or online class section.

Each instructor agreed to teach their four-section teaching load half online and half in person. West Point classes meet roughly every other day, so instructors taught no more than two class sections per day. We chose which class sections were online, and the registrar’s office randomly assigned students to class sections. This environment created counterfactual classrooms where one could see a given instructor teach both in person and online, which allows us to control for instructor talent, experience, quality, and familiarity with the course material.

Data and Method

Our analysis is based on three sources of data: student demographics and pre-West Point academic achievement data; grades for daily homework, problem sets, midterms, and a final exam; and a voluntary post-course survey, which 73 percent of students completed for extra credit.

Student data show that our sample is representative of the West Point student body: 23 percent female, 14 percent Black, 3 percent Hispanic, and about 6 percent Asian. In keeping with the academy’s focus on physical fitness, 29 percent of students are NCAA Division 1 athletes. About 17 percent are enlisted soldiers who enrolled to earn a college degree and commission as officers, and another 14 percent are students who previously attended an onsite prep school, which offers a yearlong academic development program for high school graduates to remediate weaknesses and prepare to apply to West Point. We also look at these characteristics by course section enrollment and find no statistically significant differences in students being assigned to online versus in-person classes.

During the semester, online software published and executed all graded events. Daily homework assignments were graded for completion, not accuracy. Midterms and final exams contained multiple-choice questions and software-generated graphing exercises. The software graded all assignments and exams. Because instructors had no discretion over grades, we eliminate any potential differences in grading between modalities.

After the class, a post-course survey asked students to rate their ability to concentrate and feelings of connection to their classmates and instructor on a five-point scale. The students who took the survey are representative of the whole group in terms of demographics and whether they were online or in-person. Researchers had sole access to their responses.

We use these data to conduct a straightforward comparison of grades for students enrolled in online and in-person sections, controlling for instructor, time of day, and student characteristics. Because instructors taught the same class both online and in person, this simulates a hypothetical experiment comparing a student who learned from an instructor in person to an identical student who only saw the instructor online.

Impacts on Student Performance

Overall, West Point students did reasonably well in Principles of Economics in fall 2020—the average grade is an 83, or a B. But in comparing outcomes for students assigned to online versus in-person course sections, we see that grades for online students are 22 percent of a standard deviation lower overall (see Figure 1).

The largest negative effects are for the least-prepared students. Students in the bottom quartile based on their academic records from high school, including grades and scores on the SAT and ACT, experience declines that nearly twice as large as those for students with above-average high school performance: 27 percent of a standard deviation compared to 15 percent. Grades for students who came to West Point from the military are lowered by 45 percent of a standard deviation, nearly triple the effect for students without prior military experience, whose grades are 16 percent of a standard deviation lower. The negative impact for students who attended a preparatory program out of high school is twice as large as students who did not attend the prep school, at 38 percent of a standard deviation compared to 18 percent.

Males are more negatively affected than females, with declines of 27 percent of a standard deviation compared to 9 percent. And while there is virtually no effect on Black students, grades for white students are 28 percent of a standard deviation lower when instruction moves online.

Online education adversely affected students across nearly the full range of performance in the course. While students at the extremes—those who earned either As or Fs—performed similarly in online and in-person sections, we find that at every other point on the distribution, student grades in the in-person sections dominate the online sections.

We also look at how online learning affects different graded events and again find lower student performance across the board. Daily homework assignments, worth five points each, are usually viewed as a measure of student engagement and persistence as students earn points solely for completing the assignment. The learning software grades homework automatically, so there is no instructor discretion. Online instruction lowered a student’s homework grade by 21 percent of a standard deviation, a result similar to the overall course grade.

Instructors in the course distributed exams via online learning software regardless of teaching modality. There were two versions of the final exam that were randomly assigned to students without regard to instructor or teaching modality. All students took the final online in their dorm rooms using the instructional software provided by the textbook publisher. We control for the version of the exam students saw when comparing their scores and find online learning lowers a final exam grade by 13 percent of a standard deviation.

Impacts on Student Experiences

In a post-course survey, online students reported lower levels of concentration and feelings of connectedness compared to in-person students (see Figure 2). On a five-point scale, the average student rated their ability to concentrate in class at 3.5 compared to 3.0 for online students, a decline of about three-fourths of a standard deviation. Online students also reported spending 2.3 additional minutes per day studying for the class, a finding that is not statistically significant but interesting nonetheless, since their grades were lower.

When asked how connected they felt to their instructor, with 5 being the strongest connection, the average student response was 3.8 compared to 3.5 for online students, a decline of two-thirds of a standard deviation. When asked if they felt that their instructor cared about them, the average response was 4.2 compared to 4.0 for online students, a difference of about a quarter of a standard deviation. Finally, we asked students to rank their connection to their peers. The average student response was 2.7 compared to 2.2 for online students, a decrease of roughly a half a standard deviation.

These estimates are based on comparisons of students taught by the same instructor in different teaching modalities and therefore control for instructor attributes, teaching styles, and personalities. It seems that the many costs of online education during the pandemic included student satisfaction, concentration, and many of the intangible benefits a professor provides in an in-person class.

Taken together, our quantitative and qualitative findings illustrate some of the limitations of online learning. In this experiment—and as experienced at colleges and universities across the United States during Covid-19—instructors had little time to prepare and adjust teaching styles and pedagogy and thus ported a traditional classroom environment online.

This approach negatively affected student outcomes. But our experiment and analysis informed an immediate response. While it is West Point policy to randomly assign students to classes, the academy prioritized students with weaker high school academic records for in-person classes during the 2021 spring semester. In addition, final course grades for students who had been assigned to online sections of Principles of Economics were adjusted upward by about 1.7 percentage points, in line with our findings.

EdNext in your inbox

A Cautionary Tale

In reflecting on the limitations of our findings, we return to the unique attributes of West Point that made this experiment possible. The rigor and structure of a military leadership institution certainly mean that a West Point cadet’s experience differs from that of the average college student, and the extraordinary requirements for admission mean these students are more disciplined than the typical young adult.

We believe that our findings represent the lower bound of the negative impact of online instruction—that is, in the best possible scenario, where high-performing students lived on a closed campus, did not have financial responsibilities or jobs, and were somewhat less exposed to a disruptive virus, online learning negatively affected student outcomes and experiences. Many other college students, particularly those from disadvantaged backgrounds, may have fared far worse, and we find that even at West Point, less-academically-prepared students experienced greater negative effects.

While instructors can develop new pedagogy and thoughtfully adapt coursework and instruction to an online environment, the potential for widespread learning loss as we observe at West Point should give policymakers and college administrators pause. Online learning may be popular, but it’s not clear that increasing online instruction is in students’ best interests.

Michael S. Kofoed is an assistant professor at the University of Tennessee, Knoxville and was formerly a faculty member at the United States Military Academy. Lucas Gebhart, Dallas Gilmore, and Ryan Moschitto are officers in the United States Army. The views expressed herein are those of the authors and do not reflect the position of the United States Military Academy, the Department of the Army, and the Department of Defense.

This article appeared in the Fall 2024 issue of Education Next. Suggested citation format:

Kofoed, M.S., Gebhart, L., Gilmore, D., and Moschitto, R. (2024). Zooming to Class Slows Student Learning: New evidence from West Point. Education Next, 24(4), 60-65.

The post Zooming to Class Slows Student Learning appeared first on Education Next.

Resolved: Debate Programs Boost Literacy and College Enrollment

Beth Schueler — Tue, 11 Jun 2024 09:00:48 +0000

Shahed Ananzeh and Gustavo Dos Santos, students at the Boston International Newcomers Academy, work together to prepare for an upcoming speech during a Boston Debate League tournament at Suffolk University Law School in February. Ananzeh and Dos Santos are among the novice level of policy debaters.

In a stereotypical image of a high-school debate tournament, straight-A students compete to see which renowned prep school team comes out on top. Increasingly, this is no longer the case: in recent decades, nonprofit organizations have been working to expand access to debate in public school systems that serve large concentrations of low-income students and students of color. More than 10,000 students from 20 cities participated in debate tournaments last year, according to the National Association for Urban Debate Leagues.

That includes the Boston Debate League, which was founded in 2005 to “develop critical thinkers ready for college, career, and engagement with the world around them.” The league supports teachers to launch and coach debate teams and runs monthly after-school debates for middle- and high-school students, among other initiatives (see “Making the Case for Student Debate Leagues,” features). While the immediate virtues of debate are easy to spot—teenagers research real-world topics, practice public speaking, and use evidence in support of their arguments—we wanted to know whether that translates into better academic achievement and attainment. Does participation in formal debate programs improve student outcomes?

First, we look at individual debaters’ reading and math test scores over time and compare students to themselves in years when they do and do not participate in debate. When students are on a debate team, their reading scores improve by 13 percent of a standard deviation, or about two-thirds of a typical year of learning. We find the biggest gains are for students with the lowest elementary-school test scores and reflect improvements in literacy skills related to critical thinking and reading comprehension. The impacts on math scores are minimal.

We also examine how debate affects high-school graduation and postsecondary enrollment by comparing debaters to similar peers who attended schools that did not offer debate. We find positive impacts on graduation and postsecondary enrollment, mainly driven by increased enrollment in four-year colleges. Debaters are 17 percent more likely to graduate high school within five years and 29 percent more likely to enroll in a postsecondary institution.

While many reading interventions target younger students, our results reveal a high-impact strategy to boost literacy skills and post-secondary outcomes for teenagers—particularly those whose low test scores and socioeconomic status typically pose high barriers to college success. Our results provide policymakers with a rare promising strategy for reducing inequality in reading achievement, analytical thinking skills, and educational attainment during students’ high-school years.

Potential Benefits of Policy Debate

Policy debate is an interscholastic, competitive, extracurricular activity for which teams of students engage in structured argumentation about public policy issues. Participants focus on a single topic for an entire academic year, such as arms sales, criminal-justice reform, or immigration policy, and work in two-person teams to research and develop policy proposals and arguments that support them. In tournaments, teams take on affirmative or negative positions, present their proposals, and cross-examine one another in a fast-moving sequence lasting 75 to 90 minutes. Policy debate students rely on their knowledge, effective use of evidence, ability to speak persuasively, and how well they can think on their feet.

Why might we expect all of this to pay off academically? First, successful debaters construct and deliver compelling arguments that are well-supported by both reasoning and evidence. In addition, the research aspect of policy debate includes reading and interpreting advanced non-fiction texts and social science research, while competitive debating includes quickly reading, analyzing, and refuting unfamiliar texts that opponents submit as evidence. Debaters are trained to consider both the content and relative credibility and objectivity of source materials. These skills are assessed on state reading tests and support advanced coursework in high school, including writing papers and participating in class discussions.

Debate also may provide a mechanism for motivating academic engagement. Rather than passively listening to an adult deliver a lecture, debaters are at the front of the room, creatively engaging with content they have mastered. The topics are directly related to high-interest current events and invite students to pair academic work with questioning authority, by recommending what the government should and should not do. And because timed tournament play moves quickly, is designed to engage the audience, and involves competition with other schools, debate teams and leagues can energize a school population as a whole, much like interscholastic sports. These events call on an array of softer skills, such as time management, independent organization, and teamwork. Competition also exposes students to a college-going culture, as tournaments are often held on college campuses and judged by current or former college-level debaters.

Trophies are ready for distribution at the awards ceremony for the Boston Debate League’s qualifying championship tournament. Apart from the hardware, student debaters are found to gain substantial benefits in reading achievement, graduation, and college enrollment.

Assessing Impacts in Boston

Our study focuses on the Boston Debate League, which supports 40 school-based teams in public middle and high schools in Boston, Chelsea, and Somerville, MA. We look at 10 years of individual students’ league participation data, from 2007–08 to 2016–17, and match that with demographic and academic-achievement data from the Boston Public Schools. We also use data from the National Student Clearinghouse, which shows students’ high school graduation status, postsecondary enrollment status, and whether they enrolled in a two-year, four-year, public, or private institution.

Our sample includes 3,515 students who ever participated in a debate team. These students attend schools that serve disproportionate shares of low-income families and where students’ average elementary-school reading and math scores are more than one-quarter of a standard deviation lower than schools not in the league. Some 82 percent of students at debate schools qualify for free or reduced-price school lunch and 36 percent are English language learners compared to 68 percent and 26 percent of students, respectively, at non-debate schools. The group of debaters we study is 42 percent Black, 39 percent Hispanic, 9 percent white, and 8 percent Asian. The typical debater began in the ninth grade, and a large majority only participated for a single academic year. Twenty eight percent participated in middle school.

Debaters are a self-selected group—the team is a voluntary, after-school activity, and tournaments are held offsite on evenings and weekends. We examine baseline characteristics of debaters and students at debate schools who never join a team and find notable differences. Debaters have higher elementary-school reading scores, better attendance rates, and are less likely to receive special-education services than their classmates who choose not to join the team. They also are more likely to be female, Black, and economically disadvantaged.

Because of these non-trivial contrasts and the opt-in nature of the teams, it is likely that debaters and their non-debating classmates differ from one another in ways unrelated to debate. Therefore, for part of our analysis, we look to another group of students to serve as a comparison group: students attending schools that were not in the league and therefore could not choose whether to join a team. These students are more similar to debaters in terms of baseline test scores and are likely non-debaters because the program was not available to them.

Effects on Academic Performance

First, to assess the impact of debate on academics, we compare debaters to themselves over time. Our analysis looks at individual students’ test scores, attendance, and suspension records to test whether performance is different in years when students did and did not participate in debate.

Debaters earn higher scores on reading tests in the years when they participate in debate, and those benefits increase the longer students spend on the debate team (see Figure 1). Among all students who ever debated in school—who spend an average of 1.4 years on the team—reading scores increase by 13 percent of a standard deviation in the years they participate. Scores for students who spend just one year on the team increase by 10 percent of a standard deviation compared to 14 percent for students who spend four years on the team. Among the very small group of students who start in middle school and debate for five years, reading scores are 36 percent of a standard deviation higher.

In math, we do not find strong evidence that debate has a positive impact, although we see no evidence of harm. However, the math results do provide another insight: the much smaller math impacts relative to reading gives us confidence that our reading impact estimates are not simply an artifact of selection.

We also investigate which literacy skill gains drive the increase in debaters’ reading scores by looking at which test items exhibit the biggest differences in student performance. We compare performance on “language” items, which test grammar, vocabulary, and punctuation knowledge, with performance on “reading” tasks, which focus on comprehension and analysis, such as identifying the main idea of a passage or supporting evidence for a claim. The positive impacts for debaters are nearly twice as large in more sophisticated reading tasks, at 10 percent of a standard deviation, than in language, at 6 percent of a standard deviation.

Interestingly, although debaters are generally higher performing than students in the same schools who never join debate, our analysis shows that the largest gains from debate are among students who had the lowest reading scores at the start of sixth grade (see Figure 2). When they participate on a debate team, students who were in the bottom quartile in elementary-school reading experience gains of 24 percent of a standard deviation compared to 10 percent of a standard deviation for students with the best elementary-school performance.

Finally, we also assess the impacts of debate participation on student attendance and behavior, as measured by how many days students are suspended from school. Overall, students have slightly better attendance in years they participate in debate, with an increase of 1.7 percent in days present. The impact on suspensions is minimal. However, in looking at the small group of students who start in middle school and spend five years on a debate team, we find the number of days present grows by 4 percent and the number of days suspended falls by about one-fifth.

Most likely, these comparisons produce conservative estimates of the impacts of debate because every student in our sample has participated at least once. Even after a student leaves a debate team, they may carry those experiences and learning gains with them for some unknown length of time. Therefore, our comparison between participating and non-participating years may understate the true impact of debate participation on academic achievement, since our non-participant group includes students who have already benefitted from debate.

On the other hand, these estimates may camouflage other factors contributing to the impacts of debate, such as students choosing a high school in order to join the debate team. Therefore, we also analyze our data by excluding students who debate for multiple years and by excluding students who started debate in grade 9. We do not see meaningful changes to our results, indicating that our preferred estimates capture the impact of debate participation itself.

Effects on Graduation and College Enrollment

To study the impacts of policy debate on students’ postsecondary outcomes, we use a different comparison group: demographically similar students at schools that do not offer debate. We find that debate has substantial effects on both high-school graduation and college enrollment (see Figure 3). Some 80 percent of debaters graduate high school in five years compared to 68 percent of non-debaters, an increase of 17 percent. In addition, 53 percent of debaters enroll in a postsecondary institution within two years of their expected high-school graduation date compared to 41 percent of non-debaters, an increase of 29 percent. As with the impacts on academic outcomes, we find large differences when comparing debaters by their baseline reading performance at the start of middle school. Debaters with low elementary-school reading scores experience the greatest gains in post-secondary outcomes: they are 25 percent more likely to graduate high school in five years and 55 percent more likely to enroll in a postsecondary institution, based on gains of 16.4 and 20.5 percentage points, respectively.

We also find big increases in the share of students enrolling in four-year institutions after graduating high school, with the largest gains for students with the lowest elementary-school reading scores (see Figure 4). Overall, debaters are 38 percent more likely to enroll in a four-year school and 28 percent more likely to enroll in a two-year school, based on gains of 12 and 4 percentage points, respectively. Students in the lowest quartile are 16 percentage points more likely to enroll in a four-year college after graduation compared to 9 percentage points for students with the highest baselines scores.

Policy Implications for Policy Debate

Most reading interventions are focused on the early elementary years, and third grade reading proficiency is viewed as a bellwether for success in adulthood. But what about the nine years of school that follow? We find substantial positive impacts for teenage students, the majority of whom are low-income students of color, when they participate in a competitive high-school policy debate team. Debaters make outsized progress in mastering sophisticated literacy skills and are more likely to graduate high school and enroll in college—and the biggest gains are among the students the farthest behind at the end of fifth grade. It’s never too late to accelerate student progress.

The average improvement in debaters’ reading scores is comparable to two-thirds of a year of learning and about 20 percent of the gap in 8th-grade reading between students who do and do not qualify for subsidized school lunch. Prior research has uncovered few interventions that generate literacy impacts of this magnitude for secondary school students.

Further, the positive impacts on reading scores from participating in debate are twice as large for students with the lowest baseline levels of proficiency than for students with average scores, and we find a similar pattern of results for postsecondary outcomes. Debate programs therefore have the potential to reduce educational inequality by accelerating improvement most dramatically for the students who struggle most.

These programs also are inexpensive relative to other interventions. For example, the current per-pupil cost of the Boston Debate League is about $1,360 compared to about $2,800 for high-dosage tutoring, such as the well-regarded Match Education program. Prior research has found that students’ reading performance improves by 15 percent to 25 percent of a standard deviation after tutoring. Therefore, policy debate programs appear to generate up to double the impact on reading test scores per dollar compared to state-of-the-art high-dosage tutoring.

Our study is not without limitations. Only a small subset of Boston students, all of them volunteers, participate in debate, and we can’t speak to what would happen if students were required to join. We also can’t fully rule out the possibility that some or all of the estimated effects on postsecondary outcomes are driven by selection bias, particularly because the postsecondary impact estimates are quite large.

However, our finding that the gains in reading scores are concentrated on analytical thinking competencies rather than foundational language rules and conventions strengthens our confidence that our results reflect the impact of debate participation, not some other unobserved factor. This finding also suggests that policy debate develops students’ critical thinking skills, another goal for which evidence-based strategies are in short supply. Future research should probe this finding further with better measures of critical thinking, argumentation skills, and other competencies needed for academic and civic participation such as social perspective taking, media literacy, the ability to distinguish fact from opinion, and engagement with the policy process.

Beyond highlighting the value of formal debate programs, we believe these findings also have implications for classroom instruction. A handful of organizations, including the Boston Debate League, have developed and implemented professional development programs to help teachers infuse debate pedagogy into regular classrooms. Often called “debate-centered instruction,” the goal is to give more students the opportunity to benefit from debate-like learning opportunities, not just those who can choose to take part in an intensive out-of-school program. The potential for such instruction to accelerate reading development, particularly for students far behind grade level, is an important subject for future research. While our study demonstrates exciting results for extracurricular debate participants, there may be even greater dividends to incorporating some of these practices into regular classroom-based instruction, to reach all students.

Beth E. Schueler is an assistant professor at the University of Virginia. Katherine E. Larned is a doctoral candidate at the Harvard Graduate School of Education.

This article appeared in the Summer 2024 issue of Education Next. Suggested citation format:

Schueler, B.E., and Larned, K.E. (2024). Resolved: Debate Programs Boost Literacy and College Enrollment: How debaters become better students. Education Next, 24(3), 52-59.

The post Resolved: Debate Programs Boost Literacy and College Enrollment appeared first on Education Next.