Opening the door at 3pm in August 2020 felt like opening an oven.
My family rationed AC before the heat peaked, then toughed it out through the evening.
The message from our utility was clear: cut power or expect blackouts.

I thought I understood why: heat wave → too many AC units → overloaded grid.
But while researching for my EDS 242 Ethics and Bias class, I found something different.
Part of the blackout problem came from decisions made years earlier, based on climate models that overestimated how much solar power California would have on hot summer evenings.
Following the Thread
The California Independent System Operator’s root cause analysis (California Independent System Operator (CAISO), California Public Utilities Commission (CPUC), and California Energy Commission (CEC) 2021) pointed to unprecedented heat, inadequate planning for evening “net peak” demand, and market forecasting errors. But embedded in those findings: resource planning hadn’t kept pace with California’s changing energy mix.
In the 2010s, California made massive investments in solar infrastructure based on downscaled CMIP5 climate model projections. The problem? The models systematically overestimated solar generation in regions influenced by California’s coastal meteorology.
The Marine Layer Problem
California’s coast has a distinctive climate feature: the marine layer. Cold Pacific waters create a stable layer of cool, moist air trapped under warmer air above. That marine layer produces fog and low stratus clouds that cut incoming solar radiation by 30–50% (Iacobellis and Cayan 2013).

These clouds often push inland through gaps like the Golden Gate and shape microclimates, like in the redwood forests of northern California, that climate models struggle to resolve (Johnstone and Dawson 2010).
What the Models See
100-200 km grid cells
(size of SF Bay Area)
What Actually Matters
Marine layer dynamics at <10 km
(Golden Gate, coastal valleys)
Global Climate Models (GCMs) in CMIP5 operate at coarse resolution—typically 100-200km grid cells. Marine layer dynamics happen at much finer scales, driven by sea surface gradients, topographic channeling, diurnal cycles, and upwelling.
Even “downscaled” versions refined to 10-20km resolution struggle to capture marine layer formation, persistence, and inland penetration.
A 2024 CEC report (Pierce et al. 2024) states: > “Coastal Low Clouds… are a persistent, seasonal feature… Accounting for these clouds improves solar energy forecasting.”
CMIP5-era models systematically underestimated marine layer frequency, inland fog extent, and afternoon/evening cloud cover. The Result?: Solar irradiance projections that were too optimistic, especially for afternoon/evening hours—exactly when California’s “net peak” demand occurs.

The Path From Models to Blackouts
So how does a bias in cloud cover modeling contribute to blackouts years later?
The timeline might have looked something like this:
2010–2015: Overoptimistic generation forecasts
- Solar developers use downscaled CMIP5 projections. Models underestimate afternoon cloud cover → projected generation 10-15% higher than reality.
2012–2018: Investment decisions
- CAISO plans for solar providing increasing power, assuming generation levels matching the overoptimistic projections.
2015–2020: Grid planning
- CAISO plans for solar providing increasing power, assuming generation levels matching the overoptimistic projections.
2016-2020: The “net peak” emerges
- Managing the evening “ramp” becomes critical. CAISO assumes more solar during the 5-8pm window than materializes.
August 2020: Heat Wave
- Historic heat wave hits California → Evening demand spikes.
- Solar drops faster than expected → rolling blackouts.
- Solar performs well mid-day but drops off earlier and faster than expected, partly due to cloud patterns the models hadn’t captured.
CAISO’s root cause analysis (California Independent System Operator (CAISO), California Public Utilities Commission (CPUC), and California Energy Commission (CEC) 2021) states:
“Resource planning targets have not kept pace with the evolving power mix, wherein demand during peak hours outpaces the supply of solar-produced power.”
Translation:
We thought we’d have more power available at this time of day than we actually do.
Why This Matters Beyond One Heat Wave
The marine layer bias wasn’t the primary cause of California’s blackouts—CAISO’s analysis shows it was a perfect storm. But the model bias was one link in a chain of assumptions that created vulnerability. It also shows how invisible these assumptions become once they leave the modeling world.
Connections to EDS 242 Course Concepts
This case demonstrates several types of bias we studied:
Model specification bias: systematic errors in how marine layer physics are represented in the model structure itself (Konno et al.; Lecture 2’s Catalogue of Bias).
Historical bias: The LOCA downscaling method assumes future relationships will match past patterns, which breaks down under climate change. (Lecture 2)
Reporting bias model limitations were documented in technical reports but didn’t get reported in the summary documents that informed billion-dollar decisions. This is what Cori Lopazanski called the “problem of too much information” in her guest lecture—scientists have caveats, but those get lost in translation to policy.(Lecture 2, Cori Lopazanski guest lecturer)
Data invisibility By 2020, the connection back to 2015 cloud modeling choices was completely invisible to everyone affected. As we learned in Lecture 9, these are “known unknowns” that become “unknown unknowns” as they propagate through decision systems. (Lecture 1, Barrowman reading)
The bias propagates through what Olteanu et al. call “decision chains,” compounding at each step. This connects to Environmental Data Justice themes from Lecture 8—who gets hurt when data systems fail? Not the modelers or grid planners, but families who lost groceries, elderly folks without AC, people on medical equipment.
The Technical Challenge
Climate modelers know about these biases. Marine boundary layer clouds remain one of the largest sources of uncertainty in climate models (Klein et al. 2017). Understanding different types of bias is crucial for interpreting these limitations (Konno et al. 2024).
Marine layer physics operates at scales GCMs can’t resolve. Coastal fog and low stratus depend on boundary layer processes, cloud microphysics, and topographic features at sub-grid scales (Klein et al. 2017).
Statistical downscaling (LOCA): Uses historical relationships but assumes future will match past patterns—questionable under climate change (Ekström, Grose, and Whetton 2015).
Dynamical downscaling (WRF): Simulates physics at finer scale but inherits GCM biases.
California’s approach (LOCA2): Combines both methods (Pierce et al. 2024). A 2024 technical report notes improvements but still recommends changing planning assumptions to account for limitations.
Different bias correction methods can change answers significantly (Maraun 2016). The choice of method can shift your projection as much as the climate signal itself (Ekström, Grose, and Whetton 2015).
Common approaches: quantile mapping (assumes stationary bias), delta methods (assumes correct trends), deep learning (overfitting risk), and ensemble averaging (masks uncertainties).
This pattern shows up across environmental data science—flood risk, crop yields, hydropower, fire risk. Model biases propagate through decision chains (Olteanu et al. 2019). The projections look authoritative, but limitations don’t make it into summary documents as often as needed.
What We Can Do
Climate models are our best tools for understanding future conditions, but we need to work smarter with their limitations.
1. Bake Uncertainty Into Decisions
Design systems that adapt if reality diverges. For California’s grid: maintain flexible generation capacity, plan for higher evening peaks than models predict.
2. Cross-Validate With Multiple Methods
- Historical records (past heat wave outcomes)
- Empirical relationships (actual solar vs. weather)
- Expert opinions (grid operator observations)
- Stress testing (20% lower generation scenarios)
3. Create Feedback Loops
Compare actual vs. projected generation. Feed learning back into models and planning. California is doing this now, but earlier detection would have helped.
4. Invest in the Unglamorous Work
Bias correction and regional model improvement don’t make headlines, but we under-invest badly. Need better observations, higher resolution models, long-term monitoring, transparent documentation of biases.
5. Train Decision-Makers
Engineers, planners, policy-makers need to understand what models can and can’t tell them. A projection isn’t a prediction. Uncertainty is information. Limitations should be upfront, not buried in appendices.
Why I Think About This
For my family, the 2020 blackouts were uncomfortable but manageable. For others—people dependent on medical equipment, elderly folks without AC—the stakes were higher.
California’s grid planning documents still show tight supply margins during evening peaks. The marine layer isn’t going anywhere, though research suggests California coastal fog has declined 33% since 1950 (Johnstone and Dawson 2010), with uncertain causes and future trajectory (Torregrosa, O’Brien, and Faloona 2014).
What stays with me is how easily biases slip through. Smart people, rigorous methods, peer-reviewed projections, and still, systematic errors propagate until systems fail under stress.
I see this pattern everywhere in environmental data science. Models of crop yields, water availability, species distributions, fire risk—all rest on climate projections with known limitations. Those limitations exist, they’re documented in technical reports, but they don’t always make it into decision documents.
We can build better decision processes, ones that explicitly account for uncertainty, validate projections against reality, and adapt when mismatches appear. That starts with recognizing that environmental data always carries the fingerprints of choices and assumptions baked into its creation (Walker et al. 2018).
The point isn’t to chase perfect models. It’s to build systems that don’t fall apart when the models miss something.
References
Citation
@online{miller2024,
author = {Miller, Emily},
title = {The {Fog} the {Models} {Missed}},
date = {2024-12-06},
url = {https://rellimylime.github.io/posts/eds242-final/},
langid = {en}
}