HOME ABSTRACT MAP MODEL CONCLUSION CREDITS
ABSTRACT MAP MODEL CONCLUSION CREDITS
THE BURNING
FATE
OF CALIFORNIA
THE BURNING FATE OF CALIFORNIA
CLICK ANYWHERE TO START

How good is your intuition?

In the past decade alone, California has experienced 9 of the 10 largest fires...
More than 11 million acres have been burned since 2010.

8 MILLION
FOOTBALL FIELDS

6X LARGER
THAN THE U.S. STATE OF MARYLAND

14 DAYS
OF NON-STOP DRIVING AT 60MPH

One wildfire, or rather tragedy, that you should be familiar with is the Palisades Fire.

It was the series of January 2025 wildfires that ravaged Los Angeles.

Let this sink in.

525K
ACRES BURNED

16.5K
STRUCTURES DESTROYED

31
HUMAN DEATHS

Entire neighborhoods were ERASED OVERNIGHT.
We personally know some friends and families who evacuated to escape this disaster because the smoke was unbreathable.

As climate change leads to longer and dryer seasons, wildfire frequency and intensity are expected to continue skyrocketing across the state.

This is unacceptable.

Understanding WHERE wildfires tend to occur is now more important than ever for safety.

Fortunately, we have access to decades of historical wildfire data from the NASA MODIS Satellite.

We plan to approach this problem by utilizing the power of
machine learning
to predict future breakout locations and SAVE LIVES.

This would help California policymaking, resource allocations, hazard prevention, evacuation efforts, and other forms of aid.

There is something important to understand first. Our INTUITION about wildfires are often wrong.

Take a look at this map. What do you see? Click the map to read OUR first thoughts below.

"Dense clutters appear along the Sierra Nevada boundary, the northern forests, and parts of Southern California, whereas other regions are more sparse. At the county level, regions like Shasta, Butte, and Los Angeles consistently appear among the most fire-prone areas, while regions along the Central Valley floor or coastal corridors see far fewer breakouts."

Was our analysis completely correct with just a first glance?

Of course not.

In order to truly grasp which areas need aid the most, we need to approach this with hard data.

Intuition alone can be misleading.

Patterns that look obvious at first glance might hide deeper trends.
To prove our point, let's test YOUR INTUITION before we see the real data.

We made a game that shows a timelapse of California wildfires to help illustrate the severity of this issue.

This isn't just a guessing challenge.

It is the reason why DATA-DRIVEN PREDICTION MODELS are needed to protect California.

SCROLL DOWN TO START THE CHALLENGE ↓
MAP

WELCOME TO THE CALIFORNIA TIMELAPSE.


INSTRUCTIONS


🗺️
01

Hover over the map
to view county names.
(Desktop Only)

🤔
02

Choose a county
you think will have the most wildfires in the past decade.

⏱️
03

Use the time slider
to view wildfire activity and counts on a specific date.

⏯️
04

Press the play button
to animate cumulative wildfire activity over time.

05

Adjust the playback speed
with the speed toggle.
(Default is set to Fast).

🤯
06

Watch the bar chart update
and discover if your prediction was correct.


1st Place

2nd Place

3rd Place

Current Rank
⚠️ SELECT A COUNTY ON THE MAP BEFORE PRESSING PLAY! ⚠️

Click the button after you have completed the game to reveal the conclusions.

MODEL

Can we do better than intuition? Of course! This is where machine learning comes in.

EXPLORE

our model below.

"Our mission is to evolve beyond mere human judgment."

CURRENT MODEL

County + Date → Fire Prediction

Outline

Here, our model is trying to predict a binary output (True/False) whether or not at least one fire will occur on a given county and date. Our model takes two inputs. The first is an integer that represents a county. The second is a date that is split into 3 distinct features (year, month, day). These four features are combined into a feature vector. We have a list of y values that are either True or False. Our y values represent whether a county on a given date actually had one or more fires that day. We used logistic regression with the Python module statsmodel, to get a decimal value. We evaluated this decimal value to be above or below a threshold we set while we were tuning hyperparameters, which is 0.018. Above means True, below means False. We then compared those predicted y values to the actual y values to evaluate our accuracy.

Interested in how our model works?

DEMO

our model below.

Pick a county and date and see if a fire will occur there on that day.

CONCLUSION

WHAT HAVE WE LEARNED?


California Wildfires are a threat that cause unimaginable death, destruction, and suffering. Unfortunately, our assumptions and intuitions about them are usually inaccurate.

If only there was a way to accurately predict wildfires, so California can have critical information to prepare effectively. Luckily, we can utilize machine learning to approach this task.

We made a baseline of where to start, and we hope this inspires you to take action. For more information, check out our github repo. With better modeling, better data, and better preparation...

We can build a safer future to protect California and save lives.