Probability Problem for Large Data Set

justindigo

New member
Joined
Apr 7, 2021
Messages
1
Hello,

I am trying to solve the answer to two probability questions with multiple variables as they pertain to a large data set. I am sure after assistance with the formula/method for solving one question and a portion of data, I will be able to solve the rest. The problem and questions are as follows and it appears I am not able to post the excel spreadsheet with the data so I have converted to PDF for reference.

Problem background
The 2013-16 drought prompted us to secure additional water supplies through transfers from other agencies. This water could then be stored in Lake Oroville, a major reservoir with a capacity of 3.5 million acre-feet (MAF).
However, storing water in Oroville carries a risk: if a wet winter causes the lake level to get too high, operators must release (spill) excess water to avoid the risk of catastrophic flooding. The maximum volume of water that Oroville can hold without exceeding this level is known as the Flood Control Limit, and it varies by month (in order to accommodate wet season runoff). Any stored water that causes the lake to rise above the Flood Control Level will therefore be lost. In this exercise, you will help decide whether to approve a proposed transfer of 200 thousand acre-feet (TAF) by estimating the probability of spilling this water, given a range of likely future hydrologic conditions.

The CALSIM model estimates the volume of water stored in Oroville over the next 24 months by simulating multiple future precipitation and runoff conditions (hydrologic sequences). These hydrologic sequences come from historical data for the years 1952—2001. Each of the 50 model runs uses a different historical year as the starting point. For example, model run #1 assumes the next 24 months will look like 1952-53, run #2 assumes they will look like 1953-54, etc. Together, the 50 model runs constitute a range of possible futures from which probabilities can be determined.
The results of these model runs are shown in the excel workbook, in the ‘Oroville profile’ tab. For each month, the estimated Oroville Storage and Flood Control Limit is given in TAF. If the 200 TAF of transfer water is added on top of the modelled storage, any excess water over the flood control limits would be subject to spilling.

Questions
1. Assuming we place 200 TAF (200,000 acre ft) of transfer water in storage immediately, what is the probability of having 180 TAF or more (90%) remaining after 24 months?

2. What is the probability of having 100 TAF (50%) or more remaining in storage after 24 months?
 

Attachments

  • 1757-2020 - Storage data.pdf
    66.5 KB · Views: 2
As our name implies, we provide help, not answers.

If you assumed that the data for each month are independent, what would be one of your first steps?

How would you test that assumption?
 
Top