Reach Scale Hydrology

Home >> Models & Tools >> Statistical Tools >> Sparse CDF Matching

CDF Matching against discrete reference CDF quantiles for bias correction

Sparse CDF Matching

Sparse CDF Matching performs bias correction against a known reference climatology (in the form of CDF) in a same way as a traditional CDF Matching procedure except that only a number of discrete points on the reference CDF are known instead of a complete CDF. The two methods are identical over the known points of reference CDF and the bias correction ratios in the gaps between the known points where interpolated with a log-normal function, i.e., linear interpolation of the logarithm of correction ratio.

How it works

To deal with the model biases that may still be present after LSM calibration (e.g., biases related to errors from precipitation forcing, LSM structures/parameterizations, and underrepresented routing processes), we propose a new BC approach that corrects the VIC runoff biases referenced against nine Q_c maps also delivered by Beck et al. (2015). The problem is very similar to the CDF matching used in a traditional BC (Reichle & Koster, 2004), except here no full CDF is available except for some sparse percentile values (Q_c). Our assumption is that these Q_c maps trained from ML can potentially offer useful information on runoff signatures beyond our limited knowledge of model processes and parameters, which is in line with the increasing recognition that considers ML as a powerful approach to understand hydrology in ungauged basins (e.g., Zhang et al., 2018).

At each VIC grid cell (Figure below), the model‐simulated daily runoff (35‐year, 12,784 samples) is used to construct the empirical CDF (blue line), with runoff values at different exceedance probabilities computed as R_99,m, R_95,m, R_90,m, R_80,m, R_50,m, R_20,m, R_10,m, R_5,m, and R_1,m (blue dots). The corresponding nine runoff characteristics, denoted as R_99,o, R_95,o, R_90,o, R_80,o, R_50,o, R_20,o, R_10,o, R_5,o, R_1,o, respectively, are used as reference points for adjustment (red dots). To use the sparse reference information, the ratio correction factor C_i is calculated (Equations below) at all available reference points. Assuming the intermediate ratio correction factors (C_ij) between C_i and C_i₊₁ follow a loglinear relationship (where j and N stand for the jth point and the total number of points between i and i+1, respectively; i+1stands for the next available runoff characteristics), C_ij can be written as equation 2. For modeled runoff values (R_m) greater than R_1,_o and those less than R_99,_o, a simple extrapolation technique is applied by taking the correction factor as C₉₉ and C₁. The bias‐corrected values are eventually computed by multiplying the original runoff time series by C_ij.

C_i = R_i_,o / R_i_,_m

C_ij = (C_i) ^1-^j^/^N (C_i)^j^/^N , if R₉₉_,_o < R_i_,_m < R₁_,_o

C_ij = C₉₉ , if R_i_,_m < R₉₉_,_o

C_ij = C₁ , if R_i_,_m > R₁_,_o

Sample code

Download link at github.com

Reference

The method is described in this paper:

Lin, P., M. Pan, H. E., Beck, Y. Yang, D. Yamazaki, R. Frasson, C. H. David, M. Durand, T. M. Pavelsky, G. H. Allen, C. J. Gleason, and E. F. Wood, 2019: Global reconstruction of naturalized river flows at 2.94 million reaches. Water Resources Research, https://doi.org/10.1029/2019WR025287.

Contact Peirong Lin peironglinlin@pku.edu.cn or Ming Pan m3pan@ucsd.edu for questions.

Page updated

Google Sites

Report abuse