 Research
 Open Access
 Published:
An SGBMXVA demonstrator: a scalable Python tool for pricing XVA
Journal of Mathematics in Industry volume 10, Article number: 7 (2020)
Abstract
In this work, we developed a Python demonstrator for pricing total valuation adjustment (XVA) based on the stochastic grid bundling method (SGBM). XVA is an advanced risk management concept which became relevant after the recent financial crisis. This work is a followup work on Chau and Oosterlee in (Int J Comput Math 96(11):2272–2301, 2019), in which we extended SGBM to numerically solving backward stochastic differential equations (BSDEs). The motivation for this work is basically twofold. On the application side, by focusing on a particular financial application of BSDEs, we can show the potential of using SGBM on a realworld risk management problem. On the implementation side, we explore the potential of developing a simple yet highly efficient code with SGBM by incorporating CUDA Python into our program.
Introduction
Backward stochastic differential equations (BSDEs) have been a popular research subject ever since the work of Pardoux and Peng, [2] and [3]. There are many papers regarding for their applications in mathematical finance and stochastic control. In terms of numerical analysis, there is also significant research on the efficient calculation or approximation for BSDEs. This includes Monte Carlobased research, like [4], chaos decomposition method [5], cubature methods [6] or Fourier and waveletbased methods, like in [7] and [8]. However, there are relatively few studies on the practical application of BSDEs. As far as we know, there is not yet an industrial software package for solving general BSDEs. This work is our first step to address this issue.
The goal of this study is basically twofold. This work can be seen as a followup of our theoretical research on the stochastic grid bundling method (SGBM) for BSDEs, see [1]. SGBM is a Monte Carlobased algorithm which was introduced in [9], and extended its application to BSDEs in [1]. Here, we will study the practical side by developing a demonstrator in Python, where we shall also make use of computing on a Graphics Processing Unit (GPU) in order to improve the scalability, and make use of the CUDA Python package. CUDA Python is a recently open to public programming tool, which was developed by Anaconda. It has become freely available at the end of 2017, being previously commercial software. This programming tool carries the promise of combining fast development time for Python coding with the high efficiency of GPU computing.
The second focus of this work is the application of BSDEs in financial risk management, where we would like to demonstrate the practical opportunities for efficient BSDE solving software. We choose the modeling of the variation margin and the closeout value within risk management, in the form of BSDEs, as the main test problem. In this work, we work under a complete market assumption which includes counterparty risk and margin requirements, and develop a numerical solver for XVA.
A Python demonstrator for solving XVA pricing problems with the SGBM algorithm with GPU computing is the main result of this study. The strength of this package is its scalability with respect to the dimensionality of the underlying stock price processes. While the demonstrator is designed for a specific problem setting, because of the general framework with BSDEs and SGBM, the package could easily be transformed into a general solver for BSDEs and used for other financial problems.
This article is organized as follows. In Sect. 2, we introduce the basic mathematical setting of BSDEs, the fundamental properties of the SGBM algorithm and the application of parallel computing with SGBM. Section 3 describes the programming language, the financial setting for our SGBMXVA demonstrator, and other technical details, while some numerical tests are performed in Sect. 4. Concluding remarks, possible extensions and outlook are given in Sect. 5.
Methodology
We begin this section with a brief review of the mathematical background of BSDEs and of the Monte Carlobased simulation method SGBM. For further details, the readers are referred to our previous work [1]. Furthermore, we will describe a parallel computing version of SGBM, which is a followup on [10], in which parallel SGBM was applied to the pricing of multidimensional Bermudan options.
Backward stochastic differential equations
We use a standard filtered complete probability space \((\varOmega , \mathcal{F},\mathbb{F},\mathbb{P})\), where \(\mathbb{F}:=(\mathcal{F} _{t})_{0\leq t\leq T}\) is a filtration satisfying the usual conditions for a fixed terminal time \(T > 0\). The process \(W:=(W_{t})_{0\leq t \leq T}\) is a ddimensional standard Brownian motion, adapted to the filtration \(\mathbb{F}\) and the socalled decoupled forwardbackward stochastic differential equation (FBSDE) defines a system of equations of the following form:
where \(0\leq t\leq T\). The functions \(\mu : [0,T] \times \mathbb{R}^{d}\rightarrow \mathbb{R}^{d}\) and \(\sigma : [0,T] \times \mathbb{R}^{d} \rightarrow \mathbb{R}^{d \times d}\) are referred to as the drift and the diffusion coefficients of the forward stochastic process S, respectively, and \(s_{0}\in \mathcal{F}_{0}\) is the initial condition for S. The function \(g: [0,T] \times \mathbb{R}^{d} \times \mathbb{R} \times \mathbb{R}^{1\times d}\) is called the driver function of the backward process and the terminal condition \(V_{T}\) is given by \(\mathfrak{N}(S_{T})\), for a function \(\mathfrak{N}: \mathbb{R}^{d}\rightarrow \mathbb{R}\). All stochastic integrals with Wiener process W are of the Itô type.
For both \(\mu (t,s)\) and \(\sigma (t,s)\), we assume standard conditions such that a unique strong solution for the forward stochastic differential equation exists,
This process also satisfies the Markov property, where \(\mathbb{E}[S _{\tau }\mathcal{F}_{t}] = \mathbb{E}[S_{\tau }S_{t}]\), for \(\tau \geq t\), where \(\mathbb{E}[\cdot ]\) denotes the expectation operator with respect to probability measure \(\mathbb{P}\).
A pair of adapted processes \((V,Z)\) is said to be the solution of the FBSDE, if V is a continuous realvalued adapted process, Z is a realvalued predictable row vector process, such that \(\int ^{T}_{0}\Z _{t}\^{2}\,dt < \infty \) almost surely in \(\mathbb{P}\), where \(\\cdot \\) denotes the Euclidean norm, and the pair satisfies Equation (1).
Our goal is to find \((V_{0},Z_{0})\) by solving the problem backward in time. We do this by first discretizing Equation (1) along the timewise direction, \(\pi : 0=t_{0}< t_{1}< t_{2}<\cdots <t_{N} = T\). We assume a fixed, uniform timestep, \(\Delta t = t_{k+1}t_{k}, \forall k\), and let \(\Delta W_{k+1, q} := W_{t_{k+1}, q}W_{t_{k}, q} \sim \mathcal{N}(0,\Delta t)\), a normally distributed process, for \(q = 1, \ldots , d\). The vector \(\Delta W_{k+1}\) is defined as \((\Delta W_{t_{k+1}, 1}, \ldots , \Delta W_{t_{k+1}, d})^{\top }\). The discretized forward process \(S^{\pi }\) is defined by
It is defined by the classical Euler–Maruyama discretization.
We define a discretetime approximation \((Y^{\pi },Z^{\pi })\) for \((Y,Z)\):
The notation ∇ denotes the gradient of a function. Note that various combinations of \(\theta _{1}\) and \(\theta _{2}\) give different approximation schemes. We have an explicit scheme for \(V^{\pi }\) if \(\theta _{1}=0\), and an implicit scheme otherwise.
Stochastic grid bundling method (SGBM)
We now introduce the Monte Carlobased algorithm SGBM for BSDEs. The main difference between SGBM and other commonly known Monte Carlo algorithm, for example the one in [11], include a socalled regresslater regression stage and a socalled equalsized bundling localization.
Due to the Markovian setting of \((S^{\pi }_{t_{k}},\mathcal{F}_{t_{k}})_{t _{k}\in \pi }\) and the terminal processes \(V^{\pi }_{t_{N}}\) and \(Z^{\pi }_{t_{N}}\) being deterministic with respect to \(S^{\pi }_{t _{N}}\), there exist functions \(v^{(\theta _{1}, \theta _{2})}_{k}(s)\) and \(z^{(\theta _{1}, \theta _{2})}_{k}(s)\) such that
for the given approximation in Equations (2a)–(2c). Our algorithm estimates these functions, \((y^{(\theta _{1}, \theta _{2})} _{k}(s), z^{(\theta _{1}, \theta _{2})}_{k}(s))\), recursively, backward in time, by a local leastsquares regression technique onto a function space with basis functions \((p_{l})_{0\leq l \leq Q}\).
As a Monte Carlobased algorithm, our algorithm starts with the simulation of M independent samples of \((S^{\pi }_{t_{k}})_{0 \leq k \leq N}\), denoted by \((S^{\pi ,m}_{t_{k}})_{1 \leq m \leq M, 0 \leq k \leq N}\). Note that in this basic algorithm, the simulation is performed only once. Therefore, this scheme is a nonnested Monte Carlo scheme.
The next step is the backward recursion. At initialization, we assign the terminal, \(t_{N}=T\), values to each path for our approximations, i.e.,
The superscript R is used to distinguish the simple discretization and the discretization with SGBM, while I denotes the number of Picard iterations. The following steps are performed recursively, backward in time, at \(t_{k}\), \(k=N1,\ldots , 0\). First of all, we bundle all paths into \(\mathcal{B}_{t_{k}}(1),\ldots ,\mathcal{B}_{t_{k}}(B)\) equalsized, nonoverlapping partitions based on a sorting result of a bundling function defined on \((S^{\pi , m}_{t_{k}})\). This is the partition step.
Next, we perform the local regresslater approximation separately within each bundle. The regresslater technique we are using combines the leastsquares regression with (analytically determined) expectations of the basis functions to calculate the necessary expectations.
Generally speaking, for any target function f and M Monte Carlo paths, a standard regressnow algorithm for a dynamic programming problem approximates a function ι within the space spanned by the regression basis such that it minimizes the value \(\frac{1}{M}\sum^{M}_{i=1}(f(S^{\pi ,i}_{t+\Delta t})  \iota (S^{\pi , i}_{t}))^{2}\) and approximates the expectation \(\mathbb{E}_{t}[f(S^{\pi }_{t+\Delta t})]\) by \(\mathbb{E}_{t}[\iota (S^{\pi }_{t})] = \iota (S^{\pi }_{t})\). As a projection from a function of \(S^{\pi }_{t+\Delta t}\) to a function of \(S^{\pi }_{t}\) is performed, it will introduce a statistical bias to the approximation.
Instead, the regresslater technique, as we employ in SGBM, finds a function κ which minimizes the functional \(\frac{1}{M}\sum^{M}_{i=1}(f(S^{\pi , i}_{t+\Delta t})  \kappa (S^{\pi , i}_{t+ \Delta t}))^{2}\) and approximates the expectation \(\mathbb{E}_{t}[f(S ^{\pi }_{t+\Delta t})]\) by \(\mathbb{E}_{t}[\kappa (S^{\pi }_{t+\Delta })]\). By using functions on the same variable, at \(t+\Delta t\), in the regression basis, we can avoid the statistical bias in the regression. However, the expectation of all basis functions must preferably be known in closedform, in order to apply the regresslater technique efficiently.
In the context of our BSDE SGBM algorithm, we define the bundlewise regression parameters \(\alpha _{k+1}(b)\), \(\beta _{k+1}(b)\), \(\gamma _{k+1}(b)\) as
Note that as we actually apply the equal partition bundling, \(\sum^{M}_{m}=1{\mathbf{_{\mathcal{B}_{t_{k}}(b)}}} =N\) for some constant N and for all bundle. Therefore we won’t face the problem of dividing by zero.
The approximate functions within the bundle at time k are defined by the above parameters and the expectations \(\mathbb{E}^{x}_{t_{k}}[p(S ^{\pi }_{t_{k+1}})]\) and \(\mathbb{E}^{x}_{t_{k}} [p(S^{\pi }_{t _{k+1}})\frac{\Delta W_{k, q}}{\Delta t} ]\) read,
As stated before, a Picard iteration is performed at each time step within each bundle if the choice of \((\theta _{1}, \theta _{2})\) results in an implicit scheme. For further details on the application of the Picard iteration, readers may refer to [12] or [7] and the references therein.
Finally, the full approximations for each time step are defined as:
Parallel SGBM
One way to improve the efficiency of the SGBM algorithm is by means of GPU acceleration, which has been successfully implemented for the SGBM algorithm for earlyexercise options in [10]. The framework of parallel computation from [10] can also be used for the SGBM solver of BSDEs. In this framework, we divide the SGBM algorithm in two stages, namely, the forward simulation stage and the backward recursion approximation stage. In this subsection, we briefly describe how parallel GPU computing is used in each stage.
In the forward simulation stage, we simulate independent samples of the stock price models from the initial time \(t_{0}\) to the expiration date T as defined by the problem. Moreover, we can already precompute all values of interest for the backward step that are related to the stock prices and store them in the memory. These values of interest include the sorting parameter, the basis function values, the expectations of the basis functions and the terminal values. Since the generation of each sample is independent of the other samples and a huge amount of samples may be needed for an accurate Monte Carlo simulation, this stage is particularly suitable for parallel computing. Also note that after the calculation of the values of interest, the actual stock paths can be discarded in the GPU, as they are not required anymore in the backward step.
The second stage is the backward approximation stage. From the discretization of the BSDE, we notice that the calculation in timewise direction should process sequentially, i.e., starting from the terminal time and going backward along the time direction. However, within each time step, there are many independent processes that are well suitable for parallel computing. Within each time step, the data (i.e. the values of interest) is separated into different nonoverlapping bundles and the computations in the different bundles are independent of each other. Within a bundle, multiple regression steps on different variables (V, Z, g, …) need to be completed. For a graphical representation of the computation within each time step, we refer to Fig. 1. As each regression within the time step is independent, we can also perform these computation simultaneously. Finally, in order to reduce the overall volume of memory transfers, only the information for the current time point is transferred to the GPU.
The SGBMXVA demonstrator
In order to test and analyze the applicability and practicality of the SGBM algorithm in the BSDEbased financial model framework, we have created an SGBMXVA demonstrator which computes XVA, i.e. a value of great interest in modern risk management, with the algorithm introduced in the previous section. We make use of the Python programming language and the CUDA Python package for the development of this SGBMXVA demonstrator. In this section, we address the programming tools that we have used, the basic financial setting for this problem and the design of our code.
Programming language
Python is the programming language of choice for this project as it is a popular tool in the financial industry nowadays. Being a highlevel programming language, Python is easy to develop and particularly useful for scripting because of its easy to write syntax and grammar. These properties are especially useful for the financial industry, as practitioners need to constantly monitor and adapt their models to the changing markets. Moreover, Python has been one of the dominating programming languages in data science with its widely available packages. Therefore, we develop our algorithm under the Python framework. However, Python is an interpreted programming language, so, its performance is not as strong as a compiled language. One of our focusses of study is therefore the balance between rapid development and actual computation performance which can be achieved by Python.
One of the effective techniques for improving the computational efficiency of Python is to precompile parts of the code, for example, with the help of the Numba package. This technique has been adopted in our SGBMXVA demonstrator. In order to further improve the efficiency of our algorithm, we apply parallel GPU computing as stated in the last section. The use of GPUs has been a major development in scientific computing. Along with the computing platform CUDA, the GPU provides a high potential for computational speedup. With more than hundred threads (the basic operational units within CUDA) in a typical GPU, repetitive function computations can be dedicated to various treads to be run in parallel. It will greatly reduce the computational time.
In this work, we use the CUDA Python packages to incorporate GPU programming into Python. CUDA Python is made out of Python packages from Continuum Analytics, that allow a user to make use of CUDA within the native Python syntax. The tool consists of two main parts: a Numba compiler and the pyculib library. The Numba compiler transforms a native Python code with only supported features into a CUDA kernel or a device function with only a function decorator. This feature enables GPU computing in Python without the need for the programmer to learn a new language. The pyculib library [13] is a collection of several numerical libraries, that provide functions for random number generation, sorting, linear algebra operation and more. This set of tools has been previously included in commercial software, but it has become open source since 2017.
Another benefit of using CUDA Python is in terms of an automatic memory transfer. In a GPU computing framework, it is often necessary to transfer data between the CPU and GPU memory space. In this tool, a programmer can either manage the transfer of memory for better control of GPU memory usage and the bandwidth of memory transfer between CPU and GPU, or let the platform handle it automatically, again simplifying the code development.
The main downside of this tool is that so far it only supports certain functions from the native Python and Numpy packages. Some of the welloptimized packages in linear algebra are not available for the GPU, and some of the Python code has to be rewritten into a supported version. However, using the package still requires less adaptation effort as compared to the incorporation of other compiled programming languages, like C, into a Python code. As we will discuss later, this tool delivers a great improvement in efficiency in some areas.
Financial test case: total valuation adjustment (XVA)
Next, we shall introduce the test problem for our SGBMXVA demonstrator. The main goal here is to show that BSDEs and SGBM can be used to solve financial total valuation adjustment XVA problems in risk management.
In short, we consider a financial market where the bank is selling derivatives to a counterparty, but either the bank or the counterparty may default before expiry. Therefore, a variation margin has to be posted as collateral which will be used to settle the account when one party defaults. In this market, the funding rate for each financial product may be different, as well as the deposit rate for a riskfree account and the funding rate through bonds. The goal of this model is to compare the price of a financial portfolio with and without counterparty risk. The difference between these two prices is called the total value adjustment.
We use a simplified version of the dynamics for our financial market, as in [14]. Within this setting, we take into account the possibility that both parties, in an overthecounter financial contract, may default, also we include the exchange of collateral, a closeout exchange in the case of a default and the usage of a repurchasing agreement (repo) for position funding. While this model is definitely more realistic than the classical financial derivative pricing theory (where one can borrow and lend freely with negligible cost) by taking into account the counterparty risk, this model still leaves out some parts of the financial deals, like regulatory capital or haircuts that apply to collateral. As already mentioned, we use the standard multidimensional Black–Scholes model for the asset dynamics. We select this model as a balance between a relatively realistic model and the tractability for the equations involved. The specific SGBM algorithm can be easily generalized to other models, as described in the outlook section.
Next, we introduce the mathematical model for our asset prices and the default events, as well as the notations that we use. Subsequently, we present the fundamental equations that we are solving in our demonstrator. Detailed financial interpretation as well as the model derivation will be left out and readers are referred to [14] for the description of this XVA model.
The XVA model
In this model, we take on a bank’s perspective onto risk management. This perspective focuses on a noncentrally cleared financial derivative on underlying assets S, which are traded between the bank and its client and both parties may default. However, the default will not affect the underlying assets S.
Before we proceed to the BSDE description, which we are going to compute in the SGBMXVA demonstrator, we introduce the meaning and financial background of the different terms involved. \(S_{i}\) denotes the underlying ith asset, which follows the standard Black–Scholes model under our assumptions. Its dynamics are defined by the SDE:
where \(B_{t}\) is a correlated ddimensional Wiener process, where
The constant vector \(\bar{\mu } = (\bar{\mu }_{1}, \ldots , \bar{ \mu }_{d})^{\top }\) and \((\bar{\sigma }_{1}, \ldots , \bar{\sigma } _{d})^{\top }\) represent respectively the drift rate and the standard deviation of the financial assets. The parameters \(\rho _{ij}\) form a symmetric nonnegative correlation matrix ρ,
which is invertible under our assumptions. We can relate the correlated Brownian motion B to a standard, independent ddimensional Brownian motion W by performing a Cholesky decomposition on ρ. They satisfy the equality
where \(\mathfrak{C}\) is a lower triangular matrix with real and positive diagonal entries, and \(\mathfrak{C} \mathfrak{C}^{\top }= \rho \).
The processes \(J^{\mathcal{B}}\) and \(J^{\mathcal{C}}\) are used to model the events of default for each party in the transaction. Mathematically, they are defined as counting processes,
and
where \(\tau ^{\mathcal{B}}\) and \(\tau ^{\mathcal{C}}\) are stopping times, denoting the random default times of the bank and the counterparty, respectively. The processes are assumed to have stochastic, timedependent intensities, \(\lambda ^{\mathcal{B}}\), \(\lambda ^{\mathcal{C}}\), i.e.
and
Next, we discuss the financial derivatives traded within this model, where we use \(\mathfrak{N}\) to denote the terminal payoff for the portfolio. To mitigate counterparty risk, the variation margins, X, need to be computed for the two parties. As in [14], the values X are based on the market value of the financial product, and they are computed and readjusted frequently. When \(X>0\), the counterparty is posting collateral with the bank.
When one party in the financial contract defaults, the contract position needs to be closed. We denote the portfolio value at default (at time \(\tau = \tau ^{\mathcal{B}} \wedge \tau ^{\mathcal{C}}\)) by \(\theta _{\tau }\), and it is given by
where \(R^{\mathcal{C}}\), \(R^{\mathcal{B}} \in [0,1]\) are the recovery rates in the case the counterparty and the bank default, respectively. The variable M denotes the closeout value of the portfolio when any party defaults. We will give more details regarding M in a later section.
Next, we introduce the notation for the financial quantities in our model. The adapted stochastic vector processes \(q^{S}\) and \(\gamma ^{S}\) are respectively the repo rate and the dividend yield of the underlying assets. The process r is the stochastic riskless interest rate. The processes \(r^{\mathcal{B}}\) and \(r^{\mathcal{C}}\) are the yields of the risky zero coupon bonds of the bank and the counterparty, respectively. The process \(q^{\mathcal{C}}\) is the repo rate for the bonds of the counterparty. The interest rate for the variation margin is given by \(r^{X}\), and, finally, \(r^{F}\) is the cost of external funding. In order to simplify the expression of the demonstrator, we assume all the above processes to be constant.
Fundamental BSDE and reduced BSDE
In [14], a fundamental BSDE is derived through a hedging argument based on a replicating portfolio for the financial derivative. For the hedging portfolio V̂, including the counterparty credit risk, the dynamics of posting collateral and holding counterparty bonds for hedging, are given by,
and the driver function is defined as
The notation \(\operatorname{diag}(\mathfrak{m})\) denotes a diagonal matrix with the terms of vector \(\mathfrak{m}\) on the main diagonal. We compute the price of the derivative by approximating the values of the stochastic processes \((\hat{V}, \hat{Z}, U^{\mathcal{B}}, U^{\mathcal{C}})\) that solve the above BSDE. The price of the contract at time 0 is given by \(\hat{V}_{0}\).
The main difficulty of dealing with this BSDE is that the terminal time is random, as it depends on the time of default. Therefore, this BSDE has to be transformed into a standard BSDE, to reduce the computational complexity, following the techniques used in [15]. The authors in [15] showed that we may recover the solution of (4) by solving the BSDE,
and using
Proposition 1
(Theorem A.1 in [14])
If the pair of adapted stochastic processes\((\hat{\mathcal{V}}, \hat{\mathcal{Z}})\)solves Equation (5), then the solution\((\hat{V}, \hat{Z}, U^{\mathcal{B}}, U^{\mathcal{C}})\)of Equation (4) is given by (6).
Idea of the proof
This technique considers the three possibilities of termination, i.e. no default, the bank’s default and the counterparty default, and shows that the results in (6) solve (4) under all three situations. □
When we consider the total value adjustment to the financial option values, we compute an alternative price for the same financial contracts, however under the assumption of riskfree dynamics (meaning no counterparty default). The value adjustment is defined as the difference between the two portfolios values (risky minus riskfree option values). The defaultfree portfolio dynamics, in a repo funding setting, can be expressed as
Discrete system
Here, we explain the setting for the variation margins X and closeout value M. The actual discrete system used in our demonstrator will be introduced as well.
In [16], the variation margin has been mandated to be “the full colleteralised marktomarket exposure of noncentrally cleared derivatives”. Therefore, at any time t, either the marktomarket riskfree portfolio value V or the counterparty risk adjusted value V̂ appear reasonable choices for the variation margin X. There are also two possible conventions for the portfolio closeout value, namely \(M = V\) and \(M = \hat{V}\). In this demonstration, the variation margin and the closeout value are assumed to be the marktomarket price V. In this case, the valuation problem results in a linear BSDE that contains an exogenous process V. This means that both the variation margin and the closeout value are marked to the market, and are thus not dictated by the bank’s internal model. Therefore, there is an additional stochastic process V in the driver for V̂. If the expression of V is not readily available, the numerical simulation of the V̂ system poses extra challenges, because we have to simulate V and V̂ simultaneously.
The driver for the reduced BSDE with counterparty credit risk is now given by
while the counterparty riskfree price V can be expressed as
where
and
Indeed, the elementary expression of V may be available in closedform for some basic financial derivatives under the simple 1D Black–Scholes model. However, in order for our demonstrator to cover a wide range of products, we solve V by SGBM instead.
Recall the discretization scheme in Equations (2a)–(2c), we can approximate the reduced BSDE with the following numerical scheme:
where \(0 \leq \theta _{1} \leq 1\) and \(0 < \theta _{2} \leq 1\).
Furthermore, if we include the simulation of V and include explicitly the driver functions, we have to solve the following system of discretized BSDEs:
This system is the one that we have implemented in our demonstrator. Note that the notation \((\frac{\bar{\mu } + \gamma ^{S}q^{S}}{\bar{ \sigma }} )\) is a shorthand for the vector \((\frac{\bar{ \mu }_{i} +\gamma ^{S}_{i} + q^{S}_{i}}{\bar{\sigma }_{i}} )_{1 \leq i\leq d}\).
To closeout this section, we would like to mention that there is an analytic expression for the reference price under the current setting, which is given by
There is however no elementary expression or simple way to evaluate this quantity.
Function descriptions
There are two parallel versions of our algorithm, both are written in Python. One of them is based on generic Python code and the Numpy, Scipy and Numba packages for performance and efficiency, which we would refer to as Version 1 from now on for simplicity. The second one makes use of the CUDA toolkit, the CUDA Python portion of the Numba package and the pyculib library and shall be referred to as Version 2. A numerical_test script has been provided for running basic comparison tests between the two versions under a predefined test setting.
Next, we list the main functions within our implementation. Since the two versions have similar architecture, we would only present Version 2. We have:
the main function for the whole algorithm: cuda_sgbm_xva;
the forward simulation function for the basis values and partition ordering: cuda_jit_montecarlo
the main function for the backward approximation stage: cuda_sgbminnerblock
the parallel regression function: cuda_regression
an example class to store all the information related to the test case, including Equation (7): ArithmeticBasketPut
All of the above functions form the complete demonstrator but each individual component can be used or replaced separately, which gives us flexibility for future development.
Version 1 only depends on Python and the Numba library. In addition to that, Version 2 also requires the CUDA driver and the pyculib library.
Numerical experiments
In this section, we present the tests we have implemented in our demonstrator. In these tests we assume that the bank sells a portfolio consisting of an arithmetic put option, whose payoff is given by
where K is the strike price. The detailed setting for the numerical test is presented in Table 1. This set of parameters is based on the one used in [10] and it has been adopted here as an easy to scale up problem. The main purpose of the tests is to investigate whether the code can be easily extended to solving highdimensional problems, so we choose a set of parameters whose properties do not change when we change the dimensionality of the problem. For a more practical problem, a test based on the German stock index model from [17] has been conducted in [1].
Note that here we have adopted a test case with the borrowing rate and lending rate being equal for the purpose of comparing our defaultfree prices to previously known result. Additional tests have been performed to assure that the convergence behavior will not be affected by different sets of parameters.
The numerical test has been conducted on a common desktop computer with an Intel(R) Core(TM) i57400 processor and a GeForce GTX 1080 graphic card.
Performance of GPU computing
The first set of tests is performed to check whether there is any benefit of GPU computing for our demonstrator. We perform the same test 10 times separately with the two versions and take the averaged computing time over these 10 runs to compare the efficiency between them. Moreover, in accordance with the work in [1], we use 5 sets of parameters for testing the consistency of this SGBM algorithm. The parameters are presented in Table 2. We have conducted tests with the stock price dimensionality being 1, 5 and 10. The results are shown in the Tables 3, 4 and 5.
As we expected from the theoretical work, the approximate values converge with respect to the progressively increasing grid sizes and the standard deviations decrease as we progress through the parameters sets. More importantly, there is a clear speedup of Version 2 over Version 1. The biggest improvement comes from the forward simulation stage where we run each independent sample in parallel, and greatly reduce the computational time. There still seems to be room for improvement regarding the backward simulation stage as even if we perform the regression in parallel on the GPU in Version 2 instead of sequentially in Version 1, the speedup is not obvious. The main reason may be that some part of the code is not performed on the GPU to keep some flexibility of our demonstrator. Some optimization is only available for the basic Python code but not for CUDA Python.
Nevertheless, our tests show that CUDA Python improves the efficiency and may provide a good balance between the speed of execution and the speed of development. In particular, as the workload increases due to using more time steps, samples and bundles, the speedup get higher due to GPU computing.
Finally, whereas the buildin floating point format of the GPU is 32 bits, 64 bits format is needed for ensuring high accuracy with our result.
Scalability
Next, we would like to check if our algorithm can be used for highdimensional test cases. The setting is the same as in the last section. However, Version 1 is too timeconsuming for a common desktop, so the focus is on Version 2. We perform tests up to 40 dimensions, as shown in Table 6.
It can be seen that with the help of the GPU, our method can be generalized to higher dimension. The application of GPU computing definitely expands the applicability of our method. However, we also notice that the time ratio increase is higher then the dimensional ratio. This is probably related to the architecture and the buildin hardware of the GPU. It maybe worthwhile to tune the GPU setting within our demonstrator corresponding to the GPU at hand, but this is hardwaredependent.
To sum up, the application of GPU computing not only speeds up our algorithm, but it also allows us to deal with more demanding problems with about the same hardware.
Conclusion and outlook
Conclusion
Although we just developed a demonstrator, our study gives us interesting insight in the performance of the SGBM algorithm for BSDEs. We have shown that we can apply the SGBM algorithm to solve highdimensional BSDEs with the help of GPU computing, which is an important feature for using BSDEs in practice. With a suitable BSDE model, we can deal with complicated financial risk management problems with our solver and since our code is based on a general framework for BSDEs, our demonstrator can be adopted to different pricing and valuation situations. We also demonstrated that we can incorporate GPU computing into a native Python code. We believe that this work serves as a basis for developing BSDEbased software for financial applications.
The authors encourage readers to download and test our demonstrator, which solves the problem which was stated in this work. The aim is to developed this tool further in terms of financial applications and computational efficiency.
Next, we will mention some possible generalizations for our algorithm as a guideline to future use. We will divide the outlook into two parts, a financial and a computational part.
Financial outlook
In this work, we used the classical Black–Scholes model for the stock dynamics. It is of interest to generalize the stock model by other diffusion SDEs, of the form:
This would already result in a different implementation regarding the SGBM regresslater component.
We may also alter the model dynamics to better fit the market, for example, with the inclusion of initial margins and capital requirement. Recall that in Sect. 3.2.3 we have made some assumptions about the variation margin X and the closeout value M. Simply by adjusting these assumptions, a completely different BSDE driver may result. There are four possible combinations for the variation margin and the closeout value. Each choice gives rise to a different BSDE driver, which can be seen in Table 7.
When X and M are the adjusted values V̂, it results in a linear BSDE for V̂. This means that the variation margin and the closeout value are marked to the model. As all the values are marked in V̂ and both values match, the equation reduces to an option pricing equation, with different interest rates. When both of them are the riskfree interest rates, a linear BSDE results that contains an exogenous process V. This means that both the variation margin and the closeout value are marked to the market and are not dictated by the internal model. Then there is an additional stochastic process V in the driver for V̂, which is the case we have used in this work.
Finally, when there is a mismatch between M and X, the resulting BSDE is no longer linear. Take case 4 as an example, it implies that the variation margin is collected according to an internal model while the closeup value is given by the market. The mismatch between the variation margin and the closeup value results in nonlinear terms in the driver. These nonlinear terms come from the cash flow when one party defaults.
We should notice that since we have a general BSDE algorithm in the form of SGBM, we may adapt our code to these new models by simply changing the model setting in the example class without changing the actual function. This is one of the advantages of using BSDEs in pricing and risk management.
Computational outlook
As we have mentioned before, one possible way of improvement is to sacrifice some flexibility and move the remaining solver parts of the code also to the GPU. Alternatively, one may further optimize the code within the GPU as the GPU set of routines seems to be still developing.
Furthermore, other features may be included in the software, for example, different payoff functions or a different SGBM regression basis. Finally, to have a fully independent software, a user interface and modular code are recommended.
Abbreviations
 XVA:

Total Valuation Adjustment
 SGBM:

Stochastic Grid Bundling Method
 SDE:

Stochastic Differential Equation
 BSDE:

Backward Stochastic Differential Equation
 FBSDE:

ForwardBackward Stochastic Differential Equation
 GPU:

Graphics Processing Unit
 repo:

Repurchasing Agreement
References
 1.
Chau KW, Oosterlee CW. Stochastic grid bundling method for backward stochastic differential equations. Int J Comput Math. 2019;96(11):2272–301.
 2.
Pardoux E, Peng SG. Adapted solution of a backward stochastic differential equation. Syst Control Lett. 1990;14(1):55–61.
 3.
Pardoux E, Peng SG. Backward stochastic differential equations and quasilinear parabolic partial differential equations. In: Rozovskii BL, Sowers RB, editors. Stochastic partial differential equations and their applications: proceedings of IFIP WG 7/1 international conference University of North Carolina at Charlotte, NC. June 6–8, 1991. Berlin: Springer; 1992. p. 200–17.
 4.
Lemor JP, Gobet E, Warin X. Rate of convergence of an empirical regression method for solving generalized backward stochastic differential equations. Bernoulli. 2006;12(5):889–916.
 5.
Briand P, Labart C. Simulation of BSDEs by Wiener chaos expansion. Ann Appl Probab. 2014;24(3):1129–71.
 6.
Crisan D, Manolarakis K. Solving backward stochastic differential equations using the cubature method: application to nonlinear pricing. SIAM J Financ Math. 2012;3(1):534–71.
 7.
Ruijter MJ, Oosterlee CW. A Fourier cosine method for an efficient computation of solutions to BSDEs. SIAM J Sci Comput. 2015;37(2):A859–A889.
 8.
Chau KW, Oosterlee CW. On the waveletbased SWIFT method for backward stochastic differential equations. IMA J Numer Anal. 2018;38(2):1051–83.
 9.
Jain S, Oosterlee CW. The stochastic grid bundling method: efficient pricing of Bermudan options and their greeks. Appl Math Comput. 2015;269:412–31.
 10.
Leitao A, Oosterlee CW. GPU acceleration of the stochastic grid bundling method for earlyexercise options. Int J Comput Math. 2015;92(12):2433–54.
 11.
Bender C, Steiner J. Leastsquares Monte Carlo for backward SDEs. In: Carmona AR, Del Moral P, Hu P, Oudjane N, editors. Numerical methods in finance; Bordeaux; June 2010. Berlin: Springer; 2012. p. 257–89.
 12.
Gobet E, Lemor J, Warin X. A regressionbased Monte Carlo method to solve backward stochastic differential equations. Ann Appl Probab. 2005;15(3):2172–202.
 13.
Pyculib; [cited 26 June 2019]. Available from: http://pyculib.readthedocs.io/en/latest/#.
 14.
Lesniewski A, Richter A. Managing counterparty credit risk via BSDEs. ArXiv eprints. 2016.
 15.
Kharroubi I, Lim T, Ngoupeyou A. Meanvariance hedging on uncertain time horizon in a market with a jump. Appl Math Optim. 2013;68(3):413–44.
 16.
Basel Committee on Banking Supervision. Margin requirements for noncentrally cleared derivatives; 2015.
 17.
Reisinger C, Wittum G. Efficient hierarchical approximation of highdimensional option pricing problems. SIAM J Sci Comput. 2007;29(1):440–58.
Acknowledgements
The authors would also like to thank all the supportive consultants and programmers from VORtech, BV, for their help and advice and valuable discussion for this work.
Availability of data and materials
The platform independent Python 3 code generated during the current study are available at https://github.com/kwchau/sgbm_xva under the MIT license.
Funding
This work is supported by EU Framework Programme for Research and Innovation Horizon 2020 (H2020MSCAITN2014, Project 643045, ‘EID WAKEUPCALL’).
Author information
Affiliations
Contributions
All authors were involved in planning the work, drafting and writing the manuscript. Discussion and comments on the results and the manuscript as well as a critical revision have been made by all the authors at all stages. KWC is the main author for the software mentioned in this manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chau, K.W., Tang, J. & Oosterlee, C.W. An SGBMXVA demonstrator: a scalable Python tool for pricing XVA. J.Math.Industry 10, 7 (2020). https://doi.org/10.1186/s13362020000735
Received:
Accepted:
Published:
Keywords
 SGBM
 XVA
 CUDA Python