
In this paper, properties of a recently proposed mathematical model for data flow in large-scale asynchronous computer systems are analyzed. In particular, the existence of special weak solutions based on propagating fronts is established. Qualitative properties of these solutions are investigated, both theoretically and numerically.
Citation: Cory D. Hauck, Michael Herty, Giuseppe Visconti. Qualitative properties of mathematical model for data flow[J]. Networks and Heterogeneous Media, 2021, 16(4): 513-533. doi: 10.3934/nhm.2021015
[1] | Cory D. Hauck, Michael Herty, Giuseppe Visconti . Qualitative properties of mathematical model for data flow. Networks and Heterogeneous Media, 2021, 16(4): 513-533. doi: 10.3934/nhm.2021015 |
[2] | Tong Li . Qualitative analysis of some PDE models of traffic flow. Networks and Heterogeneous Media, 2013, 8(3): 773-781. doi: 10.3934/nhm.2013.8.773 |
[3] | Giuseppe Maria Coclite, Lorenzo di Ruvo, Jan Ernest, Siddhartha Mishra . Convergence of vanishing capillarity approximations for scalar conservation laws with discontinuous fluxes. Networks and Heterogeneous Media, 2013, 8(4): 969-984. doi: 10.3934/nhm.2013.8.969 |
[4] | Christophe Chalons, Paola Goatin, Nicolas Seguin . General constrained conservation laws. Application to pedestrian flow modeling. Networks and Heterogeneous Media, 2013, 8(2): 433-463. doi: 10.3934/nhm.2013.8.433 |
[5] | Helge Holden, Nils Henrik Risebro . Follow-the-Leader models can be viewed as a numerical approximation to the Lighthill-Whitham-Richards model for traffic flow. Networks and Heterogeneous Media, 2018, 13(3): 409-421. doi: 10.3934/nhm.2018018 |
[6] | Frederike Kissling, Christian Rohde . The computation of nonclassical shock waves with a heterogeneous multiscale method. Networks and Heterogeneous Media, 2010, 5(3): 661-674. doi: 10.3934/nhm.2010.5.661 |
[7] | Mauro Garavello, Roberto Natalini, Benedetto Piccoli, Andrea Terracina . Conservation laws with discontinuous flux. Networks and Heterogeneous Media, 2007, 2(1): 159-179. doi: 10.3934/nhm.2007.2.159 |
[8] | Raimund Bürger, Stefan Diehl, M. Carmen Martí, Yolanda Vásquez . A difference scheme for a triangular system of conservation laws with discontinuous flux modeling three-phase flows. Networks and Heterogeneous Media, 2023, 18(1): 140-190. doi: 10.3934/nhm.2023006 |
[9] | Dong Li, Tong Li . Shock formation in a traffic flow model with Arrhenius look-ahead dynamics. Networks and Heterogeneous Media, 2011, 6(4): 681-694. doi: 10.3934/nhm.2011.6.681 |
[10] | Maria Laura Delle Monache, Paola Goatin . Stability estimates for scalar conservation laws with moving flux constraints. Networks and Heterogeneous Media, 2017, 12(2): 245-258. doi: 10.3934/nhm.2017010 |
In this paper, properties of a recently proposed mathematical model for data flow in large-scale asynchronous computer systems are analyzed. In particular, the existence of special weak solutions based on propagating fronts is established. Qualitative properties of these solutions are investigated, both theoretically and numerically.
The increasing number and diversity of processing units in modern supercomputers presents a significant challenge to the understanding of how data is globally distributed in these systems. This issue is critically important in emerging computer architectures for which the cost of data movement is becoming a critical driver for system design, affecting hardware and software choices and even basic algorithms [12, 17, 18]. At the same time, these supercomputers are becoming increasingly difficult to model. Indeed, simulating the detailed state of a supercomputer requires an even larger computer, thereby making such approaches prohibitive for the largest systems.
With the help of new mathematical models, a more efficient utilization of available processing units might be possible. Such models are necessarily coarse-grained approximations, used to understand the gross behavior of the system in order to make design, control, and optimization more tractable. In this spirit, a coarse-grained mathematical model has been recently derived in [5] to describe the movement of processed data in an extreme-scale computer system. This model focuses on a idealized setting, in which a lattice of processors perform operations in parallel and communicate with nearest neighbors. Although very simple, the model allows for local variations in processor speed, due to differences in hardware performance or local problem complexity, as well as asynchronously operation, meaning that processors manipulate and share data with their neighbors as they are able, as opposed to a staged process in which processors communicate only after they have all completed a step of assigned work.
The model proposed in [5] takes the form of a PDE for the density of data at a particular processor location and stage in the global computation. It is derived formally from a microscopic ODE description. The notion of coarse-graining discrete and semi-discrete model of networks and related systems to derive PDE models is common mathematical strategy that is used in many applications, including vehicular [4, 11, 16] and pedestrian traffic flow [1, 6, 8, 15], supply chains [2, 3], bacterial movement [14] and machine learning applications [7].
In the current work, we establish some qualitative properties of the coarse-grained PDE model in [5]. We focus specifically on the phenomena of front propagation and present simulation results that highlight the theoretical findings. In the context of the current application, these fronts describe the movement of data through the different stages of a computational task. The reason for this analysis is two-fold. First, it was observed in [5] that, after a transient phase, many initial conditions eventually relax to a front whose shape depends on local variations in the processor speed. Second, these fronts can be used to characterize three important features for application: the first time at which some portion of the data reaches a state of completion, the rate at which the rest of the data reaches this state, and the time at which all data reaches this stage. Moreover, the relatively simple form of these fronts enables the investigation of optimization and control strategies based on some application-specific metrics. As a first step in this direction, we explore how variations in the processor speed affect front behavior.
In this section, we briefly summary the microscopic ODE model from [5] and the macroscopic model that is obtained from it. The microscopic model describes a computer system as a lattice of processors, the dimension of which can be arbitrary, but finite. For application purposes, only one-, two-, and three-dimensional lattices are realistic, and here we focus the one-dimensional case for simplicity. It is assumed in [5] that each processor performs the same task, which is broken into discrete stages. The rate at which a given processor moves data from stage to stage depends not only on its intrinsic processing rate, but also on the availability of usable data in the processor and its neighbors.
Mathematically, the amount of data at time
For each
˙qi,k(t)=fi,k−1(t)−fi,k(t),qi,k(0)=q0i,k,fi,0(t)=fini(t), | (1) |
where
fi,k=aiv1(v2(qi,k,Qi+1,k−Qi,k+qi,k,Qi−1,k−Qi,k+qi,k);q∗), | (2) |
where
Qi,k(t)=kmax∑j=kqi,j(t)+∫t0fi,kmax(s)ds, | (3) |
and
v1(q;q∗)=max{0,min{1,qq∗}}, | (4) |
v2(q,Δ+,Δ−)=min{qi,k,max{Δ+,0},max{Δ−,0}}. | (5) |
The function
1 In [5], there is another parameter
The macroscopic model in [5] is a continuum approximation for (1) that is derived in the limit of infinitely many processors (
∂tρ(t,x,z)+∂zΦ(ρ(t,x,z),∂xP(t,x,z))=0,(t,x,z)∈R+×T×(0,1), | (6a) |
ρ(0,x,z)=ρ0(x,z),(x,z)∈T×(0,1), | (6b) |
Φ(ρ(t,x,0),∂xP(t,x,0))=Fin(t,x),(t,x)∈R+×(0,1), | (6c) |
where
P(t,x,z)=∫1zρ(t,x,y)dy+∫t0Φ(ρ(s,x,1),∂xP(s,x,1))ds, | (7a) |
Φ(ρ,σ)=αw1(w2(ρ,σ,−σ)); | (7b) |
and
w1(u)=min{max{uρ∗,0},1} | (8a) |
w2(u,v1,v2)=min{u,max{u+ηv1,0},max{u+ηv2,0}}. | (8b) |
The model (6) is a conservation law that describes the movement of data entering the system at
At first glance, (6) appears to include a recursive definition of the flux function
P(t,x,z)=∫t0Fin(s,x)ds+∫10ρ0(x,z)dz−∫z0ρ(t,x,z)dz. | (9) |
Alternatively, since
∂tP−Φ(−∂zP,∂xP)=0. | (10) |
The reformulation of (6a) in terms of (10) is the basis for the analysis and simulation performed in [5]. In the current work, we proceed by analyzing (6) directly in terms of
In this section, we investigate qualitative properties of (6). We begin by simplifying the expression for
W2(s)=min{¯w2(s),¯w2(−s)}, | (11) |
where
¯w2(s)=min{1,max{1+ηs,0}}; | (12) |
(see Figure 1). In terms of
Φ(ρ,σ)={αw1(ρW2(σρ)),ρ≠00,ρ=0. | (13) |
Next for simplicity, we artificially extend the domain in
Assumption:P(t,x,z)=∫∞zρ(t,x,y)dy∀t≥0,x∈T,z∈[0,∞). | (14) |
Hence, in the following we consider the model (6) on the extended domain
D:=R+×T×R+ | (15) |
and introduce the auxiliary variable
σ(t,x,z):=∂x∫∞zρ(t,x,y)dy,(t,x,z)∈D. | (16) |
The resulting system for
∂tρ+∂zΦ(ρ,σ)=0,(t,x,z)∈D, | (17a) |
∂xρ+∂zσ=0,(t,x,z)∈D, | (17b) |
ρ(0,x,z)=ρ0(x,z),(x,z)∈T×R+, | (17c) |
ρ(t,x,0)=ρb(t,x),σ(t,x,0)=σb(t,x),(t,x)∈R+×T. | (17d) |
The relation of the quantities
σb(t,x)=∂x∫∞0ρ(t,x,y)dy, | (18a) |
Φ(ρb(t,x),σb(t,x))=Fin(t,x), | (18b) |
limz→∞σ(t,x,z)=0. | (18c) |
Definition 3.1 [Weak solution] Given initial data
∫Dρ∂tφ+Φ(ρ,σ)∂zφdzdxdt+∫T×R+ρ0(x,z)φ(0,x,z)dzdx+∫R+×TΦ(ρb(t,x),σb(t,x))φ(t,x,0)dxdt=0, | (19a) |
∫Dρ∂xψ+σ∂zψdzdxdt+∫R+×Tσb(t,x)ψ(t,x,0)dxdt=0. | (19b) |
We do not know whether a solution in the sense of Definition 3.1 exists for (17) with general initial and boundary data or whether such a solution is unique and depends continuously on the data.2 However, for the particular initial and boundary data given in (28), we establish the existence of special solutions below.
2In [5], a continuous vanishing viscosity solution is established using known results from the mathematical literature on Hamilton-Jacobi equations. However, translating such a result to an
The weak formulation (19) gives rise to a Rankine–Hugoniot jump condition [10]. Consider the surface
S:={(t,x,z)⊂D:z=ζ(t,x)} | (20) |
for a differentiable function
Φ(ρℓ,σℓ)−Φ(ρr,σr)=∂tζ(t,x)(ρℓ−ρr), | (21) |
where
ρℓ=ρ(t,x,ζ(t,x)−),σℓ=σ(t,x,ζ(t,x)−), | (22) |
ρr=ρ(t,x,ζ(t,x)+),σr=σ(t,x,ζ(t,x)+). | (23) |
In this section, we investigate solutions to (17) that take the form of fronts; see Figure 2. As discussed in the introduction, such solutions are important from the point of view of applications.
In the case of a single front, the initial and boundary conditions for
ρ0(x,z)=rH(ζ0(x)−z)andρb(t,x)=r, | (24) |
where
σb(t,x)=r∂xζ(t,x). | (25) |
Theorem 3.2. Let
0<r<ρ∗ | (26) |
and
∂tζ(t,x)=α(t,x)ρ∗W2(∂xζ(t,x)),ζ(0,x)=ζ0(x). | (27) |
Then there pair (
ρ(t,x,z)=rH(ζ(t,x)−z),σ(t,x,z)=rH(ζ(t,x)−z)∂xζ(t,x) | (28) |
is a weak solution for (17) in the sense of Definition 3.1 with initial and boundary conditions given by (24) and (25).
Before proving this result, some remarks are in order.
1.
Φ(ρ,σ)={αρρ∗W2(σρ),ρ≠0,0,ρ=0, | (29) |
This formula and the Rankine–Hugoniot condition in (21) together form the key components of the existence proof.
2.
3.
4.
ρ0(x,z)=r1H(ζ1,0(x)−z)+r2[H(ζ2,0(x)−z)−H(ζ1,0(x)−z)] | (30) |
with positive constants
ρ(t,x,z)=(r1−r2)H(ζ1(t,x)−z)+r2H(ζ2(t,x)−z) | (31a) |
σ(t,x,z)=(r1−r2)∂xζ1(t,x)H(ζ1(t,x)−z) | (31b) |
+r2∂xζ2(t,x)H(ζ2(t,x)−z), | (31c) |
provided
∂tζ1(t,x)=α(t,x)ρ∗(r2−r1)[r2W2(∂xζ2(t,x))−r1W2((r1−r2r1)∂xζ1(t,x)+r2r1∂xζ2(t,x))], | (32a) |
∂tζ2(t,x)=α(t,x)ρ∗W2(∂xζ2(t,x)). | (32b) |
Proof of Theorem 3.2. With the
We first verify that
∫D[rH(ζ(t,x)−z)∂xψ(t,x,z)+rH(ζ(t,x)−z)∂xζ(t,x)∂zψ(t,x,z)]dzdxdt=0. | (33) |
By definition of the Heaviside function,
D[rH(ζ(t,x)−z)∂xψ(t,x,z)+rH(ζ(t,x)−z)∂xζ(t,x)∂zψ(t,x,z)]dzdxdt=r∫R+0×T∫ζ(t,x)0[∂xψ(t,x,z)+∂xζ(t,x)∂zψ(t,x,z)]dzdxdt=r∫R+0×T[∫ζ(t,x)0∂xψ(t,x,z)dz+∂xζ(t,x)ψ(t,x,ζ(t,x))]dxdt=r∫R+0×T∂∂x(∫ζ(t,x)0ψ(t,x,z)dz)dxdt=0, | (34) |
where the integral in the last line vanishes due to periodicity of the domain with respect to
We next verify that
σ(t,x,z)ρ(t,x,z)=∂xζ(t,x). | (35) |
Hence according to (29),
Φ(ρ(t,x,z),σ(t,x,z))={α(t,x)rρ∗W2(∂xζ(t,x))(t,x,z)∈S−0(t,x,z)∈S+. | (36) |
Let
Dρ∂tφ+Φ(ρ,σ)∂zφdzdxdt=∫S−r∂tφ+αrρ∗W2(∂xζ)∂zφdzdxdt=∫T∫{(t,z):z=ζ(t,x)}φ(−r∂tζ+αrρ∗W2(∂xζ))1‖(1,∂tζ)‖dA(t,z)dx, | (37) |
where
It is the equation for the front
∂tζ(t,x)=α(t,x)ρ∗×{1−η|∂xζ(t,x)|,|∂xζ(t,x)|<η−1,0,|∂xζ(t,x)|≥η−1. | (38) |
In general, we do not know if there exists a solution to (38), due to the discontinuous flux. However, we may consider particular solutions related to special initial conditions
● If for all
∂t(∂xζ)(t,x)+αηρ∗∂x|(∂xζ)(t,x)|=0,(∂xζ)(0,x)=∂xζ0(x) | (39) |
with flux function
● If
ζ(t,x)=ζ0(x−ηαρ∗t)+αρ∗t. | (40) |
Similarly if
ζ(t,x)=ζ0(x+ηαρ∗t)+αρ∗t. | (41) |
We use these formulas to make comparisons with numerical results in Examples 2-4 in Section 4.
● If locally
In this section, we construct a numerical discretization for the system. Results of numerical simulations using this discretization are presented to illustrate the behavior of special front-type solutions discussed in the previous section.
Our strategy for approximating (17) is based on the relaxed system of equations
∂tρ+∂zΦ(ρ,σ)=0, | (42a) |
ε∂tσ+∂xρ+∂zσ=0, | (42b) |
where
Before discretizing (42), we investigate hyperbolicity in the simplified case that
3 Hyperbolicity will not change if
∂tU+Bx(U)∂xU+Bz(U)∂zU=0, | (43) |
where
Bx(U)=(001ϵ0),Bz(U)=(∂ρΦ∂σΦ01ϵ). | (44) |
Under the assumption (26) and provided that
∂ρΦ(ρ,σ)=αρ∗(W2(s)−sW′2(s)),∂σΦ(ρ,σ)=αρ∗W′2(s), | (45) |
where
The system (43) is hyperbolic if for any
B(U,ξ)={(00ξ1ϵξ2ϵ),|s|>1η,](ξ2αρ∗−ξ2sgn(s)ηαρ∗ξ1ϵξ2ϵ),0<|s|<1η, | (46) |
and the eigenvalues of
(λ1,λ2)={(0,ξ2ϵ),|s|>1η,(λ∗1,λ∗2),0<|s|<1η, | (47) |
where
p(λ)=λ2−ξ2(αρ∗+1ϵ)λ+ξ2αϵρ∗(ξ2+sgn(s)ξ1η). | (48) |
For
dϵ(ξ1,ξ2)=(ξ2ϵρ∗)2[(ϵα−ρ∗)2−4sgn(s)ϵαρ∗ηξ1ξ2] | (49) |
which, for
sgn(s)ξ1ξ2>(ϵα−ρ∗)24ϵαρ∗η. | (50) |
For any
To derive a numerical scheme for (42), we first discretize in time using a first-order method. We are eventually interested in the
ρn+1=ρn−Δt∂zΦ(ρn,σn), | (51a) |
σn+1=σn−Δtε(∂xρn+1+∂zσn+1). | (51b) |
To discretize (51) in
¯Rnij≃1ΔxΔz∫xi+1/2xi−1/2∫zj+1/2zj−1/2ρn(x,z)dxdz | (52) |
and
¯Snij≃1ΔxΔz∫xi+1/2xi−1/2∫zj+1/2zj−1/2σn(x,z)dxdz, | (53) |
respectively.
We use a Lax-Friedrichs approximation of
¯Rn+1ij=¯Rnij−ΔtΔz(Fni,j+1/2−Fni,j−1/2)+14(¯Rni+1,j−2¯Rnij+¯Rni−1,j), | (54a) |
¯Sn+1ij=¯Snij−Δt2ϵΔx(¯Rn+1i+1,j−¯Rn+1i−1,j)−ΔtϵΔz(¯Sn+1i,j+1−¯Sn+1ij), | (54b) |
where
Fni,j+12=12(Φ(¯Rni,j+1,¯Sni,j+1)+Φ(¯Rnij,¯Snij)−a(¯Rni,j+1−¯Rnij)). | (55) |
The monotonicity of the numerical flux
max(R,S)‖∂ρΦ(R,S)‖=:a≤ΔzΔt. | (56) |
This condition motivates a dynamic choice of
an=max(i,j):1≤i≤Nx,1≤j≤Nz‖∂ρΦ(¯Rnij,¯Snij)‖. | (57) |
The right-biased stencil in the discretization of
The final scheme used in Section 4.2 is given by the formal limit of (54) in the limit
¯Rn+1ij−¯Rnij=−ΔtΔz[Fni,j+1/2−Fni,j−1/2]+14(¯Rni+1,j−2¯Rnij+¯Rni−1,j), | (58a) |
¯Sn+1ij=Δz2Δx(¯Rn+1i+1,j−¯Rn+1i−1,j)+¯Sn+1i,j+1. | (58b) |
This scheme above must be accompanied by boundary and initial conditions. The initial values
¯R0ij=1ΔxΔz∫xi+1/2xi−1/2∫zj+1/2zj−1/2ρ0(x,y)dxdz. | (59) |
The cell averages
¯S0ij=1ΔxNz∑ℓ=jρ0(xi+1/2,zj)−ρ0(xi−1/2,zj). | (60) |
For
¯Sni,Nz=0. | (61) |
For
¯Rni,1=r. | (62) |
Due to the central discretization of
¯Rni,Nz=¯Rni,Nz−1. | (63) |
Example 1: validation with the microscopic model. We consider a test problem that has been used to to validate the macroscopic model (17) and the numerical method (58) by comparing with a simulation of the microscopic model introduced in [5]. For this problem,
ρ0(x,z)=1.5(sin(2πz))6χ[0,0.5](z),ρb(t,x)=0,σb(t,x)=0, | (64) |
and the processor speed is
α(x)=1−0.4(sin(πx))2. | (65) |
Both models are simulated up to a final time
The
In Figure 4 we show the initial density and the final density profiles for both models at time
In the remaining examples of this subsection, we explore the behavior of front-type solutions. Unless otherwise stated the following parameters are used in all simulations:
r=0.5,ρ∗=0.8,α=0.1,Tfin=2.0. | (66) |
The value of
0≤ρ(t,x,z)<ρ∗,forall(t,x,z)∈D, | (67) |
in which case the flux
Example 2: front with constant profile. We consider the case of an initial constant front, namely
ζ0(x)=ζ0=0.2,∀x∈T, | (68) |
which provides a trivial example for a piece-wise constant solution
ζ(t,x)=ζ0+αρ∗t. | (69) |
In this example
Numerical results are shown in Figure 5. Since the processing rate
Example 3: V-shaped front with small
ζ0(x)=(1−2z0)|x−0.5|+z0 | (70) |
where
Based on the theoretical findings in [9] for (39), we expect that the front at later times will be piecewise linear. Moreover, the formulas in (40) and (41) can be used to generate the local solution analytically. Indeed, under the assumption
ζloc(t,x)=ζ0(x)+αρ∗(1−η|∂xζ0|)t. | (71) |
where
ζ(t,x)=max{ζloc(t,x),z0+αρ∗t,}. | (72) |
In Figure 6 we show numerical results for this test problem using the algorithm in (58).. We use a grid with
Example 4: Smooth front with small
ζ0(x)=14cos(2πx)+0.3. | (73) |
In order to fulfill the condition
In Figure 7 we show numerical results for this test problem using the algorithm in (58). We use a grid with
Example 5: V-shaped front with large
ζ(t,x)=max{ζ0(x),z0+αρ∗t}. | (74) |
In Figure 8 we show numerical results for a grid with
The discrete model (1) can be used to predict how variations in the processor rate affect computer performance, and to understand whether these rates can be controlled to achieve a prescribed objective, such as faster throughput. For the continuum model in (17), such a control is specified via the function
In order to normalize the control action, we assume that for all
∫Tα(t,x)dx=ˉα. | (75) |
Since
ρ0(x,z)=rH(ζ0(x)−z),r<ρ∗,andζ0(x)>0, | (76) |
Since the slope
α(x)=Cαρ∗(ζmax−ζ0(x)), | (77) |
where
In order to compare the temporal and spatial evolution of
ω1(t,z):=∫t0∫TΦ(ρ(s,x,z),σ(s,x,z))dxds,ω2(t,z):=∫TΦ(ρ(t,x,z),σ(t,x,z)dx,ω3(t,z):=∫t0∫Tρ(s,x,z)dxds. | (78) |
The quantity
For our numerical tests, we set
In this paper, we have analyzed a recently derived mathematical model for the evolution of processed data in large-scale, asynchronous computers [5]. After the introduction of an auxiliary variable, the model is expressed as a system of partial differential equations. It is possible to prove existence of discontinuous solutions to this system for a particular class of initial and boundary conditions. These solution describes the flow of information as a series of propagating fronts. A numerical scheme for the reformulated system has also been designed based on a relaxation approximation. Numerical simulations based on this scheme demonstrate qualitative agreement with theoretical findings.
We have also briefly explored the effects of local processor speed on quantities of interest predicted by the model. In future work, we intend to investigate more extensively control mechanisms for optimizing important objectives related to performance of the large-scale computers. Further it is planned to validate the model based on experimental data.
MH and GV thank the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) for the financial support through 20021702/ GRK2326, 333849990/IRTG-2379, HE5386/14, 15, 18-1, 19-1, 22-1 and under Germany's Excellence Strategy EXC-2023 Internet of Production 390621612. The funding through HIDSS-004 is acknowledged. This work has been supported also by the US National Science Foundation, RNMS (KI-Net) grant 11-07444, and the U.S. Department of Energy, Office of Advanced Scientific Computing Research. The work of CDH was performed at the Oak Ridge National Laboratory, which is managed by UT-Battelle, LLC under Contract No. De-AC05-00OR22725.
[1] |
Two-way multi-lane traffic model for pedestrians in corridors. Netw. Heterog. Media (2011) 6: 351-381. ![]() |
[2] |
A model for the dynamics of large queuing networks and supply chains. SIAM J. Appl. Math. (2006) 66: 896-920. ![]() |
[3] |
A scalar conservation law with discontinuous flux for supply chains with finite buffers. SIAM J. Appl. Math. (2011) 71: 1070-1087. ![]() |
[4] |
Derivation of continuum traffic flow models from microscopic follow-the-leader models. SIAM J. Appl. Math. (2002) 63: 259-278. ![]() |
[5] |
A mathematical model of asynchronous data flow in parallel computers. IMA Journal of Applied Mathematics (2020) 85: 865-891. ![]() |
[6] |
On the modelling crowd dynamics from scaling to hyperbolic macroscopic models. Mathematical Models and Methods in Applied Sciences (2008) 18: 1317-1345. ![]() |
[7] |
H. Chan, M. J. Cherukara, B. Narayanan, T. D. Loeffler, C. Benmore, S. K. Gray and S. K. R. S. Sankaranarayanan, Machine learning coarse grained models for water, Nature Communications, 10 (2019), 379. doi: 10.1038/s41467-018-08222-6
![]() |
[8] |
Pedestrian flow models with slowdown interactions. Mathematical Models and Methods in Applied Sciences (2014) 24: 249-275. ![]() |
[9] |
Polygonal approximations of solutions of the initial value problem for a conservation law. J. Math. Anal. Appl. (1972) 38: 33-41. ![]() |
[10] |
C. M. Dafermos, Hyperbolic Conservation Laws in Continuum Physics, vol. 325 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], 2nd edition doi: 10.1007/3-540-29089-3
![]() |
[11] |
Rigorous derivation of nonlinear scalar conservation laws from follow-the-leader type models via many particle limit. Archive for Rational Mechanics and Analysis (2015) 217: 831-871. ![]() |
[12] |
J. Dongarra, J. Hittinger, J. Bell, L. Chacón, R. Falgout, M. Heroux, P. Hovland, E. Ng, C. Webster and S. Wild, Applied Mathematics Research for Exascale Computing, Technical report, U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research Program, 2014. doi: 10.2172/1149042
![]() |
[13] |
L. C. Evans, Partial Differential Equations, American Mathematical Society, 2010. doi: 10.1090/gsm/019
![]() |
[14] |
Modeling selective local interactions with memory. Physica D: Nonlinear Phenomena (2013) 260: 176-190. ![]() |
[15] | A fluid dynamic model for the movement of pedestrians. Complex Systems (1992) 6: 391-415. |
[16] |
Models for dense multilane vehicular traffic. SIAM J. Math. Anal. (2019) 51: 3694-3713. ![]() |
[17] | C. Murray et al., Basic Research Needs for Microelectronics: Report of the Office of Science Workshop on Basic Research Needs for Microelectronics, Technical report, USDOE Office of Science (SC)(United States), 2018. |
[18] |
J. S. Vetter et al., Extreme Heterogeneity 2018-Productive Computational Science in the Era of Extreme Heterogeneity: Report for DOE ASCR Workshop on Extreme Heterogeneity, Technical report, USDOE Office of Science (SC), Washington, DC (United States), 2018. doi: 10.2172/1473756
![]() |
[19] |
Roofline: An insightful visual performance model for multicore architectures. Communications of the ACM (2009) 52: 65-76. ![]() |
1. | Richard C Barnard, Kai Huang, Cory Hauck, A mathematical model of asynchronous data flow in parallel computers*, 2020, 85, 0272-4960, 865, 10.1093/imamat/hxaa031 | |
2. | Mohammad Daneshvar, Richard C. Barnard, Cory Hauck, Ilya Timofeyev, Modeling information flow in a computer processor with a multi-stage queuing model, 2024, 01672789, 134446, 10.1016/j.physd.2024.134446 |