Professional Documents
Culture Documents
Abstract—One of the fundamental problems of using us to a-posteriori1 methods. Some examples of a-posteriori
optimization models that use different time series as data input, methods include [7], [8], [9]; however, they either contain
is the trade-off between model accuracy and computational some kind of heuristic components or are tailored to toy
tractability. To overcome computational intractability of these
problems.
full optimization models, the dimension of input data and model
size is commonly reduced through time series aggregation (TSA) In this paper we show that first, a-priori TSA methods are
methods. However, traditional TSA methods often apply a one- not a one-size-fits-all solution when used for optimization
size-fits-all approach based on the common belief that the clusters models, and that they should ultimately be replaced by
that best approximate the input data also lead to the aggregated a-posteriori methods. To that purpose, we first define full
model that best approximates the full model, while the metric and aggregated optimization models and apply a traditional
that really matters – the resulting output error in optimization k-means clustering technique to an illustrative example in
results – is not well addressed. In this paper, we plan to chal-
lenge this belief and show that output-error based TSA methods
Section II. And second, that when a-posteriori methods are
with theoretical underpinnings have unprecedented potential of based on theoretical underpinnings - as the basis-oriented
computational efficiency and accuracy. method proposed in Section III - they outperform a-priori
methods by orders of magnitude. Section IV concludes the
Index Terms—Time series aggregation, optimization.
paper.
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:10:23 UTC from IEEE Xplore. Restrictions apply.
WOGRIN: TSA FOR OPTIMIZATION: ONE-SIZE-FITS-ALL? 2491
assume there are no time-period-linking constraints5 so we can Ax = BxB + NxN = E(bi ). By definition xN = 0, so we
analyze each time step individually. In particular, we focus on further simplify Ax = BxB = E(bi ) and it follows that
the optimal solution for this single time step, e.g., an hour, −1
xB = B (E(bi )). We now substitute this optimal solution
and its corresponding basis B (in the simplex framework). in the objective function of (E), which yields cT x = cTB xB =
Consider two different time steps whose TS data only dif- −1 −1 −1 −1
fer in the right-hand-side (RHS) vector of the constraints, cTB B (E(bi )) = cTB B ( i bIi ) = 1I cTB B b1 +. . .+ 1I cTB B bI .
i.e., b1 and b2 , but both have an identical optimal basis B Since we know that B is an optimal basis for each problem
−1 −1
for the underlying LP. Let us aggregate the two individual (Ei ), we can say that: 1I cTB B b1 + . . . + 1I cTB B bI ≥
1 T −1 1 T −1
I cB B b1 + . . . + I cB B bI = cB xB . This would imply that
T
time steps into one representative: the corresponding RHS
constraint vector becoming b1 +b 2
2 ; and the objective function B is an optimal basis for (E), which is a contradiction.
weight becoming 2. If the optimal basis of this aggregated
time step were the same as the optimal basis of the two orig- B. Economic Dispatch and Basis-Oriented Clustering
inal time steps, then - in expectation - the aggregated model
results would be identical to the full model results, i.e., yield Applying basis-oriented clustering to the above-mentioned
the same objective function value, and the same expected ED example shows that there are only 3 different bases.
results for the variables. Hence, those two time steps can Therefore, we only require 3 clusters. In Figure 1(b), we
be merged without losing any accuracy in the final aggre- have color-coded each hour depending on the basis (and
gated model. The Theorem below proves that the previous corresponding cluster) this hour belongs to: blue (the wind
statements is true for LPs, in particular, that a representative generator is the marginal generator), black (thermal is on
time step whose RHS is the expected value of other time steps the margin), and red (hours with non-supplied energy). The
with the same underlying LP basis, also has the same optimal MSE in the input space is 0.0385, which is 2.3 times larger
basis. The Theorem hence allows to increase model efficiency than the MSE obtained by k-means, so the input error is
by reducing the amount of time steps used in the aggregated higher with the obtained clusters. However, if we cluster all
model. This is unrelated to the topic of omitting non-binding hours within their basis, then the aggregated optimization
constraints within the model, which is outside the scope of results are exact! Solving the aggregated optimization model
this paper but could also be applied after the basis-oriented with only 3 representative periods that have been clustered,
aggregation process. accounting for the corresponding bases, yields an error of 0%.
In other words, if different time steps are aggregated within Apart from being theoretically exact, basis-oriented clustering
their basis and represented by the cluster centroid, then the also establishes the maximum number of clusters necessary
aggregated model results will be exactly the same as the full to obtain exact optimization results, which is nothing cur-
hourly model results and have zero output error in expectation. rent TSA methods can offer. This is of course a small but
This shows: first, that TSA for optimization purposes must be illustrative example of an ED. However, preliminary find-
based on the impact of the aggregation on the optimization out- ings indicate that basis-oriented TSA has potential to scale
put error and not on similarity of input data as in traditional up numerically for more realistic and large-scale applications
methods; second, it introduces basis-oriented TSA as one of the ED (to include more generators and more complicat-
promising approach of clustering for aggregated optimization ing constraints), for example, by using a-priori offline training.
problems. But this requires extensive future research both on developing
Theorem 1: For each i = 1, . . . , I consider the following theoretical framework and numerical methods.
LPs (Ei ): min cT x s.t. Ax = bi , where x ∈ Rn , A ∈ Rmxn , c ∈
Rn , bi ∈ Rm and the LPs only differ in the RHS values bi . IV. C ONCLUSION
Then, B is also an optimal basis for the problem (E): min cT x
The takeaways from this paper are as follows. First, a-priori
s.t. Ax = E(bi ), where E(bi ) = i bIi .
TSA methods are fundamentally flawed when used in/for
Proof: We proof this by contradiction. Assume that B is not
optimization models and therefore should be abandoned and
the optimal basis for problem (E): min cT x s.t. Ax = E(bi ).
replaced by a-posteriori methods. As we have shown by
Instead, let B (= B) and N be the optimal basis and the
counter-example, the lowest input error (as indicated by MSE)
non-basis matrices of (E). Under this assumption, it follows
does not translate into the lowest (or even a low) output error
that cTB xB < cTB xB for this problem. In (E) we obtain that
when approximating full optimization model results. Second,
basis-oriented TSA can achieve a tremendous reduction (3 out
5 If there are no time-period-linking constraints in the optimization problem, of 8760 hours) in input data of several orders of magnitude
then analyzing time steps separately does not incur an error. However, this is while replicating full model results exactly. This confirms
a very strong assumption that must be explained further. Hence, we separate
two separate discussions here: one on the data aggregation technique used;
that picking clusters intelligently can outperform traditional
and two, on the formulation of the optimization model itself and how realistic one-size-fits-all a-priori TSA methods (such as k-means) even
that is. Regarding the first, this letter shows that one of the most common if those use most of the original data. Finally, more research is
approaches, i.e., k-means clustering, for data aggregation is fundamentally required to develop a theoretical framework on most efficient
flawed when it is used for aggregating data that is later on inserted as input
into an optimization model. This fundamental result is true irrespective of a-posteriori TSA methods. The basis-oriented TSA proposed
whether there are time-period linking constraints or not. Regarding the lat- here could be a starting point. In future research, we plan to
ter discussion, time-linking constraints are very important for power system explore basis-oriented TSA further to see if it can be extended
models (e.g., unit commitment, ramping, storage etc.). However, introducing
such constraints also hugely complicates the analysis shown in the letter and
beyond LPs and in particular, to mixed integer and nonlinear
will therefore be addressed in ongoing (i.e., ramping and network constraints) problems, but also to large-scale, realistic problems with, e.g.,
and future research. time-linking and discrete constraints.
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:10:23 UTC from IEEE Xplore. Restrictions apply.
2492 IEEE TRANSACTIONS ON SMART GRID, VOL. 14, NO. 3, MAY 2023
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:10:23 UTC from IEEE Xplore. Restrictions apply.