An explicit description of null controllable regions of linear systems with saturating actuators

Telechargé par mednaceurbenfredj
Systems & Control Letters 47 (2002) 65 78
www.elsevier.com/locate/sysconle
An explicit description of null controllable regions of linear
systems with saturating actuators
Tingshu Hua;;1, Zongli Lina;1,LiQiu
b;2
aDepartment of Electrical Engineering, University of Virginia, Charlottesville, VA 22903, USA
bDepartment of Electrical & Electronic Engineering, Hong Kong University of Science & Technology, Clear Water Bay,
Kowloon, Hong Kong
Received 25 September 2000; received in revised form 13 March 2002; accepted 16 April 2002
Abstract
We give simple exact descriptions of the null controllable regions for general linear systems with saturating actuators.
The description is in terms of a set of extremal trajectories of the anti-stable subsystem. For lower order systems or
systems with only real eigenvalues, this description is further simplied to result in explicit formulae for the boundaries
of the null controllable regions. c
2002 Elsevier Science B.V. All rights reserved.
Keywords: Actuator saturation; Bounded controls; Null controllable region
1. Introduction
One of the most fundamental issues associated with the control of a system is its controllability. Since all
practical control inputs are bounded (due to actuator saturation), the constrained controllability was formulated
earlier than the unconstrained one. While the unconstrained controllability has been well understood for several
decades, there have been continual eorts towards full understanding of the constrained controllability (see,
e.g., [1–3,6–8,10,14–17] and the references therein).
For a linear system with a constrained input, the null controllable region at a time T(0;), denoted as
C(T), is dened to be the set of states that can be steered to the origin in time Twith a constrained control.
The union of C(T) for all T(0;), denoted as C, is called the null controllable region. In the earlier studies,
the null controllable region, also called the controllable set or the reachable set (of the time reversed system),
was closely related to the time optimal control (e.g., [2,6,9,11]): for a given initial state x0, the time optimal
control problem has a solution if and only if x0C;Ifx0is on the boundary of C(T), then the minimal
time to steer x0to the origin is T. It is well-known that the time optimal controls are bang–bang controls.
For discrete-time systems, the time optimal control can be computed through linear programming and C(T)
can be exactly obtained, although the computational burden increases as Tincreases. Also closely related to
the time optimal control is the model predictive control or the receding horizon control. The model predictive
Corresponding author.
E-mail addresses: [email protected] (Tingshu Hu), [email protected] (Zongli Lin), [email protected] (Li Qiu).
1Work supported in part by the US Oce of Naval Research Young Investigator Program under grant N00014-99-1-0670.
2Work supported by Hong Kong Research Grant Council under grant HKUST6046=00E.
0167-6911/02/$ - see front matter c
2002 Elsevier Science B.V. All rights reserved.
PII: S 0167-6911(02)00176-7
66 Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 78
control has been extensively studied and has found wide applications in slow processes (see, e.g., [12,13] for
a survey). The development of the model predictive control also contributes to the characterization of C(T)
for discrete-time systems. On the contrary, for continuous-time systems, since the time optimal control is
generally impossible to compute except by numerical approximation, there has been no result on the explicit
or analytical characterization of C(T)orCof exponentially unstable systems. There are however numerical
algorithms available to obtain approximations of Cbased on some partial properties about the boundary of C
for second-order systems (e.g., [16,17]). There are also numerical methods for testing if a particular point in
the state space is inside C(e.g., [5]). In this paper, we will focus on the analytic characterization of Cfor
general linear systems.
We recall that a linear system is said to be anti-stable if all its poles are in the open right-half plane and
semi-stable if all its poles are in the closed left-half plane.
For a semi-stable linear system, it is well-known [11,14,15] that the null controllable region is the whole
state-space as long as the system is controllable in the usual linear system sense. For a general system with
exponentially unstable modes, there exists a nice decomposition result concerning the null controllable region
[4]. Suppose such a system is decomposed into the sum of a controllable semi-stable subsystem and an
anti-stable subsystem, then the null controllable region of the whole system is the Cartesian product of the
null controllable region of the rst subsystem, which is its whole state space, and that of the second subsystem,
which is a bounded convex open set.
However, little was known about the null controllable region of an anti-stable system. This paper is dedicated
to solving this problem. We will give simple exact descriptions of the null controllable region of a general
anti-stable linear system with saturating actuators in terms of a set of extremal trajectories of its time reversed
system. This set of extremal trajectories is particularly easy to describe for low order systems or systems with
only real eigenvalues. For example, for a second-order system, the boundary of its null controllable region is
covered by at most two extremal trajectories; and for a third-order system, the set of extremal trajectories can
be described in terms of parameters in a real interval.
The remainder of the paper is organized as follows. Section 2 contains some preliminaries and denitions
of notation. Section 3 gives a simple exact description of the null controllable regions of anti-stable linear
systems with bounded controls. Section 4 draws a brief conclusion to this paper.
2. Preliminaries and notation
Consider a linear system
˙x(t)=Ax(t)+Bu(t);(1)
where x(t)Rnis the state and u(t)Rmis the control. Let
Ua={u:uis measurable and u(t)61;tR};(2)
where u(t)= maxi|ui(t)|. A control signal uis said to be admissible if uUa. In this paper, we are
interested in the control of system (1) by using admissible controls. Our concern is the set of states that can
be steered to the origin by admissible controls.
Denition 2.1. A state x0is said to be null controllable if there exist a T[0;) and an admissible control u
such that the state trajectory x(t) of the system satises x(0)= x0and x(T)= 0. The set of all null controllable
states is called the null controllable region of the system and is denoted by C.
Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 78 67
With the above denition, we see that x0Cif and only if there exist T[0;)andauUasuch that
0=x(T)=e
AT x0+T
0
eA(T)Bu()d=e
AT x0+T
0
eABu()d:
It follows that
C=
T[0;)x=T
0
eABu()d:uUa:(3)
The minus sign “” before the integral can be removed since Uais a symmetric set. In what follows we
recall from the literature some existing results on the characterization of the null controllable region.
Proposition 2.1. Assume that (A; B)is controllable.
(a) If A is semi-stable;then C=Rn.
(b) If A is anti-stable;then Cis a bounded convex open set containing the origin.
(c) If
A=A10
0A2
with A1Rn1×n1anti-stable and A2Rn2×n2semi-stable;and B is partitioned as
B1
B2
accordingly;then C=C1×Rn2where C1is the null controllable region of the anti-stable system
˙x1(t)=A1x1(t)+B1u(t).
Statement (a) is well-known [11,14,15]. Statements (b) and (c) are proven in [4]. Because of this propo-
sition, we can concentrate on the study of null controllable regions of anti-stable systems. For this kind of
systems,
C=x=
0
eABu()d:uUa;(4)
where
Cdenotes the closure of C. We will use “@” to denote the boundary of a set. In [5], a nonlinear
programming based algorithm is proposed to test if a point in the state space belongs to C. In Section 3, we
will derive a method for explicitly describing @C. To this end, we will need some more preliminaries.
If B=[b1b2::: b
m] and the null controllable region of the system ˙x(t)=Ax(t)+biui(t)isCi,i=1;:::;m,
then
C=
m
i=1
Ci={x1+x2+···+xm:xiCi;i=1;2;:::;m}:(5)
In view of (5) and Proposition 2.1, in the study of the null controllable regions we will assume, without loss
of generality, that (A; B) is controllable, Ais anti-stable, and m= 1. For clarity, we rename Bas b.
For a general system
˙x=f(x; u);(6)
its time reversed system is
˙z=f(z; v):(7)
It is easy to see that x(t) solves (6) with x(0) = x0;x(t1)=x1, and certain uif and only if z(t)=x(t1t)
solves (7) with z(0)= x1,z(t1)=x0, and v(t)=u(t1t). The two systems have the same curves as trajectories,
but traverse in opposite directions.
68 Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 78
Consider the time reversed system of (1):
˙z(t)=Az(t)bv(t):(8)
Denition 2.2. A state zfis said to be reachable if there exist T[0;) and an admissible control vsuch
that the state trajectory z(t) of system (8) satises z(0) = 0 and z(T)=zf. The set of all reachable states is
called the reachable region of system (8) and is denoted by R.
It is known that Cof (1) is the same as Rof (8) (see, e.g., [11]). To avoid confusion, we will continue
to use the notation x; u and Cfor the original system (1), and z; v and Rfor the time-reversed system (8).
3. Null controllable regions
In Section 3.1, we show that the boundary of the null controllable region of a general anti-stable linear
system with saturating actuator is composed of a set of extremal trajectories of the time reversed system. The
descriptions of this set are further simplied for systems with only real poles and for systems with complex
poles in Sections 3.2 and 3.3, respectively.
3.1. Description of the null controllable regions
We will characterize the null controllable region Cof system (1) through studying the reachable region R
of its time reversed system (8).
Since Ais anti-stable, we have
R=z=
0
eAbv()d:vUa=z=0
−∞
eAbv()d:vUa:
The change of integration interval from [0;]to[−∞;0] is crucial to our development, as will be clear
from Eq. (17). Noticing that eA =e
A(0), we see that a point zin
Ris a state of the time-reversed system
(8) at t= 0 by applying an admissible control vfrom −∞ to 0.
Theorem 3.1.
@R=z=0
−∞
eAbsgn(ceAb)d:cRn\{0}:(9)
Ris strictly convex. Moreover;for each z@R;there exists a unique admissible control vsuch that
z=0
−∞
eAbv()d: (10)
Remark 3.1. We give some simple facts about convex sets in this remark. Consider a closed set S.IfSis
convex and z@S; then by separation theorem;there exists a hyperplane cz=kthat is tangential to @S at
zand the set Slies completely to one side of the hyperplane;i.e.;
cz6k=cz;zS:
A set Sis said to be strictly convex if it is convex and for any two points z1;z
2@S,z1+(1)z2∈ @S for
all (0;1). This is equivalent to saying that any hyperplane that is tangential to @S has only one intersection
point with @S, or, for any c= 0, there exists a unique z@S such that cz= maxzScz.
Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 78 69
Proof of Theorem 1. First;the convexity of
Rcan easily be veried by denition. Let z@R. Then;there
exists a nonzero vector cRnsuch that
cz= max
z
R
cz= max
vUa0
−∞
ceAbv()d: (11)
Since c= 0 and (A; b) is controllable;c
eAb≡ 0. Since ceAt bhas a nite number of zeros in any nite
interval;
({t:ceAt b=0})=0;(12)
where (·) denotes the measure of a set.
It is easy to see that
v(t) = sgn(ceAt b)
maximizes the right-hand side of (11). We maintain that vis the unique optimal solution of (11). To verify
this, we need to show that for any vUa;v=v,
0
−∞
ceAbv()d¿0
−∞
ceAbv()d: (13)
Since v=v, there are a set E1[−∞;0] with nonzero measure, i.e., (E1)=1¿0, and a number 1¿0
such that
|v(t)v(t)|¿1;tE1:
By (12), there exist a set EE1, with (E)=¿0, and a positive number ¿0 such that
|ceAt b|¿; tE:
Noting that vUa, we have
ceAt b(v(t)v(t))¿0;t[−∞;0]:
It then follows that
0
−∞
ceAb(v()v()) d
¿E
ceAb(v()v(t)) d=E
|ceAb||v()v()|d¿1¿0:
This shows that v(t) is the unique optimal solution of (11) and hence the unique admissible control satisfying
z=0
−∞
eAbv()d: (14)
On the other hand, if
z=0
−∞
eAbsgn(ceAb)d
for some nonzero c, then obviously
cz= max
z
R
cz:
This shows that z@Rand we have (9).
Since for each c= 0, the optimal solution v(t) and zof (11) is unique, we see that
Ris strictly convex.
1 / 14 100%
La catégorie de ce document est-elle correcte?
Merci pour votre participation!

Faire une suggestion

Avez-vous trouvé des erreurs dans linterface ou les textes ? Ou savez-vous comment améliorer linterface utilisateur de StudyLib ? Nhésitez pas à envoyer vos suggestions. Cest très important pour nous !