Null Controllable Regions of Linear Systems

Telechargé par mednaceurbenfredj

Téléchargement

Systems & Control Letters 47 (2002) 65 – 78

www.elsevier.com/locate/sysconle

An explicit description of null controllable regions of linear

systems with saturating actuators

Tingshu Hua;∗;1, Zongli Lina;1,LiQiu

b;2

aDepartment of Electrical Engineering, University of Virginia, Charlottesville, VA 22903, USA

bDepartment of Electrical & Electronic Engineering, Hong Kong University of Science & Technology, Clear Water Bay,

Kowloon, Hong Kong

Received 25 September 2000; received in revised form 13 March 2002; accepted 16 April 2002

Abstract

We give simple exact descriptions of the null controllable regions for general linear systems with saturating actuators.

The description is in terms of a set of extremal trajectories of the anti-stable subsystem. For lower order systems or

systems with only real eigenvalues, this description is further simplied to result in explicit formulae for the boundaries

of the null controllable regions. c

Keywords: Actuator saturation; Bounded controls; Null controllable region

1. Introduction

One of the most fundamental issues associated with the control of a system is its controllability. Since all

practical control inputs are bounded (due to actuator saturation), the constrained controllability was formulated

earlier than the unconstrained one. While the unconstrained controllability has been well understood for several

decades, there have been continual eorts towards full understanding of the constrained controllability (see,

e.g., [1–3,6–8,10,14–17] and the references therein).

For a linear system with a constrained input, the null controllable region at a time T∈(0;∞), denoted as

C(T), is dened to be the set of states that can be steered to the origin in time Twith a constrained control.

The union of C(T) for all T∈(0;∞), denoted as C, is called the null controllable region. In the earlier studies,

the null controllable region, also called the controllable set or the reachable set (of the time reversed system),

was closely related to the time optimal control (e.g., [2,6,9,11]): for a given initial state x0, the time optimal

control problem has a solution if and only if x0∈C;Ifx0is on the boundary of C(T), then the minimal

time to steer x0to the origin is T. It is well-known that the time optimal controls are bang–bang controls.

For discrete-time systems, the time optimal control can be computed through linear programming and C(T)

can be exactly obtained, although the computational burden increases as Tincreases. Also closely related to

the time optimal control is the model predictive control or the receding horizon control. The model predictive

∗Corresponding author.

E-mail addresses: [email protected] (Tingshu Hu), [email protected] (Zongli Lin), [email protected] (Li Qiu).

1Work supported in part by the US Oce of Naval Research Young Investigator Program under grant N00014-99-1-0670.

2Work supported by Hong Kong Research Grant Council under grant HKUST6046=00E.

0167-6911/02/$ - see front matter c

PII: S 0167-6911(02)00176-7

66 Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 – 78

control has been extensively studied and has found wide applications in slow processes (see, e.g., [12,13] for

a survey). The development of the model predictive control also contributes to the characterization of C(T)

for discrete-time systems. On the contrary, for continuous-time systems, since the time optimal control is

generally impossible to compute except by numerical approximation, there has been no result on the explicit

or analytical characterization of C(T)orCof exponentially unstable systems. There are however numerical

algorithms available to obtain approximations of Cbased on some partial properties about the boundary of C

for second-order systems (e.g., [16,17]). There are also numerical methods for testing if a particular point in

the state space is inside C(e.g., [5]). In this paper, we will focus on the analytic characterization of Cfor

general linear systems.

We recall that a linear system is said to be anti-stable if all its poles are in the open right-half plane and

semi-stable if all its poles are in the closed left-half plane.

For a semi-stable linear system, it is well-known [11,14,15] that the null controllable region is the whole

state-space as long as the system is controllable in the usual linear system sense. For a general system with

exponentially unstable modes, there exists a nice decomposition result concerning the null controllable region

[4]. Suppose such a system is decomposed into the sum of a controllable semi-stable subsystem and an

anti-stable subsystem, then the null controllable region of the whole system is the Cartesian product of the

null controllable region of the rst subsystem, which is its whole state space, and that of the second subsystem,

which is a bounded convex open set.

However, little was known about the null controllable region of an anti-stable system. This paper is dedicated

to solving this problem. We will give simple exact descriptions of the null controllable region of a general

anti-stable linear system with saturating actuators in terms of a set of extremal trajectories of its time reversed

system. This set of extremal trajectories is particularly easy to describe for low order systems or systems with

only real eigenvalues. For example, for a second-order system, the boundary of its null controllable region is

covered by at most two extremal trajectories; and for a third-order system, the set of extremal trajectories can

be described in terms of parameters in a real interval.

The remainder of the paper is organized as follows. Section 2 contains some preliminaries and denitions

of notation. Section 3 gives a simple exact description of the null controllable regions of anti-stable linear

systems with bounded controls. Section 4 draws a brief conclusion to this paper.

2. Preliminaries and notation

Consider a linear system

˙x(t)=Ax(t)+Bu(t);(1)

where x(t)∈Rnis the state and u(t)∈Rmis the control. Let

Ua={u:uis measurable and u(t)∞61;∀t∈R};(2)

where u(t)∞= maxi|ui(t)|. A control signal uis said to be admissible if u∈Ua. In this paper, we are

interested in the control of system (1) by using admissible controls. Our concern is the set of states that can

be steered to the origin by admissible controls.

Denition 2.1. A state x0is said to be null controllable if there exist a T∈[0;∞) and an admissible control u

such that the state trajectory x(t) of the system satises x(0)= x0and x(T)= 0. The set of all null controllable

states is called the null controllable region of the system and is denoted by C.

Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 – 78 67

With the above denition, we see that x0∈Cif and only if there exist T∈[0;∞)andau∈Uasuch that

0=x(T)=e

AT x0+T

eA(T−)Bu()d=e

AT x0+T

e−ABu()d:

It follows that

C=

T∈[0;∞)x=−T

e−ABu()d:u∈Ua:(3)

The minus sign “−” before the integral can be removed since Uais a symmetric set. In what follows we

recall from the literature some existing results on the characterization of the null controllable region.

Proposition 2.1. Assume that (A; B)is controllable.

(a) If A is semi-stable;then C=Rn.

(b) If A is anti-stable;then Cis a bounded convex open set containing the origin.

A=A10

0A2

with A1∈Rn1×n1anti-stable and A2∈Rn2×n2semi-stable;and B is partitioned as

B1

B2

accordingly;then C=C1×Rn2where C1is the null controllable region of the anti-stable system

˙x1(t)=A1x1(t)+B1u(t).

Statement (a) is well-known [11,14,15]. Statements (b) and (c) are proven in [4]. Because of this propo-

sition, we can concentrate on the study of null controllable regions of anti-stable systems. For this kind of

systems,



C=x=∞

e−ABu()d:u∈Ua;(4)

where 

Cdenotes the closure of C. We will use “@” to denote the boundary of a set. In [5], a nonlinear

programming based algorithm is proposed to test if a point in the state space belongs to C. In Section 3, we

will derive a method for explicitly describing @C. To this end, we will need some more preliminaries.

If B=[b1b2::: b

m] and the null controllable region of the system ˙x(t)=Ax(t)+biui(t)isCi,i=1;:::;m,

then



i=1

Ci={x1+x2+···+xm:xi∈Ci;i=1;2;:::;m}:(5)

In view of (5) and Proposition 2.1, in the study of the null controllable regions we will assume, without loss

of generality, that (A; B) is controllable, Ais anti-stable, and m= 1. For clarity, we rename Bas b.

For a general system

˙x=f(x; u);(6)

its time reversed system is

˙z=−f(z; v):(7)

It is easy to see that x(t) solves (6) with x(0) = x0;x(t1)=x1, and certain uif and only if z(t)=x(t1−t)

solves (7) with z(0)= x1,z(t1)=x0, and v(t)=u(t1−t). The two systems have the same curves as trajectories,

but traverse in opposite directions.

68 Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 – 78

Consider the time reversed system of (1):

˙z(t)=−Az(t)−bv(t):(8)

Denition 2.2. A state zfis said to be reachable if there exist T∈[0;∞) and an admissible control vsuch

that the state trajectory z(t) of system (8) satises z(0) = 0 and z(T)=zf. The set of all reachable states is

called the reachable region of system (8) and is denoted by R.

It is known that Cof (1) is the same as Rof (8) (see, e.g., [11]). To avoid confusion, we will continue

to use the notation x; u and Cfor the original system (1), and z; v and Rfor the time-reversed system (8).

3. Null controllable regions

In Section 3.1, we show that the boundary of the null controllable region of a general anti-stable linear

system with saturating actuator is composed of a set of extremal trajectories of the time reversed system. The

descriptions of this set are further simplied for systems with only real poles and for systems with complex

poles in Sections 3.2 and 3.3, respectively.

3.1. Description of the null controllable regions

We will characterize the null controllable region Cof system (1) through studying the reachable region R

of its time reversed system (8).

Since Ais anti-stable, we have



R=z=∞

e−Abv()d:v∈Ua=z=0

−∞

eAbv()d:v∈Ua:

The change of integration interval from [0;∞]to[−∞;0] is crucial to our development, as will be clear

from Eq. (17). Noticing that eA =e

−A(0−), we see that a point zin 

Ris a state of the time-reversed system

(8) at t= 0 by applying an admissible control vfrom −∞ to 0.

Theorem 3.1.

@R=z=0

−∞

eAbsgn(ceAb)d:c∈Rn\{0}:(9)



Ris strictly convex. Moreover;for each z∗∈@R;there exists a unique admissible control v∗such that

z∗=0

−∞

eAbv∗()d: (10)

Remark 3.1. We give some simple facts about convex sets in this remark. Consider a closed set S.IfSis

convex and z∗∈@S; then by separation theorem;there exists a hyperplane cz=kthat is tangential to @S at

z∗and the set Slies completely to one side of the hyperplane;i.e.;

cz6k=cz∗;∀z∈S:

A set Sis said to be strictly convex if it is convex and for any two points z1;z

2∈@S,z1+(1−)z2∈ @S for

all ∈(0;1). This is equivalent to saying that any hyperplane that is tangential to @S has only one intersection

point with @S, or, for any c= 0, there exists a unique z∗∈@S such that cz∗= maxz∈Scz.

Tingshu Hu et al. / Systems & Control Letters 47 (2002) 65 – 78 69

Proof of Theorem 1. First;the convexity of 

Rcan easily be veried by denition. Let z∗∈@R. Then;there

exists a nonzero vector c∈Rnsuch that

cz∗= max

z∈

cz= max

v∈Ua0

−∞

ceAbv()d: (11)

Since c= 0 and (A; b) is controllable;c

eAb≡ 0. Since ceAt bhas a nite number of zeros in any nite

interval;

({t:ceAt b=0})=0;(12)

where (·) denotes the measure of a set.

It is easy to see that

v∗(t) = sgn(ceAt b)

maximizes the right-hand side of (11). We maintain that v∗is the unique optimal solution of (11). To verify

this, we need to show that for any v∈Ua;v=v∗,

0

−∞

ceAbv∗()d¿0

−∞

ceAbv()d: (13)

Since v=v∗, there are a set E1⊂[−∞;0] with nonzero measure, i.e., (E1)=1¿0, and a number 1¿0

such that

|v(t)−v∗(t)|¿1;∀t∈E1:

By (12), there exist a set E⊂E1, with (E)=¿0, and a positive number ¿0 such that

|ceAt b|¿; ∀t∈E:

Noting that v∈Ua, we have

ceAt b(v∗(t)−v(t))¿0;∀t∈[−∞;0]:

It then follows that

0

−∞

ceAb(v∗()−v()) d

¿E

ceAb(v∗()−v(t)) d=E

|ceAb||v∗()−v()|d¿1¿0:

This shows that v∗(t) is the unique optimal solution of (11) and hence the unique admissible control satisfying

z∗=0

−∞

eAbv∗()d: (14)

On the other hand, if

z∗=0

−∞

eAbsgn(ceAb)d

for some nonzero c, then obviously

cz∗= max

z∈

cz:

This shows that z∗∈@Rand we have (9).

Since for each c= 0, the optimal solution v∗(t) and z∗of (11) is unique, we see that 

Ris strictly convex.

1 / 14 100%

Documents connexes

V rification symbolique de programmes manipulant des pointeurs

14th_ICSIA_WA04_BLA

CS221 Practice Midterm - Autumn 2012

Open access

Tail Calls Slides

Open access

Merci pour votre participation!

Faire une suggestion

Avez-vous trouvé des erreurs dans l'interface ou les textes ? Ou savez-vous comment améliorer l'interface utilisateur de StudyLib ? N'hésitez pas à envoyer vos suggestions. C'est très important pour nous!

GDPR Confidentialité Conditions d''utilisation

Null Controllable Regions of Linear Systems

Documents connexes

Faire une suggestion

Produits

Assistance

Produits

Assistance

Null Controllable Regions of Linear Systems

Documents connexes

Faire une suggestion

Produits

Assistance

Ajouter ce document à la (aux) collections

Ajouter ce document à enregistré

Suggérez-nous comment améliorer StudyLib