On Quantized Consensus by Means of Gossip Algorithm - Part II: Convergence Time - Lavaei2009p82762009_American_Control_Conference_Vols

On Quantized Consensus by Means of

Gossip Algorithm – Part II: Convergence Time

Javad Lavaei and Richard M. Murray

Abstract

— This paper deals with the distributed averaging

problem over a connected network of agents, subject to a

quantization constraint. It is assumed that at each time update,

only a pair of agents can update their own numbers in terms

of the quantized data being exchanged. The agents are also

required to communicate with one another in a stochastic

fashion. In the first part of the paper, it was shown that

the quantized consensus is reached by means of a stochastic

gossip algorithm proposed in a recent paper, for any arbitrary

quantization. The current part of the paper considers the

expected value of the time at which the quantized consensus

is reached. This quantity (corresponding to the worst case) is

lower and upper bounded in terms of the topology of the graph,

for uniform quantization. In particular, it is shown that the

upper bound is related to the principal minors of the weighted

Laplacian matrix. A convex optimization is also proposed to

determine the set of probabilities (used to pick a pair of agents)

which leads to the fast convergence of the gossip algorithm.

I. I

NTRODUCTION

During the past few decades, there has been a particular

interest in the area of distributed computations, which aims

to compute some quantity over a network of processors in

a decentralized fashion [1], [2]. The distributed averaging

problem, as a particular case, is concerned with computing

the average of numbers owned by the agents of a group

[3], [4]. This problem has been investigated through the

notion of consensus in several papers, motivated by different

applications [5], [6], [7], [8], [9], [10]. For instance, the

synchronization of coupled oscillators, arising in biophysics,

neurobiology, and systems biology, is studied in [5] and

[6] to explore how to reach a consensus on the oscillation

frequencies of all agents. Moreover, the problem of aligning

the heading angles of a group of mobile agents (e.g. a flock of

birds) is treated in [11]. Given a sensor network comprising

a set of sensors measuring the same quantity in a noisy

environment, the problem of consensus on state estimates

is discussed in [12]. The consensus problem for networks

of dynamic agents with fixed and switching topologies is

tackled in [3], where it is shown that the convergence rate

is related to the algebraic connectivity of the network. The

work [13] elaborates on the relationship between the amount

of information exchanged by the agents and the rate of

convergence to the consensus. A more complete survey on

this topic is given in the recent paper [4].

Consider the distributed average consensus in which the

values owned by the agents are to be averaged in a distributed

This work has been supported by AFOSR and Air Force MURI.

The authors are with the Department of Control and Dynamical

Systems, California Institute of Technology, Pasadena, USA (emails:

lavaei@cds.caltech.edu; murray@cds.caltech.edu).

fashion. Since it may turn out in some applications that all

agents cannot update their numbers synchronously, the gossip

algorithm has been widely exploited by researchers to handle

the averaging problem asynchronously [14], [15]. This type

of algorithm selects a pair of agents at each time instance,

and updates their values based on some averaging policy.

The consensus problem in the context of gossip algorithm has

been thoroughly investigated in the literature [16], [17], [18],

[19]. For instance, the work [16] studies the convergence of a

general randomized gossip algorithm, and derives conditions

under which the algorithm converges. That paper also shows

that the averaging time of a gossip algorithm depends on

the second largest eigenvalue of a doubly stochastic matrix

characterizing the algorithm.

In light of communication constraints, the data being

exchanged between each pair of agents is normally quan-

tized. This has given rise to the emergence of quantized

gossip algorithms. The notion of quantized consensus is

defined in [18] for the case when quantized values (inte-

gers) are to be averaged over a connected network with

digital communication channels. This paper shows that the

quantized gossip algorithm leads to reaching the quantized

consensus. This result is extended in [19] to the case when

the quantization is uniform, and the initial numbers owned

by the agents are reals (as opposed to being integers). The

paper [19] shows that the quantized gossip algorithm works

for a particular choice of the updating parameter, although

it conjunctures that this result is true for a wide range of

updating parameters. A related paper on quantized consensus

gives a synchronous algorithm in order to reach a consensus

with arbitrary precision, at the cost of not preserving the

average of the initial numbers [20].

In this paper, a weighted connected graph is considered

together with a set of scalars sitting on its vertices. The

weight of each edge represents the probability of establishing

a communication between its corresponding vertices through

the updating procedure. It was shown in Part I of this work

that the quantized consensus is reached under the stochastic

gossip algorithm proposed in [19], for a wide range of

updating parameters and any arbitrary quantizer including

uniform and logarithmic ones [21]. The current part of the

paper is concerned with the convergence time of the gossip

algorithm. More precisely, consider the expected value of

the time at which the consensus is reached, and take its

maximum over all possible initial states belonging to a given

hypercube. Lower and upper bounds on the this quantity are

provided for a uniform quantizer, which turn out to be related

to the Laplacian of the weighted graph. The upper bound is

2009 American Control Conference

Hyatt Regency Riverfront, St. Louis, MO, USA

June 10-12, 2009

ThB10.6

2958

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

then minimized in order to obtain the best weights resulting

in a small convergence time. To do so, a convex optimization

problem is proposed, which can be solved by a semidefinite

program.

This paper is organized as follows. Some preliminaries

are presented in Section II, and the problem is formulated

accordingly. The main results on the convergence time are

provided in Section III. The results are then illustrated

in Section IV with a numerical example. Later on, some

concluding remarks are drawn in Section V. A number of

proofs are finally given in an appendix.

II. P

ROBLEM FORMULATION AND PRELIMINARIES

Consider a connected weighted graph

= (

)

where:

•

{

, v

, ..., v

}

is the set of vertices of

;

•

is the set of undirected edges of

;

•

{

}

i,j

is the set of weights assigned to the edges

Assume that:

•

The quantity

∑

is equal to 1, where the sum is

taken over all numbers

i, j

∈

{

, ..., ν

}

such

that

≤

•

The number

(

i, j

∈

) is equal to zero if

(

i, j

)

6∈E

; otherwise, it is strictly positive. In particular,

, p

, ..., p

νν

are all equal to zero.

The scalar

associated with the edge

(

i, j

)

represents the

probability of choosing the edge

(

i, j

)

when an edge of

is to be picked at random. Suppose that a real number

has been assigned to the vertex

, for all

∈

. Let

(

) :

→

be a general quantization operator characterized as

follows:

(

) =

{

∈

[

]

∈

(

, L

]

∀

∈

(1)

where

{

}

∞

−∞

is a monotonically increasing sequence of

integers representing the quantization levels, and:

∀

∈

(2)

Note that

denotes the set of integers. In what follows, a

quantized stochastic gossip algorithm is presented [19].

Stochastic Gossip (SG) Algorithm:

Step 1

: Given a positive real

, set

= 0

. Define

[0] :=

for any

∈

Step 2

: Pick an edge of

at random.

Step 3

: Suppose that the ending vertices of the edge selected

in Step 2 possess the values

[

]

and

[

]

. Perform the

following updates:

[

+ 1] =

[

] +

(

[

])

−

(

[

])

)

[

+ 1] =

[

] +

(

[

])

−

(

[

])

)

[

+ 1] =

[

]

∀

∈

i, j

}

(3)

Step 4

: Increase

by 1 and jump to Step 2.

Let the short-hand notation:

[

] =

[

]

[

]

···

[

]

, k

∈

(4)

be used hereafter. Observe that the SG algorithm is stochastic

in the sense that an edge must be chosen

at random

each time update. The deterministic version of this algorithm,

referred to as the deterministic gossip (DG) algorithm, can

be obtained by replacing its step 2 with the following:

Step 2

: Pick an edge of

arbitrarily

(at the discretion of

the user).

Throughout this paper, the symbol

(

)

refers to the

weighted graph

, whereas the symbol

(

)

refers to the

graph

with the weights on its edges removed.

Definition 1:

Given a quantization-based protocol

act-

ing on

(

)

, denote with

[

]

∈

∪{

}

, the vector

of values on the vertices of

at time

, obtained using this

protocol. It is said that the

quantized consensus

is reached

for the graph

under the protocol

if for every arbitrary

initial state

[0]

∈

, there exist a natural number

and

an integer

such that either of the following sets of relations

holds:







∑

[

] =

∑

[0]

[

]

∈

[

, L

]

∀

≥

∀

∈

(5)

or:







∑

[

] =

∑

[0]

[

]

∈

(

]

∀

≥

∀

∈

(6)

In line with the above definition, if the protocol

stochastic, one would say that the quantized consensus is

reached

almost surely

if there exists a number

∈

with probability 1, for which either of the relations (5)

or (6) holds. For simplicity, the short name

consensus

used hereafter in order to refer to

quantized consensus

. A few

definitions and notations will be introduced in the sequel.

Definition 2:

Define

to be the set of all

-tuple

(

, α

, ..., α

)

such that

∈

[

min

, x

max

]

and, in addition,

−

is an integer multiple of

, for all

∈

, where:

max

:= max

∈

, x

min

:= min

∈

(7)

(The notations

d·e

and

b·c

stand for the ceiling and floor

operators, respectively).

Definition 3:

Let

and

be:

= max

∈

Z

s.t.

≤

ave

= min

∈

Z

s.t.

≥

ave

(8)

where

ave

1

2

···

Definition 4:

Define:

{

(

, α

, ..., α

)

∈S

∣

∈

(

, η

]

∀

∈

}

Furthermore, let

(

)

∈

, be defined as the set of all

-tuple

(

, α

, ..., α

)

∈S

such that:

∈

(

−

(

−

)

(

−

)

]

∀

∈

(9)

2959

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

Definition 5:

Define the distance function

(

) :

S→

as:

(

) := min

∈S

−

∀

∈S

(10)

where

|·|

denotes the

norm. Define also the distance

function

(

))

in the same vein.

The next result was proved in Part I of the paper [21].

Theorem 1:

Given

∈

, apply the SG algorithm to

the graph

(

)

with the initial state

[0]

. There exists

a positive number

for which one of the following cases

takes place with probability 1:

[

]

belongs to the set

, for every

≥

ii)

[

]

belongs to the set

(

)

, for every

≥

iii)

[

]

belongs to the set

(

)

, for every

≥

It is noteworthy that the above theorem proves reaching the

consensus and, besides, describes the steady-state behavior

of the system. In order to study the convergence time of the

SG algorithm, it is desired to find lower and upper bounds

on the quantity

introduced in Theorem 1. Throughout the

rest of this paper, assume that

(

)

is a uniform quantizer,

i.e., it rounds each real number

to its nearest integer (by

convention, assume that

(

+ 0

5) =

, for all integers

III. C

ONVERGENCE TIME

Since the SG algorithm is stochastic, the quantity

introduced in Theorem 1 is a random variable. Define

(

)

to be equal to

max

E

{

}

, where the maximum is taken

over all initial states

[0]

∈

[

min

, x

max

]

that belong to

none of the steady-state sets

(

)

and

(

)

(note

that

E

{·}

denotes the expectation operator). The term

(

)

indeed quantifies the expected value of the convergence time

in the worst case. This section aims to characterize

(

)

in terms of

min

, x

max

, ε

, and the topology of the graph

(together with the probabilities assigned to its edges).

Definition 6:

Given an integer

, apply the SG (or DS)

algorithm to the graph

(

)

with the initial state

[0]

The action of choosing an edge at time

(

∈

) in step 2

of the SG (or DS) algorithm is said to be a

positive action

with respect to (w.r.t.)

(

+ 0

5))

if the inequality:

(

[

]

(

+ 0

5))

< d

(

[

−

(

+ 0

5))

(11)

holds; otherwise, it is called a

trivial action

, meaning the

following (see Lemma 2 in [21]):

(

[

]

(

+ 0

5)) =

(

[

−

(

+ 0

5))

(12)

Remark 1:

Regarding Definition 6, one can observe that

there is a reduction in the Lyapunov function

(

5))

by at least 1 during each positive action. Moreover,

having assumed that the vertices

and

are chosen at

time

, where

[

−

> x

[

−

, it is straightforward to

show that a positive action occurs at this time if and only if

either of the following sets of relations holds:

•

[

−

> r

+ 0

5 +

and

[

−

≤

+ 0

; or

•

[

−

> r

+ 0

and

[

−

≤

+ 0

−

Definition 7:

Given

∈

and

∈

, define

r,ε

(

)

to be the time at which the first positive action is taken w.r.t.

(

+ 0

5))

, provided the SG algorithm is applied to

the graph

(

)

with the initial state

. Notice that

since the SG algorithm is stochastic,

r,ε

(

)

is a random

variable.

Definition 8:

Given

∈

, let

denote an infinite

sequence whose elements all belong to

(i.e.

is an infinite

sequence of edges). Define

r,ε

(

∣

)

to be equal to the time

when the first positive action is taken w.r.t.

(

5))

provided that the DG algorithm is applied to the graph

(

)

with the initial state

, where the edge selected at

time

in step 2 of this algorithm is indeed the

element

, for all

∈

Definition 9:

For every integer

and infinite sequence of

edges

, define:

(

ε, r,

) := max

r,ε

(

∣

)

(13)

where the maximum is taken over all

-tuple

[

···

]

∈

[

min

, x

max

]

with the following prop-

erties:

•

6∈S

(

+ 0

;

•

There exist

i, j

∈

such that:

–

> r

+ 0

5 +

and

≤

+ 0

; or

–

> r

+ 0

and

≤

+ 0

−

Theorem 2:

Given an integer

and an infinite sequence

of edges

, there exists a vector

[

···

]

such that:

(

ε, r,

) =

(

∣

)

(14a)

{

, α

, ..., α

}

{

, ....,

}

(14b)

where that the number

appears

−

times in the set

given above

Proof:

The proof is based on a series of Lemmas provided

in the appendix.

Theorem 2 states that the quantity

(

ε, r,

)

, introduced in

Definition 9, is independent of

and

. Instead, it is continent

upon only

and

Define

as follows:

Φ := max

E

{

(

)

}

(15)

where the maximum is taken over all

-tuple

[

···

]

satisfying the relation

{

, α

, ..., α

}

{

, ....,

}

in which the value

appears

−

times. The following theorem presents one of the main

results of this section.

Theorem 3:

Given a real

∈

, the quantity

(

)

can be lower and upper bounded as follows:

≤

(

)

≤

(

max

−

min

+ 2)Φ

(16)

Proof:

It follows immediately from the definition of

that

this quantity is a lower bound on

(

)

. To prove the other

part of the inequality, assume with no loss of generality that:

ave

−

min

≥

max

−

ave

(17)

Recall that the proof of Theorem 2 in Part I of the paper

introduces two storage functions

[

]

and

[

]

. It also

suggests minimizing

[

]

until it reaches a constant level,

2960

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

and subsequently minimizing

[

]

(in other words, the edge

being selected in step 2 of the algorithm at each update

is to be chosen in such a way that the storage function is

minimized). It follows from inequality (17) and the relation

ave

−

|≤

that:

[0]

≤

∑

−

≤

ave

−

min

+ 1

(18)

At time

where the minimum of

[

]

is reached,

two possibilities can occur according to Lemma 3 in Part I

of the paper. The first one is that

[

]

∈S

(

)

, which

implies that the consensus is reached, and there is no need

to minimize

[

]

any longer. The second scenario is that

the relation

[

]

> η

holds for all

∈

. Assume that the

latter one is the case. It is easy to verify that:

[

]

≤

∑

[

]

−

≤

max

−

ave

+ 1

(19)

As a result:

[0] +

[

]

≤

(

max

−

min

+ 2)

(20)

On the other hand, the aforementioned discussion indicates

that at most

[0] +

[

]

positive actions are required

to reach the consensus. Moreover, it can be inferred from

Theorem 2 that the expected value of the time between two

consecutive positive actions is at most equal to

(until the

consensus is reached). These facts along with inequality (20)

complete the proof.

Theorem 3 implies that

(

)

can be upper bounded by a

term which is proportional to the inverse of

. The question

arises: how to compute

systematically? This is addressed

in the sequel. The following definitions/notations will be

convenient to proceed with the development of the paper.

•

Let

be the Laplacian of the weighted graph

. In other

words,

is a

matrix whose

(

i, j

)

off-diagonal

entry,

i, j

∈

, i

, is equal to

−

and its

(

i, i

)

diagonal entry,

∈

, is equal to

∑

•

For every

∈

and

∈

, define

∼

to be the

matrix obtained from

by removing its

row and

column.

Theorem 4:

The quantity

can be obtained as follows:

Φ = max

∈

∣

(

∼

)

−

∣

∞

(21)

where

|·|

∞

stands for the infinity norm, and

∈

−

a vector of 1’s.

Proof:

For every

i, j

∈

, i

, let

denote a

dimensional vector whose elements are all equal to

except for the

and

entries which are

and

respectively. In addition, denote the quantity

E

{

(

)

}

with

(

i, j

)

for simplicity. The goal is to contrive a recursive

equation characterizing

(

i, j

)

. To this end, consider the

graph

with the initial state

. The expected value of

the time at which the first positive action is taken (under the

SG algorithm) is, by definition, equal to

(

i, j

)

. To count

this number in another way, run the algorithm one iteration.

Assume that the edge

is chosen in this iteration, which

leads to the following possibilities:

•

is equal to the edge

(

i, k

)

, for some

∈

}

this case, due to the equality

[0] =

, the vector

[1]

is obtained as

. Hence, it is expected to take the first

positive action after

(

k, j

)

time updates (in addition

to the first time update taken at the beginning).

•

is equal to the edge

(

i, j

)

This means that a positive

action is already taken at the first time update.

•

is equal to the edge

(

k, l

)

, for some

k, l

∈

}

In this case, it is easy to show that

[1] =

[0] =

This implies that it is expected to take the first positive

action after

(

i, j

)

time updates (other than the first one

already taken).

The above reasoning yields the recursive equation:

(

i, j

) = 1 +

∑

∈

}

(

k, j

)

(

−

∑

)

(

i, j

)

∀

∈

}

(22)

Stack the scalars

, j

)

, ...,

(

−

, j

)

(

+ 1

, j

)

...,

(

ν, j

)

in a column and denote the resulting vector with

(

)

∈

−

. Equation (22) can be re-arranged as:

∼

(

) =

∀

∈

(23)

Therefore:

Φ = max

i,j

∈

, i

(

i, j

) = max

∈

(

)

∞

= max

∈

(

∼

)

−

∞

(24)

This completes the proof.

The results of Theorems 3 and 4 can be combined to

explicitly bound the quantity

(

)

as follows:

max

∈

(

∼

)

−

∞

≤

(

)

(25)

and

(

)

≤

(

max

−

min

+ 2)

max

∈

(

∼

)

−

∞

(26)

The next theorem directly relates the upper bound on

(

)

to the spectral of the principal submatrices of the Laplacian.

Theorem 5:

The scalar

(

)

satisfies the following in-

equality:

(

)

≤

√

−

max

−

min

+ 2)

(

max

∈

min

{

∼

}

)

(27)

where

min

(

)

represents the smallest eigenvalue of a matrix.

Proof:

One can write:

(

∼

)

−

∞

≤|

(

∼

)

−

∞

(

∼

)

−

∞

≤

√

−

(

∼

)

−

(28)

where

|·|

stands for the 2-norm. Since the weighted Lapla-

cian matrix

is positive semi-definite (PSD), its principal

submatrix

∼

is PSD too. As a result, it can be deduced

from the above inequality that:

(

∼

)

−

∞

≤

√

−

min

{

∼

}

(29)

2961

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

The proof is completed by combining inequalities (26)

and (29).

Remark 2:

Theorem 5 states that the convergence time

(

)

is related to the

(

−

order submatrices of the

Laplacian of the graph (i.e.

∼

∈

), rather than

the Laplacian itself. Let

(

)

denote the second smallest

eigenvalue of

. Since the graph

is connected,

(

)

is strictly positive. Now, the interlacing theorem can be

exploited to argue that:

< λ

min

{

∼

}≤

(

)

(30)

This means that unlike the unquantized consensus whose

convergence mainly depends on

(

)

, a more subtle de-

pendency on

(

)

is governed for the quantized case (in

fact,

is not directly related to

(

)

Remark 3:

The quantity

introduced in this section is

an important parameter, which characterizes the expected

value of the maximum number of iterations that must be

followed until a positive action is taken. The derived lower

and upper bounds on

(

)

have been related to this quantity

corresponding to the worst case. Since the graph system is

stochastic, it is unlikely in practice that the system operates

in the worst case, which implies that the actual convergence

time of a random system may be much better than the bounds

obtained here. Nevertheless, these bounds are instrumental in

understating the worst-case behavior of the system and how

the convergence time is related to the topology of the graph.

A. Special graphs

This subsection aims to obtain lower and upper bounds on

the quantity

(

)

for both complete and path graphs in the

case when all edges have the same weight. In this regard,

assume that each edge is associated with the same weight

Corollary 1:

For a complete graph

with equally

weighted edges, the following inequality holds:

(

−

≤

(

)

≤

(

−

1)(

max

−

min

+ 2)

(31)

Proof:

The weight

for this graph is equal to

(

−

Using this fact and by means of Theorems 3 and 4, it is

straightforward to show the validity of inequality (31). The

details are omitted here.

Corollary 2:

Let

be a path graph with equally weighted

edges so that

is connected to

, for all

∈{

, ..., ν

−

}

(these are the only edges of the graph). The inequality

given below holds:

(

−

≤

(

)

≤

(

−

(

max

−

min

+ 2)

(32)

Proof:

Since the graph

has

−

edges, the weight

equal to

−

. On the other hand, it is easy to show that

Φ =

, ν

)

. This leads to the following recursive equations

(in light of (22)):

, ν

)

−

, ν

) = 1

(33a)

−

(

−

, ν

) + 2

(

i, ν

)

−

(

+ 1

, ν

) = 1

(33b)

−

(

−

, ν

) + 2

(

−

, ν

) = 1

(33c)

where the argument

in equation (33b) belongs to the set

{

, ..., ν

−

}

. Adding up these equalities results in the

relation:

(

−

, ν

) =

−

(34)

The (backward) recursive equation (33b) can be solved using

conventional techniques to conclude that there exist two

constants

and

such that:

(

i, ν

) =

−

, i

−

, ν

−

, ...,

(35)

One can employ the final conditions given by (33c) and (34)

to arrive at:

−

, b

(36)

This implies that:

Φ =

, ν

) =

−

(

−

(37)

The proof follows immediately from the above equation and

Theorem 3.

B. Optimal edge weights

In this subsection, it is desired to find out what probabil-

ities the edges of

should possess so that the consensus is

reached quickly. For this purpose, observe that the quantity

(

)

has been related to the spectral of the submatrices of

the Laplacian in (27). Having fixed

min

max

and

, the

provided upper bound on

(

)

is a multiple of the term:

max

∈

min

{

∼

}

(38)

Therefore, it is desired to minimize the function (38) over all

possible (discrete) probability distributions captured by

for

the sake of finding a sub-optimal edge-selection probability

distribution. This is accomplished in the sequel.

Problem 1:

Minimize the scalar variable

−

subject to the

constraints:

min

{

∼

}≥

μ, i

= 1

, ..., ν

(39)

where

is a matrix variable representing the Laplacian of

the weighted graph

. Denote the global minimizer of this

optimization with

(

∗

, P

∗

)

(note that there are some implicit

constraints stating that the weights on the edges are positive

and sum up to 1).

Since the operator

min

(

)

is concave w.r.t. its symmetric

argument, it is easy to show that Problem 1 is convex. More

precisely, the constraint

min

{

∼

}≥

can be expressed

∼

μI

, which is a semidefinite constraint. Hence, the

solution

∗

can be found efficiently. On the other hand, one

can verify that:

∗

= max

min

{

∼

}

(40)

or equivalently:

∗

= min

max

min

{

∼

}

(41)

2962

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

This implies that the solution

∗

gives a sub-optimal edge-

selection probability distribution (resulting in the fast con-

vergence of the SG algorithm), because of minimizing the

term given in (38).

IV. N

UMERICAL EXAMPLE

Consider the graph

drawn in Figure 1. The objective

is to find out what probabilities should be assigned to the

edges of

so that the consensus is reached quickly under

the SG algorithm. To this end, let the convex optimization

provided in Problem 1 be solved. This yields the following

probability distribution:

= 0

2087

= 0

1146

= 0

1241

(42)

The quantity

corresponding to this set of edge-selection

Fig. 1. The graph

G

rendered in the numerical example

probabilities turns out to be equal to

1770

. One can make

a comparison with two heuristic methods for designing the

probability set

, which are spelled out below:

•

The most naive approach is to assume that the edges

of the graph are equally weighted. This leads to the

probability

on each edge. The associated quantity

is obtained as

•

Another technique is to devise the probability distribu-

tion

in such a way that all vertices have the same

probability of being chosen at each time update, i.e.:

(43)

Note that

∀

i, j

∈

. The above set of

equations has a unique symmetric solution (complying

with the symmetry of the graph

) as follows:

= 0

(44)

The corresponding

is equal to

Hence, the value of

corresponding to the sub-optimal

solution is better from the ones obtained using these two

rudimentary techniques.

An interesting fact about the edge selection can be seen

in this example. Assume that the graph

does not have the

edge

(i.e. remove this edge). In this case, Problem 1

leads to the following solution:

= 0

3781

, p

= 0

1757

= 0

, p

= 0

1352

(45)

associated with

Φ = 23

1292

. Notice that

= 0

, which

signifies that although a complete graph has the best con-

vergence, if some edges do not exist (e.g. the edge

it might be better to ignore some other edges too (e.g. the

edge

). This is interesting as it reveals the fact that some

communications are redundant.

To compare the value obtained for

with other possible

values, consider the case when all edges have the same

weight. This results in the equality

Φ = 36

. Therefore, there

is a noticeable improvement in the value of

via the solution

of Problem 1.

For the purpose of simulation, the following points have

been randomly generated in the interval

100]

[0] = 20

1185

, x

[0] = 13

6221

, x

[0] = 97

8356

[0] = 45

5033

, x

[0] = 45

9224

(46)

The stochastic gossip algorithm was run 1000 times and the

average of the scalar

was calculated accordingly (note

that

is a random variable). This value for the probability

distribution (45) was obtained as 48.5710, while that for

the identical probability distribution (equal edge weights)

turned out to be 65.3580. This demonstrates that one could

significantly save in the convergence time if the solution of

Problem 1 is employed, which also obviates the usage of the

edge

V. C

ONCLUSIONS

This paper tackles the average consensus problem over

a connected weighted graph subject to a quantization con-

straint. It is assumed that each pair of vertices can be chosen

with a certain probability in order to update their numbers

in term of the quantized data being exchanged. In the first

part of the paper, it was shown that the quantized consensus

is reached under the stochastic gossip algorithm given in a

recent paper. This part of the paper deals with the time at

which the consensus is reached. Lower and upper bounds

on the expected value of this quantity in the worst case

are provided, which depend on the principal minors of the

Laplacian matrix of the weighted graph. These bounds are

explicitly computed for equally weighted complete and path

graphs. Finally, a convex optimization is provided to obtain

a set of weights on the edges of the graph that results in the

fast convergence of the gossip algorithm.

EFERENCES

[1] G. Tel, “Introduction to distributed algorithms,” Cambridge University

Press, 2000.

[2] N. A. Lynch, “Distributed algorithms,” Morgan Kaufmann Publishers,

Inc., San Francisco, CA, 1996.

2963

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

[3] R. Olfati-Saber and R. M. Murray, “Consensus problems in networks

of agents with switching topology and time-delays,”

IEEE Transac-

tions on Automatic Control

, vol. 49, no. 9, pp. 1520-1533, 2004.

[4] R. Olfati-Saber, J. A. Fax and R. M. Murray, “Consensus and coop-

eration in networked multi-agent systems,”

Proceedings of the IEEE

vol. 95, no. 1, pp. 215-233, 2007.

[5] Y. Kuramoto, “Chemical oscillators, waves, and turbulance,” Springer-

Verlag, Berlin, 1984.

[6] S. H. Strogatz, “Exploring complex networks,”

Nature

, vol. 410, pp.

268-276, 2001.

[7] D. P. Bertsekas and J. N. Tsitsiklis, “Parallel and distributed compu-

tation: Numerical methods,” Belmont, MA: Athena Scientific, 1997.

[8] Y. Rabani, A. Sinclair and R. Wanka,“Local divergence of Markov

chains and the analysis of iterative load-balancing schemes,” in

Pro-

ceedings of IEEE Conference on Foundations of Computer Science

1998.

[9] A. V. Savkin, “Coordinated collective motion of groups of autonomous

mobile robots: Analysis of Vicsek

s model,”

IEEE Transactions on

Automatic Control

, vol. 49, no. 6, pp. 981982, 2004.

[10] R. Olfati-Saber, “Flocking for multi-agent dynamic systems: Algo-

rithms and theory,”

IEEE Transactions on Automatic Control

, vol. 51,

no. 3, pp. 401420, 2006.

[11] A. Jadbabaie, J. Lin, and A. S. Morse, “Coordination of groups

of mobile autonomous agents using nearest neighbor rules,”

IEEE

Transactions on Automatic Control

, vol. 48, no. 6, pp. 9881001, 2003.

[12] A. Speranzon, C. Fischione and K.H. Johansson, “Distributed and col-

laborative estimation over wireless sensor networks,” in

Proceedings

of the 45th IEEE Conference on Decision and Control

, 2006.

[13] R. Carli, F. Fagnani, A. Speranzon and S. Zampieri, “Communication

constraints in the average consensus problem,”

Automatica

, vol. 44,

no. 3, pp. 671-684, 2008.

[14] J. Tsitsiklis, “Problems in decentralized decision making and com-

putation,” PhD thesis, Dept. of Electrical Engineering and Computer

Science, M.I.T., Boston, MA, 1984.

[15] S. Boyd, A. Ghosh, B. Prabhakar and D. Shah , “Analysis and

optimization of randomized gossip algorithms,” in

Proceedings of the

43rd IEEE Conference on Decision and Control

, 2004.

[16] S. Boyd, A. Ghosh, B. Prabhakar and D. Shah , “Randomized gossip

algorithms,”

IEEE Transactions on Information Theory

, vol. 52, no.

6, pp. 2508-2530, 2006.

[17] F. Benezit, A. G. Dimakis, P. Thiran and M. Vetterli, “Gossip along the

way: Order-optimal consensus through randomized path averaging,” in

Proceedings of the Allerton Conference on Communication, Control,

and Computing

, 2007.

[18] A. Kashyap, T. Basara and R. Srikanta, “Quantized consensus,”

Automatica

, vol. 43, no. 7, pp. 1192-1203, 2007.

[19] P. Frasca, R. Carli, F. Fagnani and S. Zampieri, “Average consensus

by gossip algorithms with quantized communication,” in

Proceedings

of the 47th IEEE Conference on Decision and Control

, 2008.

[20] A. Censi and R. M. Murray, “A biologically inspired approach to

real-valued average consensus over quantized channels with arbitrary

deterministic accuracy,” in

Proceedings of the 2009 American Control

Conference

, 2009.

[21] J. Lavaei and R. M. Murray, “On quantized consensus by means of

gossip algorithm – Part II: convergence time,” in

Proceedings of the

2009 American Control Conference

, 2009.

PPENDIX

This appendix derives a number of results in order to prove

Theorem 2.

Lemma 1:

Given

∈

and

[

···

]

6∈

(

+ 0

, the following hold for every infinite sequence

of edges

i) If

> r

+ 0

5 +

, then:

r,ε

(

∣

)

≤

r,ε

(

−

ε, α

, ..., α

∣

)

(47)

ii) If

≤

+ 0

−

, then:

r,ε

(

∣

)

≤

r,ε

(

ε, α

, ..., α

∣

)

(48)

Proof:

Let case (i) be proved here, as the other case is

analogous. Apply the DG algorithm to the graph

(

)

that its step 2 selects edges from the sequence

in order.

For the initial states

(

, α

, ..., α

)

and

(

−

ε, α

, ..., α

)

denote the resulting numbers on the vertices of

at time

with

[

] := (

[

]

, w

[

]

, ..., w

[

])

and

[

] :=

( ̄

[

]

[

]

, ...,

[

])

, respectively, for all

∈

∪{

}

Furthermore, for notational simplicity, define:

r,ε

(

−

ε, α

, ..., α

∣

)

(49)

In order to proceed with the proof by contradiction, assume

that there is no positive action w.r.t.

(

+ 0

5))

time instants

, ..., m

, by starting from the initial state

In other words:

(

[0]

(

+ 0

5)) =

(

[

]

(

+ 0

5))

(50)

for any

∈{

, ..., m

}

. Now, one can draw a number of

conclusions as follows:

[

]

is always greater than or equal to

[

]

for every

satisfying the inequality

≤

ii) It is a consequence of property (i) that

[

]

≥

[

]

for every

∈

and

∈{

, ..., m

}

iii) The relation

[

] = ̄

[

]

holds if

[

]

≤

+ 0

[

]

≤

+ 0

, for every

∈

and

∈

{

, ..., m

−

}

. This result can be easily proven by

induction on

, taking property (ii) into account, and

using the equality (50).

Assume that the

element of

is the edge

(

i, j

)

. With

no loss of generality, suppose that

[

−

≥

[

−

Since a positive action occurs at time

w.r.t.

(

5))

by starting from the initial state

[0]

, it follows from

Remark 1 that either of the cases pointed out below occurs:

[

−

> r

+ 0

5 +

and

[

−

≤

+ 0

; or

[

−

> r

+ 0

and

[

−

≤

+ 0

−

Assume that case (a) happens (the other case is similar).

Properties (ii) and (iii) mentioned above yield that:

[

−

≥

[

−

> r

+ 0

5 +

ε,

[

−

1] = ̄

[

−

≤

+ 0

(51)

Since the edge

(

i, j

)

is chosen at time

in step 2 of the

DG algorithm, the above inequalities and Remark 1 signify

that a positive action occurs at time

for the graph

(

)

with the initial state

[0]

. In other words,

r,ε

(

∣

)

must be equal to

, while it has been already assumed that

this quantity is greater than

. This contradiction completes

the proof.

Proposition 1:

Given an integer

and an infinite sequence

of edges

, there exist an integer

∈

and a vector

[

···

]

subject to:

(

ε, r,

) =

r,ε

(

∣

)

(52a)

+ 0

−

ε <α

≤

+ 0

5 +

ε,

∀

∈

}

(52b)

In addition,

satisfies one of the following relations:

+ 0

5 +

ε < α

≤

+ 0

5 + 2

(53)

or:

+ 0

−

ε < α

≤

+ 0

−

(54)

2964

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.

Proof:

The proof follows from Lemma 1. The details

are omitted here for brevity. Note that the reason why

inequality (52) is not satisfied for every

(i.e. there

is a

for which this inequality does not hold) is that

should not belong to

(

+ 0

, in light of Definition 9.

Lemma 2:

Consider a vector

[

···

]

and

an integer

∈

satisfying the relations:

+ 0

5 +

ε < α

≤

+ 0

5 + 2

ε,

+ 0

−

ε < α

≤

+ 0

−

ε < α

≤

+ 0

5 +

ε,

∀

∈

}

(55)

For every infinite sequence of edges

, the inequality given

below holds:

r,ε

(

∣

)

≤

r,ε

(

, α

ε, ..., α

∣

)

(56)

Proof:

Apply the DG algorithm to the graph

(

)

and

select edges in its step 2 from the sequence

succes-

sively. For the initial states

(

, α

, ..., α

)

and

(

, α

ε, ..., α

)

, denote the resulting numbers on the vertices of

at time

with

[

] := (

[

]

, u

[

]

, ..., u

[

])

and

[

] :=

( ̄

[

]

[

]

, ...,

[

])

, respectively, for all

∈

∪{

}

Define also:

r,ε

(

, α

ε, ..., α

∣

)

(57)

For a proof by contradiction, assume that

r,ε

(

∣

)

> g

Observe that:

i) Since

(

) =

+ 1

, for all

∈

(

+ 0

, r

+ 0

5 + 2

]

it can be verified that:

[

] = ̄

[

] =

[

]

[

]

∈

(

+ 0

−

ε, r

+ 0

5 +

]

(58)

for all

∈{

, ..., g

−

}

and

∈

}

ii) Using property (i) and by means of induction on

, one

can show that if

[

]

≤

+ 0

for some

∈

and

∈{

, ..., g

−

}

, then

[

] = ̄

[

]

Let the

element of

be the edge

(

i, j

)

, where

i < j

It results from the definition of

and property (i) that

= 1

and

[

−

≤

+ 0

(this is the only way to generate

a positive action). Therefore, by properties (i) and (ii), one

can write:

[

−

1] = ̄

[

−

≤

+ 0

[

−

> r

+ 0

5 +

(59)

As a result, Remark 1 leads to the conclusion that selecting

the edge

(

i, j

)

at time

results in a positive action for the

graph

(

)

with the initial state

; i.e.

r,ε

(

∣

) =

This contradicts the aforementioned assumption.

Proposition 2:

Consider the objects

and

(

ε, r,

)

introduced in Proposition 1 and Definition 9. There exists a

vector

[

···

]

such that:

(

ε, r,

) =

r,ε

(

∣

)

(60a)

{

, α

, ..., α

}

{

+ 0

−

, r

+ 0

5 +

, ....,

+ 0

5 +

, r

+ 0

5 + 3

}

(60b)

(note that the term

+ 0

5 +

appears

−

times in the

above set).

Proof:

It follows from Proposition 1 and Lemma 2 that

there exist a vector

[

···

]

and integers

, μ

∈

with the properties:

•

(

ε, r,

) =

r,ε

(

∣

)

•

The set of inequalities:

+ 0

5 +

ε < α

1

≤

+ 0

5 + 2

ε,

+ 0

−

ε < α

2

≤

+ 0

< α

≤

+ 0

5 +

ε,

∀

∈

, μ

}

(61)

or:

+ 0

−

ε < α

1

≤

+ 0

−

ε,

+ 0

< α

2

≤

+ 0

5 +

ε,

+ 0

−

ε < α

≤

+ 0

∀

∈

, μ

}

(62)

holds.

Due to the symmetry, one can assume with no loss of

generality that the set of inequalities given in (61) holds.

It is straightforward to show (using the above properties)

that

r,ε

(

∣

)

is unchanged if the following replacements

are made:

•

1

with

+ 0

5 +

;

•

2

with

+ 0

−

;

•

with

+ 0

5 +

, for any

∈

, μ

}

This completes the proof.

The proof of Theorem 2 is a direct consequence of

Proposition 2 given above.

2965

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on April 12,2010 at 17:59:01 UTC from IEEE Xplore. Restrictions apply.