This post walks through the existing work on the $k$-gossip problem in dynamic graphs from [KLO10]. Our main objective is to understand their $\Omega(n\log{k})$ lower bound.
Problem: In the $k$-gossip problem ($k\leq n$) on a $1$-interval connected dynamic graph $\mathcal{G}$, a set of $k$ tokens, $\mathcal{T} = \{1, 2, ..., k\}$, is allocated amongst the $n$ nodes of $\mathcal{G}$ according to a function $f : V → 2^{\mathcal{T}}$ where $|\bigcup_{v\in V}{f(v)}| = k$. The problem is solved if each node terminates with output $k$.
Question 1: Let's first look at the $1$-gossip problem. How quickly can we solve it?
The first thing to try is for every node to broadcast any token it knows to all its neighbors. Since the dynamic graph is $1$-interval connected, there must be at least one new node each round receiving the token. Suppose this was false, then all nodes who have not yet received the token, neighboring a node with a token, would not receive it. The only way this could happen is if all nodes have already received the token, since there can not be any disconnected nodes. Thus, after $n-1$ rounds all nodes are guaranteed to have the token. So we have the following fact.
Fact 1: $1$-gossip can be solved in $n-1$ rounds on $1$-interval connected dynamic graphs.
Notice Fact $1$ holds independent of the chosen adversary controlling the evolution of $\mathcal{G}$. It also implies that any global function which is associative and commutative can be computed in $n-1$ rounds. This can be useful as a primitive in dynamic graph algorithms.
Also by the same logic, $k$-gossip, can be solved in $n-1$ rounds if we allow message sizes of $O(n\log{n})$, since a single node can broadcast multiple tokens in a single round. So in $n-1$ rounds every token will have propagated to every node.
Finally, if we assume messages of size $O(\log{n})$, then we can solve $k$-gossip in $O(nk)$ time, by first sending the token number $1$ to all nodes, and after $n-1$ rounds the node with token number $2$ does the same thing, etc. This will need $k$ rounds. However, this does assume that the nodes know the value of $n$.
Question 2: What is a lower bound for the $k$-gossip problem on $1$-interval connected dynamic graphs?
[KLO10] prove a lower bound on the $k$-gossip problem for an online adaptive adversary.
If a single node knows all the tokens, we know that in $n+k-1$ rounds all nodes will have received all tokens (can be proved by induction). Thus we look at the opposite case when $k$ nodes start with one token each.
We observe that in the earlier rounds lots of nodes learn new tokens, since any token they receive must be new to them.
Observation 1: In the first round of broadcast in the $k$-gossip algorithm, at least $k$ nodes learn a new token.
proof idea. All $k$ nodes with tokens must have at least one neighbor. In the first round any neighbors will not have seen any token other than the ones they possibly have, thus at least $k$ nodes learn a new token.
As rounds progress, it becomes more likely that nodes will receive tokens they have already seen. This motivates a cost scheme on the order in which tokens are discovered for each node; where tokens discovered in earlier rounds are cheaper, since they are easier to find, but tokens discovered in later rounds are more costly.
We can give the $i^{\text{th}}$ token discovered a cost of $1/(k-i+1)$: the first token discovered will have cost $1/k$, whereas the last token discovered will have cost $1$.
We denote the total cost of all discovered tokens in round $r$ as $\Phi(r)$ where by lemma 1 $\Phi(1)=1$ and for the final round $r_f$, we have $\Phi(r_f)=n\cdot(1/k + 1/(k-1) + ... + 1)$ where
$$n\cdot(\frac{1}{2}\log_2{(k+1)}-1)\leq\Phi(r_f)\leq n\cdot\log_2{(k+1)} - 1)$$
The total discovered cost is very useful since if for some instance of $k$-gossip, the best we can do is invariantly increase $\Phi$ by some constant $c$ each round, then we need at least
$$\lceil(\Phi(r_f)-\Phi(1))/c\rceil\geq \frac{n}{c}\cdot(\frac{1}{2}\log_2{(k+1)}-1) -\frac{1}{c}=\Omega(n\log{k})$$
rounds to solve $k$-gossip on that instance. Thus any algorithm for $k$-gossip must have round complexity $\Omega(n\log{k})$. Now we just need to construct a dynamic graph where the total cost from round to round increases by at most some constant.
Note that if a centralized algorithm for $k$-gossip needs $X$ rounds, then a distributed algorithm definitely needs $X$ rounds too since centralized algorithms have access to global information whereas distributed algorithms only have direct access to local information. Therefore, suppose we have a centralized algorithm which decides $t_v(r)$ to be the message broadcast by node $v\in V$ for round $r$. Additionally, we denote the set of tokens known to $v\in V$ in round $r$ as $A_v(r)$.
Observation 2: Given two vertices $u,v\in V$ in round $r$, if $t_u(r)\in A_v(r)$ and $t_v(r)\in A_u(r)$ then $u$ and $v$ do not contribute to each other discovering new tokens. Thus, connecting them by an edge will not change the total cost $\Phi(r)$. We call such an edge $(u,v)\in E(r)$ a free edge.
The first step of our construction of $G_r$ is thus to add all free edges. Suppose after adding all free edges, $G_r$ has connected components $C_1, C_2, ..., C_l$. We want to connect these components while incurring minimal cost. A simple idea is to arbitrarily fix representative nodes $v_1\in C_1, v_2 \in C_2, ..., v_l\in C_l$ for each connected component, and then construct a connected subgraph over $V_l=\{v_1, v_2, ..., v_l\}$ which has constant cost.
Intuition: We want to avoid/be careful about connecting to nodes who know "many" tokens since if one of them discovers a new token, the cost is much higher than when a node knowing "few" tokens discovers a new token.
To formalize the idea of nodes knowing many tokens we define
$$\text{many}=\{v_i\in V_l | v_i \text{ is missing } \leq l/6 \text{ tokens}\}$$
to be the set of all nodes knowing "many" tokens, and
$$\text{few}=\{v_i\in V_l | v_i \text{ is missing } > l/6 \text{ tokens}\}$$ to be the set of all nodes knowing "few" tokens.
From our intuition, it seems reasonable to just connected all nodes in $\text{few}$ in a line since the cost of discovering a token is low. However, for nodes in $\text{many}$, we would ideally like for them to learn no new tokens since the cost is higher. A way for $u\in \text{many}$ to learn no new tokens is to connect it to a $v\in\text{few}$ where $t_v(r)\in A_u(r)$. The question is: can we always do this for every node $u\in\text{many}$? Supposing for a second that we can, then the cost of this construction would contribute is bounded by
$$\sum_{u\in\text{few}}{\sum_{i=1}^{\min(3,\text{missing(u)})}{\frac{1}{(\text{missing}(u)-(i-1)}}}\leq |\text{few}|\cdot\frac{3}{\frac{l}{6} - 2}\leq |\text{few}| \cdot \frac{6}{\frac{l}{6}} \leq l \cdot (36/l) = 36$$
Now, we just need a way to match each $v\in\text{many}$ to a single node $u\in\text{few}$, without $v$ discovering any new tokens. In order to do this we use a counting argument. If $|\text{few}|$ is larger than $|\text{many}|$, then each node $v$ has lots of options for nodes $u$ to connect to, where each $u$ knows few tokens.
Bounding $\text{|many|}$ and $|\text{few}|$:
We easily get that $\sum_{u\in\text{many}}\text{missing}(u) \leq |\text{many}|\cdot l/6$ from the definition of the $\text{many}$ set.
Secondly, note that ${|\text{many}| \choose 2}\leq\sum_{u\in\text{many}}\text{missing}(u)$. If we add an edge $\{u, v\}$ where $u, v\in V_l$, then we know that at least one node will learn a new token since otherwise, $\{u,v\}$ would be a free edge, implying that $u$ and $v$ would be in the same connected component which is a contradiction.
Thus we conclude that ${|\text{many}| \choose 2} \leq |\text{many}|\cdot l/6$. We can solve this inequality to get the bound $|\text{many}| \leq l/3 + 1$, which implies $|\text{few}|=l - |\text{many}| \geq 2l/3 - 1$.
Now, to argue that our matching from $\text{many}$ and $\text{few}$ always exist, we first observe that all nodes in $\text{few}$ will broadcast distinct tokens. This is because if any two broadcast the same token, then we know that there must be a free edge between them, which is a contradiction. For any node $v\in\text{many}$, we know $v$ is missing at most $l/6$ tokens by definition. Hence there are at least $|\text{few}| - l/6$ tokens broadcast collectively by the nodes in $\text{few}$ which $v$ already knows. Finally, note
$$|\text{many}| \leq (2l/3 - 1) - l/6 \leq |\text{few}| - l/6$$
so such a matching must exist, and we are done since this completes the proof that a construction with constant cost increase exists.
Note that this proof only works in the online adaptive adversary case since for the construction given above to be used the adversary needs to know $t_u(r)$ for each node and round, which it cannot know in advance.
[KLO10] Fabian Kuhn, Nancy Lynch, and Rotem Oshman. “Distributed
Computation in Dynamic Networks”. In: Proceedings of the FortySecond ACM Symposium on Theory of Computing. STOC ’10.
Cambridge, Massachusetts, USA: Association for Computing Machinery, 2010, pp. 513–522. isbn: 9781450300506. doi: 10.1145/
1806689.1806760. url: https://doi.org/10.1145/1806689.
1806760.