Core Concepts

Author

Sadamori Kojaku

Published

August 17, 2025

1 What to learn in this module

In this module, we will explore network robustness through the lens of infrastructure design. Starting from the historical challenge of building cost-effective power grids, we will learn:

How minimum spanning trees provide optimal cost-efficiency for network connectivity
Why real-world networks have redundancies beyond minimum connectivity requirements
How networks respond to random failures versus targeted attacks
Quantitative measures of network robustness and percolation theory
Design principles for balancing cost efficiency with resilience

Keywords: minimum spanning tree, Kruskal’s algorithm, Prim’s algorithm, network redundancy, random failures, targeted attacks, connectivity loss, R-index, percolation, robustness paradox

2 Pen-and-Paper Exercise: From MST to Robust Grid Design

✍️ Pen and Paper Exercise: Starting with a minimum spanning tree for cost efficiency, design a power grid network that maintains connectivity even when key components fail.

3 Network Design Challenges

🚀 Interactive Demo

4 Power Grid Design Challenge

In the aftermath of World War I, the newly formed Czechoslovakia faced massive reconstruction challenges. Cities and towns across Moravia needed electricity, but the young nation had limited resources. Every resources spent on unnecessary infrastructure was a resource not available for hospitals, schools, or economic recovery. Engineers at the West Moravian Power Company faced a critical question: How do you connect every town and village to the electrical grid while using the minimum length of cable?

Otakar Borůvka (1899-1995) was a Czech mathematician who is best known for his work on the minimum spanning tree problem.

The problem reached mathematician Otakar Borůvka through his friend at the power company. Borůvka’s 1926 solution gave us the first systematic approach to what we now call the minimum spanning tree problem: finding the cheapest way to connect all locations in a network.

Minimum Spanning Tree

A minimum spanning tree (MST) of a weighted network is a tree that:

Spans all nodes (connects every location in the network)
Is a tree (connected with no cycles - no redundant loops)
Has minimum total weight among all possible spanning trees

Otakar Borůvka delivered the first algorithm to solve this problem: Borůvka’s algorithm. But it is not the only algorithm to find the minimum spanning tree. In fact, there are several algorithms. We will cover two algorithms: Kruskal’s algorithm and Prim’s algorithm, which are easier to understand and implement.

Finding the Minimum Spanning Tree

Kruskal’s Algorithm

Kruskal’s algorithm embodies a remarkably simple yet powerful intuition: always choose the cheapest available option, but never create wasteful loops. While sounds heuristic, this algorithm in fact leads to the global optimial solution!

The algorithm works by first sorting every possible connection from cheapest to most expensive like arranging all the cable segments by cost. Then, it examines each connection in order, asking a crucial question: “If I add this cable, will it create a redundant loop?” If the answer is no, the cable joins the growing network. If adding it would create a cycle—meaning the two locations are already connected through some other path—the algorithm skips it as wasteful. This process continues until every location is connected, guaranteeing both minimum cost and complete coverage.

Prim’s Algorithm

Prim’s algorithm takes a fundamentally different approach, embodying the intuition of organic growth from a single starting point. Picture an engineer beginning at the central power plant and asking: “What’s the cheapest way to connect one more location to our existing grid?” This local growth strategy builds the network incrementally, always expanding from what’s already been constructed.

The algorithm begins by selecting any location as its starting point, often the power plant in our analogy. From this initial seed, it repeatedly identifies the cheapest connection that would bring a new, unconnected location into the growing network. Unlike Kruskal’s global view, Prim’s algorithm maintains a clear distinction between locations already in the network and those still waiting to be connected. At each step, it finds the minimum-cost bridge between these two groups, gradually expanding the connected region until it encompasses every location.

This local expansion strategy mirrors how many real-world infrastructure projects actually develop. Engineers often start from existing facilities and expand outward, always seeking the most cost-effective way to serve additional areas. Prim’s algorithm formalizes this natural growth process.

Note

Two algorithms find the same minimum spanning tree when all connection costs are different. If there are connections with the same cost, there are multiple minimum spanning trees of the same cost, and which tree to find depends on the algorithm. In particular, Prim’s algorithm finds different trees when starting from different locations.

5 Why Minimum Spanning Tree is Not Enough

Minimum spanning tree is an efficient way to connect all locations in a network with the minimum total cost. However, such a network is vulnerable to failures, e.g., the network can be disconnected when a single node fails, in particular those close to the center of the network. This is why our power grid has a lot of redundancies beyond the minimum spanning tree, for the sake of resilience against failures.

Figure 1: This is the power grid of the United States.

Measuring Network Damage

Not every failure is equal. Some failures are more damaging than others. For example, removing somee nodes in a grid network can be catastrophic, while removing other nodes can be more tolerable.

While there can be many metrics to quantify network damage, we will focus on a purely topological metric: the fraction of nodes remaining in the largest connected component after removal.

\text{Connectivity} = \frac{\text{Size of largest component after removal}}{\text{Original network size}}

The impact of removing a single node varies based on which node is removed. — Figure 2

The robustness profile plots connectivity against the fraction of nodes removed, revealing how networks fragment. Crucially, the shape of this profile depends entirely on the order in which nodes are removed - random removal creates one pattern, while strategic targeting creates dramatically different patterns.

Figure 3

To compare networks with a single metric, we use the R-index - the area under this curve:

R = \frac{1}{N} \sum_{k=1}^{N-1} y_k

The robustness index is a measure of how robust a network is to under a sequential failure of nodes. The higher the R-index, the more robust the network is.

Networks can exhibit different robustness profiles under different attack strategies. One form of an attack is a random failure, where nodes are removed randomly. Another form of an attack is a targeted attack, where nodes are removed strategically.

Random failures are like earthquakes or equipment malfunctions; they strike unpredictably. In power grids, generators might fail due to technical problems. In computer networks, servers might crash randomly.

Even if a network survives random failures beautifully, it might crumble under targeted attacks. Adversaries strategically choose which nodes to attack for maximum damage. The most intuitive strategy targets high-degree nodes (hubs) first, i.e., like targeting the busiest airports to disrupt air travel.

The asymmetry between random failures and targeted attacks is one of the most counterintuitive discoveries in network science. A network that seems robust can have hidden vulnerabilities that smart adversaries can exploit.

6 Theoretical Framework for Network Robustness

To understand these patterns mathematically, we can view network attacks as the reverse process of percolation. Percolation theory studies phase transitions in connectivity by asking: as we randomly add nodes to a grid, when does a giant connected component emerge? Network robustness asks the opposite: as we remove nodes, when does the giant component disappear?

Percolation theory originated in physics to understand how liquids flow through porous materials. The same mathematics explains how networks fragment under node removal - a beautiful example of how physics concepts illuminate network behavior.

Percolation vs. Robustness: Two Sides of the Same Coin

Percolation theory asks: “Starting from isolation, how many nodes must we connect to form a giant component?” - increasing connectivity from p = 0 to p = 1.

Robustness analysis asks: “Starting from full connectivity, how many nodes must we remove to fragment the network?” - decreasing connectivity from p = 1 to p = 0.

These are mathematically equivalent processes, just viewed in opposite directions along the same connectivity parameter.

The Phase Transition

Imagine a grid where each square randomly becomes a “puddle” with probability p. As p increases, something dramatic happens - suddenly, a giant puddle spanning the entire grid appears! This phase transition occurs at a critical probability p_c. Crucially, the exact timing doesn’t matter; only the fraction of nodes present or removed determines connectivity.

The Molloy-Reed Criterion

For networks with arbitrary degree distributions, the Molloy-Reed criterion determines whether a giant component exists - that is, whether the network contains a single large connected component that includes most of the nodes:

\kappa = \frac{\langle k^2 \rangle}{\langle k \rangle} > 2

where \langle k \rangle is the average degree and \langle k^2 \rangle is the average of squared degrees. The ratio \kappa measures degree heterogeneity - networks with hubs have high \kappa, while degree homogeneous networks have low \kappa. When \kappa > 2, a giant component forms that dominates the network connectivity. See the Appendix for the proof of the Molloy-Reed criterion.

The Molloy-Reed criterion is a powerful tool to predict the existence of a giant component in a network and allows us to find the critical fraction of nodes that must be removed to break the network. This critial fraction depends on the strategy of the attack, along with the degree distribution. For simplicity, let us restrict ourselves into the random failures. For the random failure case, the critical fraction is given by:

f_c = 1 - \frac{1}{\kappa - 1}

The value of \kappa depends on the degree distribution, and below, we showcase two examples of degree distributions.

Degree homogeneous network

In case of a degree homogeneous network like a random network considered in the exercise above, the critical fraction is given by:

f_c = 1 - \frac{1}{\langle k \rangle}

given that \langle k^2 \rangle = \langle k \rangle^2 and thus \kappa = \langle k \rangle. This suggests that the threshold is determined by the average degree \langle k \rangle. A large \langle k \rangle results in a larger f_c, meaning that the network is more robust against random failures.

Degree heterogeneous network

Most real-world networks are degree heterogeneous, i.e., the degree distribution P(k) \sim k^{-\gamma} follows a power law (called scale-free network). The power-law degree distribution has infinite second moment, i.e., \langle k^2 \rangle = \infty and thus f_c = 1.0, which means that all nodes must be removed to break the network into disconnected components. This is the case where the number of nodes is infinite (i.e., so that a node has a very large degree for the degree distribution to be a valid power law). When restricting the maximum degree to be finite, the critical fraction is given by:

f_c = \begin{cases} 1 - \dfrac{1}{\frac{\gamma-2}{3-\gamma} k_{\text{min}} ^{\gamma-2} k_{\text{max}}^{3-\gamma} -1} & \text{if } 2 < \gamma < 3 \\ 1 - \dfrac{1}{\frac{\gamma-2}{\gamma-3} k_{\text{min}} - 1} & \text{if } \gamma > 3 \\ \end{cases}

where k_{\text{min}} and k_{\text{max}} are the minimum and maximum degree, respectively. The variable \gamma is the exponent of the power law degree distribution, controlling the degree heterogeneity, where a lower \gamma results in a more degree heterogeneous network.

For regime 2 < \gamma < 3, the critical threshold f_c is determined by the extreme values of the degree distribution, k_{\text{min}} and k_{\text{max}}. And f_c \rightarrow 1 when the maximum degree k_{\text{max}} \in [k_{\text{min}}, N-1] increases. Notably, in this regime, the maximum degree k_{\text{max}} increases as the network size N increases, and this makes f_c \rightarrow 1.
For regime \gamma > 3, the critical threshold f_c is influenced by the minimum degree k_{\text{min}}. In contrast to k_{\text{max}}, k_{\text{min}} remains constant as the network size N grows. Consequently, the network disintegrates when a finite fraction of its nodes are removed.

Robustness Under Attack

While scale-free networks show remarkable robustness against random failures, they exhibit a fundamental vulnerability to targeted attacks that deliberately target high-degree nodes (hubs). This asymmetry reveals the “Achilles’ heel” property of complex networks, where the same structural features that provide robustness against random failures create critical vulnerabilities to strategic attacks.

Rather than removing nodes randomly, an adversary with knowledge of the network structure can systematically remove the highest-degree nodes first, followed by the next highest-degree nodes, and so on. Under this targeted hub removal strategy, scale-free networks fragment rapidly and dramatically. The critical threshold for attacks, f_c^{\text{attack}}, is dramatically lower than for random failures. While random failures require f_c^{\text{random}} \approx 1 (nearly all nodes must be removed), targeted attacks need only f_c^{\text{attack}} \ll 1 (a small fraction of hubs) to fragment the network.

To understand how networks fragment under targeted attacks, we must consider two key effects that occur when the highest-degree nodes are systematically removed. First, the removal of hub nodes changes the maximum degree of the remaining network from k_{\max} to a new lower value k'_{\max}. Second, since these removed hubs had many connections, their elimination also removes many links from the network, effectively changing the degree distribution of the surviving nodes.

The mathematical analysis of this process relies on mapping the attack problem back to the random failure framework through careful accounting of these structural changes. When we remove an f fraction of the highest-degree nodes in a scale-free network, the new maximum degree becomes k'_{\max} = k_{\min} f^{1/(1-\gamma)}, where the power-law exponent \gamma determines how rapidly the degree sequence declines.

For scale-free networks with degree exponent \gamma, the critical attack threshold f_c satisfies:

f_c^{\frac{2-\gamma}{1-\gamma}} = \frac{2 + 2^{-\gamma}}{3-\gamma} k_{\min} \left(f_c^{\frac{3-\gamma}{1-\gamma}} - 1\right)

The fractional exponents (2-\gamma)/(1-\gamma) and (3-\gamma)/(1-\gamma) arise from the power-law degree distribution and determine how quickly the network fragments as hubs are removed. For networks with \gamma < 3 (highly heterogeneous degree distributions), these exponents are negative, leading to extremely small values of f_c, i.e., meaning just a tiny fraction of hub removal can destroy network connectivity.

This vulnerability has profound real-world implications across multiple domains. Power grids invest heavily in protecting major substations and transmission hubs because their failure could cascade throughout the system. Internet infrastructure includes hub redundancy and protection protocols to maintain connectivity when major routing nodes are compromised. Transportation networks maintain backup routes and alternative pathways when major airports or train stations fail. Even biological systems have evolved protective mechanisms for critical proteins that serve as hubs in cellular networks.

The robustness paradox demonstrates that no single network structure can be optimal against all types of failures. There’s always a fundamental trade-off between efficiency, which naturally favors hub-based architectures for optimal resource distribution, and security, which requires redundancy and distributed connectivity to prevent catastrophic failures from targeted attacks.

7 Design Principles for Robust Networks

How do we design networks that resist both random failures and targeted attacks? Key principles include:

Balanced Degree Distribution: Avoid both extreme homogeneity and extreme hub concentration
Multiple Redundant Pathways: Ensure removing any single node doesn’t isolate large portions
Strategic Hub Protection: In hub-based networks, invest heavily in protecting critical nodes
Hierarchical Design: Combine local clusters with hub connections and redundant backbones
Adaptive Responses: Design systems that can reconfigure when attacks are detected

These strategies reflect lessons learned from our historical power grid challenge: moving beyond the minimum spanning tree to create networks that balance efficiency with resilience.

def%20create_power_grid_graph(use_unique_weights%3DTrue)%3A%0A%20%20%20%20%22%22%22Create%20power%20grid%20graph%20with%20nodes%20and%20edges%22%22%22%0A%0A%20%20%20%20%23%20Define%20nodes%20with%20positions%0A%20%20%20%20nodes%20%3D%20%7B%0A%20%20%20%20%20%20%20%20%22A%22%3A%20(0%2C%201)%2C%0A%20%20%20%20%20%20%20%20%22B%22%3A%20(1%2C%202)%2C%0A%20%20%20%20%20%20%20%20%22C%22%3A%20(1%2C%200)%2C%0A%20%20%20%20%20%20%20%20%22D%22%3A%20(2%2C%202.5)%2C%0A%20%20%20%20%20%20%20%20%22E%22%3A%20(2%2C%201.5)%2C%0A%20%20%20%20%20%20%20%20%22F%22%3A%20(2%2C%200.5)%2C%0A%20%20%20%20%20%20%20%20%22G%22%3A%20(2%2C%20-0.5)%2C%0A%20%20%20%20%7D%0A%0A%20%20%20%20if%20use_unique_weights%3A%0A%20%20%20%20%20%20%20%20%23%20All%20weights%20are%20unique%0A%20%20%20%20%20%20%20%20edges%20%3D%20%5B%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22A%22%2C%20%22B%22%2C%208)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22A%22%2C%20%22C%22%2C%2012)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22B%22%2C%20%22D%22%2C%205)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22B%22%2C%20%22E%22%2C%207)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22C%22%2C%20%22F%22%2C%206)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22C%22%2C%20%22G%22%2C%204)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22D%22%2C%20%22E%22%2C%203)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22E%22%2C%20%22F%22%2C%209)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22F%22%2C%20%22G%22%2C%202)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22D%22%2C%20%22C%22%2C%2011)%2C%0A%20%20%20%20%20%20%20%20%5D%0A%20%20%20%20else%3A%0A%20%20%20%20%20%20%20%20%23%20Some%20weights%20are%20the%20same%20-%20multiple%20MSTs%20possible%0A%20%20%20%20%20%20%20%20edges%20%3D%20%5B%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22A%22%2C%20%22B%22%2C%208)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22A%22%2C%20%22C%22%2C%2012)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22B%22%2C%20%22D%22%2C%205)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22B%22%2C%20%22E%22%2C%207)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22C%22%2C%20%22F%22%2C%206)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22C%22%2C%20%22G%22%2C%204)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22D%22%2C%20%22E%22%2C%203)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22E%22%2C%20%22F%22%2C%2011)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22F%22%2C%20%22G%22%2C%202)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20(%22D%22%2C%20%22C%22%2C%2011)%2C%0A%20%20%20%20%20%20%20%20%5D%0A%0A%20%20%20%20return%20nodes%2C%20edges%0A%0A%0A%23%20Create%20the%20graph%20based%20on%20current%20setting%0Anodes%2C%20edges%20%3D%20create_power_grid_graph(unique_weights.value)%0A%0Adef%20kruskal_algorithm(nodes_dict%2C%20edges_list)%3A%0A%20%20%20%20%22%22%22Kruskal's%20algorithm%20implementation%20without%20networkx%22%22%22%0A%0A%20%20%20%20%23%20Step%201%3A%20Sort%20edges%20by%20weight%20(global%20perspective)%0A%20%20%20%20sorted_edges%20%3D%20sorted(edges_list%2C%20key%3Dlambda%20x%3A%20x%5B2%5D)%0A%0A%20%20%20%20%23%20Initialize%20Union-Find%20data%20structure%0A%20%20%20%20parent%20%3D%20%7B%7D%0A%20%20%20%20rank%20%3D%20%7B%7D%0A%0A%20%20%20%20%23%20Initialize%20all%20nodes%20in%20Union-Find%0A%20%20%20%20for%20node%20in%20nodes_dict%3A%0A%20%20%20%20%20%20%20%20parent%5Bnode%5D%20%3D%20node%0A%20%20%20%20%20%20%20%20rank%5Bnode%5D%20%3D%200%0A%0A%20%20%20%20def%20find(x)%3A%0A%20%20%20%20%20%20%20%20if%20parent%5Bx%5D%20!%3D%20x%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20parent%5Bx%5D%20%3D%20find(parent%5Bx%5D)%0A%20%20%20%20%20%20%20%20return%20parent%5Bx%5D%0A%0A%20%20%20%20def%20union(x%2C%20y)%3A%0A%20%20%20%20%20%20%20%20px%2C%20py%20%3D%20find(x)%2C%20find(y)%0A%20%20%20%20%20%20%20%20if%20px%20%3D%3D%20py%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20return%20False%0A%20%20%20%20%20%20%20%20if%20rank%5Bpx%5D%20%3C%20rank%5Bpy%5D%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20px%2C%20py%20%3D%20py%2C%20px%0A%20%20%20%20%20%20%20%20parent%5Bpy%5D%20%3D%20px%0A%20%20%20%20%20%20%20%20if%20rank%5Bpx%5D%20%3D%3D%20rank%5Bpy%5D%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20rank%5Bpx%5D%20%2B%3D%201%0A%20%20%20%20%20%20%20%20return%20True%0A%0A%20%20%20%20mst_edges%20%3D%20%5B%5D%0A%20%20%20%20steps%20%3D%20%5B%5D%0A%0A%20%20%20%20for%20u%2C%20v%2C%20weight%20in%20sorted_edges%3A%0A%20%20%20%20%20%20%20%20if%20union(u%2C%20v)%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20mst_edges.append((u%2C%20v%2C%20weight))%0A%20%20%20%20%20%20%20%20%20%20%20%20steps.append(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22edge%22%3A%20(u%2C%20v)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22weight%22%3A%20weight%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22action%22%3A%20%22added%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22reason%22%3A%20f%22Connects%20%7Bu%7D%20and%20%7Bv%7D%20without%20creating%20cycle%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20%20%20%20%20%23%20Continue%20until%20we%20have%20a%20spanning%20tree%20OR%20all%20edges%20are%20processed%0A%20%20%20%20%20%20%20%20if%20len(mst_edges)%20%3D%3D%20len(nodes_dict)%20-%201%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20break%0A%0A%20%20%20%20return%20mst_edges%2C%20steps%0A%0A%0Adef%20prim_algorithm(nodes_dict%2C%20edges_list%2C%20start_node%3D%22A%22)%3A%0A%20%20%20%20%22%22%22Prim's%20algorithm%20implementation%20without%20networkx%22%22%22%0A%0A%20%20%20%20%23%20Create%20adjacency%20list%0A%20%20%20%20adj%20%3D%20%7Bnode%3A%20%5B%5D%20for%20node%20in%20nodes_dict%7D%0A%20%20%20%20for%20u%2C%20v%2C%20weight%20in%20edges_list%3A%0A%20%20%20%20%20%20%20%20adj%5Bu%5D.append((v%2C%20weight))%0A%20%20%20%20%20%20%20%20adj%5Bv%5D.append((u%2C%20weight))%0A%0A%20%20%20%20visited%20%3D%20%7Bstart_node%7D%0A%20%20%20%20mst_edges%20%3D%20%5B%5D%0A%20%20%20%20steps%20%3D%20%5B%5D%0A%0A%20%20%20%20steps.append(%0A%20%20%20%20%20%20%20%20%7B%0A%20%20%20%20%20%20%20%20%20%20%20%20%22node%22%3A%20start_node%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22action%22%3A%20%22start%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22reason%22%3A%20f%22Starting%20from%20%7Bstart_node%7D%22%2C%0A%20%20%20%20%20%20%20%20%7D%0A%20%20%20%20)%0A%0A%20%20%20%20while%20len(visited)%20%3C%20len(nodes_dict)%3A%0A%20%20%20%20%20%20%20%20min_weight%20%3D%20float(%22inf%22)%0A%20%20%20%20%20%20%20%20min_edge%20%3D%20None%0A%0A%20%20%20%20%20%20%20%20%23%20Find%20cheapest%20edge%20from%20visited%20to%20unvisited%20nodes%0A%20%20%20%20%20%20%20%20for%20node%20in%20visited%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20for%20neighbor%2C%20weight%20in%20adj%5Bnode%5D%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20if%20neighbor%20not%20in%20visited%20and%20weight%20%3C%20min_weight%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20min_weight%20%3D%20weight%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20min_edge%20%3D%20(node%2C%20neighbor%2C%20weight)%0A%0A%20%20%20%20%20%20%20%20if%20min_edge%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20u%2C%20v%2C%20weight%20%3D%20min_edge%0A%20%20%20%20%20%20%20%20%20%20%20%20visited.add(v)%0A%20%20%20%20%20%20%20%20%20%20%20%20mst_edges.append((u%2C%20v%2C%20weight))%0A%20%20%20%20%20%20%20%20%20%20%20%20steps.append(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22edge%22%3A%20(u%2C%20v)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22weight%22%3A%20weight%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22action%22%3A%20%22added%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22reason%22%3A%20f%22Cheapest%20connection%20from%20visited%20set%20to%20%7Bv%7D%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20return%20mst_edges%2C%20steps%0A%0A%0A%23%20Run%20both%20algorithms%0Akruskal_mst%2C%20kruskal_steps%20%3D%20kruskal_algorithm(nodes%2C%20edges)%0Aprim_mst%2C%20prim_steps%20%3D%20prim_algorithm(nodes%2C%20edges)%0A%0A%23%20Calculate%20total%20weights%0Akruskal_weight%20%3D%20sum(w%20for%20_%2C%20_%2C%20w%20in%20kruskal_mst)%0Aprim_weight%20%3D%20sum(w%20for%20_%2C%20_%2C%20w%20in%20prim_mst)%0A%0A%23%20Display%20algorithm%20results%20with%20current%20step%20information%0Aweight_match%20%3D%20%22%E2%9C%85%20Same%22%20if%20kruskal_weight%20%3D%3D%20prim_weight%20else%20%22%E2%9D%8C%20Different%22%0A%0A%23%20Get%20current%20step%20information%0Acurrent_step%20%3D%20time_step.value%0Amax_steps%20%3D%20len(%5Bs%20for%20s%20in%20kruskal_steps%20if%20s%5B%22action%22%5D%20%3D%3D%20%22added%22%5D)

def%20visualize_both_algorithms()%3A%0A%20%20%20%20%22%22%22Create%20side-by-side%20visualization%20of%20both%20algorithms%20with%20time%20step%20control%22%22%22%0A%0A%20%20%20%20fig%2C%20(ax1%2C%20ax2)%20%3D%20plt.subplots(1%2C%202%2C%20figsize%3D(16%2C%208))%0A%0A%20%20%20%20%23%20Get%20edges%20to%20show%20up%20to%20current%20time%20step%20for%20each%20algorithm%0A%20%20%20%20current_step%20%3D%20time_step.value%0A%0A%20%20%20%20%23%20For%20Kruskal%3A%20edges%20added%20in%20order%20they%20appear%20in%20steps%0A%20%20%20%20kruskal_edges_to_show%20%3D%20%5B%5D%0A%20%20%20%20for%20i%2C%20step%20in%20enumerate(kruskal_steps)%3A%0A%20%20%20%20%20%20%20%20if%20i%20%3E%3D%20current_step%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20break%0A%20%20%20%20%20%20%20%20if%20step%5B%22action%22%5D%20%3D%3D%20%22added%22%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20u%2C%20v%20%3D%20step%5B%22edge%22%5D%0A%20%20%20%20%20%20%20%20%20%20%20%20weight%20%3D%20step%5B%22weight%22%5D%0A%20%20%20%20%20%20%20%20%20%20%20%20kruskal_edges_to_show.append((u%2C%20v%2C%20weight))%0A%0A%20%20%20%20%23%20For%20Prim%3A%20edges%20added%20in%20order%20they%20appear%20in%20steps%0A%20%20%20%20prim_edges_to_show%20%3D%20%5B%5D%0A%20%20%20%20for%20i%2C%20step%20in%20enumerate(prim_steps%5B1%3A%5D%2C%201)%3A%20%20%23%20Skip%20the%20'start'%20step%0A%20%20%20%20%20%20%20%20if%20i%20%3E%20current_step%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20break%0A%20%20%20%20%20%20%20%20if%20step%5B%22action%22%5D%20%3D%3D%20%22added%22%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20u%2C%20v%20%3D%20step%5B%22edge%22%5D%0A%20%20%20%20%20%20%20%20%20%20%20%20weight%20%3D%20step%5B%22weight%22%5D%0A%20%20%20%20%20%20%20%20%20%20%20%20prim_edges_to_show.append((u%2C%20v%2C%20weight))%0A%0A%20%20%20%20algorithms%20%3D%20%5B%0A%20%20%20%20%20%20%20%20(ax1%2C%20%22Kruskal's%20Algorithm%22%2C%20kruskal_edges_to_show)%2C%0A%20%20%20%20%20%20%20%20(ax2%2C%20%22Prim's%20Algorithm%22%2C%20prim_edges_to_show)%2C%0A%20%20%20%20%5D%0A%0A%20%20%20%20for%20ax%2C%20title%2C%20edges_to_show%20in%20algorithms%3A%0A%20%20%20%20%20%20%20%20ax.clear()%0A%20%20%20%20%20%20%20%20ax.set_facecolor(%22white%22)%0A%0A%20%20%20%20%20%20%20%20%23%20Draw%20all%20possible%20edges%20-%20dashed%20for%20unconnected%0A%20%20%20%20%20%20%20%20mst_edge_set%20%3D%20set((u%2C%20v)%20for%20u%2C%20v%2C%20_%20in%20edges_to_show)%20%7C%20set(%0A%20%20%20%20%20%20%20%20%20%20%20%20(v%2C%20u)%20for%20u%2C%20v%2C%20_%20in%20edges_to_show%0A%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20%20%20%20%20for%20u%2C%20v%2C%20weight%20in%20edges%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20x1%2C%20y1%20%3D%20nodes%5Bu%5D%0A%20%20%20%20%20%20%20%20%20%20%20%20x2%2C%20y2%20%3D%20nodes%5Bv%5D%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20if%20(u%2C%20v)%20in%20mst_edge_set%20or%20(v%2C%20u)%20in%20mst_edge_set%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%23%20MST%20edge%20-%20solid%20black%20line%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20ax.plot(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%5Bx1%2C%20x2%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%5By1%2C%20y2%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22black%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20linewidth%3D3%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20solid_capstyle%3D%22round%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20)%0A%20%20%20%20%20%20%20%20%20%20%20%20else%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%23%20Non-MST%20edge%20-%20dashed%20grey%20line%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20ax.plot(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%5Bx1%2C%20x2%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%5By1%2C%20y2%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22grey%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20linewidth%3D2%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20linestyle%3D%22--%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20alpha%3D0.7%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20Add%20edge%20weight%20labels%20with%20larger%20font%0A%20%20%20%20%20%20%20%20%20%20%20%20mid_x%2C%20mid_y%20%3D%20(x1%20%2B%20x2)%20%2F%202%2C%20(y1%20%2B%20y2)%20%2F%202%0A%20%20%20%20%20%20%20%20%20%20%20%20ax.text(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20mid_x%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20mid_y%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20str(weight)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20fontsize%3D20%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20bbox%3Ddict(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20boxstyle%3D%22round%2Cpad%3D0.2%22%2C%20facecolor%3D%22white%22%2C%20alpha%3D0.9%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20ha%3D%22center%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20va%3D%22center%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20fontweight%3D%22bold%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20%20%20%20%20%23%20Draw%20nodes%20-%20color%20based%20on%20connection%20status%0A%20%20%20%20%20%20%20%20connected_nodes%20%3D%20set()%0A%20%20%20%20%20%20%20%20if%20%22Prim%22%20in%20title%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20For%20Prim%2C%20start%20with%20node%20A%20always%20connected%0A%20%20%20%20%20%20%20%20%20%20%20%20connected_nodes.add(%22A%22)%0A%20%20%20%20%20%20%20%20%20%20%20%20for%20u%2C%20v%2C%20_%20in%20edges_to_show%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_nodes.add(u)%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_nodes.add(v)%0A%20%20%20%20%20%20%20%20else%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20For%20Kruskal%2C%20find%20all%20nodes%20in%20connected%20components%0A%20%20%20%20%20%20%20%20%20%20%20%20if%20edges_to_show%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%23%20Build%20Union-Find%20to%20determine%20connected%20components%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20parent%20%3D%20%7Bnode%3A%20node%20for%20node%20in%20nodes%7D%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20def%20find_root(x)%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20if%20parent%5Bx%5D%20!%3D%20x%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20parent%5Bx%5D%20%3D%20find_root(parent%5Bx%5D)%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20return%20parent%5Bx%5D%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20def%20union_nodes(x%2C%20y)%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20px%2C%20py%20%3D%20find_root(x)%2C%20find_root(y)%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20if%20px%20!%3D%20py%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20parent%5Bpy%5D%20%3D%20px%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%23%20Apply%20edges%20to%20build%20connected%20components%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20for%20u%2C%20v%2C%20_%20in%20edges_to_show%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20union_nodes(u%2C%20v)%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%23%20Find%20all%20nodes%20connected%20to%20any%20component%20that%20has%20edges%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_roots%20%3D%20set()%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20for%20u%2C%20v%2C%20_%20in%20edges_to_show%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_roots.add(find_root(u))%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_roots.add(find_root(v))%0A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20for%20node%20in%20nodes%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20if%20find_root(node)%20in%20connected_roots%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20connected_nodes.add(node)%0A%0A%20%20%20%20%20%20%20%20for%20node%2C%20(x%2C%20y)%20in%20nodes.items()%3A%0A%20%20%20%20%20%20%20%20%20%20%20%20color%20%3D%20(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%22%23f5cbcc%22%20if%20node%20in%20connected_nodes%20else%20%22%23d0e2f3%22%0A%20%20%20%20%20%20%20%20%20%20%20%20)%20%20%23%20light%20red%20%3A%20light%20blue%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20Draw%20larger%20circle%20with%20black%20border%0A%20%20%20%20%20%20%20%20%20%20%20%20circle%20%3D%20plt.Circle((x%2C%20y)%2C%200.15%2C%20color%3Dcolor%2C%20zorder%3D5)%0A%20%20%20%20%20%20%20%20%20%20%20%20ax.add_patch(circle)%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20Add%20black%20border%0A%20%20%20%20%20%20%20%20%20%20%20%20border_circle%20%3D%20plt.Circle(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20(x%2C%20y)%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%200.15%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20fill%3DFalse%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20edgecolor%3D%22black%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20linewidth%3D2%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20zorder%3D6%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20)%0A%20%20%20%20%20%20%20%20%20%20%20%20ax.add_patch(border_circle)%0A%20%20%20%20%20%20%20%20%20%20%20%20%23%20Larger%20text%0A%20%20%20%20%20%20%20%20%20%20%20%20ax.text(%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20x%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20y%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20node%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20ha%3D%22center%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20va%3D%22center%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20fontsize%3D20%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20fontweight%3D%22bold%22%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20zorder%3D7%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20)%0A%0A%20%20%20%20%20%20%20%20%23%20Clean%20title%0A%20%20%20%20%20%20%20%20ax.set_title(title%2C%20fontsize%3D22%2C%20fontweight%3D%22bold%22%2C%20pad%3D20)%0A%20%20%20%20%20%20%20%20ax.set_aspect(%22equal%22)%0A%20%20%20%20%20%20%20%20ax.axis(%22off%22)%0A%0A%20%20%20%20%20%20%20%20%23%20Set%20axis%20limits%20with%20padding%0A%20%20%20%20%20%20%20%20x_coords%20%3D%20%5Bx%20for%20x%2C%20y%20in%20nodes.values()%5D%0A%20%20%20%20%20%20%20%20y_coords%20%3D%20%5By%20for%20x%2C%20y%20in%20nodes.values()%5D%0A%20%20%20%20%20%20%20%20ax.set_xlim(min(x_coords)%20-%200.3%2C%20max(x_coords)%20%2B%200.3)%0A%20%20%20%20%20%20%20%20ax.set_ylim(min(y_coords)%20-%200.3%2C%20max(y_coords)%20%2B%200.3)%0A%0A%20%20%20%20plt.tight_layout()%0A%20%20%20%20return%20fig%0A%0A%0A%23%20Create%20and%20display%20the%20visualization%0Afig%20%3D%20visualize_both_algorithms()

np.random.seed(42)%20%20%23%20For%20reproducible%20results%20during%20demo%0A%23%20Grid%20parameters%0Agrid_size%20%3D%2050%0AS%20%3D%20np.random.random((grid_size%2C%20grid_size))%0A%0Adef%20percolation_visualization()%3A%0A%0A%20%20%20%20%23%20Generate%20the%20percolation%20grid%20based%20on%20slider%20value%0A%20%20%20%20np.random.seed(42)%20%20%23%20For%20reproducible%20results%20during%20demo%0A%20%20%20%20grid%20%3D%20S%20%3C%20p_slider.value%0A%0A%20%20%20%20%23%20Find%20connected%20components%20(puddles%20that%20touch%20each%20other)%0A%20%20%20%20labeled_array%2C%20num_features%20%3D%20label(grid)%0A%0A%20%20%20%20%23%20Find%20the%20largest%20connected%20component%0A%20%20%20%20if%20num_features%20%3E%200%3A%0A%20%20%20%20%20%20%20%20sizes%20%3D%20%5B(labeled_array%20%3D%3D%20i).sum()%20for%20i%20in%20range(1%2C%20num_features%20%2B%201)%5D%0A%20%20%20%20%20%20%20%20largest_size%20%3D%20max(sizes)%0A%20%20%20%20%20%20%20%20largest_fraction%20%3D%20largest_size%20%2F%20(grid_size%20*%20grid_size)%0A%20%20%20%20else%3A%0A%20%20%20%20%20%20%20%20largest_size%20%3D%200%0A%20%20%20%20%20%20%20%20largest_fraction%20%3D%200.0%0A%0A%20%20%20%20%23%20Create%20visualization%0A%20%20%20%20plt.figure(figsize%3D(8%2C%208))%0A%0A%20%20%20%20%23%20Create%20a%20display%20grid%20that%20shows%20largest%20component%20in%20red%0A%20%20%20%20display_grid%20%3D%20np.zeros_like(grid%2C%20dtype%3Dint)%0A%0A%20%20%20%20%23%20Find%20the%20largest%20component%0A%20%20%20%20if%20num_features%20%3E%200%3A%0A%20%20%20%20%20%20%20%20%23%20Find%20which%20label%20corresponds%20to%20the%20largest%20component%0A%20%20%20%20%20%20%20%20largest_label%20%3D%20np.argmax(sizes)%20%2B%201%20%20%23%20%2B1%20because%20labels%20start%20from%201%0A%0A%20%20%20%20%20%20%20%20%23%20Set%20display%20values%3A%200%3Dwhite%2C%201%3Dblue%20(small%20components)%2C%202%3Dred%20(largest%20component)%0A%20%20%20%20%20%20%20%20display_grid%5Bgrid%5D%20%3D%201%20%20%23%20All%20puddles%20start%20as%20blue%0A%20%20%20%20%20%20%20%20display_grid%5Blabeled_array%20%3D%3D%20largest_label%5D%20%3D%202%20%20%23%20Largest%20component%20in%20red%0A%0A%20%20%20%20%23%20Create%20custom%20colormap%3A%20white%20for%20empty%2C%20blue%20for%20small%20components%2C%20red%20for%20largest%0A%20%20%20%20colors%20%3D%20%5B'white'%2C%20'%234472C4'%2C%20'%23E74C3C'%5D%20%20%23%20white%2C%20blue%2C%20red%0A%20%20%20%20cmap%20%3D%20ListedColormap(colors)%0A%0A%20%20%20%20%23%20Plot%20the%20grid%0A%20%20%20%20ax%20%3D%20plt.imshow(display_grid%2C%20cmap%3Dcmap%2C%20interpolation%3D'nearest')%0A%0A%20%20%20%20%23%20Styling%0A%20%20%20%20plt.title(f'Percolation%20Grid%20(p%20%3D%20%7Bp_slider.value%3A.2f%7D)%5Cn'%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20f'Largest%20Component%20(Red)%3A%20%7Blargest_size%7D%20squares%20'%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20f'(%7Blargest_fraction%3A.1%25%7D%20of%20grid)'%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20fontsize%3D14%2C%20pad%3D20)%0A%20%20%20%20plt.xlabel('Grid%20Position')%0A%20%20%20%20plt.ylabel('Grid%20Position')%0A%0A%20%20%20%20%23%20Add%20grid%20lines%20for%20clarity%0A%20%20%20%20plt.xticks(np.arange(-0.5%2C%20grid_size%2C%2010)%2C%20minor%3DTrue)%0A%20%20%20%20plt.yticks(np.arange(-0.5%2C%20grid_size%2C%2010)%2C%20minor%3DTrue)%0A%20%20%20%20plt.grid(which%3D'minor'%2C%20color%3D'gray'%2C%20linestyle%3D'-'%2C%20alpha%3D0.3)%0A%20%20%20%20plt.tick_params(which%3D'minor'%2C%20size%3D0)%0A%0A%20%20%20%20%23%20Add%20a%20legend%0A%20%20%20%20from%20matplotlib.patches%20import%20Patch%0A%20%20%20%20legend_elements%20%3D%20%5B%0A%20%20%20%20%20%20%20%20Patch(facecolor%3D'white'%2C%20edgecolor%3D'black'%2C%20label%3D'Empty')%2C%0A%20%20%20%20%20%20%20%20Patch(facecolor%3D'%234472C4'%2C%20label%3D'Small%20Components')%2C%0A%20%20%20%20%20%20%20%20Patch(facecolor%3D'%23E74C3C'%2C%20label%3D'Largest%20Component')%0A%20%20%20%20%5D%0A%20%20%20%20plt.legend(handles%3Dlegend_elements%2C%20loc%3D'upper%20right'%2C%20bbox_to_anchor%3D(1.15%2C%201))%0A%0A%20%20%20%20%23%20Return%20the%20plot%0A%20%20%20%20return%20ax

def%20phase_transition_plot()%3A%0A%20%20%20%20%23%20Create%20the%20phase%20transition%20plot%0A%20%20%20%20plt.figure(figsize%3D(8%2C%206))%0A%0A%20%20%20%20%23%20Plot%20the%20phase%20transition%20curve%0A%20%20%20%20plt.plot(prob_values%2C%20component_fractions%2C%20'b-'%2C%20linewidth%3D2%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20label%3D'Largest%20Component%20Size')%0A%0A%20%20%20%20%23%20Highlight%20current%20probability%0A%20%20%20%20current_idx%20%3D%20int(p_slider.value%20*%2099)%20%20%23%20Convert%20to%20index%0A%20%20%20%20plt.plot(p_slider.value%2C%20component_fractions%5Bcurrent_idx%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20'ro'%2C%20markersize%3D10%2C%20label%3Df'Current%20p%20%3D%20%7Bp_slider.value%3A.2f%7D')%0A%0A%20%20%20%20%23%20Mark%20approximate%20critical%20point%20(for%202D%20lattice%2C%20pc%20%E2%89%88%200.593)%0A%20%20%20%20critical_p%20%3D%200.593%0A%20%20%20%20plt.axvline(x%3Dcritical_p%2C%20color%3D'gray'%2C%20linestyle%3D'--'%2C%20alpha%3D0.7%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20label%3Df'Critical%20point%20(p_c%20%E2%89%88%20%7Bcritical_p%7D)')%0A%0A%20%20%20%20%23%20Styling%0A%20%20%20%20plt.xlabel('Probability%20(p)'%2C%20fontsize%3D12)%0A%20%20%20%20plt.ylabel('Fraction%20of%20Grid%20in%20Largest%20Component'%2C%20fontsize%3D12)%0A%20%20%20%20plt.title('Percolation%20Phase%20Transition'%2C%20fontsize%3D14)%0A%20%20%20%20plt.grid(True%2C%20alpha%3D0.3)%0A%20%20%20%20plt.legend(frameon%3DFalse)%0A%20%20%20%20plt.xlim(0%2C%201)%0A%20%20%20%20plt.ylim(0%2C%201)%0A%0A%20%20%20%20%23%20Add%20phase%20labels%0A%20%20%20%20plt.text(0.2%2C%200.2%2C%20'Disconnected%5CnPhase'%2C%20fontsize%3D15%2C%20ha%3D'center'%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20bbox%3Ddict(boxstyle%3D'round'%2C%20facecolor%3D'lightblue'%2C%20alpha%3D0.7))%0A%20%20%20%20plt.text(0.8%2C%200.2%2C%20'Connected%5CnPhase'%2C%20fontsize%3D15%2C%20ha%3D'center'%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20bbox%3Ddict(boxstyle%3D'round'%2C%20facecolor%3D'%23f5cbcc'%2C%20alpha%3D0.7))%0A%0A%20%20%20%20return%20plt.gca()

import%20altair%20as%20alt%0A%0Akmax_values%20%3D%20np.arange(50%2C%201000%2C%2020)%0Agammas%20%3D%20%5B2.1%2C%202.5%2C%202.9%5D%0Agamma_labels%20%3D%20%7B2.1%3A%20%22gamma%20%3D%202.1%22%2C%202.5%3A%20%22gamma%20%3D%202.5%22%2C%202.9%3A%20%22gamma%20%3D%202.9%22%7D%0Acolors%20%3D%20%7B2.1%3A%20%22red%22%2C%202.5%3A%20%22blue%22%2C%202.9%3A%20%22orange%22%7D%0Alinestyles%20%3D%20%7B2.1%3A%20%22solid%22%2C%202.5%3A%20%22dashed%22%2C%202.9%3A%20%22dashed%22%7D%0A%0A%23%20Prepare%20data%20for%20Altair%0Adata%20%3D%20%5B%5D%0Afor%20gamma%20in%20gammas%3A%0A%20%20%20%20for%20kmax%20in%20kmax_values%3A%0A%20%20%20%20%20%20%20%20fc%20%3D%20compute_fc_powerlaw(gamma%2C%201%2C%20kmax)%0A%20%20%20%20%20%20%20%20data.append(%7B%0A%20%20%20%20%20%20%20%20%20%20%20%20%22Maximum%20degree%22%3A%20kmax%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22Critical%20fraction%22%3A%20fc%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22gamma%22%3A%20gamma_labels%5Bgamma%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22color%22%3A%20colors%5Bgamma%5D%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%22linestyle%22%3A%20linestyles%5Bgamma%5D%0A%20%20%20%20%20%20%20%20%7D)%0A%0Adf%20%3D%20pd.DataFrame(data)%0A%0A%23%20Altair%20plot%0A%0A%23%20Find%20the%20minimum%20y%20value%20(critical%20fraction)%20in%20the%20data%0Amin_fc%20%3D%20df%5B%22Critical%20fraction%22%5D.min()%0A%0Achart_fc%20%3D%20alt.Chart(df).mark_line().encode(%0A%20%20%20%20x%3Dalt.X(%0A%20%20%20%20%20%20%20%20%22Maximum%20degree%3AQ%22%2C%0A%20%20%20%20%20%20%20%20title%3D%22Maximum%20degree%22%2C%0A%20%20%20%20%20%20%20%20axis%3Dalt.Axis(titleFontSize%3D18%2C%20labelFontSize%3D14)%0A%20%20%20%20)%2C%0A%20%20%20%20y%3Dalt.Y(%0A%20%20%20%20%20%20%20%20%22Critical%20fraction%3AQ%22%2C%0A%20%20%20%20%20%20%20%20title%3D%22%24f_c%24%22%2C%0A%20%20%20%20%20%20%20%20scale%3Dalt.Scale(domain%3D%5Bmin_fc%2C%201.0%5D)%2C%0A%20%20%20%20%20%20%20%20axis%3Dalt.Axis(titleFontSize%3D18%2C%20labelFontSize%3D14)%0A%20%20%20%20)%2C%0A%20%20%20%20color%3Dalt.Color(%0A%20%20%20%20%20%20%20%20%22gamma%3AN%22%2C%0A%20%20%20%20%20%20%20%20scale%3Dalt.Scale(domain%3Dlist(gamma_labels.values())%2C%20range%3D%5Bcolors%5Bg%5D%20for%20g%20in%20gammas%5D)%2C%0A%20%20%20%20%20%20%20%20legend%3Dalt.Legend(title%3D%22gamma%22%2C%20titleFontSize%3D16%2C%20labelFontSize%3D14)%0A%20%20%20%20)%2C%0A%20%20%20%20strokeDash%3Dalt.StrokeDash(%0A%20%20%20%20%20%20%20%20%22gamma%3AN%22%2C%0A%20%20%20%20%20%20%20%20scale%3Dalt.Scale(domain%3Dlist(gamma_labels.values())%2C%20range%3D%5B%5B%5D%2C%20%5B5%2C5%5D%2C%20%5B5%2C5%5D%5D)%2C%0A%20%20%20%20%20%20%20%20legend%3DNone%0A%20%20%20%20)%0A).properties(%0A%20%20%20%20width%3D300%2C%0A%20%20%20%20height%3D300%2C%0A%20%20%20%20title%3Dalt.TitleParams(%0A%20%20%20%20%20%20%20%20text%3D%22Critical%20Fraction%20(k_min%20%3D%201)%22%2C%0A%20%20%20%20%20%20%20%20fontSize%3D20%0A%20%20%20%20)%0A)

8 References

Borůvka, O. (1926). O jistém problému minimálním [About a certain minimal problem]. Práce Moravské Přírodovědecké Společnosti, 3, 37-58. [Original work on minimum spanning trees]
Molloy, M., & Reed, B. (1995). A critical point for random graphs with a given degree sequence. Random Structures & Algorithms, 6(2-3), 161-180. [Molloy-Reed criterion]
Albert, R., Jeong, H., & Barabási, A. L. (2000). Error and attack tolerance of complex networks. Nature, 406(6794), 378-382. [Seminal paper on network robustness and the “Achilles’ heel” property]
Cohen, R., Erez, K., ben-Avraham, D., & Havlin, S. (2000). Resilience of the Internet to random breakdowns. Physical Review Letters, 85(21), 4626-4629. [Mathematical framework for random failures]
Cohen, R., Erez, K., ben-Avraham, D., & Havlin, S. (2001). Breakdown of the Internet under intentional attack. Physical Review Letters, 86(16), 3682-3685. [Mathematical analysis of targeted attacks]
Callaway, D. S., Newman, M. E., Strogatz, S. H., & Watts, D. J. (2000). Network robustness and fragility: Percolation on random graphs. Physical Review Letters, 85(25), 5468-5471. [Percolation theory approach to network robustness]
Cohen, R., & Havlin, S. (2010). Complex Networks: Structure, Robustness and Function. Cambridge University Press. [Comprehensive treatment of network robustness theory]
Newman, M. E. J. (2018). Networks. Oxford University Press. [Modern textbook covering network robustness and percolation]

--- title: "Core Concepts" filters: - marimo-team/marimo --- ## What to learn in this module In this module, we will explore network robustness through the lens of infrastructure design. Starting from the historical challenge of building cost-effective power grids, we will learn: - How minimum spanning trees provide optimal cost-efficiency for network connectivity - Why real-world networks have redundancies beyond minimum connectivity requirements - How networks respond to random failures versus targeted attacks - Quantitative measures of network robustness and percolation theory - Design principles for balancing cost efficiency with resilience **Keywords**: minimum spanning tree, Kruskal's algorithm, Prim's algorithm, network redundancy, random failures, targeted attacks, connectivity loss, R-index, percolation, robustness paradox ## Pen-and-Paper Exercise: From MST to Robust Grid Design - ✍️ [Pen and Paper Exercise](./pen-and-paper/exercise.pdf): Starting with a minimum spanning tree for cost efficiency, design a power grid network that maintains connectivity even when key components fail. ## Network Design Challenges [🚀 Interactive Demo](https://skojaku.github.io/adv-net-sci/assets/vis/network-robustness.html) ## Power Grid Design Challenge In the aftermath of World War I, the newly formed Czechoslovakia faced massive reconstruction challenges. Cities and towns across [Moravia](https://en.wikipedia.org/wiki/Moravia) needed electricity, but the young nation had limited resources. Every resources spent on unnecessary infrastructure was a resource not available for hospitals, schools, or economic recovery. Engineers at the West Moravian Power Company faced a critical question: How do you connect every town and village to the electrical grid while using the minimum length of cable? ::: {.column-margin} **Otakar Borůvka** (1899-1995) was a Czech mathematician who is best known for his work on the minimum spanning tree problem. ![](https://upload.wikimedia.org/wikipedia/commons/thumb/e/ee/Otakar_Boruvka_1981.jpg/500px-Otakar_Boruvka_1981.jpg) ::: The problem reached mathematician Otakar Borůvka through his friend at the power company. Borůvka's 1926 solution gave us the first systematic approach to what we now call **the minimum spanning tree problem**: finding the cheapest way to connect all locations in a network. #### Minimum Spanning Tree A **minimum spanning tree (MST)** of a weighted network is a tree that: - **Spans** all nodes (connects every location in the network) - Is a **tree** (connected with no cycles - no redundant loops) - Has **minimum total weight** among all possible spanning trees Otakar Borůvka delivered the first algorithm to solve this problem: **Borůvka's algorithm**. But it is not the only algorithm to find the minimum spanning tree. In fact, there are several algorithms. We will cover two algorithms: **Kruskal's algorithm** and **Prim's algorithm**, which are easier to understand and implement. ### Finding the Minimum Spanning Tree ::: {.column-margin} <iframe width="250" height="150" src="https://www.youtube.com/embed/8i2XsxU-VL4?si=UGZH5hgy09Jhsjru" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe> ::: #### Kruskal's Algorithm **Kruskal's algorithm** embodies a remarkably simple yet powerful intuition: always choose the cheapest available option, but never create wasteful loops. While sounds heuristic, this algorithm in fact leads to the global optimial solution! The algorithm works by first sorting every possible connection from cheapest to most expensive like arranging all the cable segments by cost. Then, it examines each connection in order, asking a crucial question: "If I add this cable, will it create a redundant loop?" If the answer is no, the cable joins the growing network. If adding it would create a cycle---meaning the two locations are already connected through some other path---the algorithm skips it as wasteful. This process continues until every location is connected, guaranteeing both minimum cost and complete coverage. #### Prim's Algorithm **Prim's algorithm** takes a fundamentally different approach, embodying the intuition of organic growth from a single starting point. Picture an engineer beginning at the central power plant and asking: "What's the cheapest way to connect one more location to our existing grid?" This local growth strategy builds the network incrementally, always expanding from what's already been constructed. The algorithm begins by selecting any location as its starting point, often the power plant in our analogy. From this initial seed, it repeatedly identifies the cheapest connection that would bring a new, unconnected location into the growing network. Unlike Kruskal's global view, Prim's algorithm maintains a clear distinction between locations already in the network and those still waiting to be connected. At each step, it finds the minimum-cost bridge between these two groups, gradually expanding the connected region until it encompasses every location. This local expansion strategy mirrors how many real-world infrastructure projects actually develop. Engineers often start from existing facilities and expand outward, always seeking the most cost-effective way to serve additional areas. Prim's algorithm formalizes this natural growth process. ```python {.marimo} mo.vstack( [ unique_weights, time_step, mo.md( "**Instructions**: Move the slider to see how each algorithm builds the MST step by step." ), ] ) ``` ```python {.marimo} fig ``` ::: {.callout-note} Two algorithms find the same minimum spanning tree when all connection costs are different. If there are connections with the same cost, there are multiple minimum spanning trees of the same cost, and which tree to find depends on the algorithm. In particular, Prim's algorithm finds different trees when starting from different locations. ::: ## Why Minimum Spanning Tree is Not Enough Minimum spanning tree is an efficient way to connect all locations in a network with the minimum total cost. However, such a network is vulnerable to failures, e.g., the network can be disconnected when a single node fails, in particular those close to the center of the network. This is why our power grid has a lot of redundancies beyond the minimum spanning tree, for the sake of resilience against failures. ::: {#fig-us-powe-grid} ![](https://www.geni.org/globalenergy/library/national_energy_grid/united-states-of-america/graphics/UnitedStatesPowerGrid.jpg){width=50%} This is the power grid of the United States. ::: ### Measuring Network Damage Not every failure is equal. Some failures are more damaging than others. For example, removing somee nodes in a grid network can be catastrophic, while removing other nodes can be more tolerable. While there can be many metrics to quantify network damage, we will focus on a purely topological metric: the fraction of nodes remaining in the largest connected component after removal. $$ \text{Connectivity} = \frac{\text{Size of largest component after removal}}{\text{Original network size}} $$ ![](../figs/single-node-failure.jpg){#fig-single-node-failure fig-alt="The impact of removing a single node varies based on which node is removed."} The **robustness profile** plots connectivity against the fraction of nodes removed, revealing how networks fragment. Crucially, the shape of this profile depends entirely on the **order** in which nodes are removed - random removal creates one pattern, while strategic targeting creates dramatically different patterns. ![](../figs/robustness-profile.jpg){#fig-multiple-node-failure fig-alt="Robustness profile of a network for a sequential failure of nodes."} To compare networks with a single metric, we use the **R-index** - the area under this curve: $$ R = \frac{1}{N} \sum_{k=1}^{N-1} y_k $$ The robustness index is a measure of how robust a network is to under a sequential failure of nodes. The higher the R-index, the more robust the network is. Networks can exhibit different robustness profiles under different attack strategies. One form of an attack is a **random failure**, where nodes are removed randomly. Another form of an attack is a **targeted attack**, where nodes are removed strategically. Random failures are like earthquakes or equipment malfunctions; they strike unpredictably. In power grids, generators might fail due to technical problems. In computer networks, servers might crash randomly. Even if a network survives random failures beautifully, it might crumble under **targeted attacks**. Adversaries strategically choose which nodes to attack for maximum damage. The most intuitive strategy targets **high-degree nodes** (hubs) first, i.e., like targeting the busiest airports to disrupt air travel. ::: {.column-margin} The asymmetry between random failures and targeted attacks is one of the most counterintuitive discoveries in network science. A network that seems robust can have hidden vulnerabilities that smart adversaries can exploit. ::: ## Theoretical Framework for Network Robustness To understand these patterns mathematically, we can view network attacks as the **reverse process of percolation**. **Percolation theory** studies phase transitions in connectivity by asking: as we randomly add nodes to a grid, when does a giant connected component emerge? Network robustness asks the opposite: as we remove nodes, when does the giant component disappear? ::: {.column-margin} Percolation theory originated in physics to understand how liquids flow through porous materials. The same mathematics explains how networks fragment under node removal - a beautiful example of how physics concepts illuminate network behavior. ::: ```python {.marimo} p_slider ``` ```python {.marimo} percolation_visualization() ``` ::: {.column-margin} **Percolation vs. Robustness: Two Sides of the Same Coin** Percolation theory asks: *"Starting from isolation, how many nodes must we connect to form a giant component?"* - increasing connectivity from $p = 0$ to $p = 1$. Robustness analysis asks: *"Starting from full connectivity, how many nodes must we remove to fragment the network?"* - decreasing connectivity from $p = 1$ to $p = 0$. These are mathematically equivalent processes, just viewed in opposite directions along the same connectivity parameter. ::: ### The Phase Transition Imagine a grid where each square randomly becomes a "puddle" with probability $p$. As $p$ increases, something dramatic happens - suddenly, a giant puddle spanning the entire grid appears! This **phase transition** occurs at a critical probability $p_c$. Crucially, the exact timing doesn't matter; only the fraction of nodes present or removed determines connectivity. ```python {.marimo} p_slider ``` ```python {.marimo} phase_transition_plot() ``` ### The Molloy-Reed Criterion For networks with arbitrary degree distributions, the **Molloy-Reed criterion** determines whether a **giant component** exists - that is, whether the network contains a single large connected component that includes most of the nodes: $$ \kappa = \frac{\langle k^2 \rangle}{\langle k \rangle} > 2 $$ where $\langle k \rangle$ is the average degree and $\langle k^2 \rangle$ is the average of squared degrees. The ratio $\kappa$ measures **degree heterogeneity** - networks with hubs have high $\kappa$, while degree homogeneous networks have low $\kappa$. When $\kappa > 2$, a giant component forms that dominates the network connectivity. See the Appendix for the proof of the Molloy-Reed criterion. The Molloy-Reed criterion is a powerful tool to predict the existence of a giant component in a network and allows us to find the critical fraction of nodes that must be **removed** to break the network. This critial fraction depends on the strategy of the attack, along with the degree distribution. For simplicity, let us restrict ourselves into the random failures. For the random failure case, the critical fraction is given by: $$ f_c = 1 - \frac{1}{\kappa - 1} $$ The value of $\kappa$ depends on the degree distribution, and below, we showcase two examples of degree distributions. #### Degree homogeneous network In case of a degree homogeneous network like a random network considered in the exercise above, the critical fraction is given by: $$ f_c = 1 - \frac{1}{\langle k \rangle} $$ given that $\langle k^2 \rangle = \langle k \rangle^2$ and thus $\kappa = \langle k \rangle$. This suggests that the threshold is determined by the average degree $\langle k \rangle$. A large $\langle k \rangle$ results in a larger $f_c$, meaning that the network is more robust against random failures. #### Degree heterogeneous network Most real-world networks are degree heterogeneous, i.e., the degree distribution $P(k) \sim k^{-\gamma}$ follows a power law (called *scale-free* network). The power-law degree distribution has infinite second moment, i.e., $\langle k^2 \rangle = \infty$ and thus $f_c = 1.0$, which means that all nodes must be removed to break the network into disconnected components. This is the case where the number of nodes is infinite (i.e., so that a node has a very large degree for the degree distribution to be a valid power law). When restricting the maximum degree to be finite, the critical fraction is given by: $$ f_c = \begin{cases} 1 - \dfrac{1}{\frac{\gamma-2}{3-\gamma} k_{\text{min}} ^{\gamma-2} k_{\text{max}}^{3-\gamma} -1} & \text{if } 2 < \gamma < 3 \\ 1 - \dfrac{1}{\frac{\gamma-2}{\gamma-3} k_{\text{min}} - 1} & \text{if } \gamma > 3 \\ \end{cases} $$ where $k_{\text{min}}$ and $k_{\text{max}}$ are the minimum and maximum degree, respectively. The variable $\gamma$ is the exponent of the power law degree distribution, controlling the degree heterogeneity, where a lower $\gamma$ results in a more degree heterogeneous network. - For regime $2 < \gamma < 3$, the critical threshold $f_c$ is determined by the extreme values of the degree distribution, $k_{\text{min}}$ and $k_{\text{max}}$. And $f_c \rightarrow 1$ when the maximum degree $k_{\text{max}} \in [k_{\text{min}}, N-1]$ increases. Notably, in this regime, the maximum degree $k_{\text{max}}$ increases as the network size $N$ increases, and this makes $f_c \rightarrow 1$. - For regime $\gamma > 3$, the critical threshold $f_c$ is influenced by the minimum degree $k_{\text{min}}$. In contrast to $k_{\text{max}}$, $k_{\text{min}}$ remains constant as the network size $N$ grows. Consequently, the network disintegrates when a finite fraction of its nodes are removed. ```python {.marimo} chart_fc ``` ### Robustness Under Attack While scale-free networks show remarkable robustness against random failures, they exhibit a fundamental vulnerability to targeted attacks that deliberately target high-degree nodes (hubs). This asymmetry reveals the "Achilles' heel" property of complex networks, where the same structural features that provide robustness against random failures create critical vulnerabilities to strategic attacks. Rather than removing nodes randomly, an adversary with knowledge of the network structure can systematically remove the highest-degree nodes first, followed by the next highest-degree nodes, and so on. Under this targeted hub removal strategy, scale-free networks fragment rapidly and dramatically. The critical threshold for attacks, $f_c^{\text{attack}}$, is dramatically lower than for random failures. While random failures require $f_c^{\text{random}} \approx 1$ (nearly all nodes must be removed), targeted attacks need only $f_c^{\text{attack}} \ll 1$ (a small fraction of hubs) to fragment the network. To understand how networks fragment under targeted attacks, we must consider two key effects that occur when the highest-degree nodes are systematically removed. First, the removal of hub nodes changes the maximum degree of the remaining network from $k_{\max}$ to a new lower value $k'_{\max}$. Second, since these removed hubs had many connections, their elimination also removes many links from the network, effectively changing the degree distribution of the surviving nodes. The mathematical analysis of this process relies on mapping the attack problem back to the random failure framework through careful accounting of these structural changes. When we remove an $f$ fraction of the highest-degree nodes in a scale-free network, the new maximum degree becomes $k'_{\max} = k_{\min} f^{1/(1-\gamma)}$, where the power-law exponent $\gamma$ determines how rapidly the degree sequence declines. For scale-free networks with degree exponent $\gamma$, the critical attack threshold $f_c$ satisfies: $$ f_c^{\frac{2-\gamma}{1-\gamma}} = \frac{2 + 2^{-\gamma}}{3-\gamma} k_{\min} \left(f_c^{\frac{3-\gamma}{1-\gamma}} - 1\right) $$ The fractional exponents $(2-\gamma)/(1-\gamma)$ and $(3-\gamma)/(1-\gamma)$ arise from the power-law degree distribution and determine how quickly the network fragments as hubs are removed. For networks with $\gamma < 3$ (highly heterogeneous degree distributions), these exponents are negative, leading to extremely small values of $f_c$, i.e., meaning just a tiny fraction of hub removal can destroy network connectivity. This vulnerability has profound real-world implications across multiple domains. Power grids invest heavily in protecting major substations and transmission hubs because their failure could cascade throughout the system. Internet infrastructure includes hub redundancy and protection protocols to maintain connectivity when major routing nodes are compromised. Transportation networks maintain backup routes and alternative pathways when major airports or train stations fail. Even biological systems have evolved protective mechanisms for critical proteins that serve as hubs in cellular networks. The robustness paradox demonstrates that no single network structure can be optimal against all types of failures. There's always a fundamental trade-off between efficiency, which naturally favors hub-based architectures for optimal resource distribution, and security, which requires redundancy and distributed connectivity to prevent catastrophic failures from targeted attacks. ## Design Principles for Robust Networks How do we design networks that resist both random failures and targeted attacks? Key principles include: 1. **Balanced Degree Distribution**: Avoid both extreme homogeneity and extreme hub concentration 2. **Multiple Redundant Pathways**: Ensure removing any single node doesn't isolate large portions 3. **Strategic Hub Protection**: In hub-based networks, invest heavily in protecting critical nodes 4. **Hierarchical Design**: Combine local clusters with hub connections and redundant backbones 5. **Adaptive Responses**: Design systems that can reconfigure when attacks are detected These strategies reflect lessons learned from our historical power grid challenge: moving beyond the minimum spanning tree to create networks that balance efficiency with resilience.  ```python {.marimo} import marimo as mo import matplotlib.pyplot as plt import pandas as pd import numpy as np from scipy.ndimage import label from matplotlib.colors import ListedColormap from scipy.sparse import csr_matrix from scipy.sparse.csgraph import connected_components ``` ```python {.marimo} # Control for edge weight uniqueness unique_weights = mo.ui.checkbox( label="Use unique edge weights (when unchecked, some edges have same weight)", value=True, ) # Time step slider - fixed to accommodate MST (max 6 edges for 7 nodes) time_step = mo.ui.slider( start=0, stop=6, # Fixed max for power grid (7 nodes = 6 MST edges) step=1, value=6, # Start at final state label="Algorithm Time Step (0=start, 6=complete MST)", ) ``` ```python {.marimo} def create_power_grid_graph(use_unique_weights=True): """Create power grid graph with nodes and edges""" # Define nodes with positions nodes = { "A": (0, 1), "B": (1, 2), "C": (1, 0), "D": (2, 2.5), "E": (2, 1.5), "F": (2, 0.5), "G": (2, -0.5), } if use_unique_weights: # All weights are unique edges = [ ("A", "B", 8), ("A", "C", 12), ("B", "D", 5), ("B", "E", 7), ("C", "F", 6), ("C", "G", 4), ("D", "E", 3), ("E", "F", 9), ("F", "G", 2), ("D", "C", 11), ] else: # Some weights are the same - multiple MSTs possible edges = [ ("A", "B", 8), ("A", "C", 12), ("B", "D", 5), ("B", "E", 7), ("C", "F", 6), ("C", "G", 4), ("D", "E", 3), ("E", "F", 11), ("F", "G", 2), ("D", "C", 11), ] return nodes, edges # Create the graph based on current setting nodes, edges = create_power_grid_graph(unique_weights.value) def kruskal_algorithm(nodes_dict, edges_list): """Kruskal's algorithm implementation without networkx""" # Step 1: Sort edges by weight (global perspective) sorted_edges = sorted(edges_list, key=lambda x: x[2]) # Initialize Union-Find data structure parent = {} rank = {} # Initialize all nodes in Union-Find for node in nodes_dict: parent[node] = node rank[node] = 0 def find(x): if parent[x] != x: parent[x] = find(parent[x]) return parent[x] def union(x, y): px, py = find(x), find(y) if px == py: return False if rank[px] < rank[py]: px, py = py, px parent[py] = px if rank[px] == rank[py]: rank[px] += 1 return True mst_edges = [] steps = [] for u, v, weight in sorted_edges: if union(u, v): mst_edges.append((u, v, weight)) steps.append( { "edge": (u, v), "weight": weight, "action": "added", "reason": f"Connects {u} and {v} without creating cycle", } ) # Continue until we have a spanning tree OR all edges are processed if len(mst_edges) == len(nodes_dict) - 1: break return mst_edges, steps def prim_algorithm(nodes_dict, edges_list, start_node="A"): """Prim's algorithm implementation without networkx""" # Create adjacency list adj = {node: [] for node in nodes_dict} for u, v, weight in edges_list: adj[u].append((v, weight)) adj[v].append((u, weight)) visited = {start_node} mst_edges = [] steps = [] steps.append( { "node": start_node, "action": "start", "reason": f"Starting from {start_node}", } ) while len(visited) < len(nodes_dict): min_weight = float("inf") min_edge = None # Find cheapest edge from visited to unvisited nodes for node in visited: for neighbor, weight in adj[node]: if neighbor not in visited and weight < min_weight: min_weight = weight min_edge = (node, neighbor, weight) if min_edge: u, v, weight = min_edge visited.add(v) mst_edges.append((u, v, weight)) steps.append( { "edge": (u, v), "weight": weight, "action": "added", "reason": f"Cheapest connection from visited set to {v}", } ) return mst_edges, steps # Run both algorithms kruskal_mst, kruskal_steps = kruskal_algorithm(nodes, edges) prim_mst, prim_steps = prim_algorithm(nodes, edges) # Calculate total weights kruskal_weight = sum(w for _, _, w in kruskal_mst) prim_weight = sum(w for _, _, w in prim_mst) # Display algorithm results with current step information weight_match = "✅ Same" if kruskal_weight == prim_weight else "❌ Different" # Get current step information current_step = time_step.value max_steps = len([s for s in kruskal_steps if s["action"] == "added"]) ``` ```python {.marimo} def visualize_both_algorithms(): """Create side-by-side visualization of both algorithms with time step control""" fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(16, 8)) # Get edges to show up to current time step for each algorithm current_step = time_step.value # For Kruskal: edges added in order they appear in steps kruskal_edges_to_show = [] for i, step in enumerate(kruskal_steps): if i >= current_step: break if step["action"] == "added": u, v = step["edge"] weight = step["weight"] kruskal_edges_to_show.append((u, v, weight)) # For Prim: edges added in order they appear in steps prim_edges_to_show = [] for i, step in enumerate(prim_steps[1:], 1): # Skip the 'start' step if i > current_step: break if step["action"] == "added": u, v = step["edge"] weight = step["weight"] prim_edges_to_show.append((u, v, weight)) algorithms = [ (ax1, "Kruskal's Algorithm", kruskal_edges_to_show), (ax2, "Prim's Algorithm", prim_edges_to_show), ] for ax, title, edges_to_show in algorithms: ax.clear() ax.set_facecolor("white") # Draw all possible edges - dashed for unconnected mst_edge_set = set((u, v) for u, v, _ in edges_to_show) | set( (v, u) for u, v, _ in edges_to_show ) for u, v, weight in edges: x1, y1 = nodes[u] x2, y2 = nodes[v] if (u, v) in mst_edge_set or (v, u) in mst_edge_set: # MST edge - solid black line ax.plot( [x1, x2], [y1, y2], "black", linewidth=3, solid_capstyle="round", ) else: # Non-MST edge - dashed grey line ax.plot( [x1, x2], [y1, y2], "grey", linewidth=2, linestyle="--", alpha=0.7, ) # Add edge weight labels with larger font mid_x, mid_y = (x1 + x2) / 2, (y1 + y2) / 2 ax.text( mid_x, mid_y, str(weight), fontsize=20, bbox=dict( boxstyle="round,pad=0.2", facecolor="white", alpha=0.9 ), ha="center", va="center", fontweight="bold", ) # Draw nodes - color based on connection status connected_nodes = set() if "Prim" in title: # For Prim, start with node A always connected connected_nodes.add("A") for u, v, _ in edges_to_show: connected_nodes.add(u) connected_nodes.add(v) else: # For Kruskal, find all nodes in connected components if edges_to_show: # Build Union-Find to determine connected components parent = {node: node for node in nodes} def find_root(x): if parent[x] != x: parent[x] = find_root(parent[x]) return parent[x] def union_nodes(x, y): px, py = find_root(x), find_root(y) if px != py: parent[py] = px # Apply edges to build connected components for u, v, _ in edges_to_show: union_nodes(u, v) # Find all nodes connected to any component that has edges connected_roots = set() for u, v, _ in edges_to_show: connected_roots.add(find_root(u)) connected_roots.add(find_root(v)) for node in nodes: if find_root(node) in connected_roots: connected_nodes.add(node) for node, (x, y) in nodes.items(): color = ( "#f5cbcc" if node in connected_nodes else "#d0e2f3" ) # light red : light blue # Draw larger circle with black border circle = plt.Circle((x, y), 0.15, color=color, zorder=5) ax.add_patch(circle) # Add black border border_circle = plt.Circle( (x, y), 0.15, fill=False, edgecolor="black", linewidth=2, zorder=6, ) ax.add_patch(border_circle) # Larger text ax.text( x, y, node, ha="center", va="center", fontsize=20, fontweight="bold", zorder=7, ) # Clean title ax.set_title(title, fontsize=22, fontweight="bold", pad=20) ax.set_aspect("equal") ax.axis("off") # Set axis limits with padding x_coords = [x for x, y in nodes.values()] y_coords = [y for x, y in nodes.values()] ax.set_xlim(min(x_coords) - 0.3, max(x_coords) + 0.3) ax.set_ylim(min(y_coords) - 0.3, max(y_coords) + 0.3) plt.tight_layout() return fig # Create and display the visualization fig = visualize_both_algorithms() ```  ```python {.marimo} # Create a slider to control the puddle probability p_slider = mo.ui.slider( start=0.0, stop=1.0, step=0.01, value=0.5, label="Puddle Probability (p)" ) ``` ```python {.marimo} np.random.seed(42) # For reproducible results during demo # Grid parameters grid_size = 50 S = np.random.random((grid_size, grid_size)) def percolation_visualization(): # Generate the percolation grid based on slider value np.random.seed(42) # For reproducible results during demo grid = S < p_slider.value # Find connected components (puddles that touch each other) labeled_array, num_features = label(grid) # Find the largest connected component if num_features > 0: sizes = [(labeled_array == i).sum() for i in range(1, num_features + 1)] largest_size = max(sizes) largest_fraction = largest_size / (grid_size * grid_size) else: largest_size = 0 largest_fraction = 0.0 # Create visualization plt.figure(figsize=(8, 8)) # Create a display grid that shows largest component in red display_grid = np.zeros_like(grid, dtype=int) # Find the largest component if num_features > 0: # Find which label corresponds to the largest component largest_label = np.argmax(sizes) + 1 # +1 because labels start from 1 # Set display values: 0=white, 1=blue (small components), 2=red (largest component) display_grid[grid] = 1 # All puddles start as blue display_grid[labeled_array == largest_label] = 2 # Largest component in red # Create custom colormap: white for empty, blue for small components, red for largest colors = ['white', '#4472C4', '#E74C3C'] # white, blue, red cmap = ListedColormap(colors) # Plot the grid ax = plt.imshow(display_grid, cmap=cmap, interpolation='nearest') # Styling plt.title(f'Percolation Grid (p = {p_slider.value:.2f})\n' f'Largest Component (Red): {largest_size} squares ' f'({largest_fraction:.1%} of grid)', fontsize=14, pad=20) plt.xlabel('Grid Position') plt.ylabel('Grid Position') # Add grid lines for clarity plt.xticks(np.arange(-0.5, grid_size, 10), minor=True) plt.yticks(np.arange(-0.5, grid_size, 10), minor=True) plt.grid(which='minor', color='gray', linestyle='-', alpha=0.3) plt.tick_params(which='minor', size=0) # Add a legend from matplotlib.patches import Patch legend_elements = [ Patch(facecolor='white', edgecolor='black', label='Empty'), Patch(facecolor='#4472C4', label='Small Components'), Patch(facecolor='#E74C3C', label='Largest Component') ] plt.legend(handles=legend_elements, loc='upper right', bbox_to_anchor=(1.15, 1)) # Return the plot return ax ``` ```python {.marimo} # Generate data for the phase transition plot prob_values = np.linspace(0, 1, 100) component_fractions = [] #grid_size_phase = 50 # Use smaller grid for faster computation # Calculate largest component size for different probabilities np.random.seed(42) # Fixed seed for consistent results #S = np.random.random((grid_size_phase, grid_size_phase)) for prob in prob_values: # Generate random grid phase_grid = S < prob # Find connected components labeled_phase, num_phase = label(phase_grid) if num_phase > 0: phase_sizes = [(labeled_phase == i).sum() for i in range(1, num_phase + 1)] largest_phase = max(phase_sizes) / (grid_size * grid_size) else: largest_phase = 0.0 component_fractions.append(largest_phase) ``` ```python {.marimo} def phase_transition_plot(): # Create the phase transition plot plt.figure(figsize=(8, 6)) # Plot the phase transition curve plt.plot(prob_values, component_fractions, 'b-', linewidth=2, label='Largest Component Size') # Highlight current probability current_idx = int(p_slider.value * 99) # Convert to index plt.plot(p_slider.value, component_fractions[current_idx], 'ro', markersize=10, label=f'Current p = {p_slider.value:.2f}') # Mark approximate critical point (for 2D lattice, pc ≈ 0.593) critical_p = 0.593 plt.axvline(x=critical_p, color='gray', linestyle='--', alpha=0.7, label=f'Critical point (p_c ≈ {critical_p})') # Styling plt.xlabel('Probability (p)', fontsize=12) plt.ylabel('Fraction of Grid in Largest Component', fontsize=12) plt.title('Percolation Phase Transition', fontsize=14) plt.grid(True, alpha=0.3) plt.legend(frameon=False) plt.xlim(0, 1) plt.ylim(0, 1) # Add phase labels plt.text(0.2, 0.2, 'Disconnected\nPhase', fontsize=15, ha='center', bbox=dict(boxstyle='round', facecolor='lightblue', alpha=0.7)) plt.text(0.8, 0.2, 'Connected\nPhase', fontsize=15, ha='center', bbox=dict(boxstyle='round', facecolor='#f5cbcc', alpha=0.7)) return plt.gca() ```  ```python {.marimo} def compute_fc_powerlaw(gamma, k_min, k_max): """ Compute the critical fraction f_c for a power-law degree distribution. Parameters ---------- gamma : float Exponent of the power-law degree distribution. k_min : float Minimum degree in the network. k_max : float Maximum degree in the network. Returns ------- f_c : float Critical fraction of nodes that must be removed to break the network. """ if gamma > 3: denom = (gamma - 2) / (gamma - 3) * k_min - 1 if denom == 0: return 1.0 f_c = 1 - 1 / denom elif 2 < gamma < 3: denom = (gamma - 2) / (3-gamma) * (k_min ** (gamma - 2)) * (k_max ** (3 - gamma)) - 1 if denom == 0: return 1.0 f_c = 1 - 1 / denom else: raise ValueError("gamma must be > 2 for a giant component to exist.") return f_c ``` ```python {.marimo} import altair as alt kmax_values = np.arange(50, 1000, 20) gammas = [2.1, 2.5, 2.9] gamma_labels = {2.1: "gamma = 2.1", 2.5: "gamma = 2.5", 2.9: "gamma = 2.9"} colors = {2.1: "red", 2.5: "blue", 2.9: "orange"} linestyles = {2.1: "solid", 2.5: "dashed", 2.9: "dashed"} # Prepare data for Altair data = [] for gamma in gammas: for kmax in kmax_values: fc = compute_fc_powerlaw(gamma, 1, kmax) data.append({ "Maximum degree": kmax, "Critical fraction": fc, "gamma": gamma_labels[gamma], "color": colors[gamma], "linestyle": linestyles[gamma] }) df = pd.DataFrame(data) # Altair plot # Find the minimum y value (critical fraction) in the data min_fc = df["Critical fraction"].min() chart_fc = alt.Chart(df).mark_line().encode( x=alt.X( "Maximum degree:Q", title="Maximum degree", axis=alt.Axis(titleFontSize=18, labelFontSize=14) ), y=alt.Y( "Critical fraction:Q", title="$f_c$", scale=alt.Scale(domain=[min_fc, 1.0]), axis=alt.Axis(titleFontSize=18, labelFontSize=14) ), color=alt.Color( "gamma:N", scale=alt.Scale(domain=list(gamma_labels.values()), range=[colors[g] for g in gammas]), legend=alt.Legend(title="gamma", titleFontSize=16, labelFontSize=14) ), strokeDash=alt.StrokeDash( "gamma:N", scale=alt.Scale(domain=list(gamma_labels.values()), range=[[], [5,5], [5,5]]), legend=None ) ).properties( width=300, height=300, title=alt.TitleParams( text="Critical Fraction (k_min = 1)", fontSize=20 ) ) ``` ## References 1. **Borůvka, O.** (1926). O jistém problému minimálním [About a certain minimal problem]. *Práce Moravské Přírodovědecké Společnosti*, 3, 37-58. [Original work on minimum spanning trees] 2. **Molloy, M., & Reed, B.** (1995). A critical point for random graphs with a given degree sequence. *Random Structures & Algorithms*, 6(2-3), 161-180. [Molloy-Reed criterion] 3. **Albert, R., Jeong, H., & Barabási, A. L.** (2000). Error and attack tolerance of complex networks. *Nature*, 406(6794), 378-382. [Seminal paper on network robustness and the "Achilles' heel" property] 4. **Cohen, R., Erez, K., ben-Avraham, D., & Havlin, S.** (2000). Resilience of the Internet to random breakdowns. *Physical Review Letters*, 85(21), 4626-4629. [Mathematical framework for random failures] 5. **Cohen, R., Erez, K., ben-Avraham, D., & Havlin, S.** (2001). Breakdown of the Internet under intentional attack. *Physical Review Letters*, 86(16), 3682-3685. [Mathematical analysis of targeted attacks] 6. **Callaway, D. S., Newman, M. E., Strogatz, S. H., & Watts, D. J.** (2000). Network robustness and fragility: Percolation on random graphs. *Physical Review Letters*, 85(25), 5468-5471. [Percolation theory approach to network robustness] 7. **Cohen, R., & Havlin, S.** (2010). *Complex Networks: Structure, Robustness and Function*. Cambridge University Press. [Comprehensive treatment of network robustness theory] 8. **Newman, M. E. J.** (2018). *Networks*. Oxford University Press. [Modern textbook covering network robustness and percolation]