2876. Count Visited Nodes in a Directed Graph
Problem Description
In this problem, we are given a directed graph which is represented by a 0-indexed array edges
. Each element edges[i]
indicates that there is a directed edge from node i
to node edges[i]
. The graph consists of n
nodes labeled from 0
to n - 1
, and n
directed edges.
The task is to determine, for each starting node i
, how many distinct nodes can be visited if we follow the directed edges starting from node i
until we return to any node visited earlier in the same process. This essentially means we're calculating how many unique nodes we encounter before we enter a cycle or revisit any node during traversal which implies the start of a cycle.
Intuition
To solve this problem, we need to keep track of two key pieces of information:
- The number of nodes visited in the current traversal, starting from each node
i
. - Whether or not we've entered a cycle, and if so, the length of that cycle.
To do this, we can start at each unvisited node and follow edges to traverse the graph, using an array vis
to record the order in which we visited nodes. During the traversal, we do one of two things:
-
If we find a node that has already been visited in this traversal (
vis[j]
is nonzero), we then know that we have entered a cycle. We now know the number of nodes outside the cycle that have been visited and also the length of the cycle. For each unvisited node, its answer is then the number of unique nodes visited — which is the sum of the length of the cycle and the path length to the cycle for nodes outside the cycle, or simply the cycle length for nodes inside the cycle. -
If we find a node for which we've already computed an answer, then, for each visited node, its answer is the distance from the current node to this node plus the answer of this node we've already computed.
The algorithm involves iterating over each of the nodes of the graph and performing a depth-first traversal. During the traversal, we update the vis
array to reflect the visitation order and calculate the necessary counts for the cycle and the path leading to the cycle. Once a traversal from a starting node is completed or we encounter a previously computed answer, we backtrack while updating the ans
array with the number of nodes that can be visited for each start node.
This method allows us to efficiently compute the number of nodes that can be visited from each starting node by leveraging the information gained during each traversal and avoiding redundant computations.
Learn more about Graph, Memoization and Dynamic Programming patterns.
Solution Approach
The solution implements a form of depth-first traversal. Here's a breakdown of the approach based on the provided Reference Solution Approach and Python solution:
-
Initialization: Create two arrays,
ans
andvis
.ans
will store the final answer for each node (i.e., the number of different nodes visited), andvis
will store the visitation order for each node. Both are initially filled with zeros. -
Outer Loop: Iterate through all the nodes with a
for
loop, using each node as a starting point for the process. -
Traversal Initiation: For each starting node, if
ans[i]
is zero, meaning that we have not computed the answer for this node, we start traversing from this nodei
. -
Inner Loop - Traversing Nodes: In a
while
loop, continue to follow the directed edges until we encounter a node that we have already visited in this traversal (which implies a cycle) or a node that we have already computed the answer for in a different traversal. For each nodej
we visit:- Increment the count (
cnt
) which keeps track of how many nodes we have visited in this traversal. - Update
vis[j]
with the visitation ordercnt
. - Move to the next node using
j = edges[j]
.
- Increment the count (
-
Cycle and Path Detection: Once we've either hit an already visited node within this traversal or an already computed node, calculate the cycle length if necessary and the total number of nodes visited. If a cycle is detected, the length is determined by the difference between the current
cnt
value and the visitation order of the node that's part of the cycle (vis[j]
). -
Backtracking and Answer Update: Once we've hit a node for which an answer was already computed or the start of a cycle has been determined, loop back through the nodes we just visited to update their answers (
ans[j]
). The nodes inside the cycle all have the same answer (the cycle length). -
Population of Answers: Loop through the previously visited nodes and update their answers based on whether they are inside or outside the cycle.
The implementation uses the max(total, cycle)
to ensure the answer for each node is either the total number of nodes visited (for those outside the cycle) or the length of the cycle (for those inside the cycle).
Data Structures Used:
- One-dimensional lists (
ans
to store answers for each node,vis
to store visitation order).
Algorithm Patterns Used:
- Depth-First Search (DFS) traversal to explore the graph.
- Cycle detection and cycle length calculation within a directed graph.
- Backtracking to propagate the calculated answers to all starting points included in a traversal.
This approach minimizes redundant computations and efficiently calculates the answers for all the nodes in a directed graph that potentially contains cycles.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's illustrate the solution approach using a small example with a directed graph:
Suppose we have the following directed graph represented by edges = [1, 2, 0, 4, 5, 3]
.
This means we have six nodes, and the directed edges are as follows:
- Edge from node
0
to node1
- Edge from node
1
to node2
- Edge from node
2
to node0
(forming a cycle with nodes 0, 1, 2) - Edge from node
3
to node4
- Edge from node
4
to node5
- Edge from node
5
to node3
(forming a cycle with nodes 3, 4, 5)
We can visualize the graph like this:
0 → 1 → 2 ↑ ↓ └────┘ 3 → 4 → 5 ↑ ↓ └───────┘
Now, let's walk through the solution steps:
-
Initialization: Create two arrays
ans
andvis
with length 6 (the number of nodes), initializing all values to zero. -
Outer Loop: We start with node
0
and sinceans[0]
is zero, we begin traversal. -
Traversal Initiation: Set
cnt = 1
since we are starting traversal from node0
. Then we setvis[0] = 1
to indicate node0
is visited first. -
Inner Loop - Traversing Nodes: We follow the directed edge to node
1
. Increasecnt
to2
and markvis[1] = 2
. Next, visit node2
, incrementcnt
to3
, and markvis[2] = 3
. -
Cycle Detection: When we attempt to move from node
2
to node0
, we notice thatvis[0]
is already marked (indicating a cycle since we've returned to an already visited node within this traversal). The cycle includes nodes 0, 1, and 2. -
Backtracking and Answer Update: The length of the cycle is
cnt - vis[0] = 3 - 1 = 2
. We then backtrack, updatingans[0]
,ans[1]
, andans[2]
to2
as they are part of the cycle. -
Population of Answers for Nodes Outside the Cycle: Since there are no nodes outside the cycle for this starting node, we don't update any additional answers.
Next, we would proceed with node 3
as the starting node and repeat the steps. Eventually, we would identify the cycle among nodes 3, 4, and 5, and after backtracking and updating, we'd have ans[3]
, ans[4]
, and ans[5]
all set to 3
.
After processing all nodes, the final ans
array is [2, 2, 2, 3, 3, 3]
, indicating the number of nodes we can visit from each starting node before encountering a cycle or revisiting a node.
During the entire process, we only traverse the graph starting from each node one time, ensuring an efficient solution that avoids redundant calculations. The vis
array helps to quickly determine the presence of a cycle and the nodes involved in the cycle for accurate answer updates.
Solution Implementation
1from typing import List
2
3class Solution:
4 def countVisitedNodes(self, edges: List[int]) -> List[int]:
5 num_nodes = len(edges) # Determine the total number of nodes
6 visit_counts = [0] * num_nodes # Initialize list to store visit counts for each node
7 visited = [0] * num_nodes # Keep track of visited nodes with a timestamp
8
9 # Traverse all nodes to calculate visit counts
10 for node in range(num_nodes):
11 # Check if we haven't already calculated the visit count for this node
12 if not visit_counts[node]:
13 count = 0 # Initialize count for counting the nodes visited in this traversal
14 current_node = node # Start traversing from the current node
15
16 # Traverse nodes until we reach a node we've already visited
17 while not visited[current_node]:
18 count += 1 # Increase visit count
19 visited[current_node] = count # Mark current node as visited with a timestamp
20 current_node = edges[current_node] # Move to the next node in the sequence
21
22 # Analyze the traversal to distinguish between cycle and path lengths
23 cycle_length, total_length = 0, count + visit_counts[current_node]
24 if not visit_counts[current_node]:
25 # If the target node hasn't been calculated, we found a cycle
26 cycle_length = count - visited[current_node] + 1
27 total_length = count
28
29 # Now backfill the visit_counts array with appropriate visit counts
30 current_node = node # Start again from the initial node
31 while not visit_counts[current_node]:
32 # The visit count at each node is the maximum of total length and cycle length
33 visit_counts[current_node] = max(total_length, cycle_length)
34 total_length -= 1 # Reduce total_length as we move back
35 current_node = edges[current_node] # Move to the next node
36
37 return visit_counts # Return the computed visit counts for each node
38
1class Solution {
2 public int[] countVisitedNodes(List<Integer> edges) {
3 int n = edges.size(); // The size of the graph (number of nodes)
4 int[] visitedCount = new int[n]; // Array to store the count of visits for each node
5 int[] visitedState = new int[n]; // Array to track the state of each node during traversal
6
7 // Loop through each node
8 for (int currentNode = 0; currentNode < n; ++currentNode) {
9 // Only process nodes that have not been counted yet
10 if (visitedCount[currentNode] == 0) {
11 int count = 0; // Initialize visit count
12 int nextNode = currentNode; // Start traversal from the current node
13
14 // Traverse the graph until a previously visited node is encountered
15 while (visitedState[nextNode] == 0) {
16 visitedState[nextNode] = ++count; // Increment visit count
17 nextNode = edges.get(nextNode); // Move to the next node in the edge list
18 }
19
20 // Variables to determine the length of the cycle and total visits
21 int cycleLength = 0;
22 int totalVisits = count + visitedCount[nextNode];
23
24 // If the node encountered is the start of the cycle
25 if (visitedCount[nextNode] == 0) {
26 cycleLength = count - visitedState[nextNode] + 1; // Calculate cycle length
27 }
28
29 nextNode = currentNode; // Reset traversal starting node
30 // Assign the maximum of total visits or cycle length to each node in the path
31 while (visitedCount[nextNode] == 0) {
32 visitedCount[nextNode] = Math.max(totalVisits--, cycleLength);
33 nextNode = edges.get(nextNode); // Move to the next node in the edge list
34 }
35 }
36 }
37
38 return visitedCount; // Return the count array for all nodes
39 }
40}
41
1#include <vector>
2#include <algorithm>
3
4class Solution {
5public:
6 // Function to count the number of visited nodes in a directed graph.
7 // edges: An array where the i-th element is the node visited after node i.
8 std::vector<int> countVisitedNodes(std::vector<int>& edges) {
9 int numNodes = edges.size();
10 std::vector<int> visitCount(numNodes);
11 std::vector<int> visited(numNodes);
12
13 // Iterate through all nodes in the graph.
14 for (int i = 0; i < numNodes; ++i) {
15 // If the current node already has a determined visit count, skip it.
16 if (!visitCount[i]) {
17 int currentCount = 0, currentNode = i;
18 // Traverse the graph until a visited node or end of path is found.
19 while (visited[currentNode] == 0) {
20 visited[currentNode] = ++currentCount;
21 currentNode = edges[currentNode];
22 }
23
24 int cycleSize = 0, totalVisits = currentCount + visitCount[currentNode];
25 // If visitCount at the current node is zero, it means we've found a cycle.
26 if (visitCount[currentNode] == 0) {
27 cycleSize = currentCount - visited[currentNode] + 1;
28 }
29
30 // Backtrack and assign maximum between total visits and cycle size.
31 currentNode = i;
32 while (visitCount[currentNode] == 0) {
33 visitCount[currentNode] = std::max(totalVisits--, cycleSize);
34 currentNode = edges[currentNode];
35 }
36 }
37 }
38
39 // Return the visit count for each node.
40 return visitCount;
41 }
42};
43
1function countVisitedNodes(edges: number[]): number[] {
2 // Get the total number of nodes (length of the edges array)
3 const nodeCount = edges.length;
4 // Initialize the answer array with zeros
5 const answer: number[] = new Array(nodeCount).fill(0);
6 // Initialize the visited array with zeros to track if a node has been visited
7 const visited: number[] = new Array(nodeCount).fill(0);
8
9 // Iterate over each node
10 for (let currentNode = 0; currentNode < nodeCount; ++currentNode) {
11 // Only process nodes that have not determined the visited count yet
12 if (answer[currentNode] === 0) {
13 let visitCount = 0; // Counter for number of nodes visited in the current path
14 let nodeIndex = currentNode; // Current node index being visited
15
16 // Traverse the nodes starting from the current node until a visited one is encountered
17 while (visited[nodeIndex] === 0) {
18 visitCount++;
19 visited[nodeIndex] = visitCount; // Mark the node as visited with the count
20 nodeIndex = edges[nodeIndex]; // Move to the next connected node
21 }
22
23 // Calculate the total visits including the current path and potential prior answers
24 let cycleLength = 0;
25 let totalVisits = visitCount + answer[nodeIndex];
26
27 // If the next node doesn't have an answer yet, it means we found a cycle
28 if (answer[nodeIndex] === 0) {
29 cycleLength = visitCount - visited[nodeIndex] + 1; // Calculate the cycle length
30 }
31
32 // Reset nodeIndex to the start of the current path
33 nodeIndex = currentNode;
34
35 // Update the answer array with the maximum visits for each node in the current path
36 while (answer[nodeIndex] === 0) {
37 answer[nodeIndex] = Math.max(totalVisits, cycleLength);
38 totalVisits--;
39 nodeIndex = edges[nodeIndex]; // Move to the next node in the path
40 }
41 }
42 }
43
44 // Return the filled answer array with the visit count for each node
45 return answer;
46}
47
Time and Space Complexity
The time complexity of the given code is O(n)
. This is because each node is visited exactly once in a single pass to count the number of nodes visited before reaching a previously visited node (or itself, forming a cycle). The outer loop runs for each node (n
iterations), and the inner while loops process each node only once, without revisits due to the vis
array marking visited nodes.
The space complexity of the given code is also O(n)
. This is due to the allocation of two arrays of size n
: ans
array to store the answer for each node, and the vis
array to keep track of the visitation status of each node. There are no additional data structures or recursive calls that would increase the space usage, so it remains linear with respect to the number of nodes n
.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the following array represent a max heap?
Recommended Readings
https algomonster s3 us east 2 amazonaws com cover_photos graph svg Graph Fundamentals Tree with 0 cycle At this point you should be pretty familiar with trees A tree is a special kind of graph a connected acyclic cycle less graph A graph may contain cycle s and nodes could
Memoization Prereq Backtracking problems backtracking Memoization is a fancy word for a simple concept so is the case for a lot of things we learn in school It means saving the previous function call result in a dictionary and reading from it when we do the exact same call again
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
Want a Structured Path to Master System Design Too? Don’t Miss This!