1854. Maximum Population Year
Problem Description
You are provided with a 2D integer array called logs
. Each element of logs
is a two-element array that represents the birth and death years of a person, with birth[i]
being the year they were born and death[i]
being the year they died. The goal is to find which year had the highest population. A person is considered to be alive in a particular year if that year is between their birth year and the year before they died (inclusive of their birth year and exclusive of their death year).
The problem requires determining the year with the maximum population and, in case of a tie, returning the earliest year. This is akin to finding the year with the peak of a population timeline when plotted on a graph.
Intuition
To solve this problem, the first intuition is that it's not necessary to count the population for every single year in the range. Since the relevant years are defined by the birth or death years of individuals, the population changes only at these points.
We can track the population changes throughout the years by creating an array that represents the deltas in population size. For simplicity, let's base the years on an offset, considering 1950 as year 0, since the problem statement limits years to the 20th century.
For each person, we increment the population for their birth year and decrement it for their death year. This is because a birth adds one to the population, while a death reduces it by one, but not until the following year.
After populating the array with the deltas, we can find the year with the highest population by iterating through the array, maintaining a running sum which represents the current population, and updating the maximum population and year as we go. This process is like creating a cumulative sum or prefix sum array where each element is the total population up to that year.
The solution scans through the array, tracking the highest value seen and the earliest year this maximum occurred. The years we track in the array are offset by 1950, so when we find the result, we must add back the 1950 to obtain the actual year as the final answer.
Solution Approach
The solution uses an array d
to represent the changes in population. The array has a length of 101 to cover each year from 1950 to 2050 (inclusive of 1950 and exclusive of 2050), which is enough to cover the possible range of years given the problem constraints.
The offset of 1950 is used to normalize the years so that they can easily fit into a fixed-size array and simplify the index calculations.
The solution follows these steps:
-
Initialize an array
d
to track the population change for each year, setting all elements to 0 initially. -
Iterate through each person's logs. For each log, increment the population at the birth year and decrement the population at the death year. Since the person is not counted in their year of death, the decrement occurs at the index corresponding to their death year.
For example, for a person born in 1950 (
birth[i] = 1950
) and died in 1951 (death[i] = 1951
), the population increments by 1 at index 0 and decrements by 1 at index 1 in thed
array. -
Next, we need to convert the
d
array into a running sum of population changes. This is done by iterating overd
and adding the current change to a running sums
. At each step, the sums
represents the population for that year. -
While generating the running sum, the solution keeps track of the maximum population
mx
seen so far and the corresponding yearj
. If the current running sum is greater than the previously recorded maximum, the maximum value and year are updated to reflect the new maximum. -
After processing all of the years, the solution returns the year with the maximum population, which is
j + offset
to convert from the normalized index back to the actual year.
Within the code:
offset
is set to 1950 to normalize the years within the given range.- The
for a, b in logs
loop processes each person's birth and death years and reflects them in thed
array. - The for loop
for i, x in enumerate(d)
iterates over thed
array and maintains the running sums
, updates the maximum populationmx
, and keeps track of the earliest yearj
with this population. - Finally,
return j + offset
normalizes the index back to the actual year.
This approach uses the prefix sum pattern to efficiently compute cumulative values over a sequence, and it leverages the fact that person counts only change at birth or death years, thus significantly reducing the number of necessary computations.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's illustrate this solution approach with a small example:
Suppose we have the following logs
:
1logs = [[1950, 1960], [1955, 1970], [1950, 1955]]
Here's how we would walk through the problem:
-
Initialize the population changes array
d
with length 101, to cover years from 1950 to 2050. All values ind
start at0
. -
Process the birth and death years from
logs
:- For the first person, born in 1950 and died in 1960, increment
d[0]
(1950 - 1950) by1
and decrementd[10]
(1960 - 1950) by1
. - For the second person, born in 1955 and died in 1970, increment
d[5]
(1955 - 1950) by1
and decrementd[20]
(1970 - 1950) by1
. - For the third person, born in 1950 and died in 1955, increment
d[0]
by1
(already1
from the first person, now becomes2
) and decrementd[5]
(already1
from the second person, now becomes0
).
- For the first person, born in 1950 and died in 1960, increment
-
After processing
logs
, ourd
array now has these changes (only showing the first few indices with changes):
1d = [2, 0, 0, 0, 0, 0, 0, 0, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 0, 0, -1, 0, ... ]
-
Now, we convert
d
to a running sum,s
, starting at0
, and we keep track of the maximum populationmx
and the yearj
:- Start at index
0
,s = 2
,mx = 2
,j = 0
(which means the year 1950). - At index
5
, there's no change, sos
remains2
. - At index
10
, decrements
by1
. Nows = 1
.mx
is still2
, andj
remains0
. - Continue scanning until index
20
, where we decrements
by1
again. Nows = 0
.
- Start at index
-
The running sum does not exceed the maximum population (
mx
) of2
which occurred first at index0
. -
The final step is to add back the offset to
j
to find the actual year. Sincej = 0
initially, and the offset is1950
, the year with the highest population in our example isj + offset = 0 + 1950 = 1950
.
Using this method allows us to scale the process for a larger set of input data efficiently, focusing only on years where population changes occur due to births and deaths. We avoid unnecessary calculations for years where no changes happen, thus optimizing the solution.
Solution Implementation
1class Solution:
2 def maximumPopulation(self, logs: List[List[int]]) -> int:
3 # Initialize a list to represent the population changes
4 year_deltas = [0] * 101 # There are 101 years from 1950 to 2050
5
6 # Define the offset for the year 1950, as all years are based on it
7 year_offset = 1950
8
9 # Accumulate births and deaths in year_deltas
10 for birth, death in logs:
11 birth -= year_offset
12 death -= year_offset
13 year_deltas[birth] += 1 # Increment population for birth
14 year_deltas[death] -= 1 # Decrement population for death
15
16 # Initialize variables to track the maximum population and the year
17 sum_population = max_population = year_with_max_population = 0
18
19 # Loop over each year to find the year with the max population
20 for year, delta in enumerate(year_deltas):
21 sum_population += delta
22 if sum_population > max_population:
23 max_population = sum_population
24 year_with_max_population = year
25
26 # Return the year with the maximum population by adding back the offset
27 return year_with_max_population + year_offset
28
1class Solution {
2 public int maximumPopulation(int[][] logs) {
3
4 // Create an array to record the changes in population for each year.
5 // Since we are interested in years between 1950 and 2050, we use an array of size 101.
6 int[] populationDeltas = new int[101];
7 // Offset to adjust years to indexes (1950 becomes 0, 1951 becomes 1, etc.).
8 final int offset = 1950;
9
10 // Iterate over each log entry.
11 for (int[] log : logs) {
12 // Calculate the starting index based on the birth year.
13 int birthYearIndex = log[0] - offset;
14 // Calculate the end index based on the death year.
15 int deathYearIndex = log[1] - offset;
16
17 // Increment the population for the birth year index.
18 ++populationDeltas[birthYearIndex];
19 // Decrement the population for the death year index.
20 --populationDeltas[deathYearIndex];
21 }
22
23 // Sum and max population to find the maximum population year.
24 int currentPopulation = 0; // Current accumulated population.
25 int maxPopulation = 0; // Maximum population we have seen so far.
26 int maxPopulationYearIndex = 0; // Index (relative to offset) of the year with the maximum population.
27
28 // Iterate through each year to find the year with the maximum population.
29 for (int i = 0; i < populationDeltas.length; ++i) {
30 // Update the current population by adding the delta for the current year.
31 currentPopulation += populationDeltas[i];
32 // If the current population is greater than the maximum seen before.
33 if (maxPopulation < currentPopulation) {
34 // Update the maximum population to the current population.
35 maxPopulation = currentPopulation;
36 // Update the year index to the current index.
37 maxPopulationYearIndex = i;
38 }
39 }
40
41 // Return the actual year by adding the offset to the index.
42 return maxPopulationYearIndex + offset;
43 }
44}
45
1class Solution {
2public:
3 int maximumPopulation(vector<vector<int>>& logs) {
4 int populationDeltas[101] = {0}; // Initialize all years' population changes to 0
5 const int baseYear = 1950; // Use 1950 as the offset since all years are 1950 or later
6 for (const auto& log : logs) {
7 int birthYearIndex = log[0] - baseYear; // Convert birth year to index based on base year
8 int deathYearIndex = log[1] - baseYear; // Convert death year to index based on base year
9 ++populationDeltas[birthYearIndex]; // Increment population for birth year
10 --populationDeltas[deathYearIndex]; // Decrement population for death year
11 }
12 int currentPopulation = 0; // Start with zero population
13 int maxPopulation = 0; // Initialize max population to zero
14 int yearWithMaxPopulation = 0; // This will store the year with the highest population
15 for (int i = 0; i < 101; ++i) {
16 currentPopulation += populationDeltas[i]; // Update population for the year
17 if (maxPopulation < currentPopulation) {
18 maxPopulation = currentPopulation; // Update the maximum population if current is greater
19 yearWithMaxPopulation = i; // Record the year index that has the new maximum population
20 }
21 }
22 return yearWithMaxPopulation + baseYear; // Convert the index back to an actual year and return it
23 }
24};
25
1// Define the function maximumPopulation which expects logs as an array of number pairs
2function maximumPopulation(logs: number[][]): number {
3 // Create an array to store the population changes over the years, initialized to zero.
4 // The range covers years from 1950 to 2050 because of the problem constraints.
5 const populationDeltas: number[] = new Array(101).fill(0);
6
7 // Set offset to match the starting year 1950 to array index 0
8 const baseYear = 1950;
9
10 // Iterate through each log entry
11 for (const [birth, death] of logs) {
12 // Increment the population count for the birth year
13 populationDeltas[birth - baseYear]++;
14 // Decrement the population count for the death year
15 populationDeltas[death - baseYear]--;
16 }
17
18 // Initialize variables for tracking the max population and the year with max population
19 let maxPopulationYearIndex: number = 0;
20 for (let i = 0, currentPopulation = 0, maxPopulation = 0; i < populationDeltas.length; ++i) {
21 // Accumulate population changes to get the current population for each year
22 currentPopulation += populationDeltas[i];
23
24 // Check if the current population is greater than the max population observed so far
25 if (maxPopulation < currentPopulation) {
26 // Update max population and the index of the year with max population
27 maxPopulation = currentPopulation;
28 maxPopulationYearIndex = i;
29 }
30 }
31
32 // Return the year corresponding to the index with the max population
33 return maxPopulationYearIndex + baseYear;
34}
35
Time and Space Complexity
Time Complexity:
The time complexity of the solution involves iterating through the logs and updating the corresponding years, followed by finding the year with the maximum population. The for a, b in logs:
loop runs once for each set of birth and death years in logs
, hence it is O(N)
, where N
is the number of elements in logs
. The second for
loop to find the maximum population runs for a fixed size array d
of size 101 (since the years are offset by 1950 and only range between 1950 and 2050). This iteration is therefore O(101)
, which is a constant O(1)
operation. The overall time complexity is the higher of the two, so the time complexity is O(N)
.
Space Complexity:
The space complexity is determined by the additional space used by the solution. The array d
has a fixed size of 101, which is O(1)
space complexity. The space used for the variables a
, b
, s
, mx
, and j
is also constant. Therefore, the space complexity is O(1)
.
Learn more about how to find time and space complexity quickly using problem constraints.
How many times is a tree node visited in a depth first search?
Recommended Readings
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Runtime Overview When learning about algorithms and data structures you'll frequently encounter the term time complexity This concept is fundamental in computer science and offers insights into how long an algorithm takes to complete given a certain input size What is Time Complexity Time complexity represents the amount of time