1727. Largest Submatrix With Rearrangements
Problem Description
In this problem, we are given a binary matrix matrix
with dimensions m x n
, meaning it has m
rows and n
columns, where each element can either be 0
or 1
. The task is to rearrange the columns in any order such that we can find the largest submatrix composed entirely of 1
s. The output should be the area of this largest submatrix, i.e., the number of 1
s it contains.
Intuition
The intuition behind the solution relies on transforming the problem into a histogram problem, where each row of the matrix can be seen as the base of a histogram with heights denoting the number of consecutive 1
s above. To get this histogram from our binary matrix, we iterate over each element, and if it is a 1
, we add the value of the element above it, thereby accumulating the number of continuous 1
s in the column up to that row.
Once we have the histogram representation for each row, the next step is to sort each row in descending order. This sorting step helps us to ensure that, when calculating the potential area of the submatrix formed by 1
s, we always start with the tallest bar in the histogram (which corresponds to the longest consecutive sequence of 1
s in that reordered column).
After sorting, we determine the area of the largest rectangle we can form with the sorted sequence of bars in the histogram. Essentially, we iterate through each value in the sorted row, calculating the area as current_height * current_width
, where current_height
is the value of the bar (the sequence of 1
s in this column), and current_width
is the current index in the sorted array plus one (since the array is zero-indexed). We keep a running maximum of these calculated areas, which will give us the area of the largest submatrix that can be formed by rearranging the columns optimally.
Solution Approach
The provided solution approach uses dynamic programming and sorting to find the area of the largest submatrix composed of 1
s after columns are optimally reordered. Here's a step-by-step analysis:
-
Dynamic Programming: The first part of the solution is to convert the binary matrix into a form that represents the continuous vertical
1
s as the height of histograms. We do this by updating the matrix in-place. For each cell that contains a1
, we look directly above it (the cell in the previous row, same column). If the above cell is also a1
, we accumulate the value. In other words,matrix[i][j]
becomesmatrix[i][j] + matrix[i - 1][j]
ifmatrix[i][j]
equals 1.This first step is critical because it allows us to apply a histogram-based reasoning to the problem. Each row in the transformed matrix will represent a "base" and the numbers will represent the heights of "bars" that make up a histogram. In this transformed histogram, a rectangle of height
h
means that there areh
continuous1
s from the current row upwards in that column. -
Sorting: After transforming each row into a histogram, we sort each row in descending order. This enables us to treat each row as a series of potentially "stackable" rectangles where each rectangle's height is the value of the cell and the width is how many cells we've counted so far.
-
Calculating Maximum Area: For each sorted row, we calculate the area that the rectangle would occupy if we used the cell as the smallest height in the rectangle (which is also the furthest left side of the rectangle). The area is calculated by multiplying the height of the bar (
v
, the value of the cell) by the width (j
, the index of the value in the row plus one, because indices are zero-based but width counts are one-based).We iterate through each element in the row, and hence, for each element, we calculate the potential area of the submatrix if we were to stop at that particular column. Taking the maximum of these calculated areas gives us the maximum rectangular area for that particular permutation of rows.
The final answer is the largest area found after considering all such maximal rectangles for every row. This is a very efficient solution as it handles the problem with a complexity that is approximately O(m * n * log(n)), because the most expensive operation is sorting each row of the matrix, which takes O(n * log(n)) and is done m
times.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's illustrate the solution approach with a small example of a binary matrix. Consider the following 3 x 4
matrix:
matrix = [ [1, 0, 1, 1], [1, 1, 1, 1], [0, 1, 1, 0] ]
Now, let's walk through the steps:
-
Dynamic Programming:
- We start from the first row. Since there's no row above it, the values stay as they are.
- Moving onto the second row, we add the value from the above cell if the current cell is
1
. After doing this for the second row, our matrix becomes:
matrix = [ [1, 0, 1, 1], [2, 1, 2, 2], [0, 1, 1, 0] ]
- For the third row, we apply the same method. However, since the first and the last cells in the third row are
0
, they don't get accumulated and reset any potential stack of1
s. Our matrix now looks like:
matrix = [ [1, 0, 1, 1], [2, 1, 2, 2], [0, 2, 3, 0] ]
-
Sorting:
- Next, we sort each row in descending order to make 'bars' of 'histogram' aligned by their heights. This process will give us:
matrix = [ [1, 1, 1, 0], // Sorted row 1 [2, 2, 2, 1], // Sorted row 2 [3, 2, 0, 0] // Sorted row 3 ]
-
Calculating Maximum Area:
- We now calculate the areas for each row by treating each cell as the height of a histogram bar, with the width being the index of the cell +1 (since indices are zero-based but widths are one-based).
- For the first sorted row:
(1*1), (1*2), (1*3); max area = 3
. - For the second sorted row:
(2*1), (2*2), (2*3), (1*4); max area = 6
. - For the third sorted row:
(3*1), (2*2); max area = 4
.
The largest area obtained from our calculations is 6
, and this is the solution to our problem. This means the largest submatrix composed entirely of 1
s that can be obtained by rearranging the columns of the original matrix has an area of 6
.
Solution Implementation
1from typing import List
2
3class Solution:
4 def largestSubmatrix(self, matrix: List[List[int]]) -> int:
5 # Update the matrix such that each cell in the matrix represents the height
6 # of the histogram that can be formed with the base at that cell
7 for row in range(1, len(matrix)):
8 for col in range(len(matrix[0])):
9 # Increase the cell value by one if the top cell is 1;
10 # otherwise, leave it as it is (i.e., 0).
11 if matrix[row][col]:
12 matrix[row][col] = matrix[row - 1][col] + 1
13
14 # Initialize the maximum area to 0
15 max_area = 0
16
17 # Iterate over each row to find the largest rectangular area possible
18 for row in matrix:
19 # Sort each row in descending order to facilitate the calculation
20 # of the maximum rectangle in the histogram
21 row.sort(reverse=True)
22
23 # Iterate over each value in the sorted row to calculate the maximum area
24 # Each value in the sorted row corresponds to the height of a bar in the histogram,
25 # and its index represents potential width of the rectangle at that height
26 for index, height in enumerate(row):
27 # Calculate the width by adding 1 to the index since 'enumerate' is 0-based
28 width = index + 1
29
30 # Calculate the area of the rectangle that can be formed with this height
31 area = width * height
32
33 # Update the maximum area found so far
34 max_area = max(max_area, area)
35
36 # Return the maximum area found
37 return max_area
38
1class Solution {
2 public int largestSubmatrix(int[][] matrix) {
3 // Get the dimensions of the matrix
4 int rows = matrix.length;
5 int cols = matrix[0].length;
6
7 // Preprocess the matrix to count continuous ones vertically
8 for (int i = 1; i < rows; ++i) {
9 for (int j = 0; j < cols; ++j) {
10 // If the current cell has a '1', add to the count from the cell above
11 if (matrix[i][j] == 1) {
12 matrix[i][j] += matrix[i - 1][j];
13 }
14 }
15 }
16
17 // Initialize the variable to track the largest area of the submatrix
18 int maxArea = 0;
19
20 // Iterate over each row
21 for (int[] row : matrix) {
22 // Sort the row to group the column heights together
23 Arrays.sort(row);
24
25 // Iterate from the end of the row to find the maximum area
26 for (int j = cols - 1, height = 1; j >= 0 && row[j] > 0; --j, ++height) {
27 // Calculate the area of the submatrix ending at (i,j)
28 int area = row[j] * height;
29 // Update the maximum area
30 maxArea = Math.max(maxArea, area);
31 }
32 }
33
34 // Return the largest submatrix area found
35 return maxArea;
36 }
37}
38
1class Solution {
2public:
3 int largestSubmatrix(vector<vector<int>>& matrix) {
4 int numRows = matrix.size(); // Store the number of rows in the matrix
5 int numCols = matrix[0].size(); // Store the number of columns in the matrix
6
7 // Preprocess the matrix to compute the height of each column
8 for (int i = 1; i < numRows; ++i) {
9 for (int j = 0; j < numCols; ++j) {
10 // If the current cell has a 1, add the value from the cell above.
11 // This will end up with each cell containing the number of consecutive 1's above it + 1 for itself.
12 if (matrix[i][j]) {
13 matrix[i][j] += matrix[i - 1][j];
14 }
15 }
16 }
17
18 int largestArea = 0; // Initialize the largest area found to be 0
19 // Iterate through each preprocessed row to calculate the max area for each row
20 for (auto& row : matrix) {
21 // Sort the row in non-ascending order so that we can create the largest rectangle by considering the previous heights
22 sort(row.rbegin(), row.rend());
23
24 // Iterate over each element of the row to calculate the area
25 for (int j = 0; j < numCols; ++j) {
26 // Update the largest area if the current area (height * width) is greater
27 largestArea = max(largestArea, (j + 1) * row[j]);
28 }
29 }
30
31 // Return the largest area found
32 return largestArea;
33 }
34};
35
1function largestSubmatrix(matrix: number[][]): number {
2 const numRows: number = matrix.length; // Store the number of rows in the matrix
3 const numCols: number = matrix[0].length; // Store the number of columns in the matrix
4
5 // Preprocess the matrix to compute the height of each column
6 for (let i = 1; i < numRows; ++i) {
7 for (let j = 0; j < numCols; ++j) {
8 // If the current cell has a 1, add the value from the cell above.
9 // Each cell stores the count of consecutive 1's above it + 1 (itself, if it's 1).
10 if (matrix[i][j]) {
11 matrix[i][j] += matrix[i - 1][j];
12 }
13 }
14 }
15
16 let largestArea: number = 0; // Initialize the largest area found to be 0
17
18 // Iterate over each preprocessed row to calculate the max area for each row
19 for (const row of matrix) {
20 // Sort the row in non-ascending order to maximize the potential area
21 // based on the previous calculated heights
22 row.sort((a, b) => b - a); // Sorts the row in descending order
23
24 // Calculate the area considering each column's height
25 for (let j = 0; j < numCols; ++j) {
26 // Calculate the area (height * width) and update the largest area if the current area is greater
27 largestArea = Math.max(largestArea, (j + 1) * row[j]);
28 }
29 }
30
31 // Return the largest area found
32 return largestArea;
33}
34
Time and Space Complexity
The given Python code first preprocesses the input matrix by computing the maximum height of a stack of 1s ending at each cell. It then sorts these stack heights row by row and calculates the maximum area of a rectangle formed by these stacks. Now, let's break down the time and space complexities:
Time Complexity
- The first
for
loop runs through the rows of the matrix, starting at the second row, takingO(m)
time wherem
is the number of rows. - Inside this loop, the nested
for
loop runs through all the columns,n
times for each row, resulting inO(n)
time per row. - Multiplying the row iterations by the column iterations, we get
O(m * n)
for this preprocessing part. - The second part of the code, where each row is sorted in reverse order, takes
O(n log n)
time for each row, as the.sort()
operation takesO(n log n)
time. - As we apply the sorting to each of
m
rows, the total time for this step isO(m * n log n)
. - After sorting, there is another nested loop structure, which runs through all the elements of the matrix (in their sorted row form) once again. However, the complexity of this part is
O(m * n)
times, since for each ofm
rows, we are iterating overn
columns. - Combining all the parts, the dominant part of the time complexity is
O(m * n log n)
, since this term will outweighO(m * n)
for larger values ofn
.
Hence, the total time complexity of the algorithm is O(m * n log n)
.
Space Complexity
- The space complexity of the preprocessing step does not require additional space beyond the input matrix as it is performed in place, so this part is
O(1)
. - The sorting step does not use any extra space either, aside from the negligible internal space used by the sorting algorithm (which can typically be considered as constant space for purposes of complexity analysis).
Considering the above points, the space complexity of the code is O(1)
additional space, with matrix
itself taking O(m * n)
but not counting towards additional space as it is the input.
Learn more about how to find time and space complexity quickly using problem constraints.
How many ways can you arrange the three letters A, B and C?
Recommended Readings
Greedy Introduction div class responsive iframe iframe src https www youtube com embed WTslqPbj7I title YouTube video player frameborder 0 allow accelerometer autoplay clipboard write encrypted media gyroscope picture in picture web share allowfullscreen iframe div When do we use greedy Greedy algorithms tend to solve optimization problems Typically they will ask you to calculate the max min of some value Commonly you may see this phrased in the problem as max min longest shortest largest smallest etc These keywords can be identified by just scanning
Sorting Summary Comparisons We presented quite a few sorting algorithms and it is essential to know the advantages and disadvantages of each one The basic algorithms are easy to visualize and easy to learn for beginner programmers because of their simplicity As such they will suffice if you don't know any advanced
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Want a Structured Path to Master System Design Too? Don’t Miss This!