604. Design Compressed String Iterator

EasyDesignArrayHash TableStringIterator
Leetcode Link

Problem Description

The task is to design a data structure that can efficiently iterate through a compressed string. The compressed format represents a string where each distinct character is followed by an integer that indicates the number of times the character appears consecutively in the original, uncompressed string. For example, the compressed string "a2b1c5" represents "aabc". The class StringIterator should have the following functionalities:

  • next(): This function should return the next character in the uncompressed version of the string. If there are no more characters to return, it should return a white space character.
  • hasNext(): This function should tell us whether there are more characters that can be uncompressed from the string, returning true if more characters exist and false otherwise.

Intuition

The fundamental intuition behind the solution is to maintain a data structure that can easily track the current character and the number of times it still needs to be returned by the next() function.

To solve this, we can use a list of pairs, where each pair consists of a character and a number indicating how many times that character should be repeated in the uncompressed string.

Here's the thinking process for designing the StringIterator:

  1. Initialization: We parse the compressed string to fill our list with pairs of characters and their respective counts.

  2. Next: Each call to next() should return the current character and decrement the count. If the count reaches zero, that means we have fully uncompressed that character, and we need to move on to the next character.

  3. HasNext: To determine whether more characters need to be iterated upon, we check if the current position is less than the length of the list and the count at that position is greater than zero.

By keeping track of our position within the list, we can efficiently return the correct character for each next() call until all characters are fully uncompressed.

Not Sure What to Study? Take the 2-min Quiz to Find Your Missing Piece:

Which of these properties could exist for a graph but not a tree?

Solution Approach

The Reference Solution Approach you provided outlines a way to implement the StringIterator class using the logic previously described. Here's how the solution is implemented step-by-step:

  1. Initialization in __init__:

    • We parse the compressed string character by character.
    • A pointer i iterates over the string. When i points to a letter, we initialize a counter x to 0 and start building the count by moving to the next character and continue until we reach a character that is not a digit.
    • We multiply x by 10 and add the integer value of the current digit to x with each consecutive digit to build the count for the current letter correctly.
    • This count along with the character is stored as a pair (a list with two elements) in the list self.d which represents the decompressed format of the given compressed string.
  2. next Method:

    • First, we check if there is a next character by calling the hasNext method. If not, we return a space character as per the problem statement.
    • If hasNext is true, we retrieve the current character from self.d using our current position self.p.
    • We then decrement the count (which is the second element of the pair) for that character at self.p as we have returned this character once.
    • If the count becomes zero, it means we have exhausted this character; hence, we increment self.p to move to the next character.
    • We return the character that was next in line.
  3. hasNext Method:

    • We check if the current position self.p is less than the length of self.d which means there are characters left to be returned.
    • Additionally, we ensure that the count of the current letter at position self.p is greater than zero, indicating that the current character is still available to be returned by the next method.
    • The method returns True if both conditions are met, and False otherwise.

This approach efficiently solves the problem using a linear scan to parse the compressed string and then providing O(1) operations to return the next character and check for remaining characters.

Discover Your Strengths and Weaknesses: Take Our 2-Minute Quiz to Tailor Your Study Plan:

Which of the following problems can be solved with backtracking (select multiple)

Example Walkthrough

Let's consider a small example to illustrate the solution approach using the class StringIterator for the compressed string "a2b1c5". This represents the uncompressed string "aabccccc".

  1. Initialization: First, we initialize our StringIterator with the compressed string "a2b1c5". During the initialization:

    • The constructor (__init__) parses the string and fills self.d with the list of pairs: [('a', 2), ('b', 1), ('c', 5)].
    • The current pointer self.p is set to 0 to start from the first character pair.
  2. First next Call: We call next() for the first time.

    • hasNext() is true because self.p is 0, which is less than the length of self.d (which is 3), and the count for 'a' at this position is 2 (greater than zero).
    • next() returns 'a' and decrements the count of 'a' in the pair to 1.
  3. Second next Call: We call next() again.

    • hasNext() is still true for the same reasons as before.
    • next() returns 'a' again, as there's still 1 count left for 'a', and decrements the count of 'a' to 0.
  4. Third next Call: We call next() once more.

    • hasNext() is again true since now self.p is referring to the next character pair, ('b', 1), after incrementing self.p since 'a' count reached 0.
    • next() returns 'b' and the count of 'b' in the pair becomes 0, and self.p is incremented to point to ('c', 5).
  5. Subsequent next Calls: If we continue to call next(), it would return 'c' five times, decrementing the count each time. After this, self.p would be 3, which is the length of self.d, and no character would be left.

  6. Final next Call: After exhausting all characters, when we call next(), hasNext() would return false since self.p is equal to the length of self.d. As per the requirement, next() would return a space character ' ' indicating no more characters are left to return.

  7. Calling hasNext: At any point, we can call hasNext() to check if there are more characters to be uncompressed. This would return true until the next method has been called enough times to exhaust all the characters as per their counts in self.d.

The solution implemented as described allows the StringIterator to iterate through the compressed string efficiently and return uncompressed characters one at a time.

Solution Implementation

1class StringIterator:
2    def __init__(self, compressed_string: str):
3        self.decoded = []  # List to store the characters and their counts
4        self.pointer = 0  # Pointer to track the current character
5        length = len(compressed_string)  # Length of the compressed string
6        index = 0
7      
8        # Iterate over the compressed string and decode it
9        while index < length:
10            char = compressed_string[index]
11            count = 0
12            index += 1
13          
14            # Extract the count for the current character
15            while index < length and compressed_string[index].isdigit():
16                count = count * 10 + int(compressed_string[index])
17                index += 1
18              
19            # Append the character and its count to the decoded list
20            self.decoded.append([char, count])
21
22    def next(self) -> str:
23        # Return the next character if available
24        if not self.hasNext():
25            return ' '  # If no next character, return a space
26        current_char = self.decoded[self.pointer][0]  # Get current character
27        self.decoded[self.pointer][1] -= 1  # Decrement its count
28      
29        # If the current character's count hits 0, move to the next character
30        if self.decoded[self.pointer][1] == 0:
31            self.pointer += 1
32        return current_char
33
34    def hasNext(self) -> bool:
35        # Check if there is a next character available
36        return self.pointer < len(self.decoded) and self.decoded[self.pointer][1] > 0
37
38
39# Usage example:
40# string_iterator = StringIterator("a2b1")
41# print(string_iterator.next())  # Outputs: 'a'
42# print(string_iterator.hasNext())  # Outputs: True
43
1class StringIterator {
2    // Node class to store individual characters and their counts.
3    private static class Node {
4        char character;
5        int count;
6
7        Node(char character, int count) {
8            this.character = character;
9            this.count = count;
10        }
11    }
12
13    // A list to hold all the nodes derived from the compressed string.
14    private List<Node> data = new ArrayList<>();
15    // The current position in the list of nodes.
16    private int position;
17
18    // Constructor to parse the compressed string and initialize the data list.
19    public StringIterator(String compressedString) {
20        int length = compressedString.length();
21        int index = 0; // Index to iterate the compressed string.
22
23        while (index < length) {
24            // Extract the character at the current index.
25            char currentChar = compressedString.charAt(index);
26            int count = 0; // To keep track of the count of characters.
27
28            // Move to the next index and parse the digit(s) to find the count.
29            while (++index < length && Character.isDigit(compressedString.charAt(index))) {
30                count = count * 10 + (compressedString.charAt(index) - '0');
31            }
32
33            // Add the extracted character and its count to the data list as a new Node.
34            data.add(new Node(currentChar, count));
35        }
36    }
37
38    // Returns the next character in the sequence.
39    public char next() {
40        // If hasNext() returns false, return a space.
41        if (!hasNext()) {
42            return ' ';
43        }
44
45        // Get the current node based on the position and retrieve its character.
46        Node currentNode = data.get(position);
47        char resultChar = currentNode.character;
48
49        // Decrement the count of the character and move the position if count reaches 0.
50        if (--currentNode.count == 0) {
51            position++;
52        }
53
54        // Return the character.
55        return resultChar;
56    }
57
58    // Check if there are more characters to return.
59    public boolean hasNext() {
60        // hasNext is true if the current position is within the bounds of the data list
61        // and the count of the current node is greater than 0.
62        return position < data.size() && data.get(position).count > 0;
63    }
64}
65
66/**
67 * This class simulates an iterator over a string that has been compressed
68 * where characters are followed by their counts. The iterator provides the
69 * functionality to iterate over the uncompressed sequence of characters.
70 *
71 * Example usage:
72 * StringIterator iterator = new StringIterator("L1e2t1C1o1d1e1");
73 * char c = iterator.next(); // returns 'L'
74 * boolean hasNext = iterator.hasNext(); // returns true since there are more characters
75 */
76
1#include <string>
2#include <vector>
3#include <utility>
4using namespace std;
5
6class StringIterator {
7public:
8    // Constructor that initializes the object with the compressed string
9    StringIterator(string compressedString) {
10        int length = compressedString.length(); // Get the length of the compressed string
11        int index = 0; // Index to track our position in the string
12      
13        // Loop through the compressed string
14        while (index < length) {
15            char letter = compressedString[index]; // Current character to be decoded
16            int count = 0; // Count of how many times the current character is repeated
17          
18            // Increase index and calculate the count
19            while (++index < length && isdigit(compressedString[index])) {
20                count = count * 10 + (compressedString[index] - '0');
21            }
22          
23            // Save the character and its count to the deque
24            data.push_back({letter, count});
25        }
26    }
27  
28    // Returns the next character in the uncompressed version of the string or ' ' if there is none
29    char next() {
30        if (!hasNext()) {
31            return ' '; // If there are no more characters, return a space
32        }
33      
34        // Get the current character to return
35        char result = data[position].first;
36      
37        // Decrease the count of how many times this character should be repeated
38        if (--data[position].second == 0) {
39            ++position; // Move to the next character when current is exhausted
40        }
41      
42        return result;
43    }
44
45    // Check if there is any character left to return
46    bool hasNext() {
47        return position < data.size() && data[position].second > 0;
48    }
49
50private:
51    // A deque to store pairs of characters and their respective counts
52    vector<pair<char, int>> data;
53  
54    // Position variable to keep track of the current character
55    int position = 0;
56};
57
58/**
59 * Your StringIterator object will be instantiated and called as such:
60 * StringIterator* obj = new StringIterator(compressedString);
61 * char param_1 = obj->next();
62 * bool param_2 = obj->hasNext();
63 */
64
1// Define a data type for the pair of character and its count
2type CharCountPair = {
3  character: string;
4  count: number;
5};
6
7// Create an array to store CharCountPair
8let data: CharCountPair[] = [];
9
10// Position variable to keep track of the current character pair index
11let position: number = 0;
12
13// Function to initialize the object with the compressed string
14function initialize(compressedString: string): void {
15  const length = compressedString.length; // Get the length of the compressed string
16  let index = 0; // Index to track our position in the string
17
18  // Loop through the compressed string
19  while (index < length) {
20    const letter = compressedString[index]; // Current character to be decoded
21    let count = 0; // Count of how many times the current character is repeated
22
23    // Increase index and calculate the count
24    while (++index < length && isDigit(compressedString[index])) {
25      count = count * 10 + (parseInt(compressedString[index]));
26    }
27
28    // Save the character and its count to the data array
29    data.push({character: letter, count: count});
30  }
31}
32
33// Utility function to check if a character is a digit
34function isDigit(char: string): boolean {
35  return !isNaN(parseInt(char, 10));
36}
37
38// Function to return the next character in the uncompressed version of the string or ' ' if there is none
39function next(): string {
40  if (!hasNext()) {
41    return ' '; // If there are no more characters, return a space
42  }
43
44  // Get the current character pair
45  const currentPair = data[position];
46
47  // Get the current character to return
48  const result = currentPair.character;
49
50  // Decrease the count of how many times this character should be repeated
51  if (--currentPair.count === 0) {
52    ++position; // Move to the next character pair when current is exhausted
53  }
54
55  return result;
56}
57
58// Function to check if there is any character left to return
59function hasNext(): boolean {
60  return position < data.length && data[position]?.count > 0;
61}
62
63// Examples of usage:
64// initialize("x2y4"); // Example initialization with compressed string "x2y4"
65// console.log(next()); // Returns the next character 'x'
66// console.log(hasNext()); // Returns true if there are characters remaining, false otherwise
67
Not Sure What to Study? Take the 2-min Quiz:

How many times is a tree node visited in a depth first search?

Time and Space Complexity

Time Complexity

  • __init__ function: This function has a while loop that iterates over the entire length of the input string compressedString. Within this loop, there is a nested while loop that processes the digits following a letter, which can at most be O(m) where m is the number of digits following a letter. However, since each character is only visited once, the overall time complexity of __init__ is O(n) where n is the length of compressedString.

  • next function: The next method decreases the count of the current character and moves the pointer self.p forward if needed. These operations are done in constant time, hence the time complexity of the next function is O(1).

  • hasNext function: This function checks if there is a next element in the iterator, which is done by simply comparing the current pointer self.p with the length of the list self.d. This is a constant time operation, so the time complexity of hasNext is O(1).

Space Complexity

  • __init__ function: Since we are storing each character and its count in a list self.d, and there can be at most n/2 different characters if we alternate between characters and single digits, the space complexity is O(n).

  • next and hasNext functions: These functions do not use any additional space that scales with the input size, so their space complexity is O(1).

Combining the above, the space complexity for the entire StringIterator class is dominated by the __init__ function, which is O(n).

Learn more about how to find time and space complexity quickly using problem constraints.

Fast Track Your Learning with Our Quick Skills Quiz:

In a binary min heap, the minimum element can be found in:


Recommended Readings


Got a question? Ask the Teaching Assistant anything you don't understand.

Still not clear? Ask in the Forum,  Discord or Submit the part you don't understand to our editors.


TA 👨‍🏫