346. Moving Average from Data Stream
Problem Description
In the given problem, we are required to calculate the moving average of a stream of integers with a specified window size. The moving average is simply the average of the most recent size
numbers in the stream; as a new number comes in, the oldest number in the window gets removed.
To solve this, we create a class MovingAverage
that accomplishes two tasks:
-
It initializes the window with a specific size, and to track the numbers within the window, it stores them in an array.
-
It calculates the next average each time a new value is added to the window. If the window is full, it replaces the oldest number with the new value. If the window has not yet filled to its specified size, it simply adds the new value.
The result is that at any given time, we can obtain the moving average of the last size
elements, efficiently and without having to sum the entire set of values each time.
Intuition
The intuition behind the solution is to maintain a dynamic sum of the integers within the current window so that we don't have to sum all the integers each time we want to compute the average; this would be inefficient, especially for large window sizes.
The method is to:
-
Store the numbers in a fixed-size array, where the size is equal to the given window size.
-
Keep track of a running sum,
s
, of the numbers in the current window. -
Use a counter,
cnt
, to determine which element in our array is the oldest and should be replaced with the new incoming value.
Each time a new value comes in:
- We compute the index for the oldest value by taking
cnt
modulo the window size. - We update the running sum
s
by subtracting the value that's being replaced and adding the new value. - We then update the value at the computed index with the new value.
- Increment the count
cnt
.
When calculating the moving average, we divide s
by the window size. However, until the window fills up for the first time, our window size is effectively the number of elements we have received so far. So we take the minimum of the counter cnt
and the window size to know by what we should divide our running sum.
This approach ensures an efficient and scalable way to calculate the moving average as the stream progresses, with a time complexity of O(1) for each call to next
.
Learn more about Queue and Data Stream patterns.
Solution Approach
The MovingAverage
class provides an elegant solution for calculating the moving average of a stream of integers using a fixed-size sliding window. The code snippet implements this using a circular array pattern, a running sum, and a counter.
Here's a step-by-step explanation of the implementation:
-
As part of the class initialization (
__init__
method), an arrayarr
is created with a length equal to the size of the window, initially filled with zeros. This array will serve as the circular buffer. Additionally, variabless
(running sum) andcnt
(counter) are initialized. Variables
will keep track of the sum of the integers in the current window, whilecnt
will be used to count the elements that have been processed. -
The
next
method takes an integerval
as input, which represents the next number in the stream. It calculates the index where the new value should be inserted using the modulo operationidx = self.cnt % len(self.arr)
. This ensures that once the array is filled, we overwrite the oldest value, thereby maintaining the size of the sliding window. -
The running sum is updated by subtracting the value at
arr[idx]
(which is the oldest value in the window or a zero initially) and adding the newval
to it:self.s += val - self.arr[idx]
. -
The value of
val
is then stored in the array at indexidx
:self.arr[idx] = val
. -
Before calculating the average,
cnt
is incremented by one:self.cnt += 1
. -
To compute the moving average, the running sum
s
is divided by the minimum ofcnt
and the window sizelen(self.arr)
. Usingmin(self.cnt, len(self.arr))
ensures that, before the window has been completely filled for the first time, we only divide by the actual number of elements that have been added, which will be less than the maximum window size.
The code efficiently implements this approach, ensuring that regardless of the stream length or window size, the time complexity of each call to next(val)
remains constant, O(1). This is due to the fixed array size and the simple arithmetic operations involved in updating the running sum and calculating the average.
The data structures used here are:
- An array (
self.arr
) to store the most recent values up to the window size. - Variables (
self.s
andself.cnt
) to store the running sum and the count.
The main algorithmic pattern used is a circular array to maintain the sliding window efficiently.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
To illustrate the solution approach, let's use a small example where the window size for the moving average is set to 3, and we receive the following stream of integers: [5, 10, 15, 20].
Let's walk through the process:
- Initialize the
MovingAverage
class with a window size of 3. - The array
arr
is initialized with[0, 0, 0]
,s
(running sum) is set to0
, andcnt
(counter) is set to0
.
Now, we start adding numbers to the stream:
-
First number is 5:
cnt % 3
=0 % 3
=0
. We updatearr
to[5, 0, 0]
.- Update running sum
s
: we subtractarr[0]
(which is0
) and add5
, sos
becomes5
. - Now, the average is
5 / 1
=5
as we have only one number (5) in our window.
-
Second number is 10:
cnt % 3
=1 % 3
=1
. We updatearr
to[5, 10, 0]
.- Update running sum
s
: we subtractarr[1]
(which is0
) and add10
, sos
becomes15
. - The average is
15 / 2
=7.5
because we now have two numbers (5 and 10) in our window.
-
Third number is 15:
cnt % 3
=2 % 3
=2
. We updatearr
to[5, 10, 15]
.- Update running sum
s
: we subtractarr[2]
(which is0
) and add15
, sos
becomes30
. - The average is
30 / 3
=10
as we have three numbers in our window.
-
Fourth number is 20:
cnt % 3
=3 % 3
=0
(this index wraps around since the window size is reached).- We update
arr
: before it was[5, 10, 15]
and now the oldest value (5) will be replaced by 20, soarr
becomes[20, 10, 15]
. - Update running sum
s
: we subtractarr[0]
(which was5
) and add20
, sos
becomes30 + 20 - 5
=45
. - The average is
45 / 3
=15
because the window is full, and we only consider the last three numbers (20, 10, and 15).
This example clearly demonstrates how the MovingAverage
class efficiently updates and computes the new moving average as each number in the stream is added, by maintaining a sliding window of the most recent values.
Solution Implementation
1class MovingAverage:
2 def __init__(self, size: int):
3 """
4 Initialize a MovingAverage object with the specified size.
5
6 :param size: The size of the moving window for which the average
7 should be calculated.
8 """
9 self.queue = [0] * size # Initialize a fixed-size list to store the values.
10 self.window_sum = 0 # Sum of elements currently within the moving window.
11 self.count = 0 # Total count of values processed, used to calculate the index.
12
13 def next(self, val: int) -> float:
14 """
15 Calculate the moving average by adding a new value and
16 removing the oldest one from the sum if necessary.
17
18 :param val: The new value to be added in the moving average calculation.
19 :returns: The current moving average after adding the new value.
20 """
21 # Calculate the index of the oldest value based on the count of elements processed modulo the window size.
22 index = self.count % len(self.queue)
23
24 # Update the sum by adding the new value and subtracting the value that is being replaced.
25 self.window_sum += val - self.queue[index]
26
27 # Replace the oldest value in the queue with the new value.
28 self.queue[index] = val
29
30 # Increment the count of values processed.
31 self.count += 1
32
33 # Calculate and return the moving average.
34 # Use min to get the correct count of elements if the window is not fully filled.
35 return self.window_sum / min(self.count, len(self.queue))
36
37# Your MovingAverage object will be instantiated and called as such:
38# obj = MovingAverage(size)
39# param_1 = obj.next(val)
40
1class MovingAverage {
2 private int[] window; // Array to hold the values for the moving average calculation
3 private int sum; // Sum of the elements currently in the window
4 private int count; // The number of elements that have been inserted
5
6 /**
7 * Constructor to initialize the MovingAverage with a specific size.
8 *
9 * @param size The size of the window for the moving average
10 */
11 public MovingAverage(int size) {
12 window = new int[size];
13 }
14
15 /**
16 * Inserts a new value into the MovingAverage and returns the current average.
17 *
18 * @param val The new value to be added to the moving average
19 * @return The current moving average after inserting the new value
20 */
21 public double next(int val) {
22 int index = count % window.length; // Find the index to insert the new value
23 sum -= window[index]; // Subtract the value that will be replaced from the sum
24 sum += val; // Add the new value to the sum
25 window[index] = val; // Replace the old value with the new value in the window
26 count++; // Increment the count of total values inserted
27
28 // Calculate and return the moving average.
29 // The denominator is the smaller of the count of values inserted and the window size.
30 return (double) sum / Math.min(count, window.length);
31 }
32}
33
1#include <vector>
2#include <algorithm>
3
4// Class to calculate the moving average of the last N numbers in a stream.
5class MovingAverage {
6public:
7 // Constructor initializes the circular buffer 'values_' with the given 'size' and resets the sum 'sum_' to 0.
8 MovingAverage(int size) : size_(size), sum_(0), count_(0), values_(size, 0) {}
9
10 // Function to add a new 'value' to the stream and compute the updated moving average.
11 double next(int value) {
12 // Determine the index in the circular buffer to update
13 int index = count_ % size_;
14
15 // Update sum: add the new value and subtract the value that is being replaced in the buffer
16 sum_ += value - values_[index];
17
18 // Assign the new value to the appropriate position in the circular buffer
19 values_[index] = value;
20
21 // Increase the count of the total number of values processed
22 ++count_;
23
24 // Calculate the average based on the minimum of 'count_' and 'size_'
25 // This accounts for the case where fewer than 'size_' values have been processed
26 return static_cast<double>(sum_) / std::min(count_, size_);
27 }
28
29private:
30 int size_; // Size of the window for the moving average
31 long long sum_; // Sum of the last 'size_' values
32 int count_; // Total number of values that have been processed
33 std::vector<int> values_; // Circular buffer to hold the last 'size_' values
34};
35
36// Usage example (not part of the MovingAverage class)
37/*
38int main() {
39 MovingAverage* obj = new MovingAverage(size);
40 double avg1 = obj->next(val1); // Adds val1 to the stream and calculates moving average
41 double avg2 = obj->next(val2); // Adds val2 to the stream and calculates moving average
42 // ...
43 delete obj; // Clean up the allocated object
44 return 0;
45}
46*/
47
1// The size of the window for the moving average.
2let size: number;
3
4// The sum of the last 'size' values.
5let sum: number = 0;
6
7// The total number of values that have been processed.
8let count: number = 0;
9
10// Array to hold the last 'size' values, implementing circular buffer functionality.
11let values: number[];
12
13// Initialize the variables with a given size for the moving average calculation.
14function initializeMovingAverage(windowSize: number): void {
15 size = windowSize;
16 sum = 0;
17 count = 0;
18 values = new Array(windowSize).fill(0);
19}
20
21// Add a new value to the stream and compute the updated moving average.
22function next(value: number): number {
23 // Determine the index in the circular buffer to update.
24 let index = count % size;
25
26 // Update sum: add the new value and subtract the value that is being replaced in the buffer.
27 sum += value - values[index];
28
29 // Assign the new value to the appropriate position in the circular buffer.
30 values[index] = value;
31
32 // Increase the count of the total number of values processed.
33 count++;
34
35 // Calculate the average based on the minimum of 'count' and 'size'.
36 // This accounts for the case where fewer than 'size' values have been processed.
37 return sum / Math.min(count, size);
38}
39
40// Usage example (Note: 'initializeMovingAverage' must be called before 'next')
41/*
42initializeMovingAverage(size);
43let avg1 = next(value1); // Adds value1 to the stream and calculates moving average
44let avg2 = next(value2); // Adds value2 to the stream and calculates moving average
45// ...
46*/
47
Time and Space Complexity
The given Python class MovingAverage
implements a moving average with a fixed-size window. Its complexity characteristics are:
Time Complexity
Initialization (__init__
): The constructor initializes an array with a length equal to the size, which takes O(size)
time as it must allocate space for size
integers and fill them with zeros. It also initializes two integer variables, self.s
and self.cnt
, which is done in constant time, O(1)
. Hence, the time complexity for initialization is O(size)
.
Next (next
): Each next
operation involves a modulo operation to find the index, an addition, and a subtraction, all of which are O(1)
operations. Updating the sum self.s
by adding the new val
and subtracting the value overwritten in the array takes constant time. Since the array self.arr
is fixed in size, no extra time for resizing or shifting elements is required. As such, the next
method has a time complexity of O(1)
.
Space Complexity
Space Complexity: The class maintains a fixed-size array, self.arr
, to store the last size
elements, which consumes O(size)
space. The other variables, self.s
and self.cnt
, consume a constant space, O(1)
. Thus, the total space complexity for the MovingAverage
class is O(size)
, as this is the dominant term.
Learn more about how to find time and space complexity quickly using problem constraints.
Which data structure is used to implement priority queue?
Recommended Readings
Queue Intro Think of the last time you stood in line to buy movie tickets The first person in line gets their ticket first and any newcomers join the end of the line This system where the first person to arrive is the first to be served is a queue in real
Median of Data Stream Given a stream of numbers find the median number at any given time accurate to 1 decimal place Example add_number 1 add_number 2 add_number 3 get_median 2 0 add_number 4 get_median 2 5 Try it yourself Explanation Intuition Brute force way is to sort the entire list every time we get a new number This would be O N log N for each add_number operation One step up is to notice that the list is sorted before we add any new number to it So every
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Want a Structured Path to Master System Design Too? Don’t Miss This!