Data stream sampling algorithm(s) when sample storage is limited and data storage is not?

Data stream sampling algorithm(s) when sample storage is limited and data storage is not? The problem is: I have limited sample storage and unlimited data storage. Data is ingested : 1. in a real-time. but can be ingested 2. incrementally (periodic batch mode) Simple approach “take every n/th sample” does not work for obvious reason. …

Data stream sampling algorithm(s) when sample storage is limited and data storage is not? Read More »