Campus Units

Electrical and Computer Engineering

Document Type

Article

Publication Version

Accepted Manuscript

Publication Date

2-2016

Journal or Book Title

Algorithmica

Volume

74

Issue

2

First Page

787

Last Page

811

DOI

10.1007/s00453-015-9974-0

Abstract

In many stream monitoring situations, the data arrival rate is so high that it is not even possible to observe each element of the stream. The most common solution is to subsample the data stream and use the sample to infer properties and estimate aggregates of the original stream. However, in many cases, the estimation of aggregates on the original stream cannot be accomplished through simply estimating them on the sampled stream, followed by a normalization. We present algorithms for estimating frequency moments, support size, entropy, and heavy hitters of the original stream, through a single pass over the sampled stream.

Comments

The final publication is available at Springer via https://doi.org/X10.1007/s00453-015-9974-0. McGregor, Andrew, A. Pavan, Srikanta Tirthapura, and David P. Woodruff. "Space-Efficient Estimation of Statistics Over Sub-Sampled Streams." Algorithmica 74, no. 2 (2016): 787-811. Posted with permission.

Copyright Owner

Springer Science+Business Media New York

Language

en

File Format

application/pdf

Published Version

Share

COinS