Abstract: Non-duplicate sampling (NDS) is a recently proposed technique that selects items from a data stream with probability p only on their first appearance, effectively handling duplicate items.