This paper addresses the problem of determinizing probabilistic data for storage in legacy systems that only accept deterministic input, focusing on optimizing the quality of answers to queries over this data. It introduces a framework and efficient algorithms that achieve near-optimal solutions for determining the representations of uncertain objects, improving upon traditional thresholding methods. The authors also explore future applications of their methods in ranking-based object retrieval contexts.
Related topics: