This document discusses approaches for extracting and modeling distributed data from integrated web technologies based on set theory. It proposes a management approach for distributed information that involves data warehousing techniques, data fragmentation, and distributed query optimization. A mathematical model is developed using concepts from set theory to conceptualize how data from different sources can be combined into an integrated environment. An algorithm is also proposed to compute the execution time of queries on this distributed data based on the set theory composition model.