The paper presents a continuous text extraction system designed to efficiently manage high document traffic and retrieve relevant documents based on user queries using a sliding window technique. It proposes an architectural model leveraging a memory-based index complemented by MapReduce for task distribution and incremental threshold methods to maintain efficiency in processing. The system also addresses duplicate document detection and ranks results based on user feedback to enhance the relevance of retrieved documents.