The document describes an algorithm for detecting text in camera-captured images. It begins with preprocessing steps like converting the color image to grayscale, applying edge detection and morphological operations like dilation and erosion. This gives initial bounding boxes containing candidate text regions. Further processing includes applying geometrical constraints to filter boxes, performing multiresolution analysis, connected component analysis and filtering by area to get the final text regions. Inversion and addition steps are used to handle text against different backgrounds.