This document describes a Malleable Hardware Accelerator (MAHA) for data intensive applications. MAHA implements a reconfigurable computing fabric within off-chip memory to enable computation within the memory. It presents modifications made to a NAND flash memory architecture to realize the MAHA framework, including adding control logic and custom datapaths to memory blocks. It also describes the software architecture for mapping applications to MAHA, including decomposing and fusing operations. Evaluation results show the area and energy overhead of MAHA and performance/energy improvements when mapping applications compared to a general-purpose processor.