The document discusses research on enhancing perceptual sound quality of deep neural network (DNN)-based source enhancement techniques through black-box optimization methods. It introduces various approaches, including time-frequency mask selection and estimation, and compares these methods against standard training techniques. The findings suggest that DNNs can effectively be trained to improve objective sound quality assessment scores, albeit with challenges in achieving state-of-the-art results.