This document proposes a robust audio watermarking technique that embeds watermarks in the cepstrum domain based on the relationship between mean values of consecutive sample groups. The host audio is divided into frames, which are then divided into four equal-sized sub-frames. The sub-frames are transformed to the cepstrum domain. Watermarks are embedded by either interchanging or updating the differences between the mean values of the first two sub-frames and last two sub-frames, selectively distorting sample values in the sub-frames. This allows imperceptible embedding while making extraction computationally simple without needing to know distortion amounts. Simulation results show the technique is robust and imperceptible against attacks.