This document discusses various sound features that can be extracted for speech processing, including energy, tempo, pitches, zero crossing rate, spectrogram, and spectral centroid. Energy measures the loudness of a signal. Tempo estimates the speed of a musical piece in beats per minute. Pitches depend on the vibration frequency, with higher pitches for faster vibrations. Zero crossing rate indicates the rate of sign changes in a signal. A spectrogram plots the amplitude of different frequencies over time. The spectral centroid characterizes where the "center of mass" of the spectrum is located and relates to the brightness of a sound.
Related topics: