The document proposes a new optimization algorithm called the Generalized Baum-Welch (GBW) algorithm for discriminative training on hidden Markov models. GBW is based on Lagrange relaxation of a transformed optimization problem. The Baum-Welch algorithm for maximum likelihood estimation of HMM parameters and the extended Baum-Welch algorithm for discriminative training are both special cases of GBW. The performance of GBW and EBW are compared for a Farsi large vocabulary continuous speech recognition task.