This paper introduces a generic two-parameter reinforcement scheme and presents an optimization method using a breeder genetic algorithm to enhance its performance. It compares different derived schemes in terms of speed and efficiency, demonstrating the algorithm's ability to adapt in stochastic environments. The findings support the development of absolutely expedient learning schemes which are particularly applicable in areas such as robotics and intelligent vehicle control.