This study investigates the optimal hyper-parameters of the Convolutional Recurrent Neural Network (CRNN) for acoustic event detection, focusing on input segment length, learning rate, and convergence criteria. Experimental results indicated that a longer input segment and batch normalization significantly improved performance, while varying batch data during training also had a positive effect. The findings suggest that the optimal learning rate and data processing methods are crucial for enhancing the effectiveness of CRNN in acoustic event detection.