The paper presents a model for log message anomaly detection that tackles the issue of imbalanced data using a sequence generative adversarial network (seqgan) for oversampling. The proposed model incorporates an autoencoder for feature extraction and a gated recurrent unit (GRU) for classification, demonstrating improved anomaly detection accuracy on three datasets (BGL, OpenStack, and Thunderbird) when compared to without oversampling. Results indicate that appropriate data balancing techniques significantly enhance performance in classifying sparse log messages.
Related topics: