The document discusses the problem of vocabulary mismatch in source code and software artifacts, emphasizing the need to normalize identifiers for improved performance in information retrieval techniques. It presents an algorithm for identifier normalization, explaining its application and evaluation with various identifiers and expansions. Future work includes exploring different sources of co-occurrence data and normalization in the context of information retrieval tasks.