This document presents an approach for named entity recognition (NER) on idiosyncratic web collections. It proposes a two-step method using candidate selection followed by supervised classification. Candidate n-grams are first extracted based on frequency and then classified using decision trees trained on part-of-speech tags and features from knowledge bases. Experiments on scientific paper collections show the approach achieves up to 85% accuracy, outperforming traditional NER and maximum entropy models. Leveraging graphs of scientific concepts and domain-specific resources are found to be important for this task.