This study validated a natural language processing (NLP) protocol for detecting signs and symptoms of heart failure (HF) in electronic health record (EHR) text notes. The protocol extracted mentions of 15 diagnostic criteria from the Framingham HF study from 400 EHR notes with an overall F-score of 0.91. The protocol also labeled encounters with the criteria mentioned with an F-score of 0.93. While challenges remain around data quality and syntactic diversity, information extracted about HF criteria appears useful for early detection of HF in downstream applications.