The document explores the creation of digital profiles using public social media information, highlighting the increasing reliance of employers on social media for pre-screening job applicants. It reviews existing tools for data harvesting, including web APIs and web scraping techniques using Selenium 2.0 to collect structured data. Additionally, it discusses the challenges in capturing data from dynamic web sources and provides performance results from testing different social media platforms with an implemented prototype called Scrapya.
Related topics: