Researchers have proposed "living labs" as a new evaluation paradigm for information retrieval (IR) systems. Living labs involve replacing components of an existing search engine with experimental systems and evaluating them using interactions from real users of the live site. Key aspects include an API that allows experimental systems to generate rankings offline and upload them to the API to be interleaved with the production ranking for test queries. This approach allows evaluation of experimental systems using real user data and behaviors in a realistic online setting at a large scale, addressing limitations of traditional offline evaluation methodologies. Open research platforms based on this living labs approach could help advance IR evaluation.