Real-time media can use P2P, server-based infrastructure, or a hybrid approach. Originally WebRTC used a direct P2P model but this does not scale well. Server-based approaches using media servers like SFUs and MCUs can scale to millions of users but require transcoding and break end-to-end encryption. A hybrid approach with an end-to-end media layer can provide scalability while maintaining low-latency and encryption by adapting to the weakest link in the network path.