Smart vision systems embed intelligence through advanced image processing to enhance visual experiences, improve interactions, and facilitate decisions. They require increasingly complex algorithms to perform tasks like 3D reconstruction, gesture recognition, and event detection. These algorithms must be matched to diverse platform architectures with different performance and power constraints. Successful system design requires matching applications, algorithms, and architectures. The document discusses examples including 3D video interpolation, eye-gaze corrected video chatting, and a 3D camera prototype for elderly monitoring. It concludes that IBBT brings together competences in applications, algorithms, and architectures to enable new smart vision systems.