The document discusses differences between touch and in-air gestures, as well as potential solutions. Touch input has states like idle and active but lacks a tracking state when fingers are lifted. In-air gestures always require tracking via microphones but lack a clear mechanism to differentiate intended gestures from unintended movements. Possible solutions include adding reserved clutch actions to indicate intent and combining in-air gestures with other input modes.