This document describes an automated system using speech recognition to control desktop operations. The system was developed using Python libraries like pyttsx3 for text-to-speech conversion and speech recognition for speech-to-text. It allows users to perform various tasks like searching and managing files, automating web browsers like Google Chrome and YouTube, and other system operations through voice commands. The system was tested to successfully search files, copy/move/delete files, bookmark files, search the web, manage browser tabs, play videos and more using only voice. Future work may include improving speech recognition accuracy, adding eye tracking, incorporating machine learning and connecting to mobile devices for remote control.