This document describes a framework for a self-driving robot to follow natural language commands. The framework uses sequence modeling to learn the meaning of sentences describing a path and identify relevant objects and prepositions. It then uses this information in the cognizance phase to generate a path and move the robot to accomplish the navigational goal described in the input sentence. The researchers created a virtual environment using Unity game engine to simulate the robot and collect training data on floor plans, sentences, and robot paths. They preprocessed this data and used hidden Markov models and a probabilistic graphical model to represent the temporal segments of sentences and learn the relationships between objects for sequence modeling.