Date of this Version
United States Patent Application Publication Farahmand et al. Pub . No . : US 2018 / 0100662 A1
A controller for controlling an operation of an air - conditioning system conditioning an indoor space includes a data input to receive state data of the space at multiple points in the space , a memory to store a code of a reinforcement learning algorithm and a history of the state data and a history of control commands having been applied to the air - conditioning system , wherein the history of the control commands is associated with the state data and history of rewards , a processor coupled to the memory determines a value function outputting a cumulative value of the rewards and transmits a control command by using the reinforcement learning algorithm , and a data output to receive the control command from the processor and transmit a control signal to the air - conditioning system , wherein the control signal controls at least one actuator of the air - conditioning system according to the control command .