Thanks for the discussion!
We have indoor temperature measurements for the last two and half years.
There is apparent cycle pattern of daily 24 hours.
As we discussed, we just don't have the data of AC not working, or windowed opened. We do have some outdoor temperatures, and we may start to collect more.
So far, we found that a linear model of A liner model of f(t+4) = f(t) + rolling_average(f(t))*4 tracks very well with the past overheat history. But the model does not work well for general future prediction, as it overshoots significantly when the temperatures oscillate rapidly.
(That is the model has good recall, but may have poor precision as far as temperature regression is concerned.)
Currently, we may use the linear model as a base for overheat perdition together in considerations of the other necessary conditions, such as outdoor temperatures, the current temperatures, and the past history of the sites being overheat or not.
I'd like to learn if there are better methodologies or workflows, to make the model more robust, and simpler.