Introduction To Stochastic Control Theory (FHD 2024)

: She met an old sailor named Bellman, who told her: "An optimal policy means that no matter where the wind blows you today, your decision for tomorrow must be optimal from that new starting point." This was the Principle of Optimality , the heart of stochastic dynamic programming . The Voyage Home

Astra realized her deterministic math was useless here. She needed a new way to think—. The Three Lessons of the Isles To succeed, Astra had to master three new concepts: Introduction to stochastic control theory

Once upon a time, in the precisely measured kingdom of , there lived a royal engineer named Astra . Astra was famous for her clockwork inventions. In Determinista, everything followed strict laws: if you pushed a lever by exactly one inch, the gears turned exactly ten times. Her life was a series of predictable, deterministic equations . : She met an old sailor named Bellman,

One day, the King tasked Astra with building a self-steering ship to reach the distant . Confident, Astra built a ship with a perfect steering motor. But as soon as the ship hit the open ocean, it failed. The winds (disturbances) were unpredictable, and the ship’s compass (the sensor) vibrated with "noise," giving fuzzy readings. The Three Lessons of the Isles To succeed,

: She accepted that she could never know the ship's exact position. Instead of a single point, she viewed the ship as a "probability distribution"—a cloud of where it might be. She used a Kalman Filter to combine her fuzzy compass readings with her motor's known power to get the best possible guess.

She returned home not just as an engineer, but as a master of the stochastic world—knowing that while you can't control the wind, you can optimally control how you react to it. Stochastic Control Theory: Dynamic Programming Principle

: In Determinista, she only cared about reaching the goal. In the Isles, she had to balance. If she steered too hard to correct a small wave, she might waste all her fuel. She adopted the Linear-Quadratic Gaussian (LQG) approach: minimizing the "expected cost" of both the error and the effort used to fix it.