000 00492nam a22001697a 4500
005 20250215164626.0
008 250215b |||||||| |||| 00| 0 eng d
020 _a9781119815037
041 _aeng
082 _a006.31
100 _aPowell, Warren B
_919190
245 _aReinforcement learning and stochastic optimization
_b: a unified framework for sequential decisions
260 _aU.S.A
_bWiley
_c2022
300 _a1099p.
650 _aSpecial computer methods
_918109
942 _cBK
999 _c30661
_d30661