Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

Reinforcement Learning for Sequential Decision and Optimal Control

Best Price (Coupon Required):
Buy Reinforcement Learning for Sequential Decision and Optimal Control for $80.10 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

1 Offer Price Range: $89.00 - $89.00
BEST PRICE

Single Product Purchase

$80.10
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$89.00 $89.00

10% OFF
This deals requires coupon
$80.10
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9789811977831
Condition
New
Product Description

Have you ever wondered how AlphaZero learns to defeat the top human Go players? Do you have any clues about how an autonomous driving system can gradually develop self-driving skills beyond normal drivers? What is the key that enables AlphaStar to make decisions in Starcraft, a notoriously difficult strategy game that has partial information and complex rules? The core mechanism underlying those recent technical breakthroughs is reinforcement learning (RL), a theory that can help an agent to develop the self-evolution ability through continuing environment interactions. In the past few years, the AI community has witnessed phenomenal success of reinforcement learning in various fields, including chess games, computer games and robotic control. RL is also considered to be a promising and powerful tool to create general artificial intelligence in the future. As an interdisciplinary field of trial-and-error learning and optimal control, RL resembles how humans reinforce their intelligence by interacting with the environment and provides a principled solution for sequential decision making and optimal control in large-scale and complex problems. Since RL contains a wide range of new concepts and theories, scholars may be plagued by a number of questions: What is the inherent mechanism of reinforcement learning? What is the internal connection between RL and optimal control? How has RL evolved in the past few decades, and what are the milestones? How do we choose and implement practical and effective RL algorithms for real-world scenarios? What are the key challenges that RL faces today, and how can we solve them? What is the current trend of RL research? You can find answers to all those questions in this book. The purpose of the book is to help researchers and practitioners take a comprehensive view of RL and understand the in-depth connection between RL and optimal control. The book includes not only systematic and thorough explanations of theoretical basics but also methodical guidance of practical algorithm implementations. The book intends to provide a comprehensive coverage of both classic theories and recent achievements, and the content is carefully and logically organized, including basic topics such as the main concepts and terminologies of RL, Markov decision process (MDP), Bellmans optimality condition, Monte Carlo learning, temporal difference learning, stochastic dynamic programming, function approximation, policy gradient methods, approximate dynamic programming, and deep RL, as well as the latest advances in action and state constraints, safety guarantee, reference harmonization, robust RL, partially observable MDP, multiagent RL, inverse RL, offline RL, and so on.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

British Historical Facts, 1830-1900

British Historical Facts, 1830-1900

$74.99
Buchfhrung und Jahresabschlu

Buchfhrung und Jahresabschlu

$74.99
Das Widerstands- und Ultraschallschweien als Verfahren zum Verbinden kleinster Bauelemente in der E

Das Widerstands- und Ultraschallschweien als Verfahren zum Verbinden kleinster Bauelemente in der E

$59.99
Philip Larkin: Art and Self

Philip Larkin: Art and Self

$54.99
Mining and the Freshwater Environment

Mining and the Freshwater Environment

$39.99
Moving Objects Detection Using Machine Learning

Moving Objects Detection Using Machine Learning

$54.99
Wechselwirkungen zwischen Landnutzung und Klimawandel

Wechselwirkungen zwischen Landnutzung und Klimawandel

$49.99
Algebraic Geometry

Algebraic Geometry

$34.99
Rhetorik

Rhetorik

$69.99
Migrant Remittances in South Asia

Migrant Remittances in South Asia

$54.99
The Very Busy Spider

The Very Busy Spider

$8.21
The Complete Handbook of the Internet

The Complete Handbook of the Internet

$99.99
Die Telegraphentechnik

Die Telegraphentechnik

$59.99
Verbrennungen

Verbrennungen

$129.99
Building a Columnar Database on RAMCloud

Building a Columnar Database on RAMCloud

$54.99
How to be Critically Open-Minded: A Psychological and Historical Analysis

How to be Critically Open-Minded: A Psychological and Historical Analysis

$39.99
Re-imagining Senior Secondary Religious Education

Re-imagining Senior Secondary Religious Education

$54.99
Neural Information Processing

Neural Information Processing

$39.99
Public Credit Rating Agencies

Public Credit Rating Agencies

$109.99
Ueber die Vortheile der Anwendung hoch erhitzter Luft fr die Verbrennung im Allgemeinen, sowie im B

Ueber die Vortheile der Anwendung hoch erhitzter Luft fr die Verbrennung im Allgemeinen, sowie im B

$84.99
Mechanisms of Power in the Soviet Union

Mechanisms of Power in the Soviet Union

$54.99
The Catholic Church in China

The Catholic Church in China

$54.99
Prozessmanagement in Einkauf und Logistik

Prozessmanagement in Einkauf und Logistik

$19.99
Mechanics of Composite, Hybrid and Multifunctional Materials, Fracture, Fatigue, Failure and Damage

Mechanics of Composite, Hybrid and Multifunctional Materials, Fracture, Fatigue, Failure and Damage

$189.00
Hydrogen-Bonded Liquids

Hydrogen-Bonded Liquids

$219.99
Kundenberatung

Kundenberatung

$19.99
Prions and Diseases

Prions and Diseases

$169.99
Nonlinear Dynamics in Optical Complex Systems

Nonlinear Dynamics in Optical Complex Systems

$169.99
Die tiologie der Syphilis

Die tiologie der Syphilis

$59.99
Circumcision and Human Rights

Circumcision and Human Rights

$169.99
Egyptian Coastal Lakes and Wetlands: Part I

Egyptian Coastal Lakes and Wetlands: Part I

$379.99
Semantic Web Services and Web Process Composition

Semantic Web Services and Web Process Composition

$54.99
Materiality and Subject in Marxism, (Post-)Structuralism, and Material Semiotics

Materiality and Subject in Marxism, (Post-)Structuralism, and Material Semiotics

$54.99
Legacies of David Cranz's 'Historie von Grnland' (1765)

Legacies of David Cranz's 'Historie von Grnland' (1765)

$139.99
Quantum Transport in Submicron Devices

Quantum Transport in Submicron Devices

$109.99
Die beseelte Organisation und ihr Geist

Die beseelte Organisation und ihr Geist

$29.99
Phnom Penh Water Story

Phnom Penh Water Story

$99.99
Rodent Quality Control: Genes and Bugs

Rodent Quality Control: Genes and Bugs

$179.99
Befunde empirischer Forschung zu Umweltbildung und Umweltbewutsein

Befunde empirischer Forschung zu Umweltbildung und Umweltbewutsein

$59.99
Welfare Aspects of Transgenic Animals

Welfare Aspects of Transgenic Animals

$109.99
previous
next