Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

This course delves into the principles and practical application of reinforcement learning (RL) and sequential decision-making within agentic AI systems. Participants will acquire the skills to design, train, and assess agents that interact dynamically with their surroundings to achieve long-term objectives through continuous learning and adaptation.

This instructor-led, live training (available online or onsite) is tailored for advanced engineers and researchers seeking to integrate reinforcement learning and planning algorithms into agentic systems for automation, robotics, and adaptive reasoning.

Upon completion of this training, participants will be able to:

Grasp the mathematical foundations of reinforcement learning and decision-making.
Implement core RL algorithms, including DQN, PPO, and A3C, using Python and PyTorch.
Model environments using OpenAI Gym and design custom simulation scenarios.
Train, evaluate, and debug agents for both continuous and discrete control tasks.
Apply reinforcement learning techniques to agentic AI applications in robotics and planning.
Balance exploration, exploitation, and safety constraints for real-world deployment.

Course Format

Instructor-led lectures accompanied by live coding demonstrations.
Hands-on exercises utilising open-source frameworks and simulation environments.
Applied project focusing on integrating decision-making into an agentic AI system.

Customisation Options

To request a tailored training session for this course, please contact us to make arrangements.

This course is available as onsite live training in Botswana or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to Reinforcement Learning and Agentic AI

Decision-making under uncertainty and sequential planning.
Key components of RL: agents, environments, states, and rewards.
The role of RL in adaptive and agentic AI systems.

Markov Decision Processes (MDPs)

Formal definition and properties of MDPs.
Value functions, Bellman equations, and dynamic programming.
Policy evaluation, improvement, and iteration.

Model-Free Reinforcement Learning

Monte Carlo and Temporal-Difference (TD) learning.
Q-learning and SARSA.
Hands-on: implementing tabular RL methods in Python.

Deep Reinforcement Learning

Combining neural networks with RL for function approximation.
Deep Q-Networks (DQN) and experience replay.
Actor-Critic architectures and policy gradients.
Hands-on: training an agent using DQN and PPO with Stable-Baselines3.

Exploration Strategies and Reward Shaping

Balancing exploration vs. exploitation (ε-greedy, UCB, entropy methods).
Designing reward functions and avoiding unintended behaviours.
Reward shaping and curriculum learning.

Advanced Topics in RL and Decision-Making

Multi-agent reinforcement learning and cooperative strategies.
Hierarchical reinforcement learning and options framework.
Offline RL and imitation learning for safer deployment.

Simulation Environments and Evaluation

Using OpenAI Gym and custom environments.
Continuous vs. discrete action spaces.
Metrics for agent performance, stability, and sample efficiency.

Integrating RL into Agentic AI Systems

Combining reasoning and RL in hybrid agent architectures.
Integrating reinforcement learning with tool-using agents.
Operational considerations for scaling and deployment.

Capstone Project

Design and implement a reinforcement learning agent for a simulated task.
Analyse training performance and optimise hyperparameters.
Demonstrate adaptive behaviour and decision-making in an agentic context.

Summary and Next Steps

Requirements

Strong proficiency in Python programming.
Solid understanding of machine learning and deep learning concepts.
Familiarity with linear algebra, probability, and basic optimisation methods.

Audience

Reinforcement learning engineers and applied AI researchers.
Robotics and automation developers.
Engineering teams working on adaptive and agentic AI systems.

28 Hours

Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Related Courses

Autonomous Decision-Making with Agentic AI

Understanding Agentic AI: Concepts and Capabilities

Agentic AI for Business Automation: Use Cases & Integration

Agentic AI for Enterprise Applications

Agentic AI and the Future of Work

Governance and Security Patterns for WrenAI in the Enterprise

Modernizing Legacy BI with WrenAI: Adoption, Migration, and Change Management

Quality and Observability for WrenAI: Evaluation, Prompt Tuning, and Monitoring

Course Format

Customisation Options

Building with the WrenAI API: Applications, Charts, and NL to SQL

WrenAI Cloud Essentials: From Data Sources to Dashboards

WrenAI for Financial Analytics: KPI Modeling and Regulatory-Aware Dashboards

WrenAI OSS Deep Dive: Semantic Modeling, Text to SQL, and Guardrails

WrenAI for Product Teams: Conversational Analytics and Self-Service BI

Deploying WrenAI for SaaS: Embedded GenBI in Customer-Facing Products

Operational Analytics with WrenAI Spreadsheets and Metrics Library

Related Categories

Agentic AI

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites