LogoAIAny
Icon for item

Tianshou

Provides a modular PyTorch-based reinforcement learning library with dual high-level and procedural APIs — supporting online/offline, model-based and multi-agent algorithms, vectorized environments, and configurable training pipelines for research and engineering.

Introduction

RL implementations often stall between paper pseudocode and reliable training pipelines; Tianshou is focused on closing that gap by offering abstractions that make both algorithm development and experiment runs concise and reproducible.

What Sets It Apart
  • Dual API design: separates a high-level, declarative ExperimentBuilder-style API for running experiments from a procedural API for algorithm authors, so you can prototype quickly while retaining fine-grained control.
  • Broad algorithm coverage with pragmatic engineering: many canonical on-policy, off-policy and offline RL algorithms are implemented (DQN variants, PPO/SAC/TD3, CQL, PER, etc.), meaning fewer reimplementations when comparing methods.
  • Vectorized and high-performance workflows: built-in support for vectorized environments and integration-friendly interfaces (logging, multi-GPU, env acceleration) — useful when moving from single-run research to larger benchmarking.
  • Clear abstractions (Algorithm vs Policy, trainer parameters): reduces accidental coupling between training loop and policy implementation, making code easier to maintain and extend.
Great fit if...
  • You need a PyTorch-native RL codebase that lets you iterate on algorithms and run reproducible benchmarks without rewriting low-level training code.
  • You want both easy experiment configuration (high-level) and the ability to implement novel algorithmic details (procedural API).

Look elsewhere if...

  • You require a turnkey production inference/serving platform (Tianshou focuses on training and research plumbing rather than model serving).
  • Your work depends on a different deep-learning backend (it’s PyTorch-first).
Where It Fits

Tianshou sits between minimal educational RL code (good for toy examples) and heavyweight RL platforms: it emphasizes modular, readable algorithm implementations and experiment reproducibility, making it a solid choice for RL researchers and engineers who want production-quality training pipelines without sacrificing research flexibility.

Information

  • Websitegithub.com
  • Authorsthu-ml (Tsinghua University Machine Learning Group)
  • Published date2020/03/09

Categories