Tag
Explore by tags
Orchestrates and scales Python-based AI/ML workloads from laptop to thousands of GPUs — exposing task and actor primitives plus high-level libraries for training, hyperparameter tuning, serving, RL, and data processing. Designed for heterogeneous accelerators and production ML pipelines.
Provides a modular PyTorch-based reinforcement learning library with dual high-level and procedural APIs — supporting online/offline, model-based and multi-agent algorithms, vectorized environments, and configurable training pipelines for research and engineering.
Hands-on coding tutorial series for large language models with slides and runnable notebooks covering fine-tuning, prompting, RLHF, safety, steganography, watermarking, multimodal models, GUI agents, and deployment. Community-maintained, free course materials for students and researchers.
