Provides APIs to build, learn, and run Bayesian and dynamic Bayesian networks, perform probabilistic inference, and compute interventional/counterfactual queries. Ships example notebooks, tutorials, and PyPI/conda packages. ([github.com](https://github.com/pgmpy/pgmpy))
High-performance, scalable gradient-boosted decision tree library for regression, classification, ranking and custom objectives. Multi-language bindings (Python, R, Java, Scala, C++), single-node, distributed and GPU training — widely used for tabular data and ML competitions.
Programmatically author, schedule, and monitor data workflows using Python-defined DAGs. Features modular executors, rich provider/operator ecosystem (Kubernetes, AWS, GCP), and built-in scheduling/monitoring for batch and ML pipelines.
Processes and indexes digital evidence for forensic analysis (disk images, files, timelines). Offers high-speed carving, OCR (Tesseract), NER, similar-document/image and face search, audio transcription and scriptable parsers — Java-based and extensible for investigators.