Processes and indexes digital evidence for forensic analysis (disk images, files, timelines). Offers high-speed carving, OCR (Tesseract), NER, similar-document/image and face search, audio transcription and scriptable parsers — Java-based and extensible for investigators.
Provides research-grade implementations and pretrained models for sequence tasks (translation, LM, speech). Offers multi-GPU training, fast generation (beam/sampling/lexical constraints), mixed-precision, and state sharding — aimed at researchers reproducing or extending papers.
FunASR is an open-source end-to-end speech recognition toolkit (ASR) led by Alibaba DAMO Academy. It supports ASR, voice activity detection (VAD), punctuation restoration, speaker verification/diarization, multi-talker ASR, emotion recognition and more. FunASR provides many industrial-grade pretrained models, inference scripts, and deployment runtimes for research and production use.