Tag
Explore by tags
Processes and indexes digital evidence for forensic analysis (disk images, files, timelines). Offers high-speed carving, OCR (Tesseract), NER, similar-document/image and face search, audio transcription and scriptable parsers — Java-based and extensible for investigators.
Provides research-grade implementations and pretrained models for sequence tasks (translation, LM, speech). Offers multi-GPU training, fast generation (beam/sampling/lexical constraints), mixed-precision, and state sharding — aimed at researchers reproducing or extending papers.
A 57-subject multiple-choice benchmark for measuring broad language understanding in LLMs; provides per-subject configs and test/dev/auxiliary_train splits for few-/zero-shot evaluation, widely used for model comparison and academic reporting.
