olmOCR is an open-source toolkit from the Allen Institute for AI (AI2) / AllenNLP team for converting image-based documents (PDF, PNG, JPEG) into clean, readable plain text or Markdown. It uses a 7B-parameter vision-language model to handle complex layouts, equations, tables and handwriting, removes headers/footers, and outputs text in natural reading order. The repo includes a processing pipeline, benchmark suite (olmOCR-Bench), training and RL components, Docker images, and an online demo. Licensed under Apache 2.0.