Ollama

A lightweight open-source platform for running, managing, and integrating large language models locally via a simple CLI and REST API.

Visit Website

Introduction

Ollama lets developers pull, run, and customize state-of-the-art open-source LLMs such as Llama 3, Qwen, and Gemma directly on macOS, Linux, and Windows machines. Its Go-based runtime provides a command-line interface (ollama run, ollama list, etc.) and an OpenAI-compatible REST API, making local models drop-in replacements for cloud endpoints. Beyond basic chat completion, Ollama supports embeddings, tool/function calling, structured JSON outputs, streaming responses, and multi-modal vision models. The project ships pre-built binaries with GPU acceleration (NVIDIA, AMD, Apple Silicon) and can also run in Docker. A growing model library and Python/JavaScript client SDKs simplify integration into RAG pipelines, VS Code extensions, and other AI-powered apps. Founded by Jeffrey Morgan and Michael Chiang (YC W21), Ollama is fully open source under the MIT license and has an active community on GitHub and Discord.

Back

Information

Websiteollama.ai
AuthorsJeffrey Morgan, Michael Chiang
Published date2023/08/01

More Items

ElizaOS

2024

ElizaOS (elizaos)

ElizaOS is an open-source, extensible platform for building, deploying, and managing autonomous AI agents. It provides a CLI, web UI, modular core, and plugin system; is model-agnostic (supports many model providers); offers connectors to chat platforms, document ingestion (RAG), and is designed for multi-agent orchestration and production deployment.

ai-agent plugin ai-framework ai-development ai-tools+2

Perplexica

2024

ItzCrazyKns

Perplexica is an open-source, privacy-focused AI answering engine that runs on your own hardware. It combines private web search (bundled SearxNG) with local and cloud LLMs (supports local models via Ollama and cloud providers) to produce cited answers, file-based Q&A, image/video search, and configurable search modes. It’s designed for self-hosting and developer integration (Docker, API, docs).

LLM RAG ai-client ai-tools ai-api+3

SkyPilot

2021

skypilot-org, Sky Computing Lab (UC Berkeley)

SkyPilot is an open-source MLOps / AI infrastructure project that provides a unified control plane and CLI to run, manage, and scale AI workloads on any compute — Kubernetes, Slurm, 20+ clouds, or on-prem clusters. It supports job-as-code (YAML/Python), intelligent scheduling and cost optimization (spot instances, autostop), automatic setup/sync of environments, auto-recovery, and integrations for training, serving and inference workflows.

mlops ai-serving ai-train ai-workflow ai-inference+2