AI Client2023

Tabbyml.com

Self-hosted AI coding assistant you run on your own hardware as an alternative to cloud Copilot. Offers context-aware completion, an in-IDE answer engine and chat, using RAG over your repositories so suggestions match your team's code.

Visit Website

Introduction

The case for a coding assistant you host yourself is rarely about the model quality — it's about who sees your code. For teams under data-sovereignty rules or working on proprietary codebases, sending every keystroke to a cloud service is a non-starter, and that constraint, not feature parity, is what Tabby is built around. It runs entirely on-premises with no external database or cloud dependency, so the whole loop stays inside your network.

What Sets It Apart

Context comes from your own repositories: RAG-based completion pulls in team-specific patterns rather than only the open buffer, so suggestions match how your codebase actually works.
It is more than autocomplete — an Answer Engine indexes internal docs and code to answer questions inside the IDE, plus inline chat and data connectors for external sources.
It targets accessible hardware: inference runs on consumer-grade GPUs, lowering the bar to self-hosting versus infrastructure that assumes a data-center fleet.
An OpenAPI interface makes it plug into existing setups (cloud IDEs, internal tooling) instead of forcing a closed ecosystem.

Who It's For

Great fit if you need code AI that never leaves your perimeter — regulated industries, security-conscious teams, or anyone wanting an open-source, auditable alternative to Copilot they can run on hardware they own. Look elsewhere if you want zero-setup convenience and don't care where your code goes: a hosted service will be faster to start and likely sharper out of the box, since self-hosting means you own deployment, GPU provisioning, and ongoing maintenance.

Back

Information

Websitewww.tabbyml.com
AuthorsTabbyML
Published date2023/03/16

More Items

AI Agent2026

OpenHarness

HKUDS

Provides a lightweight Python harness that turns LLMs into working agents with tool-use, skills, persistent memory, permission controls and multi-agent coordination. Ships with a CLI/React TUI, 43+ built-in tools, a plugin/skill system and the ohmo personal-agent for chat gateways. Best for developers prototyping agent workflows and multi-agent experiments.

agent-skills mcp mcp-client ai-agent cli+10

AI Model2026

Bonsai Demo

PrismML-Eng, PrismML

Runs the Bonsai family of quantized LLMs locally (including vision-capable 27B): provides scripts and demo UIs to run 1-bit and ternary Bonsai models on macOS (Metal), Linux/Windows (CUDA/Vulkan/ROCm), or CPU, with long context, tool-calling and an optional Open WebUI agent demo.

llm vision multimodal huggingface github+5

AI Client2025

BrowserOS

Nithin Sonti, Nikhil Sonti +1Felafax, Inc., browseros-ai (GitHub)

Turns Chromium into a local-first AI browser with an embedded assistant that can summarise pages, extract structured data, automate web tasks, and run scheduled agents. Built as an open-source Chromium fork with 53+ built-in browser tools, 40+ app integrations, and support for BYO AI keys or fully local models (Ollama / LM Studio).

mcp mcp-server mcp-client ai-agent ai-client+8