Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

agent-skills

ai

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-deploy

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

algorithms

alibaba

amazon

android

anthropic

audio

aws

benchmark

biology

blog

book

bytedance

chatbot

chatgpt

chemistry

claude

claude-code

cli

code

codex

copilot

course

cuda

cursor

deepmind

deepseek

depth

devops

diffusers

docker

drug-discovery

electron

embeddings

engineering

evaluation

facebook

finance

flow-matching

foundation

foundation-model

gemini

gemini-cli

gemma

genomics

gitHub

github

go

google

gradient-booting

grok

groq

huggingface

image

ios

java

javascript

json

kimi

llama.cpp

LLM

llm

lora

mLOps

math

mcp

mcp-client

mcp-server

meta-ai

meta-pytorch

metal

microsoft

mlops

mobile

multilingual

multimodal

mysql

NLP

nlp

nodejs

numpy

nvidia

ocr

ollama

openai

opencode

pandas

paper

physics

pi

plugin

polars

postgres

privacy

prompt-engineering

pwa

python

pytorch

qwen

react

reasoning

retrieval

RL

robotics

rust

science

security

segmentation

shodan

skillkit

sora

speech

sqlite

ssh

stt

swe

tensorrt

terminal

transformers

translation

tts

tutorial

typescript

vibe-coding

video

vision

vllm

voice

web-search

windsurf

xAI

xai

AI API2023

Ollama

Run and manage open and community LLMs locally via a compact CLI and REST API—supports model import, Docker deployment, and official Python/JS SDKs for local inference, RAG, and dev workflows.

ollama llm ai-inference ai-serving docker+5

AI Infra2010

Distributed search and analytics engine and vector database built on Lucene that enables near-real-time full-text and vector search, indexing, and analytics over large datasets. Provides vector embeddings support, REST APIs, RAG-friendly features, and deployment options including Elastic Cloud and Docker.

java github embeddings RAG ai-serving+1

MLOps2014

Apache Airflow

Apache Software Foundation, Maxime Beauchemin (originated at Airbnb)Apache Software Foundation, Airbnb

Programmatically author, schedule, and monitor data workflows as Python-defined DAGs; the scheduler handles dependencies, retries, and backfills. Pluggable executors (Local, Celery, Kubernetes) and a broad provider ecosystem for AWS, GCP, and databases.

mlops python docker ai-workflow ai-development+3

AI Infra2017

Ray (by Anyscale)

Anyscale, RISELab (UC Berkeley)Anyscale, UC Berkeley RISELab

Scales any Python or ML workload across CPUs and GPUs with a few decorators, instead of rewriting code for Spark or MPI. Bundles libraries for distributed training, hyperparameter tuning, RL, batch inference, and online model serving on one cluster.

mlops ai-inference ai-serving ai-train ai-development+6

AI Infra2018

NautilusTrader

Nautech SystemsNautech Systems Pty Ltd

Rust-native, event-driven trading platform for backtesting and live execution across crypto, forex, equities, and futures on 27+ venues. The same strategy code runs in nanosecond backtests and in production, giving true research-to-live parity.

github python rust mlops ai-train+2

MLOps2018

Prefect

PrefectHQ

Orchestrates and schedules Python data pipelines and workflows with primitives for retries, caching, parameters, and deployments. Provides either a self-hosted server or managed Prefect Cloud for monitoring, observability, and integrations across common data tools.

mLOps python ai-workflow docker cli+2

AI Deploy2018

Triton Inference Server

NVIDIA Corporation

Serves machine learning and deep learning models for cloud, data center, edge and embedded environments. Supports multiple frameworks and backends, dynamic and sequence batching, HTTP/gRPC APIs, Docker deployment and NVIDIA-optimized runtimes.

nvidia ai-inference ai-serving tensorrt pytorch+5

AI Deploy2019

Bento: Run Inference at Scale

BentoML TeamAtalaya Tech, Inc. (BentoML), Modular

Turns Python ML code into production inference APIs that scale on Kubernetes or any cloud. Bundles models, dependencies, and serving logic into versioned "Bentos" with autoscaling, scale-to-zero, and multi-GPU serving for LLMs and custom models.

mlops ai-inference ai-serving ai-deploy python+3

AI Agent2019

Baserow

Baserow B.V.

Open-source Airtable alternative for building databases, apps, automations, and AI agents without code over a PostgreSQL-backed REST API. The Kuma assistant turns plain language into tables and workflows; self-hostable with full data ownership.

ai-agent ai-tools ai-development github postgres+4

AI Others2019

N8n.io

Jan Oberhauser, n8n GmbHn8n GmbH

Node-based platform for building automation workflows that wire together 400+ apps and 70+ LangChain AI nodes, supporting agents, RAG, and 12+ LLM providers. Fair-code licensed and self-hostable, so pricing is server time rather than per-operation.

nodejs docker github ai-api ai-workflow+1

Chatbot2019

Chatwoot

Chatwoot Inc.

Consolidates customer conversations from website chat, email, social and messaging channels into a single support inbox with self-hosting and Docker/one-click deployment options. Includes an optional AI agent (Captain) for automated replies, multilingual translation, and integrations.

chatbot ai-tools ai-agent docker gitHub+3

AI Infra2019

Milvus

Zilliz, LF AI & Data Foundation

Runs approximate nearest-neighbor search over billions of vector embeddings, separating compute from storage so reads and writes scale independently. Offers HNSW, IVF, DiskANN, and GPU CAGRA indexes plus hybrid dense+sparse and BM25 retrieval.

embeddings RAG ai-serving ai-inference python+2