AI Image2018

Kornia

Brings classic computer vision into PyTorch as differentiable, GPU-accelerated tensor operators — filters, geometric transforms, feature matching, camera calibration — so each step lives inside autograd and trains end-to-end with neural networks.

Visit Website

Introduction

Most computer-vision toolkits sit outside the neural network: OpenCV warps an image or detects features, then hands a detached array back to your model, and the gradient chain stops cold there. Kornia's bet is to rebuild that classic toolkit so every operation is a plain differentiable tensor function — meaning a homography warp, a Sobel filter, or a camera-calibration step can sit inside the forward pass and receive gradients like any other layer.

What Sets It Apart

Differentiable by construction — 500+ operators (filtering, morphology, homography warping, epipolar geometry, pose estimation) are all autograd-compatible and run on GPU, so geometry and photometry become trainable rather than frozen preprocessing.
Tensor-native, batched, device-agnostic — everything operates on (B, C, H, W) tensors, so a whole batch of augmentations or warps runs on the same device as your model with no NumPy round-trips.
CV primitives, not just augmentation — beyond AutoAugment/RandAugment-style pipelines, it ships feature matching (LoFTR, local descriptors), face detection, and multi-view geometry, work that usually means stitching several libraries together.

Great Fit If / Look Elsewhere If

Great fit if you're building end-to-end pipelines where the camera model or a geometric step needs gradients — self-supervised depth, differentiable rendering, robotics, spatial AI. Look elsewhere if you just need fast classic CV on CPU with no training loop: OpenCV is lighter and broader there. Differentiability and the PyTorch dependency are the price of admission, and some half-precision paths carry documented limits.

Back

Information

Websitegithub.com
Organizationskornia.ai
AuthorsKornia contributors, E. Riba, D. Mishkin, D. Ponsa, E. Rublee, G. Bradski
Published date2018/08/22

More Items

AI Image2026

无限画布 (infinite-canvas)

Node-based infinite-canvas web workstation for iterative visual creation — integrates image/video generation, reference editing, prompt library, multi-agent assistants, and asset management. Runs in-browser with configurable OpenAI-compatible endpoints; suited for local/personal deployment (AGPL-3.0).

ai-image image ai-agent mcp-client typescript+6

AI Model2026

Giga-World-1

open-gigaai

Diffusion-based generative model for scene and video synthesis, providing full Diffusers checkpoints and scene LoRA for fast adaptation. Includes Stage‑1 nano (1.3B) and pro (5B) variants and modular transformer/VAE components.

diffusers huggingface lora pytorch image+4

AI Image2026

Sun Direction LoRA (Flux2Klein 9B)

eric-venti-seeds

Applies or repositions directional sunlight in outdoor images by using a LoRA trained for Flux2Klein 9B to match a reference sun elevation and rotation. Workflow uses an overcast intermediate and a sphere (ball) reference; includes a ComfyUI node and Blender scene for rendering the reference.

ai-image image huggingface diffusers ai-demos