AIAny - AI Video

AI Video2024

Veo

Generates cinematic video from text and image prompts, with newer versions adding native audio and tighter creative controls. It is built for high-fidelity clips that can move from quick Gemini experiments to API and Flow workflows.

ai-tools ai-video vision

AI Video2024

Hailuo AI

MiniMax

Turns text prompts or still photos into short video clips via effect templates (dance, skydiving, character morphing) plus image-to-video animation. Adds synced AI voiceover and music; Hailuo 2.3 targets stable physics and micro-expressions.

ai-tools ai-video

AI Video2025

Seedance

ByteDance Seed

Generates 1080p videos from text or images, with native multi-shot storytelling that keeps subjects, style, and atmosphere consistent across cuts. Ranked first on Artificial Analysis T2V and I2V leaderboards, ahead of Veo 3 and Kling 2.0.

ai-tools ai-video bytedance

AI Video2023

Runway

Runway AI, Inc.

Turns text, images, and source footage into AI-generated video and world-model outputs. Its edge is the bridge between browser tools, research models, and production workflows for creative teams.

ai-tools ai-video vision

AI Train2023

AI Toolkit

Ostris

Trains and fine-tunes diffusion models on consumer GPUs: LoRA and LoKr for image families like FLUX.1/2, SDXL and Qwen-Image, plus video models such as Wan 2.x and LTX. Layer-specific targeting, configurable VRAM, and a browser dashboard for runs.

github ai-train ai-image ai-video huggingface

AI Video2024

KlingAI

Kuaishou Technology

Generates videos and images from text or reference images, with model updates aimed at higher motion realism and creator-friendly controls. Best for fast concept clips, ads, and social assets rather than fully predictable production footage.

ai-tools ai-image ai-video vision

AI Video2022

Reddit Video Maker Bot 🎥

Lewis Menelaws (Elebumm), TMRRW

Generate short social videos from Reddit threads in one command — captures post content, assembles visuals and optional TTS narration, and outputs an upload-ready MP4. Runs locally with Python + Playwright; does not auto-upload for safety.

ai-video video github python cli

AI Video2022

SadTalker

Wenxuan Zhang, Xiaodong Cun +6

Generate a lip-synced talking-head video from a single portrait image and an audio clip using learned 3D motion coefficients for realistic expression and head motion. Offers still/reference modes, Colab/HuggingFace demos, and an Apache-2.0 license.

audio video ai-video pytorch github+3

AI Image2023

X-AnyLabeling

Wei Wang, CVHub

X-AnyLabeling is a powerful annotation tool integrated with an AI engine for fast and automatic labeling. Designed for multi-modal data engineers, it offers industrial-grade solutions for complex tasks. Supports images and videos, GPU acceleration, custom models, one-click inference for all task images, and import/export formats like COCO, VOC, YOLO. Handles classification, detection, segmentation, captioning, rotation, tracking, estimation, OCR, VQA, grounding, etc., with various annotation styles including polygons, rectangles, rotated boxes.

github ai-tools vision ai-image ai-video+4

AI Image2023

Comfy.org

Comfy Org

Create and run node-based generative AI workflows for images, video, 3D, and audio — reusable, shareable node graphs with custom nodes, live previews, and local/cloud runtime options. Open-source with Comfy Cloud and Hub for creators.

ai-tools ai-image ai-video audio ai-client+1

AI Image2023

Open Generative AI

Anil-matcha (matchaman11), Muapi.ai

Provides an uncensored, self‑hostable studio for generating AI images, videos, and lip‑synced talking videos in browser or desktop. Integrates 200+ models via Muapi.ai, supports local inference (stable-diffusion.cpp), multi-image inputs and workflow automation — no content filters.

github ai-image ai-video ai-api AIGC+4

AI Image2023

Generative Models by Stability AI

Stability AI

Reference implementation for Stability AI's diffusion models: SDXL base/refiner/Turbo for text-to-image, plus Stable Video Diffusion, SV3D, and SV4D for image-to-video and 4D synthesis. A modular engine separates samplers, guiders, and conditioners.

pytorch video vision ai-image ai-video+5

Category

Explore by categories

All Categories

AI Leaderboard

AI Agent Tutorials

AI Coding Tutorials

AI Model

AI Agent Papers

Chatbot

AI Dataset

Machine Learning Foundation Books

AI Train

AI Deploy

AI Client

Machine Learning Foundation Papers

Machine Learning Foundation Tutorials

AI Image Demos

AI Agent

Large Language Model Tutorials

Large Language Model Papers

Machine Learning Engineering Papers

Computer Vision Tutorials

Computer Vision Papers

Natural Language Processing Papers

Reinforcement Learning Papers

Speech Technology Papers

AI API

AI Coding

AI Image

AI Video

MLOps

MCP Client

MCP Server

AI Video Papers

AI Audio

AI Others

AI Infra

Embodied AI

Veo

Hailuo AI

Seedance

Runway

AI Toolkit

KlingAI

Reddit Video Maker Bot 🎥

SadTalker

X-AnyLabeling

Comfy.org

Open Generative AI

Generative Models by Stability AI