LogoAIAny
  • Search
  • Collection
  • Category
  • Tag
  • Blog
LogoAIAny

Category

Explore by categories

LogoAIAny

Curated AI Resources for Everyone

support@aiany.app
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
Company
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
  • All

  • AI Leaderboard

  • AI Agent Tutorials

  • AI Coding Tutorials

  • AI Agent Papers

  • Chatbot

  • Machine Learning Foundation Books

  • AI Train

  • AI Deploy

  • AI Client

  • Machine Learning Foundation Papers

  • Machine Learning Foundation Tutorials

  • AI Image Demos

  • AI Agent

  • Large Language Model Tutorials

  • Large Language Model Papers

  • Machine Learning Engineering Papers

  • Computer Vision Tutorials

  • Computer Vision Papers

  • Natural Language Processing Papers

  • Reinforcement Learning Papers

  • Speech Technology Papers

  • AI API

  • AI Coding

  • AI Image

  • AI Video

  • MLOps

  • MCP Client

  • MCP Server

  • AI Video Papers

  • AI Audio

  • AI Infra

  • Embodied AI

Icon for item

Veo

2024
Google DeepMind

Veo is a state-of-the-art video generation model developed by Google DeepMind, designed to empower filmmakers and storytellers.

ai-toolsai-videovision
Icon for item

Sora2

2025
OpenAI

OpenAI’s latest video-and-audio generation model with improved physics, realism, controllability, and synchronized dialogue and sound effects, available via the new Sora app.

ai-videoopenaisora
Icon for item

Hailuo AI

2024
MiniMax

An AI-powered video generation platform by MiniMax that transforms text descriptions or images into short, dynamic AI videos.

ai-toolsai-video
Icon for item

Seedance

2025
ByteDance Seed

A model that supports multi-shot video generation from both text and image. It achieves breakthroughs in semantic understanding and prompt following, and can create 1080p videos with smooth motion, rich details, and cinematic aesthetics.

ai-toolsai-videobytedance
Icon for item

Runway

2023
Runway AI, Inc.

With Runway Gen-4, you are now able to precisely generate consistent characters, locations and objects across scenes. Simply set your look and feel and the model will maintain coherent world environments while preserving the distinctive style, mood and cinematographic elements of each frame. Then, regenerate those elements from multiple perspectives and positions within your scenes.

ai-toolsai-videovision
Icon for item

AI Toolkit

2023
Ostris

AI Toolkit is an all-in-one training suite for finetuning diffusion models, supporting image and video models on consumer-grade hardware. It offers GUI and CLI interfaces, making it user-friendly yet feature-rich, with capabilities for dataset handling, LoRA/LoKr training, layer-specific training, and integrations with platforms like RunPod and Modal. It supports models like FLUX.1 and SDXL, requiring an NVIDIA GPU with at least 24GB VRAM.

githubai-trainai-imageai-videohuggingface
Icon for item

KlingAI

2024
Kuaishou Technology

Kling AI, tools for creating imaginative images and videos, based on state-of-art generative AI methods.

ai-toolsai-imageai-videovision
Icon for item

X-AnyLabeling

2023
Wei Wang, CVHub

X-AnyLabeling is a powerful annotation tool integrated with an AI engine for fast and automatic labeling. Designed for multi-modal data engineers, it offers industrial-grade solutions for complex tasks. Supports images and videos, GPU acceleration, custom models, one-click inference for all task images, and import/export formats like COCO, VOC, YOLO. Handles classification, detection, segmentation, captioning, rotation, tracking, estimation, OCR, VQA, grounding, etc., with various annotation styles including polygons, rectangles, rotated boxes.

githubai-toolsvisionai-imageai-video+4
Icon for item

Generative Models by Stability AI

2023
Stability AI

An open-source repository from Stability AI that collects implementations, training configs, demos and inference scripts for multiple generative models (e.g. SDXL, SV3D, SV4D, SV4D 2.0, Stable Video Diffusion). It is modular and config-driven, provides sampling/demo scripts, training examples, and references to model weights on Hugging Face.

pytorchvideovisionai-imageai-video+5
Icon for item

Deep-Live-Cam

2023
hacksider, s0md3v

Deep-Live-Cam is an open-source real-time face-swap / deepfake tool that can replace faces in live webcam streams or videos using only a single source image. Key features include one-click live deepfakes, mouth-mask to retain original mouth motion, multi-face mapping, and pre-built binaries for Windows and Apple Silicon. It supports multiple execution providers (CUDA, CoreML, DirectML, OpenVINO) and includes built-in content checks and ethical guidance.

ai-videovideoai-toolsgithubAIGC+2
Icon for item

DiffSynth-Studio

2023
ModelScope Community, Artiprocher

DiffSynth-Studio is an open-source Diffusion model engine developed and maintained by the ModelScope Community, focusing on image and video generation. It supports mainstream models like FLUX, Wan, and Qwen-Image, offering efficient memory management and flexible training frameworks. Key features include VRAM optimization, low-memory inference, LoRA/ControlNet training, and innovative techniques like EliGen and Nexus-Gen for pushing generative model boundaries.

githubAIGCai-toolsai-imageai-video+5
Icon for item

VideoCaptioner

2024
WEIFENG2333

VideoCaptioner is an AI-powered video subtitling assistant that combines ASR (local or cloud) with LLM-based subtitle segmentation, correction and translation. It supports offline GPU transcription, concurrent chunk transcription, VAD, speaker-aware processing, batch subtitling and one-click subtitle-to-video synthesis, with both GUI and CLI options.

videoai-videoaudioASRLLM+3
  • Previous
  • 1
  • 2
  • Next