LogoAIAny
  • Search
  • Collection
  • Category
  • Tag
  • Daily AI
LogoAIAny

Tag

Explore by tags

LogoAIAny

Curated AI Resources for Everyone

[email protected]

Powered by airss.app

Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
Company
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
  • All

  • 30u30

  • ASR

  • ChatGPT

  • GNN

  • IDE

  • RAG

  • agent-skills

  • ai

  • ai-agent

  • ai-api

  • ai-api-management

  • ai-client

  • ai-coding

  • ai-demos

  • ai-deploy

  • ai-development

  • ai-framework

  • ai-image

  • ai-image-demos

  • ai-inference

  • ai-leaderboard

  • ai-library

  • ai-rank

  • ai-serving

  • ai-tools

  • ai-train

  • ai-video

  • ai-workflow

  • AIGC

  • algorithms

  • alibaba

  • amazon

  • android

  • anthropic

  • audio

  • aws

  • biology

  • blog

  • book

  • bytedance

  • chatbot

  • chatgpt

  • chemistry

  • claude

  • claude-code

  • cli

  • code

  • codex

  • copilot

  • course

  • cursor

  • deepmind

  • deepseek

  • depth

  • devops

  • diffusers

  • docker

  • drug-discovery

  • electron

  • embeddings

  • engineering

  • facebook

  • finance

  • foundation

  • foundation-model

  • gemini

  • gemini-cli

  • gemma

  • genomics

  • gitHub

  • github

  • go

  • google

  • gradient-booting

  • grok

  • huggingface

  • image

  • ios

  • java

  • javascript

  • LLM

  • llm

  • math

  • mcp

  • mcp-client

  • mcp-server

  • meta-ai

  • meta-pytorch

  • microsoft

  • mlops

  • mobile

  • multilingual

  • multimodal

  • NLP

  • nlp

  • nodejs

  • nvidia

  • ocr

  • ollama

  • openai

  • opencode

  • pandas

  • paper

  • physics

  • plugin

  • postgres

  • privacy

  • prompt-engineering

  • python

  • pytorch

  • RL

  • robotics

  • rust

  • science

  • security

  • shodan

  • skillkit

  • sora

  • speech

  • ssh

  • tensorrt

  • terminal

  • transformers

  • translation

  • tutorial

  • typescript

  • vibe-coding

  • video

  • vision

  • vllm

  • voice

  • xAI

  • xai

Icon for item

Nano Banana Demos

2025

prompts and images demo of nano banana

ai-imageai-demosai-image-demos
GitHub
Icon for item

Kornia

2018
Kornia contributors, E. Riba +4

Provides differentiable, GPU-accelerated computer-vision operators and geometric building blocks on top of PyTorch; includes 500+ ops, augmentation pipelines, and pre-trained models for detection, matching, and segmentation—suitable for research and production vision pipelines.

visionpytorchai-librarygitHubai-image+1
GitHub
Icon for item

YOLOv5

2020
Ultralytics

Real-time object detection and training toolkit in PyTorch — provides pretrained YOLOv5 models, training and evaluation scripts, and exporters to ONNX/TFLite/CoreML for fast inference and deployment across devices.

visionpytorchgithubai-imageai-tools+1
Hugging Face
Icon for item

Diffusers (Hugging Face)

2022
Patrick von Platen, Suraj Patil +12

Provides modular PyTorch pipelines and tools for training and running diffusion models across image, video, and audio. Ships ready pipelines (Stable Diffusion, img2img, inpainting, video), hardware optimizations, safety checks, and community examples — good for researchers and product teams.

huggingfaceai-imagepytorchvisionai-tools+1
GitHub
Icon for item

Brush

2024
Arthur Brussee

Produces real-time 3D reconstructions from multi-view images using Gaussian splatting, with on-device training and interactive viewing across native desktops, Android, and the browser. Uses WebGPU and the Burn ML framework to ship dependency-free binaries, a CLI, live training visualization, and streaming .ply support.

visionai-imageai-trainrustgithub+5
GitHub
Icon for item

Structured 3D Latents for Scalable and Versatile 3D Generation

2024
Microsoft

Generates high-quality, editable 3D assets from text or images and decodes to radiance fields, 3D Gaussians, or textured meshes. Ships pretrained models up to 2B parameters, a 500K asset dataset and training code; best used with image conditioning and a ≥16GB NVIDIA GPU.

microsoftgitHubhuggingfaceai-imageai-image-demos+3
GitHub
Icon for item

ViMax: Agentic Video Generation

2025
HKUDS

Automates idea→script→storyboard→video with a multi-agent pipeline that handles long-script RAG segmentation, reference selection, consistency checks, and parallel shot generation. Best for prototyping end-to-end AI video workflows; depends on external model APIs.

ai-videovideoai-agentagent-skillspython+6
Hugging Face
Icon for item

Anima

2026
CircleStone Labs, Comfy Org

Generates anime-style and other non-photorealistic illustrations from text prompts. A 2B-parameter diffusion base preview trained on millions of anime images (and ~800k non-anime art) and released under a non-commercial license; best used in ComfyUI around ~1MP resolution.

huggingfacenvidiaai-imageAIGCai-train+3
GitHub
Icon for item

Modly

2026
Lightning Pixel

Turns photos into exportable 3D meshes using open-source AI models that run entirely on your GPU. Desktop app for Windows and Linux with an extension system to install local model generators and export common 3D formats.

ai-imageimagenodejspythongithub+2
Hugging Face
Icon for item

ERNIE-Image

2026
Baidu

An open text-to-image generation model built on an 8B Diffusion Transformer that focuses on layout-sensitive, text-heavy, and instruction-following image synthesis. Notable for accurate text rendering, structured/compositional generation (posters, comics), and ability to run on consumer 24GB GPUs when paired with prompt enhancement.

visionai-imagehuggingfacepytorchprompt-engineering+3
Hugging Face
Icon for item

HY-World 2.0

2026
Tencent

Generates and reconstructs navigable, editable 3D worlds from text, single images, multi-view photos, or video; outputs meshes and Gaussian Splatting assets and includes WorldMirror 2.0 for fast multi-view reconstruction. Suited for research and production pipelines that import assets into engines; requires substantial GPU resources.

visionai-imagehuggingfacepytorchai-demos+3
Hugging Face
Icon for item

LingBot-Map: Geometric Context Transformer for Streaming 3D Reconstruction

2026
Robbyant Team, Chen, Lin‑Zhuo +10

Performs feed‑forward streaming 3D reconstruction from image sequences, combining coordinate grounding, dense geometric cues and trajectory memory to correct long‑range drift; uses paged KV‑cache attention for ~20 FPS inference at 518×378 and supports sequences >10,000 frames.

visionpytorchfoundation-modelhuggingfaceai-inference+2
  • Previous
  • 1
  • 2
  • Next