LogoAIAny
  • Search
  • Collection
  • Category
  • Tag
  • Blog
LogoAIAny

Tag

Explore by tags

LogoAIAny

Curated AI Resources for Everyone

support@aiany.app
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
Company
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
  • All

  • 30u30

  • ASR

  • ChatGPT

  • GNN

  • IDE

  • RAG

  • ai-agent

  • ai-api

  • ai-api-management

  • ai-client

  • ai-coding

  • ai-demos

  • ai-development

  • ai-framework

  • ai-image

  • ai-image-demos

  • ai-inference

  • ai-leaderboard

  • ai-library

  • ai-rank

  • ai-serving

  • ai-tools

  • ai-train

  • ai-video

  • ai-workflow

  • AIGC

  • alibaba

  • amazon

  • anthropic

  • audio

  • blog

  • book

  • bytedance

  • chatbot

  • chemistry

  • claude

  • claude-code

  • course

  • deepmind

  • deepseek

  • engineering

  • finance

  • foundation

  • foundation-model

  • gemini

  • github

  • google

  • gradient-booting

  • grok

  • huggingface

  • LLM

  • llm

  • math

  • mcp

  • mcp-client

  • mcp-server

  • meta-ai

  • microsoft

  • mlops

  • NLP

  • nvidia

  • ocr

  • ollama

  • openai

  • paper

  • physics

  • plugin

  • pytorch

  • RL

  • robotics

  • science

  • security

  • sora

  • translation

  • tutorial

  • vibe-coding

  • video

  • vision

  • xAI

  • xai

Icon for item

Seedance

2025
ByteDance Seed

A model that supports multi-shot video generation from both text and image. It achieves breakthroughs in semantic understanding and prompt following, and can create 1080p videos with smooth motion, rich details, and cinematic aesthetics.

ai-toolsai-videobytedance
Icon for item

MineContext

2025
Volcengine

MineContext is an open-source proactive context-aware AI partner designed to bring clarity and efficiency to your work, study, and creation. It captures and understands your digital world context via screenshots and content comprehension (with future support for multi-modal sources like documents, images, videos, code), and proactively delivers high-quality information such as insights, daily/weekly summaries, to-do lists, and activity records using a context engineering framework.

githubbytedanceai-toolsai-clientai-agent+4
Icon for item

UI-TARS Desktop

2025
ByteDance

UI-TARS Desktop is a native desktop GUI agent by ByteDance that enables multimodal, vision-language-driven control of local and remote computers and browsers. It provides precise mouse/keyboard control, screenshot-based visual recognition, cross-platform support, and integration with the Agent TARS ecosystem and MCP tools. It focuses on private/local processing and building human-like task completion workflows.

bytedancegithubai-agentvisionLLM+7
Icon for item

Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

2025
ByteDance

Dolphin is an open-source document image parsing project from ByteDance that uses heterogeneous anchor prompting and a document-type-aware two-stage architecture. It handles both digital-born and photographed documents, offering page-level and element-level parsing (text, tables, formulas, code). Dolphin-v2 (3B) improves accuracy and adds multi-page PDF support, deployment recipes (vLLM, TensorRT-LLM), and Hugging Face model hosting. The repository includes code, demos, pretrained models, and a BibTeX citation; license: MIT.

githubbytedanceocrpapervision+3
  • Previous
  • 1
  • Next