AIAny - Large Language Model Papers

Large Language Model Papers2022

ReAct: Synergizing Reasoning and Acting in Language Models

Shunyu Yao, Jeffrey Zhao +5Google Research, Princeton University

Interleaves chain-of-thought reasoning with tool-using actions in one LLM loop: the model plans, queries a source like Wikipedia, then revises from results. Cuts hallucination versus reasoning-only prompting and beats trained agents on interactive tasks.

paper LLM NLP ai-agent google+1

AI Agent Papers2024

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

John Yang, Carlos E. Jimenez +5Princeton Language and Intelligence, Princeton University

Treats the interface between an LM agent and a computer as a design variable. A custom agent-computer interface (ACI) with concise file-edit, repo-navigation, and test commands plus compact feedback reaches 12.5% pass@1 on SWE-bench, 87.7% on HumanEvalFix.

paper ai-agent LLM ai-coding engineering

Large Language Model Papers2024

LightRAG

Zirui Guo, Lianghao Xia +3University of Hong Kong, Beijing University of Posts and Telecommunications

A graph-based RAG framework pairing a knowledge graph with vector retrieval and a dual-level (low/high) query mode. New documents merge into the graph via set operations instead of triggering a rebuild, cutting the cost of keeping the index current.

RAG LLM NLP github ai-development+5

Large Language Model Papers2017

Attention Is All You Need

Ashish Vaswani, Noam Shazeer +6Google Brain, Google Research +1

The 2017 paper that replaced recurrence with pure self-attention, making sequence models fully parallelizable — and, almost as a side effect, laying the architectural foundation for nearly every large language model that followed, from BERT to GPT.

NLP LLM AIGC 30u30 paper+1

Machine Learning Foundation Papers2018

Relational recurrent neural networks

Adam Santoro, Ryan Faulkner +8DeepMind, University College London

Embeds multi-head self-attention inside an LSTM-style memory, so stored memories can attend to one another instead of just sitting in separate slots — sharpening relational reasoning and topping WikiText-103, Project Gutenberg, and GigaWord.

foundation 30u30 paper NLP LLM

Large Language Model Papers2018

GPT1: Improving Language Understanding by Generative Pre-Training

Alec Radford, Karthik Narasimhan +2OpenAI

Introduced the two-stage recipe behind the GPT lineage: unsupervised generative pre-training on unlabeled text, then supervised fine-tuning per task. A single 12-layer Transformer decoder beat bespoke architectures on 9 of 12 NLP benchmarks.

openai transformers foundation-model paper LLM+1

Large Language Model Papers2019

GPT2: Language Models are Unsupervised Multitask Learners

Alec Radford, Jeffrey Wu +4OpenAI

A 1.5B-parameter model trained only to predict the next token on diverse web text does translation, summarization, and QA zero-shot, with no fine-tuning. It recast NLP tasks as conditional language modeling and sparked the staged-release misuse debate.

LLM NLP openai paper

Large Language Model Papers2020

Scaling Laws for Neural Language Models

Jared Kaplan, Sam McCandlish +8OpenAI, Johns Hopkins University

Demonstrated that language model loss falls as a smooth power law in model size, data, and compute across more than seven orders of magnitude — turning "make it bigger" from a hunch into a budget you can plan, and justifying the GPT-3 scale-up.

LLM NLP openai 30u30 paper

Large Language Model Papers2020

GPT3: Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann +29OpenAI

At 175 billion parameters, this autoregressive model becomes a strong few-shot learner: it handles translation, QA, and reasoning from a few prompt examples with no gradient updates, establishing in-context learning as an alternative to fine-tuning.

LLM NLP openai paper

Large Language Model Papers2021

Codex: Evaluating Large Language Models Trained on Code

Mark Chen, Jerry Tworek +2OpenAI

Showed that fine-tuning a GPT model on public GitHub code yields a capable program synthesizer, and introduced HumanEval — the docstring-to-function benchmark that still anchors code-generation evaluation. A production variant powers GitHub Copilot.

openai code codex copilot evaluation+2

Large Language Model Papers2022

InstructGPT: Training Language Models to Follow Instructions with Human Feedback

Long Ouyang, Jeff Wu +4OpenAI

Made reinforcement learning from human feedback (RLHF) the standard alignment recipe: collect demonstrations and preference rankings, train a reward model, then optimize with PPO. A 1.3B aligned model was preferred over the 175B GPT-3 by human raters.

openai RL paper LLM NLP

Large Language Model Papers2023

GPT-4 Technical Report

Josh Achiam, Steven Adler +277OpenAI

A multimodal model that accepts image and text inputs and returns text, scoring at human level on professional exams — including a bar exam in the top 10%. Its performance was forecast from models using 1/1000th the compute, showing predictable scaling.

LLM NLP openai paper multimodal

Category

Explore by categories

All Categories

AI Leaderboard

AI Agent Tutorials

AI Coding Tutorials

AI Model

AI Agent Papers

Chatbot

AI Dataset

Machine Learning Foundation Books

AI Train

AI Deploy

AI Client

Machine Learning Foundation Papers

Machine Learning Foundation Tutorials

AI Image Demos

AI Agent

Large Language Model Tutorials

Large Language Model Papers

Machine Learning Engineering Papers

Computer Vision Tutorials

Computer Vision Papers

Natural Language Processing Papers

Reinforcement Learning Papers

Speech Technology Papers

AI API

AI Coding

AI Image

AI Video

MLOps

MCP Client

MCP Server

AI Video Papers

AI Audio

AI Others

AI Infra

Embodied AI