Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated daily.

19th of October 2023

AgentTuning: Enabling Generalized Agent Abilities for LLMs

AgentTuning: Improves LLM capability by Instruction Tuning to user tasks by using AgentInstruct-dataset to create AgentLM using AgentTuning.

18th of October 2023

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Language agent to automatically identify ans quantify extent of generated images.
Planning and Reasoning. Tool usage: Intent understanding, Instruction generation, Instruction retrieval, Prompt optimization & Stereotype score generation.

16th of October 2023

OpenAgents: An Open Platform for Language Agents in the Wild

OpenAgents-platform: Data agent, Plugin/Tools and Web agent
Automatic tool selection from over 200 tools

13th of October 2023

A Zero-Shot Language Agent for Computer Control with Structured Reflection

Zero-shot agent plans executable actions in the environment and iteratively progresses by learning from mistakes using self-reflection and structured thoughts management.
Better generalization, outperforms best iterative-planning agents

12th of October 2023

AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems

AgentCF: LLM agent-based recommender system with Use and Item Agents.
User & Item Agents interact autonomously and the discrepancies between the two are stored in the memory to help guide better future recommendations.

12th of October 2023

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Octopus: Uses Vision-Language Model with Reinforcement Learning from Environmental Feedback (RLEF).
Generates action sequences and executable code.

12th of October 2023

MemGPT: Towards LLMs as Operating Systems

MemGPT: OS-based design with LLM-processor managing its actual context and long term memory and uses functions to make changes and events to manage order of processing data.

12th of October 2023

Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques

Promptor: Automatic prompt generation.
Builds prompts based on: User goals, User Profiles, Data Profile, Contextual nformation & Output constraints
System prompt includes: instructions, Actions, Facts and Examples.

12 of October 2023

Towards Robust Multi-Modal Reasoning via Model Selection

Dynamic model selection by taking into account input & sub-task dependencies.

9th of October 2023

FireAct: Toward Language Agent Fine-tuning

Fine-tuning LLMs with agent trajectories for better autonomous agents.

8th of October 2023

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

MemWalker: navigates long-context iteratively and construct memory as treelike structure.

7th if October 2023

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Introspective reasoning of the knowledge.

6th of October 2023

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Language Agents Tree Search (LATS): Self-Refine, Memory, Reasoning, Decision Making & Planning.
Uses multiple reasonining paths and learns from experience by integrating external feedback & self-reflection.

5th of October 2023

Agent Instructs Large Language Models to be General Zero-Shot Reasoners

AgentInstruct: generates instructions for th problem and then solves it using these instructions, improving the Chain of Thought (CoT) zero-shot reasoning.

5th of October 2023

Balancing Autonomy and Alignment: A Multi-Dimensional Taxonomy for Autonomous LLM-powered Multi-Agent Architectures

Characteristics of Autonomous Agents: Goal-driven task management, Intelligent Agents with LLMs, Multi-Agents collaboration, Context interaction, Balancing Autonomy vs. Alignment.

3rd of October 2023

Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation

Self-Taught Optimizer (STOP): Ask LLM to improve initial program by providing improvement candidates and then output best solution.

3rd of October 2023

Lyfe Agents: Generative agents for low-cost real-time social interactions

LyfeAgents Brain: Sensory processing, Internal states, Self-monitor, Action selection and Memory.
Internal states are text based: current goal, memory, recent events and sensory inputs.
Cognitive controller selects high-level actions. Action model selects actions until termination condition is reached.
Self-monitoring maintains and emphasizes recent and novel events towards agent goals
Memories are clustered and summarized before moving them to long-term storage (vector database)

3rd of October 2023

Large Language Models as Analogical Reasoners

LLM self-generates examples/knowledge related to the task.

3rd of October 2023

Conceptual Framework for Autonomous Cognitive Entities

Conceptual framework for Autonomous entities.

2nd of October 2023

SmartPlay : A Benchmark for LLMs as Intelligent Agents

SmartPlay: a benchmark to test LLM-based agents from 9 perspectives.
Tests: Reason�ing with object dependencies, planning ahead, spatial reasoning, learning from history, and understanding randomness.

2nd of October 2023

GRID: A Platform for General Robot Intelligence Development

GRID: General Robot Intelligence Development
Solves complex tasks using simulatiom and/or real-world data
Task specification, robot configuration and sensor/API.
Foundation Mosaic: a neural architecture.

1st of October 2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

RoleLLM: Role-profile constructor, Context-based Instruction generarion, Role-based Prompting(RoleGPT), Role-conditioned Instruction-tuning.

28th of September 2023

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

Promptbreeder uses thinking styles and mutation-prompts and is able to improve mutation/task prompts.

23th of September 2023

Natural Language based Context Modeling and Reasoning with LLMs: A Tutorial

LLM-driven Context-aware Computing (LCaC) approach.

20th of September 2023

You only look at the screens: Multimodal Chain-of-Action Agents

Multimodal Chain-of-Actions Agents (Auto-UI) interacts directly with the UI
Chain-ofAction technique using series of action histories and future action plans.

14th of September 2023

The Rise and Potential of Large Language Model Based Agents: A Survey

A conceptual framework for LLM-based agents with three components brain, perception, and action.

14th September 2023

Agents: An Open-source Framework for Autonomous Language Agents

Multi-agent: Planning, memory, tool usage, multi-agent communication & symbolic control.
Open source library.

12th of September 2023

Life-inspired Interoceptive Artificial Intelligence for Autonomous and Adaptive Agents

Interoceptive AI: monitoring own internal state of the artificial agent.

8th of September 2023

Unleashing the Power of Graph Learning through LLM-based Autonomous Agents

AutoGraph procedure: data, configuration, searching a d tuning agents.

28th of August

RecMind: Large Language Model Powered Agent For Recommendation

RecMind: a recommender focused LLm agent with reasoning, planning to sub-tasks, memory & tools.

22th of August 2023

A Survey on Large Language Model based Autonomous Agents

Systematic review of LLM based Autonomous Agents.
Use cases and evaluation strategies and future use cases.

21st of August 2023

https://arxiv.org/abs/2308.10848

AgentVerse: multi-agent collaborarion and individual agents social bjeaviours.

25th of July 2023

WebArena: A Realistic Web Environment for Building Autonomous Agents

An environment to test Autonomous agents in an environment with tools, external knowledge.

8th June 2023

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

Builds multi-agent simulation environment to generate dataset of using many real world apis.
Small models can achieve comparable performance to larger models on tool usage.

5th June 2023

SELFEVOLVE: A Code Evolution Framework via Large Language Models

Generates intermediate code based on input prompt.
Use LLM to act as expert programmer to debug the generated code by receiving errors from Python interpreter.

3th June 2023

Prompt Sapper: LLM-Empowered Software Engineering Infrastructure for AI-Native Services

Human AI collaborative intelligence methodology & technical practices, where the idea is not to have "full Auto-GPT" from user input to direct resolution by LLM, but rather human reviews steps between.
Useer inputs objective, LLM asks clarification. Use then User adds clarifications and LLM constructs AI chain for human to review. Finally LLM executes the AI chain with user acceptabnce tests.

3th June 2023

Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions

Auto-GPTs outperforms supervised state-of-the-art Imitiation Learning (IL) models with GPT4 in WebShop- and ALFWorld-benchmarks in unknown external environments.
Additional opinions algorithm improves performance, which takes into account additional opinions from external expert models.

24th May 2023

Gorilla: Large Language Model Connected with Massive APIs

Gorilla is a retrieve-aware finetuned LLaMA-7B model for API calls using self-instruct to generate Instruction-API pairs.

17th May 2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Tree of Thoughts (ToT)-technique makes decisions using multiple different reasoning paths, self-evaluating choices to decide next action with ability to look back/forward for global decisions.

11th April 2023

Teaching Large Language Models to Self-Debug

The model generates new code together with code explanation. The code is then executed and this executed code is sent back as feedback together with the code explanation. This feedback

7th of April 2023

ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions

ChatPipe - Iterative, data preparation program with ChatGPT using 1. Operation Recommendation, 2. Program generation, 3. Version management.
Recommends next data preparation opration. Easily roll-back to previous program for version control.

6th April 2023

Generative Agents: Interactive Simulacra of Human Behavior

Enable believable human behavior: observation, planning, and reflection.
An agent wants to throw a Valentine’s Day party. The agents autonomously spread invitations, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time.
GPTeam is inspired by this approach.

31 March 2023

CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society

CAMEL attempts to facilitate autonomous cooperation among communicative agents through role-playing framework.
The approach manages complete tasks with minimal human input.

30th of March 2023

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace

A LLM (such as ChatGPT) accesses HuggingFace community to look AI models to complete the given task.
It can read multi modalities by outsourcing tasks like image recognition to the specific image model.

DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents

Dialog-Enabled Resolving Agents (DERA) uses two roles: Researcher and Decider to perform discussion between these two agents.
Researcher role processes information and Decider role uses judgement.

29th of March 2023

TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs

Multimodal conversational foundation model (MCFM). MCFM generates a textual solution outline, then API selector chooses most relevant API from collection of APIs (with API name, parameter list, description, usage example and example when combining it with another API).
MCFM generates action code using recommended API and the API call is executed. Finally, output is provided back to developer.

28th March 2023

Task-driven Autonomous Agent Utilizing GPT-4, Pinecone, and LangChain for Diverse Applications

Task-driven autonomous agent, with vector database and Langchain. BabyGPT includes: Execution, creation and prioritization
Takes objective, pulls an item from task queue and moves it to execution agent with access to memory.

20th March 2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Reflexion agents reflect on task feedback, use it from memory to make better decisions and new attempts.

qsimeon / autonomous-agents Goto Github PK

autonomous-agents's Introduction

Autonomous-Agents

19th of October 2023

18th of October 2023

16th of October 2023

13th of October 2023

12th of October 2023

12th of October 2023

12th of October 2023

12th of October 2023

12 of October 2023

9th of October 2023

8th of October 2023

7th if October 2023

6th of October 2023

5th of October 2023

5th of October 2023

3rd of October 2023

3rd of October 2023

3rd of October 2023

3rd of October 2023

2nd of October 2023

2nd of October 2023

1st of October 2023

28th of September 2023

23th of September 2023

20th of September 2023

14th of September 2023

14th September 2023

12th of September 2023

8th of September 2023

28th of August

22th of August 2023

21st of August 2023

25th of July 2023

8th June 2023

5th June 2023

3th June 2023

3th June 2023

24th May 2023

17th May 2023

11th April 2023

7th of April 2023

6th April 2023

31 March 2023

30th of March 2023

29th of March 2023

28th March 2023

20th March 2023

Recommend Projects

Recommend Topics

Recommend Org