Large Language Models Archives - Page 7 of 12

RestGPT Framework: Controlling Real-World Applications via RESTful APIs

2025-04-01 by AI Agent

©PaperWeekly Original · Author | Yifan Song Affiliation | PhD Student, Institute of Computational Linguistics, Peking University Research Area | Natural Language Processing Paper Title: RestGPT: Connecting Large Language Models with Real-World RESTful APIs Paper Link: https://arxiv.org/abs/2306.06624 Code Link: https://github.com/Yifan-Song793/RestGPT Research Background Large language models (LLMs) such as ChatGPT and GPT-4 have demonstrated many powerful … Read more

Multi-Agent Collaboration Mechanisms: A Review of Large Language Models

2025-04-01 by AI Agent

With the latest advancements in large language models (LLMs), agentic artificial intelligence (Agentic AI) has made significant progress in real-world applications, moving towards intelligent agents based on multiple large language models that achieve perception, learning, reasoning, and collaborative actions. These multi-agent systems (MASs) based on large language models enable a group of agents to collaborate … Read more

2025 AI Engineering Advancement Guide: Unlocking 10 Core Areas with 50 Must-Read Papers!

2025-04-01 by AI Agent

Hello everyone, I am Mu Yi, an internet technology product manager who continuously focuses on the AI field, a top 2 undergraduate in China, a top 10 CS graduate student in the US, and an MBA. I firmly believe that AI is the “power-up” for ordinary people, which is why I created the WeChat public … Read more

Essential Papers for AI Engineers in 2025

2025-03-31 by AI Agent

Part 1: Cutting-Edge Large Language Models GPT Series includes related papers on GPT1, GPT2, GPT3, Codex, InstructGPT, and GPT4. These papers are straightforward and clear. Additionally, GPT3.5, 4o, o1, and o3 are more related to release activities and system cards. GPT1 https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf GPT2 https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf GPT3 https://arxiv.org/pdf/2005.14165 Codex https://arxiv.org/abs/2107.03374 InstructGPT https://arxiv.org/pdf/2203.02155 GPT4 https://arxiv.org/abs/2303.08774 Claude and Gemini … Read more

Microsoft Open Sources The Phi Series: Technological Evolution, Capability Breakthroughs, And Future Prospects

2025-03-31 by AI Agent

Microsoft Open Sources The Phi Series: Technological Evolution, Capability Breakthroughs, And Future Prospects 1. Introduction In recent years, the parameter scale of large language models (LLMs) has shown an exponential growth trend, demonstrating strong general intelligence and achieving groundbreaking progress in numerous natural language processing tasks. However, these large models come with high training costs, … Read more

AI Innovations in Frontend Technology

2025-03-31 by AI Agent

Author | Zhou Tingshuai Planning | Xue Liang In an era dominated by large language models (LLMs), conversational flows have become the mainstream method for most interactive cloud communications. How can traditional frontend technologies integrate with these powerful models in light of this trend? Besides user interfaces (UI), in what other areas can frontend technology … Read more

Ollama 0.5.7 Deployment Guide: Easily Build Your AI Assistant!

2025-03-30 by AI Agent

Introduction Do you want to run large language models locally and create your own AI assistant? The latest Ollama 0.5.7 version makes this easier than ever. By following the steps below, you will easily complete the deployment and embark on an intelligent journey!💡 🎯 What is Ollama? Ollama is a tool designed to help users … Read more

Installing Ollama Local Large Model for AI Workflows

2025-03-30 by AI Agent

To pave the way for future workflows and local models, it is necessary to discuss the installation of the Ollama local large language model backend. 1. Ollama Before installation, it is necessary to clarify what Ollama is.. Ollama is a backend service for large language models that helps users easily deploy large language models without … Read more

Building a Minimal RAG Model Step by Step

2025-03-29 by AI Agent

Datawhale Insights Author: Song Zhixue, Datawhale Member Hello everyone, I am not a fan of garlic and ginger. Next, I will guide you step by step to implement a simple RAG model, which is a simplified version of RAG, called Tiny-RAG. Tiny-RAG is a simplified version of RAG that only includes the core functions of … Read more

Key Module Analysis of RAG Full Link

2025-03-29 by AI Agent

Original: https://zhuanlan.zhihu.com/p/682253496 Organizer: Qingke AI 1. Background Introduction The RAG (Retrieval Augmented Generation) method refers to a combination of retrieval-based models and generative models to improve the quality and relevance of generated text. This method was proposed by Meta in the 2020 paper “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks”[1], allowing language models (LM) to acquire … Read more