Quickly Build an Agent with Llama-Index

Quickly Build an Agent with Llama-Index

Meow! In the previous article, we used Tongyi Qianwen to create an intelligent customer service agent with four major functions through four system-level prompts. This article will build an upgraded agent based on calling Tongyi Qianwen and combining it with Llama-Index. First, let’s implement the simplest example using ReActAgent and Functional Tool to create a … Read more

Transforming Text to SQL with LLaMA2: A Local LLM Guide

Transforming Text to SQL with LLaMA2: A Local LLM Guide

With the rapid development of large model technology, how to fully utilize AI while ensuring data privacy has become a hot topic. Open-source local large language models (LLMs) are gradually becoming an important tool to solve this problem. Today, we will introduce a star-level open-source model—LLaMA2, and see how it seamlessly implements the “text to … Read more

Tang Guoliang Llama Model Architecture: Theory to Practice

Tang Guoliang Llama Model Architecture: Theory to Practice

Follow the official account above to reply:Course Resources can be obtained from this course There is a course on Tang Guoliang Llama model architecture from theory to practice Tang Guoliang Llama model architecture from theory to practice Tang Guoliang Llama Model Architecture: From Theory to Practice In today’s era of rapid advancement in artificial intelligence, … Read more

Automating IT Interviews with Ollama and Python Audio Features

Automating IT Interviews with Ollama and Python Audio Features

Are you still troubled by the mixed quality and poor performance of domestic AI? Then let’s take a look at Dev Cat AI (3in1)! This is an integrated AI assistant that combines GPT-4, Claude3, and Gemini. It covers all models of these three AI tools. Including GPT-4o and Gemini flash Now you can own them … Read more

Local AI – The Ultimate Productivity Stack with Ollama

Local AI - The Ultimate Productivity Stack with Ollama

Are you still troubled by the uneven quality and poor performance of AI in China? Then let’s take a look at Dev Cat AI (3in1). This is an integrated AI assistant that combines GPT-4, Claude3, and Gemini. It covers all models of the three AI tools. Including GPT-4o and Gemini flash Now you can own … Read more

How to Deploy Private Free Large Models Locally with Ollama

How to Deploy Private Free Large Models Locally with Ollama

Click below 👇“AI Knowledge Exchange”Follow the official account Ollama is an open-source framework designed for the convenient deployment and operation of large language models (LLMs) on local machines. Its core feature is to simplify usage and provide an efficient technical architecture, allowing developers to easily access and use powerful AI language models. Ollama supports local … Read more

Local Deployment of Ollama for Offline AI Model Usage

Local Deployment of Ollama for Offline AI Model Usage

Ollama is a local large model running framework that allows users to run and utilize large language models (LLM) on their own computers. Its design goal is to simplify the operation of large models, enabling non-professional ordinary users to easily work with these models that typically require high-end hardware and complex setups to run. Currently, … Read more

Windsurf Editor: The Future Programming Assistant for Architecture

Windsurf Editor: The Future Programming Assistant for Architecture

Windsurf Editor: The Future Programming Assistant for Architecture Introduction In the architecture industry, designers and engineers rely on various software tools, such as CAD (Computer-Aided Design) and document processing software (like Word), to realize their ideas and designs. With the advancement of technology, enhancing efficiency and collaboration in design work has become a major challenge … Read more

Local Invocation of Llama3 Large Model Development

Local Invocation of Llama3 Large Model Development

1. Test using the trained weights from transformers import AutoModelForCausalLM,AutoTokenizer,TextGenerationPipeline import torch tokenizer = AutoTokenizer.from_pretrained(r"E:\大模型AI开发\AI大模型\projects\gpt2\model\models–uer–gpt2-chinese-cluecorpussmall\snapshots\c2c0249d8a2731f269414cc3b22dff021f8e07a3") model = AutoModelForCausalLM.from_pretrained(r"E:\大模型AI开发\AI大模型\projects\gpt2\model\models–uer–gpt2-chinese-cluecorpussmall\snapshots\c2c0249d8a2731f269414cc3b22dff021f8e07a3") # Load our own trained weights (Chinese poetry) model.load_state_dict(torch.load("net.pt")) # Use the system's built-in pipeline tool to generate content pipline = TextGenerationPipeline(model,tokenizer,device=0) print(pipline("天高", max_length=24)) The performance is actually not good: 2. Post-process the AI-generated results # Customized … Read more

WindSurf vs Cursor AI: Choosing the Right AI Code Editor

WindSurf vs Cursor AI: Choosing the Right AI Code Editor

On a sunny afternoon, I sat in front of my computer, with a gentle breeze whispering in my ear: “WindSurf or Cursor AI, that is the question.” Recently, discussions about these two AI code editors have surged online like a tsunami, leaving me confused about my choice. Since that’s the case, why not explore and … Read more