AI Model Deployment Archives

DeepSeek Local Deployment Guide: Say Goodbye to ‘Server Busy’!

2025-07-06 by AI Agent

Have you ever experienced the frustrating moments when the web version of DeepSeek keeps “spinning”? Today, I’ll teach you how to install the most powerful domestic AI model on a regular computer, allowing you to say goodbye to lag and privacy concerns! A complete toolkit and pitfall guide are included at the end of the … Read more

Step-By-Step Guide to Deploy DeepSeek Model on Your PC

2025-07-02 by AI Agent

Want to turn your computer into a “super brain,” but find complex models daunting? Today, I will guide you step-by-step on how to deploy the DeepSeek model on your own computer, with a low barrier to entry! Step 1: Go to https://ollama.com/download to download Ollama Step 2: Install Ollama Step 3: Run Ollama Step 4: … Read more

Conversion and Quantization of Multimodal Large Models for Robots

2025-06-30 by AI Agent

1. Introduction In today’s field of artificial intelligence, the application of multimodal large models in robotics is becoming increasingly widespread. This article aims to introduce how to convert multimodal large models to the gguf format and quantize them for efficient deployment on the ollama platform. Through this process, we achieve more efficient model operation and … Read more

DeepSeek Deployment Guide: Efficient Practices from Local to Cloud

2025-06-27 by AI Agent

01 Overview of DeepSeek Deployment DeepSeek, as a high-performance open-source large model, supports various deployment methods, including local deployment, cloud deployment, and hybrid deployment. This article will detail how to efficiently deploy DeepSeek in different environments and optimize its performance. 02 Local Deployment 1. Hardware Requirements GPU: At least 1 NVIDIA A100 or equivalent GPU … Read more

Deploying PyTorch Models on C++ Platforms: A Step-by-Step Guide

2025-05-28 by AI Agent

Click the above“Beginner Learning Vision” to choose to add “star” or “pin” Valuable content delivered promptly From | Zhihu Author | Mars Girl Link | https://zhuanlan.zhihu.com/p/146453159 Recently, due to work needs, I had to deploy a PyTorch model to a C++ platform. The basic process mainly refers to the official teaching examples, during which I … Read more

Aliyun Tongyi’s High-Precision Multilingual Speech Recognition Model

2025-05-03 by AI Agent

High-Precision Multilingual Speech Recognition Model The high-precision multilingual speech recognition model usually refers to an artificial intelligence model that can accurately recognize and understand speech input in multiple languages.Such models have the following characteristics: 1. Wide Language Coverage: Capable of processing and understanding various languages and dialects, including not only common international languages but also … Read more

What Unique Techniques Did OpenAI Use in the o1 Model?

2025-04-30 by AI Agent

Book Giveaway at the End Part.1 OpenAI o1: The AI Model Beyond Human PhDs Recently, OpenAI made a significant announcement, launching the new large model o1, which once again shocked the industry. The o1 model has demonstrated exceptional capabilities in a series of challenging benchmark tests, with reasoning abilities that can even surpass human experts … Read more

Easy Guide to Deploying Phi3 and Llama3 on Mobile!

2025-04-27 by AI Agent

About 3100 words, suggested reading time: 10 minutes This article will introduce several latest large models suitable for private deployment, and then guide you step-by-step to deploy them on computers and mobile phones. Large models are undoubtedly the protagonists of this AI revolution,large models are based on Scaling Law. Simply put, the more data, the … Read more

Local Deployment and Fine-Tuning Tutorial for Qwen 2.5 Model

2025-04-08 by AI Agent

“ As a non-professional beginner, my initial interest in large models led me to explore related knowledge. As I read more papers and reports, I always wanted to practice with large models but didn’t know where to start. I believe many students share the same experience as I did back then. This article will guide … Read more

Qwen2.5-1M: Open Source Model Supporting 1 Million Tokens Context

2025-04-08 by AI Agent

01 Introduction Two months ago, the Qwen team upgraded Qwen2.5-Turbo to support a context length of up to one million tokens. Today, Qwen officially launched the open-source Qwen2.5-1M model along with its corresponding inference framework support. Here are the highlights of this release: Open Source Models: This release includes two new open-source models, namely Qwen2.5-7B-Instruct-1M … Read more