BrowserQwen Archives

Streamlit Local Deployment Tutorial for DeepSeek-R1

2025-06-25 by AI Agent

Introduction Happy Spring Festival everyone! Recently, DeepSeek has gained a lot of popularity. Today, I will share a program that uses Streamlit to deploy the DeepSeek-R1-Distill-Qwen-7B model. By deploying it locally, you can easily utilize DeepSeek’s conversational capabilities. Relationship with Qwen DeepSeek-R1-Distill-Qwen-7B is an open-source inference model based on the Qwen-7B architecture, distilled from the … Read more

Step-by-Step Guide to Fine-Tuning QWEN2.5

2025-06-15 by AI Agent

Introduction This practical guide uses the 0.5B model of QWEN2.5 for fine-tuning on the Ruozhi Bar dataset. As we all know, there are many absurd questions in the Ruozhi Bar.Although these nonsensical questions may seem like forced ambiguity of Chinese semantics from a human perspective, they actually provide high-quality training data for the model to … Read more

Understanding the Qwen2.5 Technical Report

2025-04-08 by AI Agent

Author: PicturesqueOriginal: https://zhuanlan.zhihu.com/p/13700531874 >>Join the Qingke AI Technology Exchange Group to discuss the latest AI technologies with young researchers/developers Technical Report: https//arxiv.org/abs/2412.15115 Github Code: https//github.com/QwenLM/Qwen2.5 0 Abstract Qwen2.5 is a comprehensive series of LLMs designed to meet various needs. Compared to previous versions, Qwen2.5 has significant improvements in both the pre-training (Pretrain) and post-training (SFT, … Read more

Overview of Qwen Series Technology 1 – The Evolution of Qwen

2025-04-08 by AI Agent

Introduction The moon of ancient times is unseen by people today, yet this month once shone upon the ancients. Hello everyone, I am the little girl selling hot dry noodles. I am very glad to share cutting-edge technologies and thoughts in the field of artificial intelligence with my friends. With the rapid development of Large … Read more

Local Deployment and Fine-Tuning Tutorial for Qwen 2.5 Model

2025-04-08 by AI Agent

“ As a non-professional beginner, my initial interest in large models led me to explore related knowledge. As I read more papers and reports, I always wanted to practice with large models but didn’t know where to start. I believe many students share the same experience as I did back then. This article will guide … Read more

Ali Qwen 2.5-1M Open Source: 320GB for 14B Tokens

2025-04-08 by AI Agent

Recently, domestic large models such as DeepSeek, Kimi, Baichuan Intelligence, Doubao, and Jieti Xingchen have released their respective models. On the last day of the year, Alibaba Qwen couldn’t hold back anymore and also open-sourced the million-token contextQwen2.5-1M model and its corresponding inference framework support. Open Source Model: The Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M models, which extend … Read more

Qwen2.5-1M: Open Source Model Supporting 1 Million Tokens Context

2025-04-08 by AI Agent

01 Introduction Two months ago, the Qwen team upgraded Qwen2.5-Turbo to support a context length of up to one million tokens. Today, Qwen officially launched the open-source Qwen2.5-1M model along with its corresponding inference framework support. Here are the highlights of this release: Open Source Models: This release includes two new open-source models, namely Qwen2.5-7B-Instruct-1M … Read more

Qwen-Agent Framework: Exploring Open Source Qwen Model Capabilities

2025-04-08 by AI Agent

Qwen-Agent is a code framework designed to explore the tool usage, planning, and memory capabilities of the open-source Qwen model. Based on Qwen-Agent, we developed a Chrome browser extension called BrowserQwen, which has the following main features: Discuss the content of the current webpage or PDF document with Qwen. With your authorization, BrowserQwen will record … Read more