In-Depth Study of Qwen 2.5 Paper

In-Depth Study of Qwen 2.5 Paper

Introduction I must say, Qwen is really impressive. It seems that its foundational capabilities have firmly established it as the leader in open source, and it is not at all inferior compared to most closed sources. Many companies’ foundational teams are likely already being judged on the significance of foundational models. Qwen’s open-source momentum is … Read more

Qwen 1.5 Open Source! Best Practices for Magic Adaptation!

Qwen 1.5 Open Source! Best Practices for Magic Adaptation!

In recent months, the Tongyi Qianwen team has been working hard to explore how to build a ‘good’ model while optimizing the developer experience. Just before the Chinese New Year, the Tongyi Qianwen team shared the next version of the Qwen open-source series, Qwen 1.5. Qwen 1.5 has open-sourced six sizes of foundational and chat … Read more

Qwen Technical Report Details Sharing

Qwen Technical Report Details Sharing

Introduction Alibaba open-sourced the Qwen-7B model a long time ago, but for some reason, it was taken down. Just yesterday, Alibaba re-open-sourced the Qwen-14B model (the original 7B model was also released), and simultaneously released the technical report on Qwen. Today, I would like to share this with everyone. PS: Now domestic open-source large models … Read more

Understanding Alibaba’s Qwen Model and Local Deployment

Understanding Alibaba's Qwen Model and Local Deployment

Introduction Overview Pre-training Data Sources Pre-processing Tokenization Model Design Extrapolation Capability Model Training Experimental Results Deployment Testing Alignment Supervised Fine-tuning (SFT) RM Model Reinforcement Learning Alignment Results (Automatic and Human Evaluation) Automatic Evaluation Human Evaluation Deployment Testing Conclusion Introduction This article mainly introduces the Chinese large model Alibaba Qwen, specifically including model details interpretation and … Read more

Interpretation of Qwen2.5 Technical Report

Interpretation of Qwen2.5 Technical Report

MLNLP community is a well-known machine learning and natural language processing community both domestically and internationally, covering NLP master’s and doctoral students, university professors, and corporate researchers. The vision of the community is to promote communication and progress between the academic and industrial circles of natural language processing and machine learning, especially for the advancement … Read more

Introduction to Qwen-Agent: An Open Source Agent Development Framework

Introduction to Qwen-Agent: An Open Source Agent Development Framework

Recommended Reading Multi-Agent Framework Comparison —- Magentic-One, AutoGen, LangGraph, CrewAI, Swarm Open Source Search Engine MiniPerplx: An AI Search Engine Built with Agent Brain Pydantic Agents: A Recommendation System for Context Processing Based on Prompt Injection Top 5 Frameworks for Building Multi-Agents and Their Usage How to Build a General LLM Agent Overview of Foreign … Read more