The Intelligent Era: How AI Agents Reshape the World

The Intelligent Era: How AI Agents Reshape the World

Hong Mei: CEO of an AI robotics company, focusing on the application of artificial intelligence for B-end customer transactions, empowering individuals to grow into AI super individuals. In the boundless universe of technology, AI Agents (智能体/智能代理) shine like a dazzling morning star, completely overturning our life and work patterns with unimaginable speed and power. Are … Read more

How AI Transformed My Work Efficiency: Insights from Nicholas Carlini

How AI Transformed My Work Efficiency: Insights from Nicholas Carlini

Source | AI Researcher Introduction This article showcases the application of large language models (LLMs) in real work through the personal experience of Nicholas Carlini and discusses how AI technology is changing our work methods and enhancing productivity. Table of Contents: 1. Nuances 2. My Background 3. How to Utilize Language Models 4. Evaluating the … Read more

Phi Series Models: Small Size, Big Impact

Phi Series Models: Small Size, Big Impact

Today, Microsoft released the Phi3 model, which achieves results comparable to Mixtral-8x7B with a compact size of 3.8B, causing quite a stir in the community. Teacher Fuyao exclaimed, “Cannot compare to Li Jie!” A while ago, I tried to finetune the Phi2 model, and to be honest, the results were not very ideal. The default … Read more

How Kimi-k1.5 is Developed?

How Kimi-k1.5 is Developed?

Yesterday, everyone was shouting about the New Year! Kimi has released a rare technical report, let’s take a look at the technical details. The technical report is here: https://github.com/MoonshotAI/Kimi-k1.5 First, as always, Kimi emphasizes long context. So the question arises: if we provide the model with a longer ‘thinking space’, can it naturally learn to … Read more

Llama 3.2 Reasoning WebGPU: A Powerful In-Browser Model

Llama 3.2 Reasoning WebGPU: A Powerful In-Browser Model

Llama 3.2 Reasoning WebGPU: A compact and powerful reasoning language model that runs in the browser, akin to putting an intelligent brain into a webpage, enabling quick understanding and reasoning of various issues. References: [1] http://github.com/huggingface/transformers.js-examples/tree/main/llama-3.2-reasoning-webgpu Welcome to support my knowledge platform (NLP Engineering): Dify source code analysis and Q&A, Dify dialogue system source code, … Read more

Tang Guoliang Llama Model Architecture: Theory to Practice

Tang Guoliang Llama Model Architecture: Theory to Practice

Follow the official account above to reply:Course Resources can be obtained from this course There is a course on Tang Guoliang Llama model architecture from theory to practice Tang Guoliang Llama model architecture from theory to practice Tang Guoliang Llama Model Architecture: From Theory to Practice In today’s era of rapid advancement in artificial intelligence, … Read more

2024 AIGC Industry Research: Multimodal Large Models and Commercial Applications

2024 AIGC Industry Research: Multimodal Large Models and Commercial Applications

Sora has once again ignited the AIGC industry, accelerating the arrival of the AGI era. Author|36Kr Research Institute Source|36Kr Research Institute (ID: kr_research) Cover Source| Visual China In February 2024, OpenAI released its first video generation model, Sora, allowing users to generate high-definition videos with smooth scene transitions and clear details by simply inputting a … Read more

Neural Network Transfer Learning for Natural Language Processing

Neural Network Transfer Learning for Natural Language Processing

Recommended by New Intelligence Yuan Source: Zhuangzhi (ID: Quan_Zhuanzhi) [New Intelligence Yuan Guide] In reality, natural language processing faces various types of tasks across multiple domains and languages, making it impractical to label data for each task individually. Transfer learning allows for the transfer of learned knowledge to related scenarios. This article introduces Dr. Sebastian … Read more