In-Depth Analysis of DeepSeek by Tsinghua Professor

In-Depth Analysis of DeepSeek by Tsinghua Professor

Recently, CCF-Talk held an online seminar themed “Night Talk on DeepSeek: Technical Principles and Future Directions”. Associate Professor Liu Zhiyuan from Tsinghua University and Chief Scientist of Benwall Intelligence was one of the speakers, delivering an exciting presentation on “Technical Principles of Large Model Reinforcement Learning and Insights on Large Model Technology Development“. Liu Zhiyuan … Read more

In-Depth Analysis of ChatGPT’s Development, Principles, Architecture, and Future

In-Depth Analysis of ChatGPT's Development, Principles, Architecture, and Future

Source: Dolphin Data Science Laboratory This article is approximately 6000 words and is recommended for a 12-minute read. This is a deep technical popular science and interpretation article, without excessive technical terms. [ Introduction ] The author of this article is Dr. Chen Wei, who previously served as the chief scientist of a Huawei-affiliated natural … Read more

Understanding Reinforcement Learning in ChatGPT

Understanding Reinforcement Learning in ChatGPT

Author: Chen Zhiyan This article is about 2400 words long and is recommended for an 8-minute read. This article introduces reinforcement learning in ChatGPT. ChatGPT is based on OpenAI’s GPT-3.5 and is a derivative product of InstructGPT. It introduces a new method of incorporating human feedback into the training process, allowing the model’s output to … Read more

Understanding Machine Learning Through Visuals

Understanding Machine Learning Through Visuals

Source: DeepHub IMBA This article is about 2300 words long and is recommended for an 8-minute read. This article introduces the types of machine learning. Machine Learning Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed. Machine learning can … Read more

Understanding Machine Learning in One Article

Understanding Machine Learning in One Article

This article is reproduced from the public account sigua’s inner voice, a frontline small developer in the mathematics department, updating original articles on data structures and algorithms | deep learning | workplace technologies.This public account is followed by Google TensorFlow programmers, Apple company programmers, Microsoft programmers, a contestant from the ultimate quiz show, CTOs of … Read more

Reinforcement Learning: Enabling Autonomous Decision-Making in Machines

Reinforcement Learning: Enabling Autonomous Decision-Making in Machines

Reinforcement Learning, abbreviated as RL, is a term you will frequently hear in today’s AI research. RL techniques have been applied in many projects, such as AlphaGo. A Brief The algorithms we learned earlier are actually based on supervised learning. In any case, we provided them with label tags. Whether it’s CNN, RNN, GAN, or … Read more

Machine Learning: Definition, Development History, and Algorithm Classification

Machine Learning: Definition, Development History, and Algorithm Classification

1. Definition Machine learning is a multidisciplinary field that encompasses knowledge of probability theory, statistics, approximation theory, and complex algorithms. It uses computers as tools to simulate human learning in real-time and aims to effectively improve learning efficiency by structuring existing knowledge. There are several definitions of machine learning: (1) Machine learning is a science … Read more

Development and Application of Expert Control System for Cement Mill Based on Fuzzy Control and Machine Learning

Development and Application of Expert Control System for Cement Mill Based on Fuzzy Control and Machine Learning

Abstract The cement grinding process has nonlinear, large time-delay, and time-varying characteristics, making it difficult to establish an accurate cement mill model. The intelligent expert control system for cement mills can achieve intelligent control of the cement production process, which is a core control link for realizing intelligent manufacturing in cement production. It can reduce … Read more

Research on Intelligent Quantitative Trading System Based on Deep Hybrid Architecture

Research on Intelligent Quantitative Trading System Based on Deep Hybrid Architecture

Source: DeepHub IMBA This article is approximately 5500 words, recommended reading time is over 10 minutes. This article explores the hybrid modeling method that combines temporal features and static features in the field of quantitative trading. By integrating Stacked Sparse Denoising Autoencoder (SSDA) and Long Short-Term Memory based Autoencoder (LSTM-AE), we aim to build a … Read more

Introduction to Reinforcement Learning with DI-engine: Using RNN

Introduction to Reinforcement Learning with DI-engine: Using RNN

1. Data Processing The mini-batch data used for training RNNs differs from the usual data. This data should typically be arranged in a time series. For DI-engine, this processing is done during the collector phase. Users need to specify learn_unroll_len in the configuration file to ensure that the length of the sequence data matches the … Read more