Deep Learning (2) – Restricted Boltzmann Machine

Deep Learning (2) - Restricted Boltzmann Machine

Continuing from the previous article Deep Learning (1) – Overview, Distributed Representation, and Ideas 9. Common Models or Methods in Deep Learning 9.1 AutoEncoder The specific process is briefly described as follows: 1) Given unlabeled data, learn features using unsupervised learning: In our previous neural network, as shown in the first diagram, the samples we … Read more

Analysis of Qwen2.5 Coder Training Process and Data Distribution

Analysis of Qwen2.5 Coder Training Process and Data Distribution

I have read some papers and training data on Qwen2.5 Coder and summarized them. Paper link: https://arxiv.org/pdf/2409.12186 1. Introduction The Qwen2.5-Coder series is a major upgrade from its predecessor CodeQwen1.5, aimed at achieving top-notch code task performance across various model sizes. This series includes six models: Qwen2.5-Coder-0.5B Qwen2.5-Coder-1.5B Qwen2.5-Coder-3B Qwen2.5-Coder-7B Qwen2.5-Coder-14B Qwen2.5-Coder-32B The architecture of … Read more

Visualizing LSTM Networks: Exploring Memory Formation

Visualizing LSTM Networks: Exploring Memory Formation

Selected from Medium Author: Piotr Tempczyk Translated by Machine Heart Contributors: Chen Yunzhu, Liu Xiaokun There are many studies on visualization in the field of convolutional neural networks, but there are not enough similar tools for LSTM. Visualizing LSTM networks can yield interesting results; due to their time-related characteristics, we can explore the relationships between … Read more

How to Use DeepFake for Face Swapping in Videos

How to Use DeepFake for Face Swapping in Videos

Machine Heart Release Author: Feng Qinyuan Not long ago, the AV video face-swapping star DeepFake became popular. This article will guide you step by step on how to achieve face swapping. If you are hearing about DeepFake for the first time, be sure to click the video above to personally experience how Nicolas’s face occupies … Read more

DeepMind Scientist Analyzes Diffusion Models from Eight Perspectives

DeepMind Scientist Analyzes Diffusion Models from Eight Perspectives

Machine Heart Compilation Author: Sander Dieleman Editor: Panda W Diffusion models are very popular, and their descriptions vary widely. In this article, a DeepMind research scientist provides a comprehensive analysis of the topic “What is a diffusion model?” If you’ve tried one of the most popular AI painting tools, Stable Diffusion, then you’ve already experienced … Read more