SlimPajama: Cerebras’ Latest Commercial-Grade Language Model Dataset

SlimPajama: Cerebras' Latest Commercial-Grade Language Model Dataset

A critical prerequisite for training large language models is a high-quality, large-scale dataset. To promote the development of the open-source large model ecosystem, Cerebras has released a massive text dataset called SlimPajama, which can serve as a training dataset for large language models and is of very high quality. Cerebras is an American AI chip … Read more

Worried About Prompt Leaking Privacy? This Framework Enables Secure Inference for LLaMA-7B

Worried About Prompt Leaking Privacy? This Framework Enables Secure Inference for LLaMA-7B

Machine Heart Reports Editor: Panda Currently, there are numerous providers offering deep learning services. When using these services, users need to send their information included in the prompt to these providers, which can lead to privacy leakage issues. On the other hand, service providers are generally unwilling to disclose the model parameters they have painstakingly … Read more

Building Local Large Model Programming Tools with PyCharm, Ollama, DeepSeek-Coder, and CodeGPT

Building Local Large Model Programming Tools with PyCharm, Ollama, DeepSeek-Coder, and CodeGPT

1. PyCharm PyCharm is a widely used Python editor; simply download and install the Community Edition. Download link: https://www.jetbrains.com/zh-cn/pycharm/. The installation process is straightforward, so no further explanation is needed. 2. Ollama Ollama is a tool designed for running and customizing large language models in local environments. It provides a simple yet efficient interface for … Read more

Building A Local Document Intelligence Stack Using Docling, Ollama, Phi-4

Building A Local Document Intelligence Stack Using Docling, Ollama, Phi-4

Are you still troubled by the mixed quality of AI in China and its poor performance? Then let’s take a look at Dev Cat AI (3in1). This is an integrated AI assistant that combines GPT-4, Claude3, and Gemini into one. It covers all models of the three AI tools. Including GPT-4o and Gemini flash Now … Read more

Building Dify Platform Based on Ollama Backend (Part 1): Installation

Building Dify Platform Based on Ollama Backend (Part 1): Installation

Purpose: Establish your own workflow through Dify to enhance personal AI capabilities. Goal: Implement Dify’s Docker container locally to build your own workflow. This article focuses solely on the deployment and installation of Dify. First, let’s have a look at the origin of the name Dify: Do It For You Official Dify GitHub address: https://github.com/langgenius/dify … Read more

XRAG-Ollama: Enabling Lightweight Local RAG Framework Deployment

XRAG-Ollama: Enabling Lightweight Local RAG Framework Deployment

XRAG supports comprehensive RAG evaluation benchmarks and toolkits, covering over 50 testing metrics for thorough evaluation and optimization of failure points in RAG. It supports comparisons among four types of advanced RAG modules (query rewriting, advanced retrieval, question-answering models, post-processing) and integrates various specific implementations within the modules, supporting the OpenAI large model API. The … Read more

In-Depth Analysis of RL Strategies in Mainstream Open-Source LLMs

In-Depth Analysis of RL Strategies in Mainstream Open-Source LLMs

The author is from Meta, an internet practitioner, focusing on LLM4Code and LLMinfra. The original text is from Zhihu, link: https://zhuanlan.zhihu.com/p/16270225772 This article is for academic/technical sharing only. If there is any infringement, please contact for removal. RLHF is an important part of LLM training. With the development of open-source models, we observe that some … Read more

Deploy Your Local Large Language Model Knowledge Base with Ollama + Open WebUI

Deploy Your Local Large Language Model Knowledge Base with Ollama + Open WebUI

Install and Set Up Ollama https://ollama.com/ After downloading and installing, it will automatically start with the system by default. The model is installed on the default system disk, so there is no need to set it to auto-start; you can start it whenever you want; then set the default installation path for the model. Delete … Read more

Integrating Ollama Model with SiYuan Note

Integrating Ollama Model with SiYuan Note

This is a simple record and also a solution to some AI configuration issues in SiYuan Note. This article mainly discusses how to use the local Ollama model for artificial intelligence in SiYuan Note. The main plugin used is the ‘Publishing Tool’ in SiYuan Note (surprising, right?). 1. Why First, why do this? First, it’s … Read more

Llama 3.2 Model Fine-Tuning Guide

Llama 3.2 Model Fine-Tuning Guide

1. Basic Description 1. The models from ollama are all stripped-down versions. 2. They are relatively weak in distributed environments. 3. ollama is lightweight and suitable for individual users, but not for enterprise users. 4. ollama only performs model inference (used for running models). 5. ollama uses GGUF for quantization. 6. Engineering deployment generally does … Read more