Object Tracking Based on YOLO and Computer Vision

Object Tracking Based on YOLO and Computer Vision

Click on the above “Beginner Learning Vision” to select Star or Pin. Important content delivered immediately YOLO is a deep learning algorithm used for real-time object detection. You can use pre-trained YOLO models like YOLOv8 or YOLOv9, or train your own model on a custom dataset as needed. In this article, I will guide you … Read more

In-Depth Analysis of Kaiming He’s Team’s Four Major Works

In-Depth Analysis of Kaiming He's Team's Four Major Works

Source: New Intelligence Author: krish This article is 5000 words long and is recommended for a reading time of 10+ minutes. This article introduces some of the latest innovations from Kaiming He, Tsung-Yi Lin, and their team at FAIR in the field of computer vision, including Feature Pyramid Networks, RetinaNet, Mask R-CNN, and weakly supervised … Read more

Detailed Module Analysis of DETR Structure

Detailed Module Analysis of DETR Structure

Transformers shine in the field of computer vision, and the Detection Transformer (DETR) is a successful application of Transformers in object detection. By utilizing the attention mechanism in Transformers, it effectively models long-range dependencies in images, simplifying the object detection pipeline and constructing an end-to-end object detector. Object detection can be understood as a set … Read more

A Comprehensive Review of Multi-Modal Fusion Perception in Autonomous Driving

A Comprehensive Review of Multi-Modal Fusion Perception in Autonomous Driving

Introduction Multi-modal fusion is a crucial task in the perception of autonomous driving systems. This article will detail the multi-modal perception methods for autonomous driving, including object detection and semantic segmentation tasks involving LiDAR and cameras. From the perspective of the fusion stage, existing solutions are categorized into data-level, feature-level, object-level, and asymmetric fusion. Furthermore, … Read more

5 Key Technologies of Machine Vision and Their Common Applications

5 Key Technologies of Machine Vision and Their Common Applications

Click the blue text Follow us Shanghai Civil Engineering Society “ Computer vision refers to the process of enabling machines to simulate human vision through visual information such as digital images or videos, achieving understanding, recognition, classification, tracking, and reconstruction of objects. It is a branch of artificial intelligence that involves multiple fields including image … Read more

Cross-Domain Models: Using Transformers for Object Detection

Cross-Domain Models: Using Transformers for Object Detection

Report by Machine Heart Contributors: Racoon, Du Wei, Zhang Qian Since its introduction in 2017, the Transformer has swept the entire NLP field, with popular models like BERT and GPT-2 adopting Transformer-based architectures.Since it is so effective, why not apply it to CV?Recently, researchers at Facebook AI have attempted this by applying Transformers to object … Read more

YOLT: A YOLO-Based Framework for Large-Scale Image Recognition

YOLT: A YOLO-Based Framework for Large-Scale Image Recognition

This article shares a satellite image object detection framework based on an improved YOLO v2 — YOLT, which provides a good idea for many friends facing challenges in large satellite image recognition during the recent Amazon Web Services 【AI For Good – 2022】 competition. Currently, the YOLT framework has been updated to v4 and is … Read more

Generate Subtitles Using Python and OpenCV

Generate Subtitles Using Python and OpenCV

Click on the above “Beginner’s Visual Learning” to select and add Star or Top. Important content delivered at the first time View the code on GitHub https://github.com/antoneev/66DaysOfData/tree/main/captionSuggestionsUsingLyrics View the deployed application https://share.streamlit.io/antoneev/66daysofdata/main/captionSuggestionsUsingLyrics/app.py Table of Contents Introduction Color Detection Object Detection Similar Word Suggestions Lyrics Genius API Main Function Streamlit Deployment Resources Introduction The purpose of … Read more

Recommended Computer Vision Projects from Microsoft Research Asia

Recommended Computer Vision Projects from Microsoft Research Asia

ClickI Love Computer Vision to star and get CVML new technologies faster. Microsoft Research Asia has produced a lot of cutting-edge technologies in computer vision, as well as numerous high-quality open-source projects that are highly regarded. For those who love CV, the following recommendations are not to be missed. Additionally, at the end of the … Read more

Understanding Object Detection in Computer Vision

Understanding Object Detection in Computer Vision

Click the "Xiaobai Learns Vision" above, select to add "Star" or "Top" Heavyweight content delivered to you first Editor’s Recommendation Object Detection is a popular direction in the field of Computer Vision, widely used in autonomous driving, industrial inspection, video surveillance, and aerospace, among others. Its basic process involves locating the target of interest in … Read more