30 Essential Computer Vision Projects for Beginners

There are many questions online about how to get started in computer vision. Most authors emphasize that practical projects and theoretical learning are equally important, as over-focusing on theory while neglecting practice can easily lead to the path of ‘from beginner to giving up.’

Although there is a consensus on this, there is no systematic summary online about which projects beginners in CV should start with. Therefore, this article will organize some suitable projects for newcomers in the field of computer vision, categorized into five main tasks: Object Detection, Object Tracking, Image Segmentation, Image Classification, and Image Generation. A brief introduction and links to these projects will also be compiled in the article. Since external links cannot be inserted, you can click on ‘Read the original text’ at the end of the article. I hope this article can truly help many beginners in the field of computer vision!

Object Detection

1. License Plate Recognition using YOLOv4, OpenCV, and Tesseract OCR

30 Essential Computer Vision Projects for Beginners

Project Link: github.com/theAIGuysCod

2. YOLO V5 Object Detection using Pytorch

YOLOv5 is a series of object detection architectures and models pre-trained on the COCO dataset, representing Ultralytics’ open research on future visual AI methods, which incorporates the experience and best practices gained from thousands of hours of research and development.

Demo Link: youtube.com/watch?

Project Link: github.com/ultralytics/

Colab Link: colab.research.google.com

3. Face Mask Detection using Python, Keras, OpenCV, and MobileNet | Detect Masks in Real-Time Video Streams

Project Link: github.com/balajisriniv

4. YOLOR Card Object Detection DEMO

Demo Link: youtube.com/watch?

Project Link: github.com/WongKinYiu/y

5. Fire Detection / Disaster Prevention / Real-Time / PyTorch, Python Tutorial, OpenCV, YOLOv5

Demo Link: When YOLOv5 Meets Fire Detection!

Project Link: github.com/ai-coordinat

6. YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors

Demo Link: YOLOv7 – Object Detection Algorithm

Project Link: github.com/WongKinYiu/y

7. Drowning Detection

Demo Link: Computer Vision Application | Drowning Recognition

Project Link: github.com/Nico31415/Dr

Object Tracking

1. Multi-Object Tracking using YOLOv5

Demo Link: Multi-Object Tracking with YOLOv5! DeepSort to the Rescue!

Project Link: github.com/anil2k/YOLOv

2. Object Tracking Demo using dlib + opencv-python

Demo Link: Object Tracking Demo using dlib + opencv-python (with Project Link)

Project Link: github.com/bikz05/objec

3. A Discriminative Single-Stage Segmentation Tracker

Demo Link: youtube.com/watch?

Project Link: github.com/alanlukezic/

4. Monocular Quasi Dense 3D Object Tracking

Demo Link: youtube.com/watch?

Project Link: github.com/SysCV/qd-3dt

5. TraDes: Tracking Object Detection and Segmentation: Online Multi-Object Tracker

Demo Link: youtube.com/watch?

Project Link: github.com/JialianW/Tra

6. Global Instance Tracking (GIT) Task

Demo Link: A General Benchmark for Visual Tracking Intelligence Evaluation

Project Link: github.com/huuuuusy/vid

7. YOLO v7 + SORT for Object Detection Tracking

Demo Link: Source Code! YOLO v7 + SORT for Object Detection Tracking (Windows & Linux)

Project Link: patreon.com/TheCodingBu

Image Segmentation

1. Weakly Supervised Semantic Segmentation (WSSS) New Framework: Explicit Pseudo Pixel Supervision (EPS)

Demo Link: Weakly Supervised Semantic Segmentation (WSSS) New Framework: Explicit Pseudo Pixel Supervision (EPS) | CVPR2021

Project Link: github.com/halbielee/EP

2. Self-Supervised Depth Estimation to Enhance Semantic Segmentation Performance

Demo Link: Self-Supervised Depth Estimation to Enhance Semantic Segmentation Performance | CVPR2021

Project Link: github.com/lhoyer/impro

3. Unsupervised Learning Video Segmentation Method for Effective Learning of Spatiotemporal Correspondences

Demo Link: Unsupervised Learning Video Segmentation Method! Effective Learning of Spatiotemporal Correspondences | NeurIPS 2021

Project Link: GitHub – visinf/dense-ulearn-vos: Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)

4. Multi-Modal Transformer for Video Segmentation

Demo Link: CVPR2022 | Multi-Modal Transformer for Video Segmentation Effects Stunning

Project Link: github.com/mttr2021/MTT

5. Query-Based Instance Segmentation

Demo Link: Huazhong University of Science and Technology & Tencent Proposed: Query-Based Instance Segmentation! Code has been Open-Source | ICCV2021

Project Link: github.com/hustvl/Query

6. UNet for Medical Image Segmentation

Demo Link: A Step-by-Step Guide to Using UNet for Medical Image Segmentation

Project Link: github.com/肆十二/unet_42

Image Classification

1. Image Classification using Keras, TensorFlow | Cat and Dog Prediction | Convolutional Neural Network

Demo Link: youtube.com/watch?

Project Link: github.com/balajisriniv

2. Image Classification using CNN (CIFAR10 dataset) | Deep Learning Tutorial (TensorFlow & Python)

Demo Link: youtube.com/watch?

Project Link: github.com/codebasics/d

3. Image Classification With Deep Learning And TensorFlow: Intro Project

In this project, we will complete an end-to-end deep learning project using TensorFlow and Keras. We will read a dataset of dog images and train a convolutional neural network to classify them by breed. Finally, you will learn how to use Keras to train and optimize neural networks, and you will also learn how to process images using Python.

Demo Link: youtube.com/watch?

Project Link: github.com/dataquestio/

4. Real Python Neural Network Tutorial (Image Classification w/ CNN) | TensorFlow & Keras

Demo Link: youtube.com/watch?

Project Link: github.com/KeithGalli/n

Colab Link: colab.research.google.com

5. MNIST Handwritten Digit Recognition

Demo Link: Pure Python Implementation of CNN for Handwritten Digit Recognition + GUI Display MNIST Dataset [Source Code Attached]

Project Link: github.com/hamlinzheng/

Image Generation

1. Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

Demo Link: snap-research.github.io

Project Link: github.com/snap-researc

2. Barbershop: GAN-based Image Compositing using Segmentation Masks

Demo Link: Barbershop: GAN-based Image Compositing using Segmentation Masks | SIGGRAPH Asia 2021

Project Link: github.com/ZPdesu/Barbe

3. Pose-Controlled Face Generation

Demo Link: youtube.com/watch?

Project Link: github.com/Hangz-nju-cu

4. PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

Demo Link: PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering | ICCV2021

Project Link: github.com/RenYurui/PIR

5. Neural Texture Extraction and Distribution for Controllable Character Image Synthesis

Demo Link: CVPR2022 | Neural Texture Extraction and Distribution for Controllable Character Image Synthesis! Open Source

Project Link: github.com/RenYurui/Neu

1. License Plate Recognition using YOLOv4, OpenCV, and Tesseract OCR

2. YOLO V5 Object Detection using Pytorch

3. Face Mask Detection using Python, Keras, OpenCV, and MobileNet | Detect Masks in Real-Time Video Streams

4. YOLOR Card Object Detection DEMO

5. Fire Detection / Disaster Prevention / Real-Time / PyTorch, Python Tutorial, OpenCV, YOLOv5

6. YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors

7. Drowning Detection

Object Tracking

1. Multi-Object Tracking using YOLOv5

2. Object Tracking Demo using dlib + opencv-python

3. A Discriminative Single-Stage Segmentation Tracker

4. Monocular Quasi Dense 3D Object Tracking

5. TraDes: Tracking Object Detection and Segmentation: Online Multi-Object Tracker

6. Global Instance Tracking (GIT) Task

7. YOLO v7 + SORT for Object Detection Tracking

1. Weakly Supervised Semantic Segmentation (WSSS) New Framework: Explicit Pseudo Pixel Supervision (EPS)

2. Self-Supervised Depth Estimation to Enhance Semantic Segmentation Performance

3. Unsupervised Learning Video Segmentation Method for Effective Learning of Spatiotemporal Correspondences

4. Multi-Modal Transformer for Video Segmentation

5. Query-Based Instance Segmentation

6. UNet for Medical Image Segmentation

Image Classification

1. Image Classification using Keras, TensorFlow | Cat and Dog Prediction | Convolutional Neural Network

2. Image Classification using CNN (CIFAR10 dataset) | Deep Learning Tutorial (TensorFlow & Python)

3. Image Classification With Deep Learning And TensorFlow: Intro Project

4. Real Python Neural Network Tutorial (Image Classification w/ CNN) | TensorFlow & Keras

5. MNIST Handwritten Digit Recognition

Image Generation

1. Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

2. Barbershop: GAN-based Image Compositing using Segmentation Masks

3. Pose-Controlled Face Generation

4. PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

5. Neural Texture Extraction and Distribution for Controllable Character Image Synthesis

Leave a Comment Cancel reply