Best Computer Vision projects With Source Code And Dataset
Building a simple computer vision model is not rocket science — all you need is access to quality data and a reliable training data platform to get started. Check out our ideas for computer vision projects for beginners and start building!
Here’s what we’ll cover:
1 . People counting tool
2. Colors detection
3. Object tracking in a video
4. Pedestrian detection
5. Hand gesture recognition
6. Human emotion recognition
7. Road lane detection
8. Business card scanner
9. License plate recognition
10. Handwritten digit recognition
11. Iris Flowers Classification
12. Family photo face detection
13. LEGO Brick Finder
14. PPE Detection
15. Face mask detection
16. Traffic light detection
1. People counting tool
Building a people-counting solution could be both — a fun project and one that actually finds real-world applications.
To detect and count people present in an image, you’ll need a relevant training dataset and a data training platform. You can use a free tool like OpenCV to label your data or an auto annotation tool like V7 to complete this project faster.
Since the COVID-19 outbreak, people counting solutions have been growing in popularity, helping to enforce social distancing rules and improve safety.
Here’s a recommended dataset to get you started:
- People Counting Dataset (PCDS)
- Source Code
2. Colors detection
Next up is a simple color detector that you can use for a wide variety of visual tasks.
From detecting colors to building the green screen app — replacing the green background with a custom video or background — to a simple photo editing software, building a color recognizer is an awesome project to get started with Computer Vision.
Here are a few interesting datasets you might want to use for your project:
3. Object tracking in a video
Object tracking is about estimating the state of the target object present in the scene from previous information.
You can build simple object tracking models using videos involving one object, such as a car, or multiple objects like pedestrians, animals, and whatnot.
Essentially, the model will perform two tasks — predicting the object’s next state and correcting this state with respect to the object’s real condition. Object tracking models find applications in traffic control and human-computer interactions.
Here are a few video datasets you might find interesting for this computer vision task:
???? Pro tip: Read My ML Blogs To learn from zero
4. Pedestrian detection
Building an object detection model to detect pedestrians is one of the simplest and fastest computer vision projects to complete.
All you need is a relevant dataset of high-quality images and a data training platform to train and test your model. You can use one of the free image annotation tools.
Pedestrian detectors are commonly used in the automotive industry for traffic safety as well as human-robot interactions and intelligent video systems.
Consider these datasets to get started:
- Caltech Pedestrian Dataset
- Penn-Fudan Database for Pedestrian Detection
- Pedestrian Detection Dataset (Kaggle)
5. Hand gesture recognition
Hand gesture recognition is a bit more advanced computer vision task requiring you to first separate the hand region from the background and then segment the fingers to predict hand gestures.
You can use OpenCV if you want to keep your model simple. After training, you can test your model using a webcam. Hand gesture models can be used in VR games and sign languages.
Check out those datasets to get started:
- Hand Gestures of digits from 0 to 5
- Hand Gesture Recognition Database
- Multi-Modal Hand Gesture Dataset
6. Human emotion recognition
If you decide to go with a bit more challenging task, consider building an emotion detection model. You can base your model on six main facial emotions: happiness, sadness, anger, fear, disgust, and surprise.
The three main components of this project include Image Pre-processing, Feature Extraction, and Feature Classification.
Here are the datasets that might come in handy:
7. Road lane detection
Road lane detection is yet another computer vision model that plays a key role in the development of the automotive industry.
Used primarily for self-driving cars, a road lane detector can be a fun beginner project that will help you get hands-on experience with both images and videos.
Here are a couple of datasets to help you out:
8. Business card scanner
Developing a business card scanner can be done using the OCR (Optical Character Recognition) technology. Your trained model will find and extract information from business cards.
Essentially, this project will be divided into three phases: image processing (noise cancellation), OCR (text extraction), and classification (classifying key properties).
You can use your business card reader to automate data entry.
Pick on one of those datasets to begin:
9. License plate recognition
A license plate recognizer is another idea for a computer vision project using OCR.
However, there are two challenges related to this project: data collection and the differences in license plate formats depending on the location/country.
Therefore, your model might not be accurate unless you train large amounts of data (if you manage to obtain it).
Note: License plate numbers are considered sensitive data, so make sure you stick to the publicly available datasets when building your models.
A simple automatic license plate recognition system can use basic image processing techniques, and you can build it using OpenCV and Python.
However, more advanced systems use object detectors like YOLO or Fast C-RNN.
Automatic license plate recognition can be used for security, parking, smart cities, automatic toll collection, and access control.
Here are a few datasets you might consider:
10. Handwritten digit recognition
This project is a perfect start for computer vision newbies — you can build a simple digit recognizer using the MNIST dataset.
As you get a chance to train your model using Convolutional Neural Networks, you’ll learn how to develop, evaluate, and use convolutional deep learning neural networks for image classification.
The MNIST dataset contains a training set of 60,000 examples and a test set of 10,000 examples. You can access it here:
11. Iris Flowers Classification
Here’s another computer vision project based on one of the most popular and thus readily available datasets for pattern recognition — Iris Flowers Classification Dataset.
It contains three classes of 50 instances each, where each class refers to a type of iris plant. It’s a great beginner’s project that’ll help you get hands-on experience with image classification as you’ll train your model to predict the species of a new iris flower.
You can download the dataset here:
12. Family photo face detection
Grab your family album to collect original data and build a face recognition model to identify your family members in the photos.
You can label your data using a free annotation tool and train your model in less than an hour. This task is a multi-stage process consisting of face detection, alignment, feature extraction, and feature recognition.
To make your project more interesting and your model more accurate consider using video data, too. If you can’t obtain data on your own, check out these datasets to get started with facial recognition projects:
13. LEGO Brick Finder
If you’ve ever spent hours building LEGO in your childhood, this project could be a perfect way to get you hooked on computer vision.
In its simplest form, you can build a model to detect and identify LEGO bricks in real time using your webcam or your phone camera. All you need is a large set of training data and a tool to train your model.
Here are the datasets for you:
14. PPE Detection
The goal of this computer vision project is to build a model identifying the elements of PPE or face masks. You can complete it in a couple of hours and test it using a webcam and wearing a face mask in front of your computer.
PPE detection models find application in industries such as construction or healthcare (hospitals).
Check out these datasets to get started:
15. Face mask detection
Similarly to PPE detection, you can build a simple face mask detection model to identify people who wear and don’t wear a mask in public.
Remember to collect large amounts of data to ensure the model’s accuracy in handling varying kinds of occlusions.
Check out this dataset to get started:
16. Traffic light detection
Finally, consider spending some time training a traffic light detector. This project is relatively easy to complete because of the availability of data and research that you can access for free.
Traffic light detection finds applications in the intelligent transportation field including popular use cases such as autonomous cars and smart cities.
Here are a few datasets you can use: