Computer Vision Projects
Filter Projects (show all): [Computer Vision] [Deep Learning] [Machine Learning] [Software Development]
Kaggle Right Whale Recognition | ||||||
Convolutional Neural Network model to recognize individual Right Whales | ||||||
We use several different methods to recognize individual Right Whales: fine-tuning a pre-trained CNN model, and reusing those models as feature extractors to fed into an SVM. | ||||||
Toolstack: C++, Python, Caffe | ||||||
Computer Vision
Deep Learning
Machine Learning
Image Classification
Fine-Grained Recognition
|
||||||
Bird and Squirrel Alert System | ||||||
System to recognize birds and squirrels on a bird-feeder from video | ||||||
We developed a system to automatically recognize and locate birds and squirrels on a bird-feeder in real-time from continous video input. We use optical flow, motion detection, and tracking. | ||||||
Toolstack: C++, OpenCV | ||||||
Computer Vision
Object Recognition
Object Localization
|
||||||
Video |
Human Recognition in Infrared Images | ||||||
Convolutional Neural Network model to recognize humans in infrared spectrum | ||||||
We manually collect human and non-human infrared images using Seek Thermal imaging camera and manually label/annotate them. We fine-tune pre-trained CNN models and use them as feature extractors to train SVMs. | ||||||
Toolstack: C++, Python, Caffe | ||||||
Computer Vision
Deep Learning
Machine Learning
Image Classification
|
First-person Scene Classification | ||||||
A model to recognize place, activities, and objects in first-person images | ||||||
We manually collect thousands of first-person photographs using Narrative Clip wearable camera and then we create a hierarchical structure of 17 classes in 3 main categories: places, activities and objects. We do multi-label classification using Caffe and scikit-learn's OneVsRestClassifier. | ||||||
Toolstack: C++, Python, Caffe, scikit-learn | ||||||
Computer Vision
Deep Learning
Machine Learning
Image Classification
Egocentric Vision
|
Optical Music Recognition | ||||||
A system for recognizing musical notations from sheet music | ||||||
Implemented a system to recognize musical notations by using template matching, music notation boundary detection, and staves detection using Hough transform. | ||||||
Toolstack: C++, CImg | ||||||
Computer Vision
Image Processing
Machine Learning
Object Recognition
|