SRA's seminar on Introduction to Computer Vision Fundamentals

Overview

Introduction to Computer Vision

This repository includes basics to :

  1. Python
  2. Numpy: A python library
  3. Git
  4. Computer Vision.

The aim of this repository is to provide:

  • A brief idea of algorithms involved in Computer Vision .
  • Introduction to Version Control System: Git and GitHub.
  • Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
  • Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Table Of Contents

Comments
  • Add suboptimal 2D convolution

    Add suboptimal 2D convolution

    This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

    opened by meshtag 5
  • Morphology notes updated.

    Morphology notes updated.

    I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

    opened by Aryaman22102002 2
  • Updated cv-basics/

    Updated cv-basics/

    Optimised code and flow as discussed in:

    1. cv-basics/5_opencv_overview.ipynb
    2. python-numpy-basics/7_classes_and_objects.ipynb

    Added an image :

    1. cv-basics/image/bcci.png
    opened by dhairyashah1 1
  • Port to C++ : Assignments related to PIXELS seminar

    Port to C++ : Assignments related to PIXELS seminar

    Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

    Describe the solution you'd like

    • Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.
    • You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).
    enhancement 
    opened by meshtag 0
  • Add Content: Interpolations.

    Add Content: Interpolations.

    Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

    Describe the solution you'd like

    • Create a implementations of interpolation from scratch using necessary OpenCV C++ API.
    • Add a Makefile to compile and build executables.
    • Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

    Additional context Reference: Ancient Secrets of computer vision.

    Note: Content is not finalised and open for discussion

    enhancement 
    opened by amanchhaparia 0
  • Add Content: Image Storing Formats.

    Add Content: Image Storing Formats.

    Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

    Describe the solution you'd like

    • Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.
    • Implement a .cpp file on how image can be read from the bmp format.
      • Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).
      • Use simple posix read() api to read the image bitmap file.
      • Directly storing the values of various attributes of image in struct is suggested.
    • A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.
    • Add a Makefile to compile and build the executable.
    • Add a .md file explaining the theory and instructions to build and run the executables.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 2
  • Add Content: Build Systems

    Add Content: Build Systems

    Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

    Describe the solution you'd like

    • Content should be added for manual creating and linking the object files.
    • Importance of build systems.
    • Add the contents for Makefile.
    • Add contents for Cmake.

    Additional context Can refer from here: Embedded Study Group Week 2.

    Note: Content is not finalised and open for discussion.

    enhancement Build-Systems 
    opened by amanchhaparia 0
  • Add Content: C++ basic concepts for seminar.

    Add Content: C++ basic concepts for seminar.

    Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

    Describe the solution you'd like

    • Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 1
Releases(v1.0)
  • v1.0(Sep 7, 2022)

    This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

    • This release can be used as a reference to basic Image Processing using Python.
    • Contains a tutorial for necessary numpy methods.
    • Tutorials on commonly used OpenCV functions in Python.
    • Implementation of blob detection a very commonly used algorithm in Python.
    Source code(tar.gz)
    Source code(zip)
Owner
Society of Robotics and Automation
The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .
Society of Robotics and Automation
Perspective recovery of text using transformed ellipses

unproject_text Perspective recovery of text using transformed ellipses. See full writeup at https://mzucker.github.io/2016/10/11/unprojecting-text-wit

Matt Zucker 111 Nov 13, 2022
Augmenting Anchors by the Detector Itself

Augmenting Anchors by the Detector Itself Introduction It is difficult to determine the scale and aspect ratio of anchors for anchor-based object dete

4 Nov 06, 2022
Zoom , GoogleMeets에서 Vtuber 데뷔하기

EasyVtuber Facial landmark와 GAN을 이용한 Character Face Generation Google Meets, Zoom 등에서 자신만의 웹툰, 만화 캐릭터로 대화해보세요! 악세사리는 어느정도 추가해도 잘 작동해요! 안타깝게도 RTX 2070

Gunwoo Han 140 Dec 23, 2022
Image processing using OpenCv

Image processing using OpenCv Write a program that opens the webcam, and the user selects one of the following on the video: ✅ If the user presses the

M.Najafi 4 Feb 18, 2022
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

hocr-tools About About the code Installation System-wide with pip System-wide from source virtualenv Available Programs hocr-check -- check the hOCR f

OCRopus 285 Dec 08, 2022
Corner-based Region Proposal Network

Corner-based Region Proposal Network CRPN is a two-stage detection framework for multi-oriented scene text. It employs corners to estimate the possibl

xhzdeng 140 Nov 04, 2022
Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Streaming speaker diarization Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation by Juan Manuel Coria, Hervé

Juanma Coria 185 Jan 01, 2023
"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

ID Verification by LibraX.ai This is the first free Identity verification in the market. LibraX.ai is an identity verification platform for developers

LibraX.ai 46 Dec 06, 2022
kaldi-asr/kaldi is the official location of the Kaldi project.

Kaldi Speech Recognition Toolkit To build the toolkit: see ./INSTALL. These instructions are valid for UNIX systems including various flavors of Linux

Kaldi 12.3k Jan 05, 2023
learn how to use Gesture Control to change the volume of a computer

Volume-Control-using-gesture In this project we are going to learn how to use Gesture Control to change the volume of a computer. We first look into h

Diwas Pandey 49 Sep 22, 2022
A python screen recorder for low-end computers, provides high quality video output.

RecorderX - v1.0 A screen recorder made in Python with the help of OpenCv, it has ability to record your screen in high quality. No matter what your P

Priyanshu Jindal 4 Nov 10, 2021
An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

InceptText-Tensorflow An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Orien

GeorgeJoe 115 Dec 12, 2022
Computer vision applications project (Flask and OpenCV)

Computer Vision Applications Project This project is at it's initial phase. This is all about the implementation of different computer vision techniqu

Suryam Thapa 1 Jan 26, 2022
Geometric Augmentation for Text Image

Text Image Augmentation A general geometric augmentation tool for text images in the CVPR 2020 paper "Learn to Augment: Joint Data Augmentation and Ne

Canjie Luo 440 Jan 05, 2023
Balabobapy - Using artificial intelligence algorithms to continue the text

Balabobapy - Using artificial intelligence algorithms to continue the text

qxtony 1 Feb 04, 2022
Text modding tools for FF7R (Final Fantasy VII Remake)

FF7R_text_mod_tools Subtitle modding tools for FF7R (Final Fantasy VII Remake) There are 3 tools I made. make_dualsub_mod.exe: Merges (or swaps) subti

10 Dec 19, 2022
Document manipulation detection with python

image manipulation detection task: -- tianchi function image segmentation salie

JiaKui Hu 3 Aug 22, 2022
Controlling Volume by Hand Gestures

This program allows the user to control the volume of their device with specific hand gestures involving their thumb and index finger!

Riddhi Bajaj 1 Nov 11, 2021
基于Paddle框架的PSENet复现

PSENet-Paddle 基于Paddle框架的PSENet复现 本项目基于paddlepaddle框架复现PSENet,并参加百度第三届论文复现赛,将在2021年5月15日比赛完后提供AIStudio链接~敬请期待 AIStudio链接 参考项目: whai362-PSENet 环境配置 本项目

QuanHao Guo 4 Apr 24, 2022
Open Source Computer Vision Library

OpenCV: Open Source Computer Vision Library Resources Homepage: https://opencv.org Courses: https://opencv.org/courses Docs: https://docs.opencv.org/m

OpenCV 65.7k Jan 03, 2023