Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Differential Privacy for Heterogeneous Federated Learning : Utility & Privacy tradeoffs

    Differential Privacy for Heterogeneous Federated Learning : Utility & Privacy tradeoffs In this work, we propose an algorithm DP-SCAFFOLD(-warm), whic

    19 Nov 10, 2022
    ⚡ H2G-Net for Semantic Segmentation of Histopathological Images

    H2G-Net This repository contains the code relevant for the proposed design H2G-Net, which was introduced in the manuscript "Hybrid guiding: A multi-re

    André Pedersen 8 Nov 24, 2022
    Boostcamp CV Serving For Python

    Boostcamp-CV-Serving Prerequisites MySQL GCP Cloud Storage GCP key file Sentry Streamlit Cloud Secrets: .streamlit/secrets.toml #DO NOT SHARE THIS I

    Jungwon Seo 19 Feb 22, 2022
    Libraries, tools and tasks created and used at DeepMind Robotics.

    Libraries, tools and tasks created and used at DeepMind Robotics.

    DeepMind 270 Nov 30, 2022
    Human motion synthesis using Unity3D

    Human motion synthesis using Unity3D Prerequisite: Software: amc2bvh.exe, Unity 2017, Blender. Unity: RockVR (Video Capture), scenes, character models

    Hao Xu 9 Jun 01, 2022
    《Improving Unsupervised Image Clustering With Robust Learning》(2020)

    Improving Unsupervised Image Clustering With Robust Learning This repo is the PyTorch codes for "Improving Unsupervised Image Clustering With Robust L

    Sungwon Park 129 Dec 27, 2022
    Robust Lane Detection via Expanded Self Attention (WACV 2022)

    Robust Lane Detection via Expanded Self Attention (WACV 2022) Minhyeok Lee, Junhyeop Lee, Dogyoon Lee, Woojin Kim, Sangwon Hwang, Sangyoun Lee Overvie

    Min Hyeok Lee 18 Nov 12, 2022
    TianyuQi 10 Dec 11, 2022
    This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

    Trivial Augment This is the official implementation of TrivialAugment (https://arxiv.org/abs/2103.10158), as was used for the paper. TrivialAugment is

    AutoML-Freiburg-Hannover 94 Dec 30, 2022
    Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images

    Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images [ICCV 2021] © Mahmood Lab - This code is made avail

    Mahmood Lab @ Harvard/BWH 63 Dec 01, 2022
    Easily pull telemetry data and create beautiful visualizations for analysis.

    This repository is a work in progress. Anything and everything is subject to change. Porpo Table of Contents Porpo Table of Contents General Informati

    Ryan Dawes 33 Nov 30, 2022
    STEAL - Learning Semantic Boundaries from Noisy Annotations (CVPR 2019)

    STEAL This is the official inference code for: Devil Is in the Edges: Learning Semantic Boundaries from Noisy Annotations David Acuna, Amlan Kar, Sanj

    469 Dec 26, 2022
    Personalized Federated Learning using Pytorch (pFedMe)

    Personalized Federated Learning with Moreau Envelopes (NeurIPS 2020) This repository implements all experiments in the paper Personalized Federated Le

    Charlie Dinh 226 Dec 30, 2022
    A font family with a great monospaced variant for programmers.

    Fantasque Sans Mono A programming font, designed with functionality in mind, and with some wibbly-wobbly handwriting-like fuzziness that makes it unas

    Jany Belluz 6.3k Jan 08, 2023
    A Deep Learning Framework for Neural Derivative Hedging

    NNHedge NNHedge is a PyTorch based framework for Neural Derivative Hedging. The following repository was implemented to ease the experiments of our pa

    GUIJIN SON 17 Nov 14, 2022
    This repo is customed for VisDrone.

    Object Detection for VisDrone(无人机航拍图像目标检测) My environment 1、Windows10 (Linux available) 2、tensorflow = 1.12.0 3、python3.6 (anaconda) 4、cv2 5、ensemble

    53 Jul 17, 2022
    EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures

    SCICAP: Scientific Figures Dataset This is the Github repo of the EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures (Hsu

    Edward 26 Nov 21, 2022
    Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

    Keyword Spotting Transformer This is the unofficial TensorFlow implementation of the Keyword Spotting Transformer model. This model is used to train o

    Intelligent Machines Limited 8 May 11, 2022
    Code for "Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation" ICCV'21

    Skeletal-GNN Code for "Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation" ICCV'21 Various deep learning techniques have been propose

    37 Oct 23, 2022
    This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

    AS-MLP architecture for Image Classification Model Zoo Image Classification on ImageNet-1K Network Resolution Top-1 (%) Params FLOPs Throughput (image

    SVIP Lab 106 Dec 12, 2022