MITB Banner

Top PyTorch-Based Projects To Try Out In 2021

Share

Facebook released its open-source machine learning library PyTorch in 2016. Developers and researchers were immediately hooked, and PyTorch’s popularity soared. Organisations such as Microsoft and Tesla also use Facebook’s brainchild to drive innovation and solve business challenges.

Below, we list some PyTorch-based projects for all the enthusiasts out there.

CycelGAN for Image-To-Image Translation

Cycle Generative Adversarial Network or CycleGAN is a technique for automatic training of image-to-image translation models without using paired examples. 

CycleGAN is made of two kinds of networks–discriminators and generators. While the discriminator classifies images as real or fake, generators create convincing fake images for both types of images. Unlike conventional methods that require generating synthetic image datasets of the given version with a specific modification, CycleGan is easy and inexpensive.

GitHub link.

OneNet

OneNet is an end-to-end fully convolutional one-stage object detector which eliminates the requirement for techniques such as non-maximal suppression. Instead, it introduces new techniques such as minimum cost assignment.

Its code is based on Detectron2 and DETR, and the code requirements are Python-3.6+, Pytorch-1.5+, torchvision. It offers advantages such as:

  • End-to-end training
  • No RoI operations
  • Minimum cost of classification based label assignment as opposed to complex bipartite-matching.

Full paper link.

GitHub link.

PyText

It is a natural language processing framework based on PyTorch for large scale deployment. PyText is a library built on PyTorch and open-sourced by parent company Facebook in 2018. The previous frameworks suffered from latency and memory problems in production. PyText provides a unified framework from research to production, thereby ensuring a simpler workflow with faster implementation.

Facebook has already deployed PyText in their video calling device portal, M suggestions of Facebook Messenger, DeepText, and Conversational AI.

GitHub link.

ArtLine

ArtLine uses deep learning algorithms to produce quality line art portraits. Built using the APDrawing dataset and Anime line art pair, this project generates better high-quality images than the existing methods using PyTorch and Fastai libraries. However, this project’s limitation is that it needs smooth or plain backgrounds with good lighting to output quality results.

GitHub link.

Human Pose Estimation and Tracking

This project is based on the 2018 research work titled ‘Simple Baselines for Human Pose Estimation and Tracking’. The research offers simple and effective baseline methods for evaluating new ideas in the field of pose tracking and estimation. 

Post estimation in the experiment is based on deconvolutional layers added on the ResNet (Residual Network), a kind of artificial neural network. In terms of pose estimation, the project achieved an improvement of 0.7 percent in the mean average precision (mAP) on Common Objects in Context (COCO) dataset compared to its predecessor. For Pose tracking, this experiment shows an improvement of 51.8 percent on mAP score on the previous best.

The project is carried out using Python 3.6 on Ubuntu 16.04. The code is developed and tested using four NVIDIA P100 GPU cards.

Link to the GitHub project.

Find the complete paper here.

Automatic Speech Recognition System

This open-source project is based on the research work titled, ‘Adversarial Training of End-to-end Speech Recognition Using a Criticising Language Model’. Implemented mostly with Pytorch, this end-to-end ASR is based on listen, attend, and spell model — a neural network that can transcribe speech utterances to text characters.

GitHub link.

Full paper link

Multi-Class Text Classification

This project demonstrates how multi-class classification can be done using TorchText — a natural language processing library in PyTorch with data processing utilities and popular datasets.

The model is composed of EmbeddingBag layers, which deals with the text entries of varying length by calculating the mean value of embedding ‘bags’. Besides, the model also has a linear layer. This model is trained on DBpedia data which consists of 14 classes. It has 630,000 text instances, 560,000 training instances, and 70,000 test instances.Click here for further details.

PS: The story was written using a keyboard.
Share
Picture of Shraddha Goled

Shraddha Goled

I am a technology journalist with AIM. I write stories focused on the AI landscape in India and around the world with a special interest in analysing its long term impact on individuals and societies. Reach out to me at shraddha.goled@analyticsindiamag.com.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India