Last updated February 1, 2021
In AI Mysteries

Top PyTorch-Based Projects To Try Out In 2021

Published on February 1, 2021

by Shraddha Goled

Facebook released its open-source machine learning library PyTorch in 2016. Developers and researchers were immediately hooked, and PyTorch’s popularity soared. Organisations such as Microsoft and Tesla also use Facebook’s brainchild to drive innovation and solve business challenges.

Below, we list some PyTorch-based projects for all the enthusiasts out there.

CycelGAN for Image-To-Image Translation

Cycle Generative Adversarial Network or CycleGAN is a technique for automatic training of image-to-image translation models without using paired examples.

CycleGAN is made of two kinds of networks–discriminators and generators. While the discriminator classifies images as real or fake, generators create convincing fake images for both types of images. Unlike conventional methods that require generating synthetic image datasets of the given version with a specific modification, CycleGan is easy and inexpensive.

GitHub link.

OneNet

OneNet is an end-to-end fully convolutional one-stage object detector which eliminates the requirement for techniques such as non-maximal suppression. Instead, it introduces new techniques such as minimum cost assignment.

Its code is based on Detectron2 and DETR, and the code requirements are Python-3.6+, Pytorch-1.5+, torchvision. It offers advantages such as:

End-to-end training
No RoI operations
Minimum cost of classification based label assignment as opposed to complex bipartite-matching.

Full paper link.

GitHub link.

PyText

It is a natural language processing framework based on PyTorch for large scale deployment. PyText is a library built on PyTorch and open-sourced by parent company Facebook in 2018. The previous frameworks suffered from latency and memory problems in production. PyText provides a unified framework from research to production, thereby ensuring a simpler workflow with faster implementation.

Facebook has already deployed PyText in their video calling device portal, M suggestions of Facebook Messenger, DeepText, and Conversational AI.

GitHub link.

ArtLine

ArtLine uses deep learning algorithms to produce quality line art portraits. Built using the APDrawing dataset and Anime line art pair, this project generates better high-quality images than the existing methods using PyTorch and Fastai libraries. However, this project’s limitation is that it needs smooth or plain backgrounds with good lighting to output quality results.

GitHub link.

Human Pose Estimation and Tracking

This project is based on the 2018 research work titled ‘Simple Baselines for Human Pose Estimation and Tracking’. The research offers simple and effective baseline methods for evaluating new ideas in the field of pose tracking and estimation.

Post estimation in the experiment is based on deconvolutional layers added on the ResNet (Residual Network), a kind of artificial neural network. In terms of pose estimation, the project achieved an improvement of 0.7 percent in the mean average precision (mAP) on Common Objects in Context (COCO) dataset compared to its predecessor. For Pose tracking, this experiment shows an improvement of 51.8 percent on mAP score on the previous best.

The project is carried out using Python 3.6 on Ubuntu 16.04. The code is developed and tested using four NVIDIA P100 GPU cards.

Link to the GitHub project.

Find the complete paper here.

Automatic Speech Recognition System

This open-source project is based on the research work titled, ‘Adversarial Training of End-to-end Speech Recognition Using a Criticising Language Model’. Implemented mostly with Pytorch, this end-to-end ASR is based on listen, attend, and spell model — a neural network that can transcribe speech utterances to text characters.

GitHub link.

Full paper link

Multi-Class Text Classification

This project demonstrates how multi-class classification can be done using TorchText — a natural language processing library in PyTorch with data processing utilities and popular datasets.

The model is composed of EmbeddingBag layers, which deals with the text entries of varying length by calculating the mean value of embedding ‘bags’. Besides, the model also has a linear layer. This model is trained on DBpedia data which consists of 14 classes. It has 630,000 text instances, 560,000 training instances, and 70,000 test instances.Click here for further details.

PS: The story was written using a keyboard.

Access all our open Survey & Awards Nomination forms in one place

Shraddha Goled

I am a technology journalist with AIM. I write stories focused on the AI landscape in India and around the world with a special interest in analysing its long term impact on individuals and societies. Reach out to me at shraddha.goled@analyticsindiamag.com.

PyTorch Releases torchtune for Easily Fine-Tuning LLMs

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

The Impact of Lok Sabha Election on India’s AI Progress

Vidyashree Srinivas

The BJP aims to safeguard citizen safety and privacy, leaning towards regulation, while the Congress views AI advancements as an opportunity to create jobs.