Machine-Transliteration

This is CS6910 course assignment-3 at IIT Madras. Here you will find detailed information about the assignment. In this assignment, I created a Recurrent Neural Network to transliterate a word from English to Bengali (how we type while chatting with our friends on WhatsApp etc). Different types of cells such as vanilla RNN, LSTM, GRU have been implemented to improve the accuracy of the model. Along with that, an attention network has been added in the model to further increase the accuracy of the model.

The model has been trained by using Aksharantar dataset released by AI4Bharat.

Here is the detailed wandb report for this assignment.

Dependencies

In this assignment, I used these libraies:

import torch
import random
import pandas as pd
import torch.nn as nn
from torch import optim
from torch.utils.data import TensorDataset
from torch.utils.data import DataLoader
from torch.nn.utils.rnn import pad_sequence
import argparse

If you don't have these packages installed then use this command to install:

pip install pytorch
pip install pandas

Supported Command Line Arguments

If you want to train and use this model for your dataset then download the train.py file. Hyperparameters of the model, train and test dataset path can be mentioned using command line arguments. Here are the supported command line arguments:

Argument	Short Option	Type	Default	Description
`--train_dataset_path`	`-trainPath`	`str`	`aksharantar_sampled/ben/ben_train.csv`	Path to the training dataset
`--validation_dataset_path`	`-vaildPath`	`str`	`aksharantar_sampled/ben/ben_valid.csv`	Path to the validation dataset
`--test_dataset_path`	`-testPath`	`str`	`aksharantar_sampled/ben/ben_test.csv`	Path to the test dataset
`--epochs`	`-ep`	`int`	`15`	Number of epochs
`--batch_size`	`-bs`	`int`	`32`	Batch size
`--cell_type`	`-ct`	`str`	`LSTM`	Type of RNN cell (e.g., LSTM, GRU)
`--embedding_size`	`-es`	`int`	`128`	Size of the embeddings
`--hidden_size`	`-hs`	`int`	`256`	Size of the hidden layers
`--encoder_layer`	`-el`	`int`	`3`	Number of encoder layers
`--decoder_layer`	`-dl`	`int`	`3`	Number of decoder layers
`--dropout`	`-dp`	`float`	`0.2`	Dropout rate
`--bidirectional`	`-bd`	`str`	`Yes`	Use bidirectional RNN (Yes/No)
`--attention`	`-atn`	`str`	`Yes`	Use attention mechanism (Yes/No)

rupak-paul / cs6910-assignment-3 Goto Github PK

cs6910-assignment-3's Introduction

Machine-Transliteration

Dependencies

Supported Command Line Arguments

cs6910-assignment-3's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent