📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

In Part 1 of our Transformer series - Multi-Class Text Classification with Deep Learning using BERT, we explored the evolution of NLP models, from simpler models like Bag of Words (BOW) and TF-IDF to advanced Transformer architectures like BERT.

In Part 2, we dive into two novel architectures that enhance BERT's performance through innovative training and optimization techniques:

RoBERTa: A Robustly Optimized BERT Pretraining Approach
XLNet: Generalized Autoregressive Pretraining for Language Understanding

We'll analyze these models, explore their training methods, and use them to classify human emotions from text data.

📄 Data Description

We use the Emotion dataset from the Hugging Face library, which consists of English Twitter messages labeled with six basic emotions: anger, fear, joy, love, sadness, and surprise.

Dataset Breakdown:

Train: 16,000 rows
Validation: 2,000 rows
Test: 2,000 rows

Labels:

0: sadness
1: joy
2: love
3: anger
4: fear
5: surprise

🛠️ Tech Stack

Language: Python
Libraries: datasets, numpy, pandas, matplotlib, seaborn, ktrain, transformers, tensorflow, sklearn
Environment: Jupyter Notebook, Google Colab Pro (Recommended)

🚀 Approach

Install Libraries: Ensure all necessary libraries are installed.
Load Dataset: Load and explore the Emotion dataset.
Data Preprocessing: Convert datasets to DataFrame and create additional features.
Data Visualization: Use histograms to visualize data distribution.
Model Training:
- RoBERTa:
  - Create and configure the model.
  - Preprocess data, compile the model, and find optimal learning rates.
  - Fine-tune the model and evaluate its performance.
  - Save and test the model.
- XLNet:
  - Similar steps as RoBERTa with additional understanding of Autoregressive and Autoencoder models.
Performance Evaluation: Evaluate both models on test data and compare their metrics.

📂 Project Structure

Modular Code

src: Contains modularized code for the entire project.
- Engine.py: Main script to run the project.
- ML_Pipeline: Folder with functions for data processing and model training.
output: Contains trained models for easy loading and reuse.
lib: Contains Jupyter notebooks and reference materials.

📝 Project Takeaways

Understand business problems in NLP.
Explore Transformer architectures and self-attention mechanisms.
Gain insights into RoBERTa and XLNet models.
Learn data preprocessing and visualization techniques.
Develop and fine-tune Transformer models.
Compare and evaluate model performances.

📦 Setup Instructions

Prerequisites

Ensure git is installed on your machine.

Installation

Clone the repo

git clone https://github.com/Vidhi1290/Text-Classification-with-Transformers-RoBERTa-and-XLNet-Model.git

Navigate to the project directory

cd Text-Classification-with-Transformers-RoBERTa-and-XLNet-Model

Install dependencies

pip install -r modular_code/requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

📄 Data Description

Dataset Breakdown:

Labels:

🛠️ Tech Stack

🚀 Approach

📂 Project Structure

Modular Code

📝 Project Takeaways

📦 Setup Instructions

Prerequisites

Installation

Follow me on:

Files

README.md

Latest commit

History

README.md

File metadata and controls

📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

📄 Data Description

Dataset Breakdown:

Labels:

🛠️ Tech Stack

🚀 Approach

📂 Project Structure

Modular Code

📝 Project Takeaways

📦 Setup Instructions

Prerequisites

Installation

Follow me on: