How to train binary classification #4

ps3-app · 2021-01-20T16:47:31Z

I use sentiment analysis with bert, however it is multiclass classification, how to change for binary class text classification.

kforcodeai · 2021-01-23T12:31:05Z

Same as multiclass classification with few modifications.

n_classes = 2, in last layer , self.out = nn.Linear(self.bert.config.hidden_size, n_classes), actually this will be automatically handled by the code for mutlicalss classification itself --- model = SentimentClassifier(len(class_names))
replace softmax with sigmoid here --- F.softmax(model(input_ids, attention_mask), dim=1)
loss function should be changed to BinaryCrossEntropyLoss i.e nn.BCELoss() from ---- loss_fn = nn.CrossEntropyLoss().to(device)

Siddharth-Latthe-07 · 2024-07-18T12:45:36Z

Here are some of the steps that might help you achieve the goal:-

Adjust Output Layer:
Change the output layer to predict a single value representing the sentiment score.
Use a sigmoid activation function instead of a softmax for binary classification.
Loss Function:
Switch from a multiclass cross-entropy loss (nn.CrossEntropyLoss) to binary cross-entropy loss (nn.BCEWithLogitsLoss).
Thresholding
sample snippet:-

import torch
import torch.nn as nn
from transformers import BertModel, BertTokenizer

class SentimentClassifier(nn.Module):
    def __init__(self, pretrained_model_name, num_classes=2):
        super(SentimentClassifier, self).__init__()
        self.bert = BertModel.from_pretrained(pretrained_model_name)
        self.dropout = nn.Dropout(0.1)
        self.linear = nn.Linear(self.bert.config.hidden_size, num_classes)

    def forward(self, input_ids, attention_mask):
        outputs = self.bert(input_ids=input_ids, attention_mask=attention_mask)
        pooled_output = outputs.pooler_output
        pooled_output = self.dropout(pooled_output)
        logits = self.linear(pooled_output)
        return logits

# Initialize the BERT model and tokenizer
pretrained_model_name = 'bert-base-uncased'
tokenizer = BertTokenizer.from_pretrained(pretrained_model_name)

# Example data
text = "This movie is really good!"
labels = torch.tensor([1])  # 1 for positive sentiment

# Tokenize input
inputs = tokenizer(text, return_tensors='pt')
input_ids = inputs['input_ids']
attention_mask = inputs['attention_mask']

# Initialize and load the model
model = SentimentClassifier(pretrained_model_name, num_classes=1)  # Binary classification

# Forward pass
logits = model(input_ids, attention_mask)

# Loss and optimizer
criterion = nn.BCEWithLogitsLoss()
optimizer = torch.optim.Adam(model.parameters(), lr=1e-5)

# Binary classification expects labels as float (0 or 1)
labels = labels.float()

# Compute loss
loss = criterion(logits.view(-1), labels)

# Backward pass and update
optimizer.zero_grad()
loss.backward()
optimizer.step()

# Evaluation
predictions = torch.sigmoid(logits) > 0.5  # Threshold at 0.5
predicted_labels = predictions.long()

print("Predicted label:", predicted_labels.item())

Hope this helps
Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train binary classification #4

How to train binary classification #4

ps3-app commented Jan 20, 2021

kforcodeai commented Jan 23, 2021

Siddharth-Latthe-07 commented Jul 18, 2024

How to train binary classification #4

How to train binary classification #4

Comments

ps3-app commented Jan 20, 2021

kforcodeai commented Jan 23, 2021

Siddharth-Latthe-07 commented Jul 18, 2024